NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|490419153|ref|WP_004291462|]
View 

MULTISPECIES: N-6 DNA methylase [Bacteria]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG4646 COG4646
Adenine-specific DNA methylase, N12 class [Replication, recombination and repair];
517-1869 0e+00

Adenine-specific DNA methylase, N12 class [Replication, recombination and repair];


:

Pssm-ID: 443684 [Multi-domain]  Cd Length: 1711  Bit Score: 720.88  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  517 IYATLDWDTNPPINGFYEMMMGLTPERRKELRELARQHNEKQVAEKTEVKAVPETSREQPRQEETQPEAVAAPAVTDTPS 596
Cdd:COG4646   411 LGSLKDAFPKARARRAARIAADLAASVAASLAAAARALATAEILEITEIREADRLEAEADEDLKDLEVVFAELEGEGAII 490
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  597 EAVGTFlfpdieAEKPKEEVVDlsprayhrtpEMHLREGSLVADRGRhnIGYLKDitpygATFQPLDLKGYQKEKALLYV 676
Cdd:COG4646   491 DVNRYF------ADHPEVTPPA----------DPSVKDGSYTFEDGV--LYVDEA-----HNFKNLEVPATKMRRVAGLI 547
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  677 SLRDAYERLYRYESLRREANVPWREHLNTCYDEFVMRYGNLNAKQNVKLVMMDAGGRDILSLERM--ENGKFVKADIFEH 754
Cdd:COG4646   548 PLRDAVRELIEAQAEDDGSQKALRMRLNCRYDAFVAKYGPINSRPNLRAFRDDPDYPLLLSLEEYdeETGTARKADIFTK 627
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  755 PVSFAVEShANVGSPEEALSASLNKYGTVNLDYMREITDSTAEDLLTALQGRIYYNPLVT--GYEIKDRFIAGNVIEKAE 832
Cdd:COG4646   628 RVIRPPTE-TSVDTAAEALAVSLNERGRVDLDYMAELTGTPISNSLAELYGMIYLDPDTLedGWVTFDEYLSGNVREKLA 706
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  833 RIEAWMGDNPENermpeVKQALEALKDAEPQRIAFEDLDFNFGERWIPTGVYAAYMSRLFDTEVKIAYSASMDEFSVVCG 912
Cdd:COG4646   707 AARAAAELDPRT-----FGENVTALELVQPEDLEPSEIDVRLGATWIPKTRFEAFIRELLGTPITVSYSPETGEWSVKGK 781
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  913 YRTMKITDEFLVKGYyrnyDGMHLLKHALHNTCPDMMKSIGKDehgndIKMRDSEGIQLANAKIDEIRNGFSEWLEEQsP 992
Cdd:COG4646   782 NGNAAATSTYGTERA----NAPELLEDALNVADIRIADPVPDE-----RRVLNTEETEAAKEKQEAIKEAFAEWVWED-P 851
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  993 QFKERLVTMYNRKFNCFVRPRYDGSHQTFPDLNLKglasrgiKSVYPSQMDCVWMLKQNGGGICDHEVGTGKTLIMCIAA 1072
Cdd:COG4646   852 ERAERLVRLYNDKFNSIVPREYDGSHLKFPGDSRK-------ISLRPHQKNAVARILYGGNTLLAHEVGAGKTFTMVAAA 924
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1073 HEMKRLNLAHKPMII----GLKANVAEIAATYQAAYpnarILYASEKDFSTANRVRFFNNIKNNDYDCVIMSHDQFGKIP 1148
Cdd:COG4646   925 MELRRLGLANKPMIVvpnhLLEQDAPSKLNLYAAAN----ILIATKTDFEKGTRLVFCADIATGDYDAVIIGHIQFEKIP 1000
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1149 ----QSPELQQRILQAELDTVEENLEVLRQQGKNVSRAMLKGLEKRKHNLEAKLEKVEHAIKSRTDDVVDFKQMGIDHIF 1224
Cdd:COG4646  1001 asgeRQEEILEEQIAEILKAIKELKAVVRKRFTVKQLESTKKLGAGKLKQLDLLALKDLDVPWEPLDVDQLFGRGSRQGN 1080
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1225 IDESHQFKNLTFNTRHDRVAGLGNSEGSQKALNMLFAIRTIQERTGKDLGATFLSGTTISNSLTELYLLFKYLRPKELER 1304
Cdd:COG4646  1081 NNFLVTKMRNVAGLAFSDAAKLSDYFGKQRYRDELTAGKGVVVATGTDESNLMYELYTAQAYLQLLLLGKQGLTNFDTWA 1160
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1305 QDIRCFDAWAAIFAKKTTDFEFNVTNNVVQKERFRYFIKVPELAAFYNEITDYRTAEDVGVDRPAKNEILHHIPPTPEQE 1384
Cdd:COG4646  1161 STLEELVTAAELAPERTAYRANTREAKAVNLPEEDVMIKEAEDAKTADELLLPTPEKISGGVATKPSEVQKELLEELEER 1240
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1385 DFIQKLMQFAKTGDATLLGRLPLSETEEKAKMLIATDYARKMALDMRMIDPNYEDHPDNKASHCAKMIAEYYQKYDAQKG 1464
Cdd:COG4646  1241 AAIVRKNDGEPDRDNMLVITDDGRKAALDQRLDIKTLPDDEGSLVALCVTNIDRIWEDNPESKLTQLVFCDLSTPKGDGT 1320
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1465 TQFVFSDLGTYQPGDgWNVYSEIKRKLTEDYGIPPSEVRFIQECKTDKARKAVIDAMNAGTVRVLFGSTSMLGTGVNAQK 1544
Cdd:COG4646  1321 FNDLEDIREKLIEEE-IAELEIAFIHLALDDQEKAELFARDRLGAVEKLRISTAKMGAGTNVRLLLEATHDLDVPWRPRD 1399
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1545 RCVAIHHLDTPWRPSDLQQRDGRGVRAGNEIAKHFAGNNVDVIIYAVEKSLDSYKFNLLHCKQTFISQLKSGAMGARTID 1624
Cdd:COG4646  1400 AEQRAGRGRRQGNENEEVEEIRYVTENTFDAYLWQAAETKQKFIAQIMTSKSPVRSLEDVDEAALSYAERKALAAGRPKE 1479
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1625 EGAMDEKSGMNFSEYMALLSGNTDLLDKAKLEKRIASLEGERKSFNKGKRDSEFKlesktgelrnntafIDAMTEDWNRF 1704
Cdd:COG4646  1480 KEKMDLDIEVLKLKLLDAAALEQLYAEEDKLRKSYLDEEEALEERIEAATKDLRL--------------ARAASQEEADE 1545
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1705 LSVVQTDKEGNRLNIIKVDGVDSADEKVIGKRLQEIAKNATTGGLYTQVGELYGFPIKVVSERILKEGLEFTDNRFVVEG 1784
Cdd:COG4646  1546 QESASKEAAAGEKKAAAAELLAALQAAGLIVLDGGRTPRGEKGGGLLARALLEAATLLLPIEEAEGSEGADATGDRRTGA 1625
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1785 NYKYTYNNGHLAMADPLAAARNFLNAMERIPSIIDQYKAKNEVLEMEIPQLQEIAGKVWKKEDELKQLKSELAALDRKIQ 1864
Cdd:COG4646  1626 AAEIELAAEALILNLAERLERALRDGAEEEEIAPRELEAALKEEAALLARAGELAELELDKADLEAELEALADDLAEAAE 1705

                  ....*
gi 490419153 1865 LELAP 1869
Cdd:COG4646  1706 EERKE 1710
YtxK super family cl28092
Adenine-specific DNA N6-methylase [Replication, recombination and repair];
97-297 1.86e-15

Adenine-specific DNA N6-methylase [Replication, recombination and repair];


The actual alignment was detected with superfamily member COG0827:

Pssm-ID: 440589 [Multi-domain]  Cd Length: 327  Bit Score: 79.61  E-value: 1.86e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153   97 LKASVLTAF-YTPkeitDTIADVLAD-----YSVRPARMLEPSAGVGVFVDSMLRHSPN-ADVMAFEKDLLTGTILRHLY 169
Cdd:COG0827    85 MKESVQPNHqMTP----DAIGLLIGYlvekfTKKEGLRILDPAVGTGNLLTTVLNQLKKkVNAYGVEVDDLLIRLAAVLA 160
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  170 PDQKmRTCGF--EKIERP-FNNYFDLAVSNIPFGDIAVFDaefqRSDSFGRRSAQKT--IHNYFFLKGLDAVRDGGIVAF 244
Cdd:COG0827   161 NLQG-HPVELfhQDALQPlLIDPVDVVISDLPVGYYPNDE----RAKRFKLKADEGHsyAHHLFIEQSLNYLKPGGYLFF 235
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 490419153  245 ITSQGVLNSTKTS-VRNELFSQANLVSAIRLPNNLFTDNAGTEvgsDLIVLQKN 297
Cdd:COG0827   236 LVPSNLFESDQAAqLREFLKEKAHIQGLIQLPESLFKNEAAAK---SILILQKK 286
PspC_subgroup_2 super family cl41463
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
395-630 2.23e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


The actual alignment was detected with superfamily member NF033839:

Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 52.46  E-value: 2.23e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  395 KVAVQNKVERPAIKLETVSS----AQTVETPTE--KPQPADEKPEIEPRPQysagvqltlldlwgmteevsQPKTSKKKK 468
Cdd:NF033839  254 KVEIENTVHKIFADMDAVVTkfkkGLTQDTPKEpgNKKPSAPKPGMQPSPQ--------------------PEKKEVKPE 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  469 TVKKAVTAKSTPPKPKVTVTPTAPTAKPamenkEVKAE-NTAKPadpddiyaTLDWDTNPPINGFYEMMMGLTPERRKEL 547
Cdd:NF033839  314 PETPKPEVKPQLEKPKPEVKPQPEKPKP-----EVKPQlETPKP--------EVKPQPEKPKPEVKPQPEKPKPEVKPQP 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  548 RELARQHNEKQVAEKTEVKAVPETSREQ--PRQEETQPEAVAAPavtDTPSEAVGTflfpdiEAEKPKEEVvdlSPRAYH 625
Cdd:NF033839  381 ETPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQP---EKPKPEVKP------QPEKPKPEV---KPQPEK 448

                  ....*
gi 490419153  626 RTPEM 630
Cdd:NF033839  449 PKPEV 453
 
Name Accession Description Interval E-value
COG4646 COG4646
Adenine-specific DNA methylase, N12 class [Replication, recombination and repair];
517-1869 0e+00

Adenine-specific DNA methylase, N12 class [Replication, recombination and repair];


Pssm-ID: 443684 [Multi-domain]  Cd Length: 1711  Bit Score: 720.88  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  517 IYATLDWDTNPPINGFYEMMMGLTPERRKELRELARQHNEKQVAEKTEVKAVPETSREQPRQEETQPEAVAAPAVTDTPS 596
Cdd:COG4646   411 LGSLKDAFPKARARRAARIAADLAASVAASLAAAARALATAEILEITEIREADRLEAEADEDLKDLEVVFAELEGEGAII 490
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  597 EAVGTFlfpdieAEKPKEEVVDlsprayhrtpEMHLREGSLVADRGRhnIGYLKDitpygATFQPLDLKGYQKEKALLYV 676
Cdd:COG4646   491 DVNRYF------ADHPEVTPPA----------DPSVKDGSYTFEDGV--LYVDEA-----HNFKNLEVPATKMRRVAGLI 547
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  677 SLRDAYERLYRYESLRREANVPWREHLNTCYDEFVMRYGNLNAKQNVKLVMMDAGGRDILSLERM--ENGKFVKADIFEH 754
Cdd:COG4646   548 PLRDAVRELIEAQAEDDGSQKALRMRLNCRYDAFVAKYGPINSRPNLRAFRDDPDYPLLLSLEEYdeETGTARKADIFTK 627
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  755 PVSFAVEShANVGSPEEALSASLNKYGTVNLDYMREITDSTAEDLLTALQGRIYYNPLVT--GYEIKDRFIAGNVIEKAE 832
Cdd:COG4646   628 RVIRPPTE-TSVDTAAEALAVSLNERGRVDLDYMAELTGTPISNSLAELYGMIYLDPDTLedGWVTFDEYLSGNVREKLA 706
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  833 RIEAWMGDNPENermpeVKQALEALKDAEPQRIAFEDLDFNFGERWIPTGVYAAYMSRLFDTEVKIAYSASMDEFSVVCG 912
Cdd:COG4646   707 AARAAAELDPRT-----FGENVTALELVQPEDLEPSEIDVRLGATWIPKTRFEAFIRELLGTPITVSYSPETGEWSVKGK 781
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  913 YRTMKITDEFLVKGYyrnyDGMHLLKHALHNTCPDMMKSIGKDehgndIKMRDSEGIQLANAKIDEIRNGFSEWLEEQsP 992
Cdd:COG4646   782 NGNAAATSTYGTERA----NAPELLEDALNVADIRIADPVPDE-----RRVLNTEETEAAKEKQEAIKEAFAEWVWED-P 851
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  993 QFKERLVTMYNRKFNCFVRPRYDGSHQTFPDLNLKglasrgiKSVYPSQMDCVWMLKQNGGGICDHEVGTGKTLIMCIAA 1072
Cdd:COG4646   852 ERAERLVRLYNDKFNSIVPREYDGSHLKFPGDSRK-------ISLRPHQKNAVARILYGGNTLLAHEVGAGKTFTMVAAA 924
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1073 HEMKRLNLAHKPMII----GLKANVAEIAATYQAAYpnarILYASEKDFSTANRVRFFNNIKNNDYDCVIMSHDQFGKIP 1148
Cdd:COG4646   925 MELRRLGLANKPMIVvpnhLLEQDAPSKLNLYAAAN----ILIATKTDFEKGTRLVFCADIATGDYDAVIIGHIQFEKIP 1000
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1149 ----QSPELQQRILQAELDTVEENLEVLRQQGKNVSRAMLKGLEKRKHNLEAKLEKVEHAIKSRTDDVVDFKQMGIDHIF 1224
Cdd:COG4646  1001 asgeRQEEILEEQIAEILKAIKELKAVVRKRFTVKQLESTKKLGAGKLKQLDLLALKDLDVPWEPLDVDQLFGRGSRQGN 1080
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1225 IDESHQFKNLTFNTRHDRVAGLGNSEGSQKALNMLFAIRTIQERTGKDLGATFLSGTTISNSLTELYLLFKYLRPKELER 1304
Cdd:COG4646  1081 NNFLVTKMRNVAGLAFSDAAKLSDYFGKQRYRDELTAGKGVVVATGTDESNLMYELYTAQAYLQLLLLGKQGLTNFDTWA 1160
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1305 QDIRCFDAWAAIFAKKTTDFEFNVTNNVVQKERFRYFIKVPELAAFYNEITDYRTAEDVGVDRPAKNEILHHIPPTPEQE 1384
Cdd:COG4646  1161 STLEELVTAAELAPERTAYRANTREAKAVNLPEEDVMIKEAEDAKTADELLLPTPEKISGGVATKPSEVQKELLEELEER 1240
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1385 DFIQKLMQFAKTGDATLLGRLPLSETEEKAKMLIATDYARKMALDMRMIDPNYEDHPDNKASHCAKMIAEYYQKYDAQKG 1464
Cdd:COG4646  1241 AAIVRKNDGEPDRDNMLVITDDGRKAALDQRLDIKTLPDDEGSLVALCVTNIDRIWEDNPESKLTQLVFCDLSTPKGDGT 1320
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1465 TQFVFSDLGTYQPGDgWNVYSEIKRKLTEDYGIPPSEVRFIQECKTDKARKAVIDAMNAGTVRVLFGSTSMLGTGVNAQK 1544
Cdd:COG4646  1321 FNDLEDIREKLIEEE-IAELEIAFIHLALDDQEKAELFARDRLGAVEKLRISTAKMGAGTNVRLLLEATHDLDVPWRPRD 1399
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1545 RCVAIHHLDTPWRPSDLQQRDGRGVRAGNEIAKHFAGNNVDVIIYAVEKSLDSYKFNLLHCKQTFISQLKSGAMGARTID 1624
Cdd:COG4646  1400 AEQRAGRGRRQGNENEEVEEIRYVTENTFDAYLWQAAETKQKFIAQIMTSKSPVRSLEDVDEAALSYAERKALAAGRPKE 1479
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1625 EGAMDEKSGMNFSEYMALLSGNTDLLDKAKLEKRIASLEGERKSFNKGKRDSEFKlesktgelrnntafIDAMTEDWNRF 1704
Cdd:COG4646  1480 KEKMDLDIEVLKLKLLDAAALEQLYAEEDKLRKSYLDEEEALEERIEAATKDLRL--------------ARAASQEEADE 1545
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1705 LSVVQTDKEGNRLNIIKVDGVDSADEKVIGKRLQEIAKNATTGGLYTQVGELYGFPIKVVSERILKEGLEFTDNRFVVEG 1784
Cdd:COG4646  1546 QESASKEAAAGEKKAAAAELLAALQAAGLIVLDGGRTPRGEKGGGLLARALLEAATLLLPIEEAEGSEGADATGDRRTGA 1625
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1785 NYKYTYNNGHLAMADPLAAARNFLNAMERIPSIIDQYKAKNEVLEMEIPQLQEIAGKVWKKEDELKQLKSELAALDRKIQ 1864
Cdd:COG4646  1626 AAEIELAAEALILNLAERLERALRDGAEEEEIAPRELEAALKEEAALLARAGELAELELDKADLEAELEALADDLAEAAE 1705

                  ....*
gi 490419153 1865 LELAP 1869
Cdd:COG4646  1706 EERKE 1710
YtxK COG0827
Adenine-specific DNA N6-methylase [Replication, recombination and repair];
97-297 1.86e-15

Adenine-specific DNA N6-methylase [Replication, recombination and repair];


Pssm-ID: 440589 [Multi-domain]  Cd Length: 327  Bit Score: 79.61  E-value: 1.86e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153   97 LKASVLTAF-YTPkeitDTIADVLAD-----YSVRPARMLEPSAGVGVFVDSMLRHSPN-ADVMAFEKDLLTGTILRHLY 169
Cdd:COG0827    85 MKESVQPNHqMTP----DAIGLLIGYlvekfTKKEGLRILDPAVGTGNLLTTVLNQLKKkVNAYGVEVDDLLIRLAAVLA 160
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  170 PDQKmRTCGF--EKIERP-FNNYFDLAVSNIPFGDIAVFDaefqRSDSFGRRSAQKT--IHNYFFLKGLDAVRDGGIVAF 244
Cdd:COG0827   161 NLQG-HPVELfhQDALQPlLIDPVDVVISDLPVGYYPNDE----RAKRFKLKADEGHsyAHHLFIEQSLNYLKPGGYLFF 235
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 490419153  245 ITSQGVLNSTKTS-VRNELFSQANLVSAIRLPNNLFTDNAGTEvgsDLIVLQKN 297
Cdd:COG0827   236 LVPSNLFESDQAAqLREFLKEKAHIQGLIQLPESLFKNEAAAK---SILILQKK 286
HELICc smart00490
helicase superfamily c-terminal domain;
1512-1572 1.39e-06

helicase superfamily c-terminal domain;


Pssm-ID: 197757 [Multi-domain]  Cd Length: 82  Bit Score: 47.98  E-value: 1.39e-06
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 490419153   1512 KARKAVIDAMNAGTVRVLFgSTSMLGTGVN-AQKRCVAIhhLDTPWRPSDLQQRDGRGVRAG 1572
Cdd:smart00490   24 EEREEILDKFNNGKIKVLV-ATDVAERGLDlPGVDLVII--YDLPWSPASYIQRIGRAGRAG 82
Helicase_C pfam00271
Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, ...
1483-1572 1.94e-06

Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.


Pssm-ID: 459740 [Multi-domain]  Cd Length: 109  Bit Score: 48.36  E-value: 1.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  1483 VYSEIKRKLTEDY-----GIPpseVRFIQECKTDKARKAVIDAMNAGTVRVLFgSTSMLGTGVNAQKRCVAIHhLDTPWR 1557
Cdd:pfam00271   20 IFSQTKKTLEAELllekeGIK---VARLHGDLSQEEREEILEDFRKGKIDVLV-ATDVAERGLDLPDVDLVIN-YDLPWN 94
                           90
                   ....*....|....*
gi 490419153  1558 PSDLQQRDGRGVRAG 1572
Cdd:pfam00271   95 PASYIQRIGRAGRAG 109
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
395-630 2.23e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 52.46  E-value: 2.23e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  395 KVAVQNKVERPAIKLETVSS----AQTVETPTE--KPQPADEKPEIEPRPQysagvqltlldlwgmteevsQPKTSKKKK 468
Cdd:NF033839  254 KVEIENTVHKIFADMDAVVTkfkkGLTQDTPKEpgNKKPSAPKPGMQPSPQ--------------------PEKKEVKPE 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  469 TVKKAVTAKSTPPKPKVTVTPTAPTAKPamenkEVKAE-NTAKPadpddiyaTLDWDTNPPINGFYEMMMGLTPERRKEL 547
Cdd:NF033839  314 PETPKPEVKPQLEKPKPEVKPQPEKPKP-----EVKPQlETPKP--------EVKPQPEKPKPEVKPQPEKPKPEVKPQP 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  548 RELARQHNEKQVAEKTEVKAVPETSREQ--PRQEETQPEAVAAPavtDTPSEAVGTflfpdiEAEKPKEEVvdlSPRAYH 625
Cdd:NF033839  381 ETPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQP---EKPKPEVKP------QPEKPKPEV---KPQPEK 448

                  ....*
gi 490419153  626 RTPEM 630
Cdd:NF033839  449 PKPEV 453
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
421-612 1.03e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.99  E-value: 1.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  421 PTEKPQPADEKPEIEPRPQYSAGvqltlldlwGMTEEVSQPKTSKKKKTVKKAVTAKSTPPKPKVTVTPTAPTAKPAMEN 500
Cdd:NF033839  330 PEVKPQPEKPKPEVKPQLETPKP---------EVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 400
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  501 KEVKAENTAKPaDPDDIYATLDWDTNPPingfyemmmglTPERRKELRELARQHNEKQVAEKTEVKAVPETSREQPRQEE 580
Cdd:NF033839  401 QPEKPKPEVKP-QPEKPKPEVKPQPEKP-----------KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQP 468
                         170       180       190
                  ....*....|....*....|....*....|..
gi 490419153  581 TQPEavaaPAVTDTPSEAVGTFLFPDIEAEKP 612
Cdd:NF033839  469 EKPK----PEVKPQPEKPKPDNSKPQADDKKP 496
 
Name Accession Description Interval E-value
COG4646 COG4646
Adenine-specific DNA methylase, N12 class [Replication, recombination and repair];
517-1869 0e+00

Adenine-specific DNA methylase, N12 class [Replication, recombination and repair];


Pssm-ID: 443684 [Multi-domain]  Cd Length: 1711  Bit Score: 720.88  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  517 IYATLDWDTNPPINGFYEMMMGLTPERRKELRELARQHNEKQVAEKTEVKAVPETSREQPRQEETQPEAVAAPAVTDTPS 596
Cdd:COG4646   411 LGSLKDAFPKARARRAARIAADLAASVAASLAAAARALATAEILEITEIREADRLEAEADEDLKDLEVVFAELEGEGAII 490
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  597 EAVGTFlfpdieAEKPKEEVVDlsprayhrtpEMHLREGSLVADRGRhnIGYLKDitpygATFQPLDLKGYQKEKALLYV 676
Cdd:COG4646   491 DVNRYF------ADHPEVTPPA----------DPSVKDGSYTFEDGV--LYVDEA-----HNFKNLEVPATKMRRVAGLI 547
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  677 SLRDAYERLYRYESLRREANVPWREHLNTCYDEFVMRYGNLNAKQNVKLVMMDAGGRDILSLERM--ENGKFVKADIFEH 754
Cdd:COG4646   548 PLRDAVRELIEAQAEDDGSQKALRMRLNCRYDAFVAKYGPINSRPNLRAFRDDPDYPLLLSLEEYdeETGTARKADIFTK 627
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  755 PVSFAVEShANVGSPEEALSASLNKYGTVNLDYMREITDSTAEDLLTALQGRIYYNPLVT--GYEIKDRFIAGNVIEKAE 832
Cdd:COG4646   628 RVIRPPTE-TSVDTAAEALAVSLNERGRVDLDYMAELTGTPISNSLAELYGMIYLDPDTLedGWVTFDEYLSGNVREKLA 706
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  833 RIEAWMGDNPENermpeVKQALEALKDAEPQRIAFEDLDFNFGERWIPTGVYAAYMSRLFDTEVKIAYSASMDEFSVVCG 912
Cdd:COG4646   707 AARAAAELDPRT-----FGENVTALELVQPEDLEPSEIDVRLGATWIPKTRFEAFIRELLGTPITVSYSPETGEWSVKGK 781
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  913 YRTMKITDEFLVKGYyrnyDGMHLLKHALHNTCPDMMKSIGKDehgndIKMRDSEGIQLANAKIDEIRNGFSEWLEEQsP 992
Cdd:COG4646   782 NGNAAATSTYGTERA----NAPELLEDALNVADIRIADPVPDE-----RRVLNTEETEAAKEKQEAIKEAFAEWVWED-P 851
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  993 QFKERLVTMYNRKFNCFVRPRYDGSHQTFPDLNLKglasrgiKSVYPSQMDCVWMLKQNGGGICDHEVGTGKTLIMCIAA 1072
Cdd:COG4646   852 ERAERLVRLYNDKFNSIVPREYDGSHLKFPGDSRK-------ISLRPHQKNAVARILYGGNTLLAHEVGAGKTFTMVAAA 924
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1073 HEMKRLNLAHKPMII----GLKANVAEIAATYQAAYpnarILYASEKDFSTANRVRFFNNIKNNDYDCVIMSHDQFGKIP 1148
Cdd:COG4646   925 MELRRLGLANKPMIVvpnhLLEQDAPSKLNLYAAAN----ILIATKTDFEKGTRLVFCADIATGDYDAVIIGHIQFEKIP 1000
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1149 ----QSPELQQRILQAELDTVEENLEVLRQQGKNVSRAMLKGLEKRKHNLEAKLEKVEHAIKSRTDDVVDFKQMGIDHIF 1224
Cdd:COG4646  1001 asgeRQEEILEEQIAEILKAIKELKAVVRKRFTVKQLESTKKLGAGKLKQLDLLALKDLDVPWEPLDVDQLFGRGSRQGN 1080
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1225 IDESHQFKNLTFNTRHDRVAGLGNSEGSQKALNMLFAIRTIQERTGKDLGATFLSGTTISNSLTELYLLFKYLRPKELER 1304
Cdd:COG4646  1081 NNFLVTKMRNVAGLAFSDAAKLSDYFGKQRYRDELTAGKGVVVATGTDESNLMYELYTAQAYLQLLLLGKQGLTNFDTWA 1160
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1305 QDIRCFDAWAAIFAKKTTDFEFNVTNNVVQKERFRYFIKVPELAAFYNEITDYRTAEDVGVDRPAKNEILHHIPPTPEQE 1384
Cdd:COG4646  1161 STLEELVTAAELAPERTAYRANTREAKAVNLPEEDVMIKEAEDAKTADELLLPTPEKISGGVATKPSEVQKELLEELEER 1240
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1385 DFIQKLMQFAKTGDATLLGRLPLSETEEKAKMLIATDYARKMALDMRMIDPNYEDHPDNKASHCAKMIAEYYQKYDAQKG 1464
Cdd:COG4646  1241 AAIVRKNDGEPDRDNMLVITDDGRKAALDQRLDIKTLPDDEGSLVALCVTNIDRIWEDNPESKLTQLVFCDLSTPKGDGT 1320
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1465 TQFVFSDLGTYQPGDgWNVYSEIKRKLTEDYGIPPSEVRFIQECKTDKARKAVIDAMNAGTVRVLFGSTSMLGTGVNAQK 1544
Cdd:COG4646  1321 FNDLEDIREKLIEEE-IAELEIAFIHLALDDQEKAELFARDRLGAVEKLRISTAKMGAGTNVRLLLEATHDLDVPWRPRD 1399
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1545 RCVAIHHLDTPWRPSDLQQRDGRGVRAGNEIAKHFAGNNVDVIIYAVEKSLDSYKFNLLHCKQTFISQLKSGAMGARTID 1624
Cdd:COG4646  1400 AEQRAGRGRRQGNENEEVEEIRYVTENTFDAYLWQAAETKQKFIAQIMTSKSPVRSLEDVDEAALSYAERKALAAGRPKE 1479
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1625 EGAMDEKSGMNFSEYMALLSGNTDLLDKAKLEKRIASLEGERKSFNKGKRDSEFKlesktgelrnntafIDAMTEDWNRF 1704
Cdd:COG4646  1480 KEKMDLDIEVLKLKLLDAAALEQLYAEEDKLRKSYLDEEEALEERIEAATKDLRL--------------ARAASQEEADE 1545
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1705 LSVVQTDKEGNRLNIIKVDGVDSADEKVIGKRLQEIAKNATTGGLYTQVGELYGFPIKVVSERILKEGLEFTDNRFVVEG 1784
Cdd:COG4646  1546 QESASKEAAAGEKKAAAAELLAALQAAGLIVLDGGRTPRGEKGGGLLARALLEAATLLLPIEEAEGSEGADATGDRRTGA 1625
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1785 NYKYTYNNGHLAMADPLAAARNFLNAMERIPSIIDQYKAKNEVLEMEIPQLQEIAGKVWKKEDELKQLKSELAALDRKIQ 1864
Cdd:COG4646  1626 AAEIELAAEALILNLAERLERALRDGAEEEEIAPRELEAALKEEAALLARAGELAELELDKADLEAELEALADDLAEAAE 1705

                  ....*
gi 490419153 1865 LELAP 1869
Cdd:COG4646  1706 EERKE 1710
YtxK COG0827
Adenine-specific DNA N6-methylase [Replication, recombination and repair];
97-297 1.86e-15

Adenine-specific DNA N6-methylase [Replication, recombination and repair];


Pssm-ID: 440589 [Multi-domain]  Cd Length: 327  Bit Score: 79.61  E-value: 1.86e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153   97 LKASVLTAF-YTPkeitDTIADVLAD-----YSVRPARMLEPSAGVGVFVDSMLRHSPN-ADVMAFEKDLLTGTILRHLY 169
Cdd:COG0827    85 MKESVQPNHqMTP----DAIGLLIGYlvekfTKKEGLRILDPAVGTGNLLTTVLNQLKKkVNAYGVEVDDLLIRLAAVLA 160
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  170 PDQKmRTCGF--EKIERP-FNNYFDLAVSNIPFGDIAVFDaefqRSDSFGRRSAQKT--IHNYFFLKGLDAVRDGGIVAF 244
Cdd:COG0827   161 NLQG-HPVELfhQDALQPlLIDPVDVVISDLPVGYYPNDE----RAKRFKLKADEGHsyAHHLFIEQSLNYLKPGGYLFF 235
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 490419153  245 ITSQGVLNSTKTS-VRNELFSQANLVSAIRLPNNLFTDNAGTEvgsDLIVLQKN 297
Cdd:COG0827   236 LVPSNLFESDQAAqLREFLKEKAHIQGLIQLPESLFKNEAAAK---SILILQKK 286
HepA COG0553
Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, ...
1211-1567 1.02e-12

Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, and repair];


Pssm-ID: 440319 [Multi-domain]  Cd Length: 682  Bit Score: 73.34  E-value: 1.02e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1211 DVVDFKQMGIDHIFIDESHQFKNltfntrhdrvaglgnsegsqkalnmlfaIRTIQERTGKDLGATF---LSGTTISNSL 1287
Cdd:COG0553   350 DIELLAAVDWDLVILDEAQHIKN----------------------------PATKRAKAVRALKARHrlaLTGTPVENRL 401
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1288 TELYLLFKYLRPKELERQDircfdAWAAIFAKkttdfeFNVTNNVVQKERFRYFIKvPELAAfyneitdyRTAEDVGVDR 1367
Cdd:COG0553   402 EELWSLLDFLNPGLLGSLK-----AFRERFAR------PIEKGDEEALERLRRLLR-PFLLR--------RTKEDVLKDL 461
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1368 PAKNEILHHIPPTPEQEDFIQKLMQFAktgDATLLGRLPlseTEEKAKMLIATDYARKMALDMRMIDPNYEDHPDN--KA 1445
Cdd:COG0553   462 PEKTEETLYVELTPEQRALYEAVLEYL---RRELEGAEG---IRRRGLILAALTRLRQICSHPALLLEEGAELSGRsaKL 535
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153 1446 SHCAKMIAEYyqkydAQKG------TQFVfsDLGTYqpgdgwnvyseIKRKLtEDYGIPPSevrFIQECKTDKARKAVID 1519
Cdd:COG0553   536 EALLELLEEL-----LAEGekvlvfSQFT--DTLDL-----------LEERL-EERGIEYA---YLHGGTSAEERDELVD 593
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*....
gi 490419153 1520 AMNAGT-VRVLFGSTSMLGTGVNAQKRCVAIhHLDTPWRPSDLQQRDGR 1567
Cdd:COG0553   594 RFQEGPeAPVFLISLKAGGEGLNLTAADHVI-HYDLWWNPAVEEQAIDR 641
HsdM COG0286
Type I restriction-modification system, DNA methylase subunit [Defense mechanisms];
105-297 2.97e-12

Type I restriction-modification system, DNA methylase subunit [Defense mechanisms];


Pssm-ID: 440055 [Multi-domain]  Cd Length: 243  Bit Score: 68.68  E-value: 2.97e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  105 FYTPKEITDTIADVLADYSVRpaRMLEPSAGVGVF----VDSMLRH--SPNADVMAF--EKDLLTGTILR-----HLYPD 171
Cdd:COG0286    25 FYTPREVVRLMVELLDPKPGE--TVYDPACGSGGFlveaAEYLKEHggDERKKLSLYgqEINPTTYRLAKmnlllHGIGD 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  172 QKMRTCGFEKIERPFNNYFDLAVSNIPFGDIavFDAEFQRSDSFGRRSAQKTIHNY----FFLKGLDAVRDGGIVAFITS 247
Cdd:COG0286   103 PNIELGDTLSNDGDELEKFDVVLANPPFGGK--WKKEELKDDLLGRFGYGLPPKSNadllFLQHILSLLKPGGRAAVVLP 180
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 490419153  248 QGVLNSTK-TSVRNELFSQANLVSAIRLPNNLFtdnAGTEVGSDLIVLQKN 297
Cdd:COG0286   181 DGVLFRGAeKEIRKKLLENDLLEAIIGLPSNLF---YNTGIPTCILFLTKG 228
HELICc smart00490
helicase superfamily c-terminal domain;
1512-1572 1.39e-06

helicase superfamily c-terminal domain;


Pssm-ID: 197757 [Multi-domain]  Cd Length: 82  Bit Score: 47.98  E-value: 1.39e-06
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 490419153   1512 KARKAVIDAMNAGTVRVLFgSTSMLGTGVN-AQKRCVAIhhLDTPWRPSDLQQRDGRGVRAG 1572
Cdd:smart00490   24 EEREEILDKFNNGKIKVLV-ATDVAERGLDlPGVDLVII--YDLPWSPASYIQRIGRAGRAG 82
Helicase_C pfam00271
Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, ...
1483-1572 1.94e-06

Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.


Pssm-ID: 459740 [Multi-domain]  Cd Length: 109  Bit Score: 48.36  E-value: 1.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  1483 VYSEIKRKLTEDY-----GIPpseVRFIQECKTDKARKAVIDAMNAGTVRVLFgSTSMLGTGVNAQKRCVAIHhLDTPWR 1557
Cdd:pfam00271   20 IFSQTKKTLEAELllekeGIK---VARLHGDLSQEEREEILEDFRKGKIDVLV-ATDVAERGLDLPDVDLVIN-YDLPWN 94
                           90
                   ....*....|....*
gi 490419153  1558 PSDLQQRDGRGVRAG 1572
Cdd:pfam00271   95 PASYIQRIGRAGRAG 109
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
395-630 2.23e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 52.46  E-value: 2.23e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  395 KVAVQNKVERPAIKLETVSS----AQTVETPTE--KPQPADEKPEIEPRPQysagvqltlldlwgmteevsQPKTSKKKK 468
Cdd:NF033839  254 KVEIENTVHKIFADMDAVVTkfkkGLTQDTPKEpgNKKPSAPKPGMQPSPQ--------------------PEKKEVKPE 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  469 TVKKAVTAKSTPPKPKVTVTPTAPTAKPamenkEVKAE-NTAKPadpddiyaTLDWDTNPPINGFYEMMMGLTPERRKEL 547
Cdd:NF033839  314 PETPKPEVKPQLEKPKPEVKPQPEKPKP-----EVKPQlETPKP--------EVKPQPEKPKPEVKPQPEKPKPEVKPQP 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  548 RELARQHNEKQVAEKTEVKAVPETSREQ--PRQEETQPEAVAAPavtDTPSEAVGTflfpdiEAEKPKEEVvdlSPRAYH 625
Cdd:NF033839  381 ETPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQP---EKPKPEVKP------QPEKPKPEV---KPQPEK 448

                  ....*
gi 490419153  626 RTPEM 630
Cdd:NF033839  449 PKPEV 453
DEXDc smart00487
DEAD-like helicases superfamily;
1029-1144 1.26e-05

DEAD-like helicases superfamily;


Pssm-ID: 214692 [Multi-domain]  Cd Length: 201  Bit Score: 48.26  E-value: 1.26e-05
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153   1029 LASRGIKSVYPSQMDCV-WMLKQNGGGICDHEVGTGKTLIMCIAAHEMKRLNLAHKPMIIG-LKANVAEIAATYQAAYPN 1106
Cdd:smart00487    1 IEKFGFEPLRPYQKEAIeALLSGLRDVILAAPTGSGKTLAALLPALEALKRGKGGRVLVLVpTRELAEQWAEELKKLGPS 80
                            90       100       110
                    ....*....|....*....|....*....|....*...
gi 490419153   1107 ARILYASEkdFSTANRVRFFNNIKNNDYDCVIMSHDQF 1144
Cdd:smart00487   81 LGLKVVGL--YGGDSKREQLRKLESGKTDILVTTPGRL 116
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
1811-1898 2.10e-04

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 45.39  E-value: 2.10e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153   1811 MERIPSIIDQYKAKNEVLEMEIPQLQEIAGKVWK-KEDELKQLKSELAALDRKIQLELApptpEVAEKENEGQQLKPEAE 1889
Cdd:smart00787  167 LELLNSIKPKLRDRKDALEEELRQLKQLEDELEDcDPTELDRAKEKLKKLLQEIMIKVK----KLEELEEELQELESKIE 242

                    ....*....
gi 490419153   1890 DVRNRQAQY 1898
Cdd:smart00787  243 DLTNKKSEL 251
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
421-612 1.03e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.99  E-value: 1.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  421 PTEKPQPADEKPEIEPRPQYSAGvqltlldlwGMTEEVSQPKTSKKKKTVKKAVTAKSTPPKPKVTVTPTAPTAKPAMEN 500
Cdd:NF033839  330 PEVKPQPEKPKPEVKPQLETPKP---------EVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 400
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 490419153  501 KEVKAENTAKPaDPDDIYATLDWDTNPPingfyemmmglTPERRKELRELARQHNEKQVAEKTEVKAVPETSREQPRQEE 580
Cdd:NF033839  401 QPEKPKPEVKP-QPEKPKPEVKPQPEKP-----------KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQP 468
                         170       180       190
                  ....*....|....*....|....*....|..
gi 490419153  581 TQPEavaaPAVTDTPSEAVGTFLFPDIEAEKP 612
Cdd:NF033839  469 EKPK----PEVKPQPEKPKPDNSKPQADDKKP 496
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH