|
Name |
Accession |
Description |
Interval |
E-value |
| CARMIL_C |
pfam16000 |
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ... |
778-1066 |
6.36e-84 |
|
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).
Pssm-ID: 464966 [Multi-domain] Cd Length: 299 Bit Score: 276.65 E-value: 6.36e-84
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000 1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000 81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQGRPRPPRTTPPGPGRPSMPAPGTRQENGMA 1006
Cdd:pfam16000 161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTKGRPKRNKTRAPTRPPGKVGPAQDGEQNGLS 240
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257 1007 TRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1066
Cdd:pfam16000 241 GRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
|
|
| Carm_PH |
pfam17888 |
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ... |
31-118 |
5.36e-33 |
|
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.
Pssm-ID: 436119 Cd Length: 94 Bit Score: 123.16 E-value: 5.36e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888 7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86
|
....*...
gi 156938257 111 SALSKVCP 118
Cdd:pfam17888 87 TALKKIFP 94
|
|
| RNA1 |
COG5238 |
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ... |
274-657 |
1.99e-16 |
|
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];
Pssm-ID: 444072 [Multi-domain] Cd Length: 434 Bit Score: 83.69 E-value: 1.99e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238 88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238 168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238 241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238 319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257 587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238 366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
|
|
| LRR_RI |
cd00116 |
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ... |
339-600 |
6.89e-13 |
|
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Pssm-ID: 238064 [Multi-domain] Cd Length: 319 Bit Score: 71.23 E-value: 6.89e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 339 LDLSKNpgLLATDEANALYSFLAQPNALVHLDLS-----GTDCVIDLLLGALLHGCCshLTYLNLARNSCShrkgrEAPP 413
Cdd:cd00116 28 LRLEGN--TLGEEAAKALASALRPQPSLKELCLSlnetgRIPRGLQSLLQGLTKGCG--LQELDLSDNALG-----PDGC 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 414 AFKQFFSSAYTLSHVNLSATKLPLEALRALLQGL-SLNSHLSDLhlDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNG 492
Cdd:cd00116 99 GVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLkDLPPALEKL--VLGRNRLEGASCEALAKALRANRDLKELNLANNG 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 493 F-DSDLLTLVPALGKNKSLKHLFLGKNF--NVKAKTLEEILHKLVQL--IQEEDCSLQSLSVADsrlklrtsiLINALGS 567
Cdd:cd00116 177 IgDAGIRALAEGLKANCNLEVLDLNNNGltDEGASALAETLASLKSLevLNLGDNNLTDAGAAA---------LASALLS 247
|
250 260 270
....*....|....*....|....*....|....
gi 156938257 568 -NTCLAKVDLSGNGMEDIGAKMLSKALQINSSLR 600
Cdd:cd00116 248 pNISLLTLSLSCNDITDDGAKDLAEVLAEKESLL 281
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1122-1372 |
1.31e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.40 E-value: 1.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1122 GGRGPSFRRKMG--TEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAW-----QKRRSS 1194
Cdd:PHA03247 2682 RPRRRAARPTVGslTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpaRPARPP 2761
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1195 DDAGPGSwKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHSCQSPSPASQDGEEEKE-GTLFPERTLPARNAKLQ-DPA 1272
Cdd:PHA03247 2762 TTAGPPA-PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALpPAASPAGPLPPPTSAQPtAPP 2840
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1273 LAPWPPKPVAVPRGRQPPQEPgVREEAEAGDAAPGVNKPR----LRLSSQQDQEEPEVQG-PPDPGRR----TAPLKPKR 1343
Cdd:PHA03247 2841 PPPGPPPPSLPLGGSVAPGGD-VRRRPPSRSPAAKPAAPArppvRRLARPAVSRSTESFAlPPDQPERppqpQAPPPPQP 2919
|
250 260
....*....|....*....|....*....
gi 156938257 1344 TRRAQSCDKLEPDRRRPPDPTGTSEPGTD 1372
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1238-1369 |
1.21e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 46.30 E-value: 1.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1238 QSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPpqEPGVREEAEAG--DAAPGVN 1309
Cdd:NF033839 359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPEKP--KPEVKPQPEKPkpEVKPQPE 436
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 156938257 1310 KPRLRLSSQQDQEEPEVQ---GPPDPGRRTAPLKPKRTRRAQScDKLEPDRRRP-PDPTGTSEP 1369
Cdd:NF033839 437 KPKPEVKPQPEKPKPEVKpqpETPKPEVKPQPEKPKPEVKPQP-EKPKPDNSKPqADDKKPSTP 499
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1236-1371 |
5.96e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 40.91 E-value: 5.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1236 SCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVN 1309
Cdd:NF033839 324 QLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPE 403
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 156938257 1310 KPRLRLSSQQDQEEPEVQGPPDPGR---RTAPLKPKRTRRAQscdklePDRRRP--------PDPTGTSEPGT 1371
Cdd:NF033839 404 KPKPEVKPQPEKPKPEVKPQPEKPKpevKPQPEKPKPEVKPQ------PEKPKPevkpqpetPKPEVKPQPEK 470
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| CARMIL_C |
pfam16000 |
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ... |
778-1066 |
6.36e-84 |
|
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).
Pssm-ID: 464966 [Multi-domain] Cd Length: 299 Bit Score: 276.65 E-value: 6.36e-84
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000 1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000 81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQGRPRPPRTTPPGPGRPSMPAPGTRQENGMA 1006
Cdd:pfam16000 161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTKGRPKRNKTRAPTRPPGKVGPAQDGEQNGLS 240
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257 1007 TRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1066
Cdd:pfam16000 241 GRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
|
|
| Carm_PH |
pfam17888 |
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ... |
31-118 |
5.36e-33 |
|
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.
Pssm-ID: 436119 Cd Length: 94 Bit Score: 123.16 E-value: 5.36e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888 7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86
|
....*...
gi 156938257 111 SALSKVCP 118
Cdd:pfam17888 87 TALKKIFP 94
|
|
| RNA1 |
COG5238 |
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ... |
274-657 |
1.99e-16 |
|
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];
Pssm-ID: 444072 [Multi-domain] Cd Length: 434 Bit Score: 83.69 E-value: 1.99e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238 88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238 168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238 241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238 319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257 587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238 366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
|
|
| RNA1 |
COG5238 |
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ... |
189-477 |
8.18e-14 |
|
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];
Pssm-ID: 444072 [Multi-domain] Cd Length: 434 Bit Score: 75.21 E-value: 8.18e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 189 REFNLLDFSHLESRDLALMVAALAYNQWFTKLYCKDLRLGSEVLEQVLHTLSKSGSLEELVLDNAGLKTDFVQKLAGVFG 268
Cdd:COG5238 154 NAVHLLGLAARLGLLAAISMAKALQNNSVETVYLGCNQIGDEGIEELAEALTQNTTVTTLWLKRNPIGDEGAEILAEALK 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 269 ENGScvLHALTLSHNPIEDKGFLSLSqQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPafasSLRYLDLSKNPgll 348
Cdd:COG5238 234 GNKS--LTTLDLSNNQIGDEGVIALA-EALKNNTTVETLYLSGNQIGAEGAIALAKALQGNT----TLTSLDLSVNR--- 303
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 349 ATDE-ANALYSFLAQPNALVHLDLS----GTDCVIdLLLGALLHGccSHLTYLNLARNscshRKGREAPPAFKQFFSSAY 423
Cdd:COG5238 304 IGDEgAIALAEGLQGNKTLHTLNLAyngiGAQGAI-ALAKALQEN--TTLHSLDLSDN----QIGDEGAIALAKYLEGNT 376
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 156938257 424 TLSHVNLSATKLPLEALRALLQGLSLNShlsdLH-LDLSSCELRSAGAQALQEQL 477
Cdd:COG5238 377 TLRELNLGKNNIGKQGAEALIDALQTNR----LHtLILDGNLIGAEAQQRLEQLL 427
|
|
| LRR_RI |
cd00116 |
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ... |
339-600 |
6.89e-13 |
|
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Pssm-ID: 238064 [Multi-domain] Cd Length: 319 Bit Score: 71.23 E-value: 6.89e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 339 LDLSKNpgLLATDEANALYSFLAQPNALVHLDLS-----GTDCVIDLLLGALLHGCCshLTYLNLARNSCShrkgrEAPP 413
Cdd:cd00116 28 LRLEGN--TLGEEAAKALASALRPQPSLKELCLSlnetgRIPRGLQSLLQGLTKGCG--LQELDLSDNALG-----PDGC 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 414 AFKQFFSSAYTLSHVNLSATKLPLEALRALLQGL-SLNSHLSDLhlDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNG 492
Cdd:cd00116 99 GVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLkDLPPALEKL--VLGRNRLEGASCEALAKALRANRDLKELNLANNG 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 493 F-DSDLLTLVPALGKNKSLKHLFLGKNF--NVKAKTLEEILHKLVQL--IQEEDCSLQSLSVADsrlklrtsiLINALGS 567
Cdd:cd00116 177 IgDAGIRALAEGLKANCNLEVLDLNNNGltDEGASALAETLASLKSLevLNLGDNNLTDAGAAA---------LASALLS 247
|
250 260 270
....*....|....*....|....*....|....
gi 156938257 568 -NTCLAKVDLSGNGMEDIGAKMLSKALQINSSLR 600
Cdd:cd00116 248 pNISLLTLSLSCNDITDDGAKDLAEVLAEKESLL 281
|
|
| LRR_RI |
cd00116 |
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ... |
428-628 |
7.57e-11 |
|
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Pssm-ID: 238064 [Multi-domain] Cd Length: 319 Bit Score: 65.07 E-value: 7.57e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 428 VNLSATKLPLEALRALlqGLSLNSHLSDLHLDLSSCELRS--AGAQALQEQLGAVTCVGSLDLSDNGFDSDLLTLVPALG 505
Cdd:cd00116 28 LRLEGNTLGEEAAKAL--ASALRPQPSLKELCLSLNETGRipRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLL 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 506 KNKSLKHLFLGKN------FNVKAKTLEEILHKLVQLIQE-----------------EDCSLQSLSVADSRLKLR-TSIL 561
Cdd:cd00116 106 RSSSLQELKLNNNglgdrgLRLLAKGLKDLPPALEKLVLGrnrlegascealakalrANRDLKELNLANNGIGDAgIRAL 185
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 156938257 562 INALGSNTCLAKVDLSGNGMEDIGAKMLSKALQINSSLRTILWDRNNTSALGFLDIARALES-NHTLR 628
Cdd:cd00116 186 AEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSpNISLL 253
|
|
| LRR |
COG4886 |
Leucine-rich repeat (LRR) protein [Transcription]; |
334-663 |
1.32e-09 |
|
Leucine-rich repeat (LRR) protein [Transcription];
Pssm-ID: 443914 [Multi-domain] Cd Length: 414 Bit Score: 61.87 E-value: 1.32e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 334 SSLRYLDLSKNPGLlatdeanalysflAQPNALVHLDLSGTDCV-IDLLLGALlhgccSHLTYLNLARNSCShrkgrEAP 412
Cdd:COG4886 96 TNLTELDLSGNEEL-------------SNLTNLESLDLSGNQLTdLPEELANL-----TNLKELDLSNNQLT-----DLP 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 413 PAFKQFFSsaytLSHVNLSATKL-----PLEALRALlQGLSL-NSHLSDLHLDLSSCelrsagaQALQEqlgavtcvgsL 486
Cdd:COG4886 153 EPLGNLTN----LKSLDLSNNQLtdlpeELGNLTNL-KELDLsNNQITDLPEPLGNL-------TNLEE----------L 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 487 DLSDNGFDSdlltLVPALGKNKSLKHLFLGKNfnvKAKTLEEILhklvQLIqeedcSLQSLSVADSRLKLrtsilINALG 566
Cdd:COG4886 211 DLSGNQLTD----LPEPLANLTNLETLDLSNN---QLTDLPELG----NLT-----NLEELDLSNNQLTD-----LPPLA 269
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 567 SNTCLAKVDLSGNGMEDIGAKMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNHTLRFMSFPVSDISQAYRSAPE 646
Cdd:COG4886 270 NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLAL 349
|
330
....*....|....*..
gi 156938257 647 RTEDVWQKIQWCLVRNN 663
Cdd:COG4886 350 LTLLLLLNLLSLLLTLL 366
|
|
| LRR_RI |
cd00116 |
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ... |
210-493 |
1.58e-07 |
|
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Pssm-ID: 238064 [Multi-domain] Cd Length: 319 Bit Score: 54.67 E-value: 1.58e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 210 ALAYNQWFTKLYCKDLRLGS--EVLEQVLHTLSKSGSLEELVLDNAGLKTDFVQKLAGVFGengSCVLHALTLSHNPIED 287
Cdd:cd00116 46 ALRPQPSLKELCLSLNETGRipRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLLR---SSSLQELKLNNNGLGD 122
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 288 KGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPafasSLRYLDLSKNPglLATDEANALYSFLAQPNALV 367
Cdd:cd00116 123 RGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANR----DLKELNLANNG--IGDAGIRALAEGLKANCNLE 196
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 368 HLDLSgtDCVID-----LLLGALLHGCCshLTYLNLARNSCSHRKGREAPPAFKqffSSAYTLSHVNLSATKLPLEALRA 442
Cdd:cd00116 197 VLDLN--NNGLTdegasALAETLASLKS--LEVLNLGDNNLTDAGAAALASALL---SPNISLLTLSLSCNDITDDGAKD 269
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 156938257 443 LLQGLSLNSHLsdLHLDLSSCELRSAGAQALQEQLGAVTC-VGSLDLSDNGF 493
Cdd:cd00116 270 LAEVLAEKESL--LELDLRGNKFGEEGAQLLAESLLEPGNeLESLWVKDDSF 319
|
|
| LRR_RI |
cd00116 |
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ... |
185-501 |
3.50e-07 |
|
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Pssm-ID: 238064 [Multi-domain] Cd Length: 319 Bit Score: 53.90 E-value: 3.50e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 185 AEDNREFNLLDFS--HLESRDLALMVAALAYnQWFTKLYCKDLR------LGSEVLEQVLHtlskSGSLEELVLDNAGLK 256
Cdd:cd00116 47 LRPQPSLKELCLSlnETGRIPRGLQSLLQGL-TKGCGLQELDLSdnalgpDGCGVLESLLR----SSSLQELKLNNNGLG 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 257 TDFVQKLAGVFGENgSCVLHALTLSHNPIEDKGFLSLSQqLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANpafaSSL 336
Cdd:cd00116 122 DRGLRLLAKGLKDL-PPALEKLVLGRNRLEGASCEALAK-ALRANRDLKELNLANNGIGDAGIRALAEGLKAN----CNL 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 337 RYLDLSKNpgLLATDEANALYSFLAQPNALVHLDLSgtDCVIDLLlgallhgCCSHLtylnlarnscshrkgreappafk 416
Cdd:cd00116 196 EVLDLNNN--GLTDEGASALAETLASLKSLEVLNLG--DNNLTDA-------GAAAL----------------------- 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 417 qffssaytlshvnlsatklpLEALRALLQGLslnshlsdLHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGFDSD 496
Cdd:cd00116 242 --------------------ASALLSPNISL--------LTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRGNKFGEE 293
|
....*
gi 156938257 497 LLTLV 501
Cdd:cd00116 294 GAQLL 298
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1122-1372 |
1.31e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.40 E-value: 1.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1122 GGRGPSFRRKMG--TEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAW-----QKRRSS 1194
Cdd:PHA03247 2682 RPRRRAARPTVGslTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpaRPARPP 2761
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1195 DDAGPGSwKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHSCQSPSPASQDGEEEKE-GTLFPERTLPARNAKLQ-DPA 1272
Cdd:PHA03247 2762 TTAGPPA-PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALpPAASPAGPLPPPTSAQPtAPP 2840
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1273 LAPWPPKPVAVPRGRQPPQEPgVREEAEAGDAAPGVNKPR----LRLSSQQDQEEPEVQG-PPDPGRR----TAPLKPKR 1343
Cdd:PHA03247 2841 PPPGPPPPSLPLGGSVAPGGD-VRRRPPSRSPAAKPAAPArppvRRLARPAVSRSTESFAlPPDQPERppqpQAPPPPQP 2919
|
250 260
....*....|....*....|....*....
gi 156938257 1344 TRRAQSCDKLEPDRRRPPDPTGTSEPGTD 1372
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1121-1337 |
5.86e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 50.75 E-value: 5.86e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1121 GGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGlerakGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPG 1200
Cdd:PRK07764 597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1201 SWKPPPPPQSTKPSFSAMRRAEATwHIAEESAPNHSCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPPkP 1280
Cdd:PRK07764 672 KAGGAAPAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-P 749
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156938257 1281 VAVPRGRQPPQEPGVREEAEAGDAAPgvnkprlrlSSQQDQEEPEV----QGPPDPGRRTA 1337
Cdd:PRK07764 750 DPAGAPAQPPPPPAPAPAAAPAAAPP---------PSPPSEEEEMAeddaPSMDDEDRRDA 801
|
|
| PHA03169 |
PHA03169 |
hypothetical protein; Provisional |
1122-1336 |
3.53e-05 |
|
hypothetical protein; Provisional
Pssm-ID: 223003 [Multi-domain] Cd Length: 413 Bit Score: 48.04 E-value: 3.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1122 GGRGPSFRRKMGTEGSEPGEGGPAPGTA--QQPRVHGVALPGLERAKGWSF----DGKREGPGPDQEGSTQAWQKRRSSD 1195
Cdd:PHA03169 34 GRRRGTAARAAKPAPPAPTTSGPQVRAVaeQGHRQTESDTETAEESRHGEKeergQGGPSGSGSESVGSPTPSPSGSAEE 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1196 DAGPGSWKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHScQSPSPASQ-DGEEEKEGTLFPERTLPARNAKLQDPALA 1274
Cdd:PHA03169 114 LASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES-HNPSPNQQpSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 156938257 1275 PWPPKPVAVPRGRQPPQEPGvREEAEAGDAAPGVNKPRlRLSSQQDQEEPEVQGPPDPGRRT 1336
Cdd:PHA03169 193 PQSETPTSSPPPQSPPDEPG-EPQSPTPQQAPSPNTQQ-AVEHEDEPTEPEREGPPFPGHRS 252
|
|
| LRR |
COG4886 |
Leucine-rich repeat (LRR) protein [Transcription]; |
239-576 |
7.50e-05 |
|
Leucine-rich repeat (LRR) protein [Transcription];
Pssm-ID: 443914 [Multi-domain] Cd Length: 414 Bit Score: 46.85 E-value: 7.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 239 LSKSGSLEELVLDNAGLKT--DFVQKLAGvfgengscvLHALTLSHNPIEDkgflsLSQQLLCFPSgLTKLCLAKTAIS- 315
Cdd:COG4886 109 LSNLTNLESLDLSGNQLTDlpEELANLTN---------LKELDLSNNQLTD-----LPEPLGNLTN-LKSLDLSNNQLTd 173
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 316 -PRGLQALgqtfganpafaSSLRYLDLSKNPgllATDEANAlysfLAQPNALVHLDLSGTD-CVIDLLLGALlhgccSHL 393
Cdd:COG4886 174 lPEELGNL-----------TNLKELDLSNNQ---ITDLPEP----LGNLTNLEELDLSGNQlTDLPEPLANL-----TNL 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 394 TYLNLARNscshrKGREAPpafkqFFSSAYTLSHVNLSATKlplealralLQGLSLNSHLSDL-HLDLSSCELRSAGAQA 472
Cdd:COG4886 231 ETLDLSNN-----QLTDLP-----ELGNLTNLEELDLSNNQ---------LTDLPPLANLTNLkTLDLSNNQLTDLKLKE 291
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 473 LQE-------QLGAVTCVGSLDLSDNGFDSDLLTLVPALGKNKSLKHLFLGKNFNVKAKTLEEILHKLVQLIQEEDCSLQ 545
Cdd:COG4886 292 LELllglnslLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGL 371
|
330 340 350
....*....|....*....|....*....|.
gi 156938257 546 SLSVADSRLKLRTSILINALGSNTCLAKVDL 576
Cdd:COG4886 372 LGLLEATLLTLALLLLTLLLLLLTTTAGVLL 402
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1238-1369 |
1.21e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 46.30 E-value: 1.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1238 QSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPpqEPGVREEAEAG--DAAPGVN 1309
Cdd:NF033839 359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPEKP--KPEVKPQPEKPkpEVKPQPE 436
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 156938257 1310 KPRLRLSSQQDQEEPEVQ---GPPDPGRRTAPLKPKRTRRAQScDKLEPDRRRP-PDPTGTSEP 1369
Cdd:NF033839 437 KPKPEVKPQPEKPKPEVKpqpETPKPEVKPQPEKPKPEVKPQP-EKPKPDNSKPqADDKKPSTP 499
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1119-1367 |
1.77e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 1.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1119 GFGGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKgwsfdgkreGPGPDQEGSTQAWQKRRSSDDAG 1198
Cdd:PHA03247 262 GEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLAL---------PAPPDPPPPAPAGDAEEEDDEDG 332
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1199 PGSWKPPPPPQSTKPSFSAMRRAEATW----HIAEESAPNHSCQSPSPASQDGEEEKEG-TLFPERTLPARNAKLQDPAL 1273
Cdd:PHA03247 333 AMEVVSPLPRPRQHYPLGFPKRRRPTWtppsSLEDLSAGRHHPKRASLPTRKRRSARHAaTPFARGPGGDDQTRPAAPVP 412
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1274 APWP-PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVnkprlrlssqqdqeEPEVQGPPDPGRRTAPLKPKRTRRAqscdk 1352
Cdd:PHA03247 413 ASVPtPAPTPVPASAPPPPATPLPSAEPGSDDGPAP--------------PPERQPPAPATEPAPDDPDDATRKA----- 473
|
250
....*....|....*.
gi 156938257 1353 LEPDR-RRPPDPTGTS 1367
Cdd:PHA03247 474 LDALReRRPPEPPGAD 489
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
1280-1369 |
7.62e-04 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 42.68 E-value: 7.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1280 PVAVPRGRQPPqePGVREEAEAGDAAPGVNKP-RLRLSSQQDQEEPEVQGPPDPGRRTAPlKPKRTRRAQSCDKLEPDRR 1358
Cdd:PRK11633 58 AATQALPTQPP--EGAAEAVRAGDAAAPSLDPaTVAPPNTPVEPEPAPVEPPKPKPVEKP-KPKPKPQQKVEAPPAPKPE 134
|
90
....*....|.
gi 156938257 1359 RPPDPTGTSEP 1369
Cdd:PRK11633 135 PKPVVEEKAAP 145
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1104-1372 |
3.00e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.08 E-value: 3.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1104 SSPCWSPEEESSLLPGFGGGRGPSFRRK---MGTEGSEPGEGGPAP-GTAQQPRVHGVALPGLER-AKGWSFDGKREGPG 1178
Cdd:PHA03307 206 PPRRSSPISASASSPAPAPGRSAADDAGassSDSSSSESSGCGWGPeNECPLPRPAPITLPTRIWeASGWNGPSSRPGPA 285
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1179 PDQEGStqawqkRRSSDDAGPGSwkpppppqSTKPSFSAMRRAeatwhiAEESAPNHSCQSPSPASQDgeeekegtlFPE 1258
Cdd:PHA03307 286 SSSSSP------RERSPSPSPSS--------PGSGPAPSSPRA------SSSSSSSRESSSSSTSSSS---------ESS 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1259 RTLPARNAKLQDPALAPWPPKPVAV---PRGRQPPQEPGVREEAEAGDAAP-GVNKPRLRLSSQQDQeepevqgppdPGR 1334
Cdd:PHA03307 337 RGAAVSPGPSPSRSPSPSRPPPPADpssPRKRPRPSRAPSSPAASAGRPTRrRARAAVAGRARRRDA----------TGR 406
|
250 260 270
....*....|....*....|....*....|....*...
gi 156938257 1335 RTAPLKPKRTRRAQSCDKLEPDRRRPPDPTGTSEPGTD 1372
Cdd:PHA03307 407 FPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPWPGSP 444
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1134-1372 |
4.06e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 41.76 E-value: 4.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1134 TEGSEPGEGGPA--PGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPGSWKPPPPPQST 1211
Cdd:PRK07003 363 TGGGAPGGGVPArvAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDA 442
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1212 KPSFSAMRRAEATWHIAEESAPNHSCQSPS-------------PASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPP 1278
Cdd:PRK07003 443 ADGDAPVPAKANARASADSRCDERDAQPPAdsgsasapasdapPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAP 522
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1279 KPVAVPRGRQPPQEPGV-REEAEAGDAAPGVNKPR---LRLSSQQDQE---------EPEVQGPPDPGRRTAPLKPKRTR 1345
Cdd:PRK07003 523 AAAAPPAPEARPPTPAAaAPAARAGGAAAALDVLRnagMRVSSDRGARaaaaakpaaAPAAAPKPAAPRVAVQVPTPRAR 602
|
250 260
....*....|....*....|....*....
gi 156938257 1346 RAQSCDKLEPDRRRPP--DPTGTSEPGTD 1372
Cdd:PRK07003 603 AATGDAPPNGAARAEQaaESRGAPPPWED 631
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1145-1369 |
4.55e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 41.59 E-value: 4.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1145 APGTAQQP--RVHGVALPGlerakgwsfDGKREGPGPDQEGSTQaWQKRRSSDDAGPGSWKPPPPPQSTKPSFSAMRRAE 1222
Cdd:PHA03378 589 APSYAQTPwpVPHPSQTPE---------PPTTQSHIPETSAPRQ-WPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVE 658
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1223 ATWHIAEESAPNHSCQSPSPASQDgeeekegTLFPERTLPARnakLQDPALAPWPPKPVAVPRGR-QPPQEPGVREEAEA 1301
Cdd:PHA03378 659 ITPYKPTWTQIGHIPYQPSPTGAN-------TMLPIQWAPGT---MQPPPRAPTPMRPPAAPPGRaQRPAAATGRARPPA 728
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 156938257 1302 GdaAPGVNKPRlrlssqqdqeepevQGPPDPGRRTAPlKPKRTRRAQSCdklePDRRRPPD-----PTGTSEP 1369
Cdd:PHA03378 729 A--APGRARPP--------------AAAPGRARPPAA-APGRARPPAAA----PGRARPPAaapgaPTPQPPP 780
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1236-1371 |
5.96e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 40.91 E-value: 5.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156938257 1236 SCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVN 1309
Cdd:NF033839 324 QLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPE 403
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 156938257 1310 KPRLRLSSQQDQEEPEVQGPPDPGR---RTAPLKPKRTRRAQscdklePDRRRP--------PDPTGTSEPGT 1371
Cdd:NF033839 404 KPKPEVKPQPEKPKPEVKPQPEKPKpevKPQPEKPKPEVKPQ------PEKPKPevkpqpetPKPEVKPQPEK 470
|
|
|