|
Name |
Accession |
Description |
Interval |
E-value |
| PLAT_polycystin |
cd01752 |
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ... |
1130-1248 |
1.01e-55 |
|
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.
Pssm-ID: 238850 Cd Length: 120 Bit Score: 189.41 E-value: 1.01e-55
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1130 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1209
Cdd:cd01752 2 LYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPEKPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSWY 81
|
90 100 110
....*....|....*....|....*....|....*....
gi 115583681 1210 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1248
Cdd:cd01752 82 LSRVIVRDLQTGKKWFFLCNDWLSVEEGDGTVERTFPVA 120
|
|
| Polycystin_dom |
pfam20519 |
Polycystin domain; This domain represents the polycystin domain from group II of Transient ... |
1708-1893 |
1.02e-35 |
|
Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.
Pssm-ID: 466668 Cd Length: 199 Bit Score: 135.63 E-value: 1.02e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1708 FSEIKTVEDFYPWANGTLLPNLYGD-----YRGFITDGNSFLLGNVLIRQTRIPNDIFFPGSLHKQMKSPPQHQ-----E 1777
Cdd:pfam20519 2 LLTVTDLDDIWDWLSSVLLPALHSNktpsgLPGSFIAYESLLLGVPRLRQLRVRNSSCLVHDKFVREINECHAGysppsE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1778 DRENYGAGWVPPDTNITKvdSIWHYQNQESLGGYPIQGELATYSGGGYVVRLGRNHSAATRVLQHLEQRRWLDHCTKALF 1857
Cdd:pfam20519 82 DRKLYSALPYKPVHYGSK--YWFIYTPPGLLMGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVF 159
|
170 180 190
....*....|....*....|....*....|....*.
gi 115583681 1858 VEFTVFNANVNLLCAVTLILESSGVGTFLTSLQLDS 1893
Cdd:pfam20519 160 VDFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQS 195
|
|
| PKD_channel |
pfam08016 |
Polycystin cation channel; This family contains the cation channel region from group II of ... |
1898-2118 |
2.88e-34 |
|
Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.
Pssm-ID: 462341 [Multi-domain] Cd Length: 225 Bit Score: 132.02 E-value: 2.88e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1898 QSSERGFAWIVSQVVYYLLVCYYAFIQGCRLKRQRLAFFTRKRNLLDTSIVLISFSILGLSMQSLSLLHKKMQQYHCDRD 1977
Cdd:pfam08016 2 YVTNRSLFILLCEIVFVVFFLYFVVEEILKIRKHRPSYLRSVWNLLDLAIVILSVVLIVLNIYRDFLADRLIKSVEASPV 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1978 RFISFYEALRVNSAVTHLRGFLLLFATVRVWDLLRHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFGWSI 2057
Cdd:pfam08016 82 TFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLFGTQA 161
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 115583681 2058 SDYQSFFRSIVTVVGLLMGTSKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAF 2118
Cdd:pfam08016 162 PNFSNFVKSILTLFRTILGDFGYNEIFSGNRVLGPLLFLTFVFLVIFILLNLFLAIINDSY 222
|
|
| PLAT_repeat |
cd01756 |
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 ... |
1131-1248 |
5.83e-29 |
|
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.
Pssm-ID: 238854 Cd Length: 120 Bit Score: 113.03 E-value: 5.83e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1131 YLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTV-FERGALDVFLLSTGSwLGDLHGLRLWHDNSGDSPSWY 1209
Cdd:cd01756 3 YEVTVKTGDVKGAGTDANVFITLYGENGDTGKRKLKKSNNKNkFERGQTDKFTVEAVD-LGKLKKIRIGHDNSGLGAGWF 81
|
90 100 110
....*....|....*....|....*....|....*....
gi 115583681 1210 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1248
Cdd:cd01756 82 LDKVEIREPGTGDEYTFPCNRWLDKDEDDGQIVRELYPS 120
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
325-652 |
9.19e-26 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 116.55 E-value: 9.19e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 325 PASSSPpqVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSP-PQGTSETPASNSP-PQGTSET 402
Cdd:pfam05109 461 PASTGP--TVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPtPNATSPTPAVTTPtPNATSPT 538
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 403 PGFSSPPQ-VTTATLVSSSP-PQVTSETPASSSPT--------QVTSETPASSSPTqVTSDTPASNSPPQ---GTSDTPG 469
Cdd:pfam05109 539 LGKTSPTSaVTTPTPNATSPtPAVTTPTPNATIPTlgktsptsAVTTPTPNATSPT-VGETSPQANTTNHtlgGTSSTPV 617
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASS 549
Cdd:pfam05109 618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTS 697
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 550 SPPQVTSETPASSSPTNmtSDTPASSSPTNMTSDTPasssPTNMTS-DTPASSSPPWPVITEVTRPESTIPAGRSLANIT 628
Cdd:pfam05109 698 SPAPRPGTTSQASGPGN--SSTSTKPGEVNVTKGTP----PKNATSpQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHG 771
|
330 340
....*....|....*....|....*.
gi 115583681 629 SKAQED--SPLGVISTHPQMSFQSST 652
Cdd:pfam05109 772 ARTSTEptTDYGGDSTTPRTRYNATT 797
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
355-729 |
2.01e-25 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 115.40 E-value: 2.01e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 355 SPPQGTLDTPSSSSPPQGTSdTPASSSPPQGTSETpaSNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSP 434
Cdd:pfam05109 440 AAPNTTTGLPSSTHVPTNLT-APASTGPTVSTADV--TSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTP 516
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 435 T-QVTSETPASSSPT-QVTSDTPASNSP--------PQGTSDTPGFSSPT-QVTTATLVSSSPPQ-VTSDTPASSSP--- 499
Cdd:pfam05109 517 TpNATSPTPAVTTPTpNATSPTLGKTSPtsavttptPNATSPTPAVTTPTpNATIPTLGKTSPTSaVTTPTPNATSPtvg 596
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 500 ---PQV---------TSDTPASSSPPQvTSETPASSSPPQVTSDTSASIS-PPQVISDTPASSSPPQVTSETP--ASSSP 564
Cdd:pfam05109 597 etsPQAnttnhtlggTSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSlRPSSISETLSPSTSDNSTSHMPllTSAHP 675
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 565 T---NMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPwpvitevTRP-ESTIPAGRSLANITSKaqeDSPLGVI 640
Cdd:pfam05109 676 TggeNITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTS-------TKPgEVNVTKGTPPKNATSP---QAPSGQK 745
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 641 STHPQMsfqSSTSQALDETAGERvptipdfqaHSEFQKACAILQRLRDFLPTSPTSAQKNNSWSSQTPAVSCPFQPLGRL 720
Cdd:pfam05109 746 TAVPTV---TSTGGKANSTTGGK---------HTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTF 813
|
....*....
gi 115583681 721 TTTEKSSHQ 729
Cdd:pfam05109 814 TSPPVTTAQ 822
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
159-660 |
2.28e-23 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 109.64 E-value: 2.28e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 159 GPGPLLPMKRRGAETERHMipgngPPlamcHQPAPPelfetlcfPIDPASSAppkATHRMTITSLTGRPQVTSDtlaSSS 238
Cdd:PHA03247 2550 DPPPPLPPAAPPAAPDRSV-----PP----PRPAPR--------PSEPAVTS---RARRPDAPPQSARPRAPVD---DRG 2606
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 239 PPQGTSdtPASSSPPQVTSAtsasssppqgtsDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSpp 318
Cdd:PHA03247 2607 DPRGPA--PPSPLPPDTHAP------------DPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL-- 2670
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 319 qgtSDTPASSSPPQvtsatsasssppqgtSDTPASSSPPQGTLdTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQG 398
Cdd:PHA03247 2671 ---GRAAQASSPPQ---------------RPRRRAARPTVGSL-TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQA 2731
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 399 TSETPGFSSPPQVTTATLVSSSP-----PQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSP 473
Cdd:PHA03247 2732 SPALPAAPAPPAVPAGPATPGGParparPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV 2811
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 474 TQVTTATLVSSSP----PQVTSDTPASSSPPQVTSDTP-------------ASSSPPQVTSETPASSSPPQVTSDTSASI 536
Cdd:PHA03247 2812 LAPAAALPPAASPagplPPPTSAQPTAPPPPPGPPPPSlplggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAV 2891
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 537 SPPQVISDTPASSSPPQvtsETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP----PWPVITEVT 612
Cdd:PHA03247 2892 SRSTESFALPPDQPERP---PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPsgavPQPWLGALV 2968
|
490 500 510 520
....*....|....*....|....*....|....*....|....*....
gi 115583681 613 RPESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQS-STSQALDETA 660
Cdd:PHA03247 2969 PGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSwASSLALHEET 3017
|
|
| PLAT |
pfam01477 |
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ... |
1131-1246 |
6.20e-23 |
|
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.
Pssm-ID: 396180 Cd Length: 115 Bit Score: 95.58 E-value: 6.20e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1131 YLIQVYTGYRRRAATTAKVVITLYGSEGHS--EPHHLCDPEktvFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSW 1208
Cdd:pfam01477 1 YQVKVVTGDELGAGTDADVYISLYGKVGESaqLEITLDNPD---FERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEW 77
|
90 100 110
....*....|....*....|....*....|....*....
gi 115583681 1209 YVSQVIV-SDMTTRKKWHFQCNCWLAVDLGNcERDRVFT 1246
Cdd:pfam01477 78 FLKSITVeVPGETGGKYTFPCNSWVYGSKKY-KETRVFF 115
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
345-619 |
1.88e-20 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 97.72 E-value: 1.88e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 345 QGTSDTPASSSPPqgtldTPSSSSPPQGTSDTPASSS-PPQGTSETPASNSPPQGTSETPgfSSPPQVTTATLVSSSPPQ 423
Cdd:pfam17823 104 EGAADGAASRALA-----AAASSSPSSAAQSLPAAIAaLPSEAFSAPRAAACRANASAAP--RAAIAAASAPHAASPAPR 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 424 V--------TSETPASSSPTQVTSETPASSSPTQVTSD------TPAS--------NSPPQGTSDTPGFSSPTQVTTATL 481
Cdd:pfam17823 177 TaassttaaSSTTAASSAPTTAASSAPATLTPARGISTaatatgHPAAgtalaavgNSSPAAGTVTAAVGTVTPAALATL 256
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 482 VSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSasisppQVISDTPASSSPPQVTSetpaS 561
Cdd:pfam17823 257 AAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPII------QVSTDQPVHNTAGEPTP----S 326
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 562 SSPTNMTSDTPASSSPTNMTSDTpasssPTNMTSDTPASSSPPWPVITEVTRPESTIP 619
Cdd:pfam17823 327 PSNTTLEPNTPKSVASTNLAVVT-----TTKAQAKEPSASPVPVLHTSMIPEVEATSP 379
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
142-629 |
1.92e-20 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 99.63 E-value: 1.92e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 142 QAAAFPPQGASIWRNEFGPGPllpmkrRGAETERHMIPGNGPPLAMCHQPAPPELF---ETLCFPIDPASSAPPKATHRM 218
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPSP------AANEPDPHPPPTVPPPERPRDDPAPGRVSrprRARRLGRAAQASSPPQRPRRR 2686
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 219 TItsltgRPQVTSDTLASSSPPQGTsdTPASSSPPQVTSATSASSSPPQGTSD--TPASSSPPQVTSATSA----SSSPP 292
Cdd:PHA03247 2687 AA-----RPTVGSLTSLADPPPPPP--TPEPAPHALVSATPLPPGPAAARQASpaLPAAPAPPAVPAGPATpggpARPAR 2759
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 293 QGTSDTPASSSPPQVtsatsasssppqgtsdtPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPqg 372
Cdd:PHA03247 2760 PPTTAGPPAPAPPAA-----------------PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP-- 2820
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 373 tSDTPASSSPPQgTSETPASNSPPQG---TSETPGFSSPPqvtTATLVSSSPPQVTSETPASSSPTQVTS-ETPASSSPT 448
Cdd:PHA03247 2821 -AASPAGPLPPP-TSAQPTAPPPPPGpppPSLPLGGSVAP---GGDVRRRPPSRSPAAKPAAPARPPVRRlARPAVSRST 2895
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 449 QVTSDTPASNSPPQgtsdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQV 528
Cdd:PHA03247 2896 ESFALPPDQPERPP----QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR 2971
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 529 TSDTSASISPPQVISDTPASSSPPQVTSETP-----ASSSPTNMTSDTPASSSPTNM--TSDTPASSSPTNMTSDTPASS 601
Cdd:PHA03247 2972 VAVPRFRVPQPAPSREAPASSTPPLTGHSLSrvsswASSLALHEETDPPPVSLKQTLwpPDDTEDSDADSLFDSDSERSD 3051
|
490 500 510
....*....|....*....|....*....|....
gi 115583681 602 S------PPWPVITEVTRPESTIPAGRSLANITS 629
Cdd:PHA03247 3052 LealdplPPEPHDPFAHEPDPATPEAGARESPSS 3085
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
179-604 |
2.97e-20 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 98.70 E-value: 2.97e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 179 PGNGPPLAMCHQPAPPELFETLCFPIDPASSAPPKATHrmtitSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSA 258
Cdd:PHA03307 67 PPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG-----SPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPV 141
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 259 TSASSSPPQGTSDTPASSSPPQvtsatsasssppqGTSDTPASSSPPQvtsatsasssppQGTSDTPASSSPPQVTSATS 338
Cdd:PHA03307 142 GSPGPPPAASPPAAGASPAAVA-------------SDAASSRQAALPL------------SSPEETARAPSSPPAEPPPS 196
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 339 ASSSPPQGTSdtPASSSPPQGTLDTPSSSSPPQGTSDTPASSSppqGTSETPASNSPPQGTSETPGFSSPPQVTTATLVS 418
Cdd:PHA03307 197 TPPAAASPRP--PRRSSPISASASSPAPAPGRSAADDAGASSS---DSSSSESSGCGWGPENECPLPRPAPITLPTRIWE 271
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 419 SSPPQVTSETPASSSPtqVTSETPASSSPTQVTSDTPASNSPPQGTSDtpGFSSPTQVTTATLVSSSPPQVTSDTPASSS 498
Cdd:PHA03307 272 ASGWNGPSSRPGPASS--SSSPRERSPSPSPSSPGSGPAPSSPRASSS--SSSSRESSSSSTSSSSESSRGAAVSPGPSP 347
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 499 PPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PHA03307 348 SRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFY 427
|
410 420
....*....|....*....|....*.
gi 115583681 579 NMTSDTPASSSPtnmtsdTPASSSPP 604
Cdd:PHA03307 428 ARYPLLTPSGEP------WPGSPPPP 447
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
237-636 |
9.76e-20 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 97.16 E-value: 9.76e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 237 SSPPQGTSDTPASSSPPQVtsatsasssppqGTSDTPASSSPPQVTSATSASSSPPQGtsdtPASSSPPQVTSATSASSS 316
Cdd:PHA03307 25 PATPGDAADDLLSGSQGQL------------VSDSAELAAVTVVAGAAACDRFEPPTG----PPPGPGTEAPANESRSTP 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 317 PPQGTSDTPASSSPPQvtsatsaSSSPPQGTSDTPA-SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSP 395
Cdd:PHA03307 89 TWSLSTLAPASPAREG-------SPTPPGPSSPDPPpPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAA 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 396 PQGTSETPGFSSPP-QVTTATLVSSSPPQvtSETPASSSPTQVTSETPASSSPTQVTSDTPASnSPPQGTSDTPGFSSPT 474
Cdd:PHA03307 162 VASDAASSRQAALPlSSPEETARAPSSPP--AEPPPSTPPAAASPRPPRRSSPISASASSPAP-APGRSAADDAGASSSD 238
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 475 QVTTATLVSSSPPQvtSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPqVISDTPASSSPPQV 554
Cdd:PHA03307 239 SSSSESSGCGWGPE--NECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPS-SPGSGPAPSSPRAS 315
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 555 TSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPP--------WPVITEVTRPESTIPAGRSLAN 626
Cdd:PHA03307 316 SSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPrkrprpsrAPSSPAASAGRPTRRRARAAVA 395
|
410
....*....|
gi 115583681 627 ITSKAQEDSP 636
Cdd:PHA03307 396 GRARRRDATG 405
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
171-619 |
1.73e-19 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 96.14 E-value: 1.73e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 171 AETERHMIPGNGPPLAMCHQPAppelFETLCFPIDPASSAPPKATH---RMTITSLTGRPQVTSDTlaSSSPPQGTSDTP 247
Cdd:pfam05109 412 ATTTTHKVIFSKAPESTTTSPT----LNTTGFAAPNTTTGLPSSTHvptNLTAPASTGPTVSTADV--TSPTPAGTTSGA 485
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 248 ASSSPPQVTSATSASSSPPQGTSDTPASSSPpqvtsatsasssPPQGTSDTPASSSPpqvtsatsasssPPQGTSDTPAS 327
Cdd:pfam05109 486 SPVTPSPSPRDNGTESKAPDMTSPTSAVTTP------------TPNATSPTPAVTTP------------TPNATSPTLGK 541
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 328 SSPpqvtsATSASSSPPQGTSDTPASSSP-PQGTLDTPSSSSPPQG-TSDTPASSSPPQGTSETPA--SNSPPQGTSETP 403
Cdd:pfam05109 542 TSP-----TSAVTTPTPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPTPNATSPTVGETSPQAntTNHTLGGTSSTP 616
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 404 GFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSsptqvTSDTPASNSPPQGTSDTPGFSSPTQVTTAtlvS 483
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPS-----TSDNSTSHMPLLTSAHPTGGENITQVTPA---S 688
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 484 SSPPQVTSDTPASSSPPQVTSDTPASSSppqvTSETPASSSPPQVTSDTSAsiSPPQVISDTpaSSSPPQVTSETPASSS 563
Cdd:pfam05109 689 TSTHHVSTSSPAPRPGTTSQASGPGNSS----TSTKPGEVNVTKGTPPKNA--TSPQAPSGQ--KTAVPTVTSTGGKANS 760
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 564 PTNMTSDT----PASSSP-TNMTSDTPASSSPTNMTSDTPASSS----PPWPVIT-EVTRPESTIP 619
Cdd:pfam05109 761 TTGGKHTTghgaRTSTEPtTDYGGDSTTPRTRYNATTYLPPSTSsklrPRWTFTSpPVTTAQATVP 826
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
144-607 |
1.92e-19 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 96.55 E-value: 1.92e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 144 AAFPPQGasiWRNEFGPGPLLPMkrrgAETERHMIPGNGPplamchQPAPPELFetlcfPIDPASSAPPKATHRMTITSL 223
Cdd:PHA03247 2676 ASSPPQR---PRRRAARPTVGSL----TSLADPPPPPPTP------EPAPHALV-----SATPLPPGPAAARQASPALPA 2737
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 224 TGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSasssppqgtsdtPASSSPPQVTSATSASSSPPQGTSDTPASSS 303
Cdd:PHA03247 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA------------PAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 304 PPQVTSATSASSSPpqgTSDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQGtldtPSSSSPPQGTSDTP----AS 379
Cdd:PHA03247 2806 DPPAAVLAPAAALP---PAASPAGPLPPP--------------TSAQPTAPPPPPG----PPPPSLPLGGSVAPggdvRR 2864
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 380 SSPPQGTSETPASNS-PPQGTSETPGFSSPPQvttaTLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASN 458
Cdd:PHA03247 2865 RPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 459 SPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISP 538
Cdd:PHA03247 2941 PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPP 3020
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 539 PQVISDT---PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPV 607
Cdd:PHA03247 3021 PVSLKQTlwpPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPL 3092
|
|
| PLAT |
cd00113 |
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. ... |
1130-1232 |
5.91e-19 |
|
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. It consists of an eight stranded beta-barrel. The domain can be found in various domain architectures, in case of lipoxygenases, alpha toxin, lipases and polycystin, but also as a single domain or as repeats.The putative function of this domain is to facilitate access to sequestered membrane or micelle bound substrates.
Pssm-ID: 238061 Cd Length: 116 Bit Score: 84.31 E-value: 5.91e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1130 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHhLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1209
Cdd:cd00113 2 RYTVTIKTGDKKGAGTDSNISLALYGENGNSSDI-PILDGPGSFERGSTDTFQIDLKLDIGDITKVYLRRDGSGLSDGWY 80
|
90 100
....*....|....*....|...
gi 115583681 1210 VSQVIVSDMTTRKKWHFQCNCWL 1232
Cdd:cd00113 81 CESITVQALGTKKVYTFPVNRWV 103
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
192-622 |
4.39e-18 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 91.77 E-value: 4.39e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 192 APPELFETLCFPIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPqvtsATSASSSPPQGTSD 271
Cdd:PHA03307 53 VTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSP----DPPPPTPPPASPPP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 272 TPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvtsatsasssppqGTSDTPASSSPPQvtsatsasSSPPQgTSDTP 351
Cdd:PHA03307 129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA-------------SDAASSRQAALPL--------SSPEE-TARAP 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 352 ASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQ-----VTTATLVSSSPPQVTS 426
Cdd:PHA03307 187 SSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSgcgwgPENECPLPRPAPITLP 266
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 427 ETPASSSPTQVTSETPASSSPtqVTSDTPASNSPPQGTSDTPGFSSPtqvttATLVSSSPPQVTSDTPASSSPPQVTSDT 506
Cdd:PHA03307 267 TRIWEASGWNGPSSRPGPASS--SSSPRERSPSPSPSSPGSGPAPSS-----PRASSSSSSSRESSSSSTSSSSESSRGA 339
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 507 PASSSPPQVTSETPASSSPPQVTSDTSASISP---PQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSD 583
Cdd:PHA03307 340 AVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPsraPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDA 419
|
410 420 430
....*....|....*....|....*....|....*....
gi 115583681 584 TPASSSPTNMTSDTPASSSpPWPvitEVTRPestiPAGR 622
Cdd:PHA03307 420 GAASGAFYARYPLLTPSGE-PWP---GSPPP----PPGR 450
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
207-576 |
2.93e-17 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 87.71 E-value: 2.93e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 207 ASSAPP------KATHRMTITSLTGRPQVTSDTLASSSPP--QGTSDTPASSSPpqvtsATSASSSPPQGTSDTPASSSP 278
Cdd:pfam17823 62 AATAAPapvtltKGTSAAHLNSTEVTAEHTPHGTDLSEPAtrEGAADGAASRAL-----AAAASSSPSSAAQSLPAAIAA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 279 PQVTSATSASSSPPQgtsdTPASSSPpqvtsatsasSSPPQGTSDTPASSSPPQvtsatSASSSPPQGTSDTPASSSPPQ 358
Cdd:pfam17823 137 LPSEAFSAPRAAACR----ANASAAP----------RAAIAAASAPHAASPAPR-----TAASSTTAASSTTAASSAPTT 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 359 GTLDTPSSSSPPQGTSDTPASSSPPQ---GTSETPASNSPPQGTSETPGFSSPPQV-----------TTATLVSSSPPQV 424
Cdd:pfam17823 198 AASSAPATLTPARGISTAATATGHPAagtALAAVGNSSPAAGTVTAAVGTVTPAALatlaaaagtvaSAAGTINMGDPHA 277
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 425 TSETPASSSPTQVTSETPASSS------PT-QVTSDTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASS 497
Cdd:pfam17823 278 RRLSPAKHMPSDTMARNPAAPMgaqaqgPIiQVSTDQPVHNTAGEPTP-SPSNTTLEPNTPKSVASTNLAVVTTTKAQAK 356
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 498 SPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQvisdtpassSPPQVTSE-TP--ASSSPTNMTS---DT 571
Cdd:pfam17823 357 EPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILL---------APEQVATEaTAgtASAGPTPRSSgdpKT 427
|
....*
gi 115583681 572 PASSS 576
Cdd:pfam17823 428 LAMAS 432
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
376-668 |
3.64e-17 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 87.32 E-value: 3.64e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 376 TPASSSPPQGTSETPAS--NSPPQGTSETP---GFSSPP-QVTTATLVSSSPPQvtseTPASSSPTQVTSETPASSS-PT 448
Cdd:pfam17823 63 ATAAPAPVTLTKGTSAAhlNSTEVTAEHTPhgtDLSEPAtREGAADGAASRALA----AAASSSPSSAAQSLPAAIAaLP 138
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 449 QVTSDTPASNSPPQGTSDTPgfSSPTQVTTATLVSSSPPQVtsdtpASSSPPQVTSDTPASSSPPQVTSETPASSSP--P 526
Cdd:pfam17823 139 SEAFSAPRAAACRANASAAP--RAAIAAASAPHAASPAPRT-----AASSTTAASSTTAASSAPTTAASSAPATLTParG 211
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 527 QVTSDT-----SASISPPQVISDTPAS-SSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSD------TPASSSPTNMT 594
Cdd:pfam17823 212 ISTAATatghpAAGTALAAVGNSSPAAgTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDpharrlSPAKHMPSDTM 291
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 595 SDTPASSSPP------------WPVITEVTRPeSTIPAGRSLANITSKAQEDSPLGVISThpqmsfqsSTSQALDETAGE 662
Cdd:pfam17823 292 ARNPAAPMGAqaqgpiiqvstdQPVHNTAGEP-TPSPSNTTLEPNTPKSVASTNLAVVTT--------TKAQAKEPSASP 362
|
....*.
gi 115583681 663 rVPTIP 668
Cdd:pfam17823 363 -VPVLH 367
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
182-564 |
4.44e-17 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 88.05 E-value: 4.44e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 182 GPPLAMCHQPAP-PELFETLCFPIDPASSAPPKATHrmtitslTGRPQVTSDTLASSSP-PQGTSDTPASSSPpqvtsat 259
Cdd:pfam05109 465 GPTVSTADVTSPtPAGTTSGASPVTPSPSPRDNGTE-------SKAPDMTSPTSAVTTPtPNATSPTPAVTTP------- 530
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 260 sasssPPQGTSDTPASSSPpqvtsATSASSSPPQGTSDTPASSSP-PQVTSATSASSSPPQG-TSDTPASSSPP--QVTS 335
Cdd:pfam05109 531 -----TPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPTPNATSPTvgETSP 600
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 336 ATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSP-----PQ 410
Cdd:pfam05109 601 QANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAhptggEN 680
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 411 VTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPT------QVTSDTPASN-SPPQGTSDTPGfSSPTQVTTATLVS 483
Cdd:pfam05109 681 ITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTStkpgevNVTKGTPPKNaTSPQAPSGQKT-AVPTVTSTGGKAN 759
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 484 SSPPQVTSDTPASSSPPQVTSDTPASSSPPQvTSETPASSSPPQvtsdTSASISPPQVISDTPASSSppQVTSETPASSS 563
Cdd:pfam05109 760 STTGGKHTTGHGARTSTEPTTDYGGDSTTPR-TRYNATTYLPPS----TSSKLRPRWTFTSPPVTTA--QATVPVPPTSQ 832
|
.
gi 115583681 564 P 564
Cdd:pfam05109 833 P 833
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
179-576 |
7.66e-17 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 87.51 E-value: 7.66e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 179 PGNGPPLAMChQPAPPELFETLCFPIDPASSAPPKATHRMT--ITSLTGRPQVTSDTLASSSPP--QGTSDTPASSSPPQ 254
Cdd:pfam03154 186 PPPPGTTQAA-TAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTPTLHPQRLPSPHPPlqPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 255 VTSATSASssppqgtSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvt 334
Cdd:pfam03154 265 PLPQPSLH-------GQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQ-- 335
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 335 satsasssppqgtSDTPASSSP-PQGTLDTPSSSSPPQgtsdTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTT 413
Cdd:pfam03154 336 -------------SQQPPREQPlPPAPLSMPHIKPPPT----TPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKP 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 414 ATLVSSSPPQVTSETPASSSPTQVTSETPASSSP--TQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTS 491
Cdd:pfam03154 399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPvlTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP 478
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 492 DTPASSSPPQVTSDTPASSSPPQVTSETPASssppqvtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDT 571
Cdd:pfam03154 479 SGPPTSTSSAMPGIQPPSSASVSSSGPVPAA---------VSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNT 549
|
....*
gi 115583681 572 PASSS 576
Cdd:pfam03154 550 PSHAS 554
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
206-577 |
8.54e-17 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 87.28 E-value: 8.54e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 206 PASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDtpaSSSPPQVTSATSASSSPPQGTSDTPASSSPpqvtsat 285
Cdd:pfam05109 461 PASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTE---SKAPDMTSPTSAVTTPTPNATSPTPAVTTP------- 530
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 286 sasssPPQGTSDTPASSSPpqvtsATSASSSPPQGTSDTPASSSP-PQVTSATSASSSPPQG-TSDTPASSSPPQGTlDT 363
Cdd:pfam05109 531 -----TPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPTPNATSPTVGE-TS 599
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 364 PSSSSPPQ---GTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETP--ASSSPT--- 435
Cdd:pfam05109 600 PQANTTNHtlgGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPllTSAHPTgge 679
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSS-PTQVTTATLVSSSPPQVTSDTPASS----SPPQVTSDTPASS 510
Cdd:pfam05109 680 NITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSStSTKPGEVNVTKGTPPKNATSPQAPSgqktAVPTVTSTGGKAN 759
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 115583681 511 SP---PQVTSETPASSSPPqvTSDTSASISPPQVISDT-----PASSSP--PQVTSETPASSSpTNMTSDTPASSSP 577
Cdd:pfam05109 760 STtggKHTTGHGARTSTEP--TTDYGGDSTTPRTRYNAttylpPSTSSKlrPRWTFTSPPVTT-AQATVPVPPTSQP 833
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
203-603 |
1.11e-15 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 82.81 E-value: 1.11e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTlaSSSPPQGTSDTPA----SSSPPQV-----TSATSASSSPPQGTSDTP 273
Cdd:pfam03546 67 PRKGAPPVPPGKTGPAAAQAQAGKPEEDSES--SSEESDSDGETPAaatlTTSPAQVkplgkNSQVRPASTVGKGPSGKG 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 274 ASSSPPQVTSATSASSSPPQGTSDTPASS--------SPPQVTSATSASSSPPQGTSDTP---ASSSPPQVTSATSASSS 342
Cdd:pfam03546 145 ANPAPPGKAGSAAPLVQVGKKEEDSESSSeesdsegeAPPAATQAKPSGKILQVRPASGPakgAAPAPPQKAGPVATQVK 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 343 PPQGTSDT-------------PASSSPPQG--TLDTPSS-SSPPQGTSDTPASSSPPQGTSETPASNSppQGTSETPGFS 406
Cdd:pfam03546 225 AERSKEDSesseessdseeeaPAAATPAQAkpALKTPQTkASPRKGTPITPTSAKVPPVRVGTPAPWK--AGTVTSPACA 302
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 407 SPPQVTTAT----LVSSSPPQVTSETPASSSPTQVTSET-------PASSSPTQVTSDTPASNSPPQGTSdtpgfSSPTQ 475
Cdd:pfam03546 303 SSPAVARGAqrpeEDSSSSEESESEEETAPAAAVGQAKSvgkglqgKAASAPTKGPSGQGTAPVPPGKTG-----PAVAQ 377
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 476 VTTATLVSSSPPQVTSD------TPASSSPPQVTSDTPASSSPPQVTSETPASSSP-------PQVTSDTSASISPPQVI 542
Cdd:pfam03546 378 VKAEAQEDSESSEEESDseeaaaTPAQVKASGKTPQAKANPAPTKASSAKGAASAPgkvvaaaAQAKQGSPAKVKPPART 457
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 543 SDTPASSSPPQ---------VTSETPASSSPTNMTSDTPASSSPTNMTSD--TPASSSPTNMTSDTPASSSP 603
Cdd:pfam03546 458 PQNSAISVRGQasvpavgkaVATAAQAQKGPVGGPQEEDSESSEEESDSEeeAPAQAKPSGKTPQVRAASAP 529
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
211-608 |
1.55e-15 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 83.28 E-value: 1.55e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 211 PPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTpASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSS 290
Cdd:pfam03154 94 PERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDE-GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPP 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 291 PPQGTSDTPASSSPPQVTSATSASSSPPQGTSD-----TPASSSPPQVTSATSASSSPPQGTSDT-----PASSSPPQG- 359
Cdd:pfam03154 173 VLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSvppqgSPATSQPPNQTQSTAAPHTLIQQTPTLhpqrlPSPHPPLQPm 252
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 360 TLDTPSSSSPPQGTSDT-------PASSS------------PPQG---TSETPASNSPPQGTSETPGFSSPPQVTTATLV 417
Cdd:pfam03154 253 TQPPPPSQVSPQPLPQPslhgqmpPMPHSlqtgpshmqhpvPPQPfplTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQS 332
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 418 SSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgFSSPTQVTT-------ATLVSSSPP--- 487
Cdd:pfam03154 333 QLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSP-FQMNSNLPPppalkplSSLSTHHPPsah 411
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 488 ----QVTSDTPASSSPP-------QVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:pfam03154 412 ppplQLMPQSQQLPPPPaqppvltQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPG 491
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 557 ETPASSSPTNMTSDTPASSS----PTNMTSDTPASSSPTNMTSDTPASSSPPWPVI 608
Cdd:pfam03154 492 IQPPSSASVSSSGPVPAAVScplpPVQIKEEALDEAEEPESPPPPPRSPSPEPTVV 547
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
323-709 |
3.01e-15 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 81.27 E-value: 3.01e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 323 DTPASSSPPQVTSATSASSSppQGTSdTPASSSPPQGTLDTPSSSSPPQGTS--------DTPASSSPPQGTSETPASNS 394
Cdd:pfam03546 37 ETPAAKTPLQAKPSGKTPQV--RAAS-APAKESPRKGAPPVPPGKTGPAAAQaqagkpeeDSESSSEESDSDGETPAAAT 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 395 PPQGTSETPGFSSPPQVTTATLVSSSP------PQVTSETPASSSPTQVTSETPASSSPTQvTSDTPASNSPPQGTSDTP 468
Cdd:pfam03546 114 LTTSPAQVKPLGKNSQVRPASTVGKGPsgkganPAPPGKAGSAAPLVQVGKKEEDSESSSE-ESDSEGEAPPAATQAKPS 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 469 GFSSPTQVTT--ATLVSSSPPQVTSdtPASSsppQVTSDTP---ASSSPPQVTSE-------TPASSSPPQVTSDTSAsi 536
Cdd:pfam03546 193 GKILQVRPASgpAKGAAPAPPQKAG--PVAT---QVKAERSkedSESSEESSDSEeeapaaaTPAQAKPALKTPQTKA-- 265
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 537 SPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT------- 609
Cdd:pfam03546 266 SPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACASSPAVARGAQRPEEDSSSSEESESEEETAPAAAVGQaksvgkg 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 610 ---------------EVTRPESTIPAGRSLANITSKAQEDS--------PLGVISTHPQMSFQSSTSQALDETAGERVPT 666
Cdd:pfam03546 346 lqgkaasaptkgpsgQGTAPVPPGKTGPAVAQVKAEAQEDSesseeesdSEEAAATPAQVKASGKTPQAKANPAPTKASS 425
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 115583681 667 IPDfqAHSEFQKACAILQRLRDFLPT---SPTSAQKNNSWSSQTPA 709
Cdd:pfam03546 426 AKG--AASAPGKVVAAAAQAKQGSPAkvkPPARTPQNSAISVRGQA 469
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
373-609 |
5.42e-15 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 80.57 E-value: 5.42e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 373 TSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTS 452
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 453 DTPASNSPPqgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPpqVTSDT 532
Cdd:COG3469 81 TATAAAAAA---------TSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSA--GSTTT 149
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 115583681 533 SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSsptnmtsdTPASSSPPWPVIT 609
Cdd:COG3469 150 TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT--------GPPTPGLPKHVLV 218
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
183-623 |
3.24e-14 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 78.66 E-value: 3.24e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 183 PPLAMCHQPAPPelfetlcfpidPASSAPPKAThrmtitsltgrpqvtsdTLASSSPPQGTSDTPASSSPPqvtsatsaS 262
Cdd:pfam03154 171 PPVLQAQSGAAS-----------PPSPPPPGTT-----------------QAATAGPTPSAPSVPPQGSPA--------T 214
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 263 SSPPQGTSDTPASSSPPQvtsatsasssppQGTSDTPASSSPPQvtsatsasSSPPQGTSDTPASSSPPQvtsatsasss 342
Cdd:pfam03154 215 SQPPNQTQSTAAPHTLIQ------------QTPTLHPQRLPSPH--------PPLQPMTQPPPPSQVSPQ---------- 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 343 ppqgtSDTPASSSPPQGTLDTPSSSSPPQgtSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPP 422
Cdd:pfam03154 265 -----PLPQPSLHGQMPPMPHSLQTGPSH--MQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgFSSPtqvttatlvSSSPPqvtsdTPASSSPPQV 502
Cdd:pfam03154 338 QPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSP-FQMN---------SNLPP-----PPALKPLSSL 402
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 503 TSDTPASSSPP--QVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETP-------ASSSPTNMTSDTPA 573
Cdd:pfam03154 403 STHHPPSAHPPplQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqhpfvPGGPPPITPPSGPP 482
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*..
gi 115583681 574 SSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT-------EVTRPESTIPAGRS 623
Cdd:pfam03154 483 TSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQikeealdEAEEPESPPPPPRS 539
|
|
| SPT5 |
COG5164 |
Transcription elongation factor SPT5 [Transcription]; |
320-632 |
9.81e-14 |
|
Transcription elongation factor SPT5 [Transcription];
Pssm-ID: 444063 [Multi-domain] Cd Length: 495 Bit Score: 76.22 E-value: 9.81e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 320 GTSDTPA----SSSPPQvtsatsasssppQGTSDTPASSSPPQGTLDTPSSSSPPQGT-SDTPA----SSSPPQGT-SET 389
Cdd:COG5164 24 QGSTKPAqnqgSTRPAG------------NTGGTRPAQNQGSTTPAGNTGGTRPAGNQgATGPAqnqgGTTPAQNQgGTR 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 390 PASNSPPQGTSETPGFSSPPQVTTATLV-----SSSPPQVTSETPASssPTQVTSETPASSSPTQVTsdTPASNSPPQGT 464
Cdd:COG5164 92 PAGNTGGTTPAGDGGATGPPDDGGATGPpddggSTTPPSGGSTTPPG--DGGSTPPGPGSTGPGGST--TPPGDGGSTTP 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 465 SDTPGFSSPTQVTTATlvssSPPqvtsdTPASSSPPQVTSDTPAS--SSPPQVTSETPASSSPPQVTSDTSASispPQVI 542
Cdd:COG5164 168 PGPGGSTTPPDDGGST----TPP-----NKGETGTDIPTGGTPRQgpDGPVKKDDKNGKGNPPDDRGGKTGPK---DQRP 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 543 SDTPASSSPPQVTSETPASSSPTNmTSDTPASSSPTNMTSDTPASSS--PTNMTSDTPASSSPPWPVITEVTRPESTIPA 620
Cdd:COG5164 236 KTNPIERRGPERPEAAALPAELTA-LEAENRAANPEPATKTIPETTTvkDLATVLGKKGSDLVTNLMKKGKGTNINAALD 314
|
330
....*....|..
gi 115583681 621 GRSLANITSKAQ 632
Cdd:COG5164 315 FETAATIALEGN 326
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
325-631 |
1.07e-13 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 77.67 E-value: 1.07e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 325 PASSSPPQVTSATSASSSPPQGTSDTPASSSPPqgtldTPSSSSPPQGTSDTPASSSPPQ------GTSET-------PA 391
Cdd:PHA03247 2478 PVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPP-----APSRLAPAILPDEPVGEPVHPRmltwirGLEELasddagdPP 2552
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 392 SNSPPQGTSETPGFSSP----------PQVTTATLVSSSPPQVTS-ETPASSSPTQVTSETPASSSPTQVTSDTPASNSP 460
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPpprpaprpsePAVTSRARRPDAPPQSARpRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 461 PQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSppqvtSDTPASSSPPQVTSETPAsssPPQVTSDTSASISPPQ 540
Cdd:PHA03247 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL-----GRAAQASSPPQRPRRRAA---RPTVGSLTSLADPPPP 2704
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 541 visDTPASSSPPQVTSETPASSSPTNMTSDTPASSS-----PTNMTSDTPASSS--PTNMTSDTPASSSPPW-PVITevT 612
Cdd:PHA03247 2705 ---PPTPEPAPHALVSATPLPPGPAAARQASPALPAapappAVPAGPATPGGPArpARPPTTAGPPAPAPPAaPAAG--P 2779
|
330
....*....|....*....
gi 115583681 613 RPESTIPAGRSLANITSKA 631
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESL 2798
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
194-637 |
2.57e-13 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 75.85 E-value: 2.57e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 194 PELFETLCFPIDP-ASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTpassSPPQVTSATSASSSPPQGT--- 269
Cdd:COG5665 168 PVAVVVTTMIAVPsAPAAPPNAVDYSVLVPIAAQDPAASVSTPQAFNASATSGR----SQHIVQAAKRVGVEWWGDPsll 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 270 SDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPpqvTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGT-- 347
Cdd:COG5665 244 ATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQ---LTTSNTPTSTAKAQPQPPTKKQPAKEPPSDTASGNPSAPSvl 320
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 348 --SDTPASSSPPqgtldTPSSSSPPQGTSDTPASSSPpqgtsetpasnsppqgtsetpgfSSPPQVTTATLVSSSPPQVT 425
Cdd:COG5665 321 inSDSPTSEDPA-----TASVPTTEETTAFTTPSSVP-----------------------STPAEKDTPATDLATPVSPT 372
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 426 SetPASSSPTQVTSETPASSSPTQVTSDTPASNSPP---QGTSDTPGFSSPTQvtTATLVSSSPPQvtsdTPASSSPPQV 502
Cdd:COG5665 373 P--PETSVDKKVSPDSATSSTKSEKEGGTASSPMPPniaIGAKDDVDATDPSQ--EAKEYTKNAPM----TPEADSAPES 444
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 503 TSDTPAS---SSPPQVTSET---------PASSSPPQVTSDTSAS-----ISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:COG5665 445 SVRTEASpsaGSDLEPENTTlrdpapnaiPPPEDPSTIGRLSSGDklaneTGPPVIRRDSTPSSTADQSIVGVLAFGLDQ 524
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 566 NMTSdtpASSSPTNMTSDTPASSSPTNMTSDTpaSSSPPWPvITEVTRPESTIpAGRSLANITSKAQEDSPL 637
Cdd:COG5665 525 RTQA---EISVEAASRSNPLLNSQVKSFPLGK--RSEGAKG-KTQTDRGISNA-LVNASALITNLKSAARRS 589
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
414-727 |
3.41e-13 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 74.61 E-value: 3.41e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 414 ATLVSSSPPQVTSETPASS-SPTQVTSETpassSPTQVTSDTPASNsppQGTSDTPGFSSPTQVTtatlvSSSPPQVTSD 492
Cdd:pfam17823 62 AATAAPAPVTLTKGTSAAHlNSTEVTAEH----TPHGTDLSEPATR---EGAADGAASRALAAAA-----SSSPSSAAQS 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 493 TPASSS-PPQVTSDTP---ASSSPPQVTSETP--ASSSPPQVTSDT-SASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:pfam17823 130 LPAAIAaLPSEAFSAPraaACRANASAAPRAAiaAASAPHAASPAPrTAASSTTAASSTTAASSAPTTAASSAPATLTPA 209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 566 NMTSD------TPASSSPTNM--TSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQE---- 633
Cdd:pfam17823 210 RGISTaatatgHPAAGTALAAvgNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHmpsd 289
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 634 ---DSPLGVISTHPQ-MSFQSSTSQALDETAGERVPTIPDFQAHSEFQKACAilqrlrdflPTSPTSAQKNNSWSSQTPA 709
Cdd:pfam17823 290 tmaRNPAAPMGAQAQgPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVA---------STNLAVVTTTKAQAKEPSA 360
|
330
....*....|....*...
gi 115583681 710 VSCPFQPLGRLTTTEKSS 727
Cdd:pfam17823 361 SPVPVLHTSMIPEVEATS 378
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
351-605 |
3.87e-13 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 75.27 E-value: 3.87e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 351 PASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQ---VTSE 427
Cdd:PRK07003 362 VTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPAtadRGDD 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 428 TPASSSPTQVTSETPASSSPT-QVTSDTPASNSPPQG--TSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK07003 442 AADGDAPVPAKANARASADSRcDERDAQPPADSGSASapASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDA 521
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 505 DTPASSSPPQVTSETPASSSPPQVTSDTSASI-----SPPQVISD------TPASSSPPQVTSETPASSSPtnmTSDTPA 573
Cdd:PRK07003 522 PAAAAPPAPEARPPTPAAAAPAARAGGAAAALdvlrnAGMRVSSDrgaraaAAAKPAAAPAAAPKPAAPRV---AVQVPT 598
|
250 260 270
....*....|....*....|....*....|..
gi 115583681 574 SSSPTNMTSDTPASSSPTNMTSDTPaSSSPPW 605
Cdd:PRK07003 599 PRARAATGDAPPNGAARAEQAAESR-GAPPPW 629
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
209-603 |
1.49e-12 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 72.80 E-value: 1.49e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 209 SAPPKATHRMTITSLTGRPQVTSDtlaSSSPPQGTSD--TPASSSPPQVTSATSASSSppQGTSdTPASSSP-----PQV 281
Cdd:pfam03546 2 PATPGKAGPAATQAKAGKPEEDSE---SSSEEESDSEeeTPAAKTPLQAKPSGKTPQV--RAAS-APAKESPrkgapPVP 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 282 TSATSASSSPPQ-GTSDTPASSSPPQVTSATSASSSPPQGTSdtPASSSP----PQVTSATSASS-SPPQGTSDTPASSS 355
Cdd:pfam03546 76 PGKTGPAAAQAQaGKPEEDSESSSEESDSDGETPAAATLTTS--PAQVKPlgknSQVRPASTVGKgPSGKGANPAPPGKA 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 356 PPQGTL--------DTPSSSSPPQGTSDTPASSS---PPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQV 424
Cdd:pfam03546 154 GSAAPLvqvgkkeeDSESSSEESDSEGEAPPAATqakPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSE 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 425 TSETPASSSPTQVTSETPASSSPTQVTSDTPAsnSPPQGTSDTPgfsSPTQVTTATLVSSSPPQV-TSDTPASSSPPQVT 503
Cdd:pfam03546 234 SSEESSDSEEEAPAAATPAQAKPALKTPQTKA--SPRKGTPITP---TSAKVPPVRVGTPAPWKAgTVTSPACASSPAVA 308
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 504 SDT--PASSSPPQVTSETPASSSPP----QVTS-----DTSASISPPQVISDTPASSSPP--------QVTSETPASSSP 564
Cdd:pfam03546 309 RGAqrPEEDSSSSEESESEEETAPAaavgQAKSvgkglQGKAASAPTKGPSGQGTAPVPPgktgpavaQVKAEAQEDSES 388
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 115583681 565 TNMTSD------TPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:pfam03546 389 SEEESDseeaaaTPAQVKASGKTPQAKANPAPTKASSAKGAASAP 433
|
|
| DUF2967 |
pfam11179 |
Protein of unknown function (DUF2967); This family of proteins with unknown function appears ... |
353-753 |
2.53e-12 |
|
Protein of unknown function (DUF2967); This family of proteins with unknown function appears to be restricted to Drosophila.
Pssm-ID: 402654 [Multi-domain] Cd Length: 954 Bit Score: 72.76 E-value: 2.53e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 353 SSSPPQGTLDTPSSSsPPQGTSDTPASSSPPQGTSET--PASNSPPQGTSETPgfSSPPQVTTATLVSSSPPQVTSetpa 430
Cdd:pfam11179 15 SSAPPHAALAGPITA-APTGAAAAAATSTAAASAASStiTAPGAGPGGTPTSR--SRGAQAMTASLAHAAQGNANA---- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 431 ssspTQVTSETPASSSPTQVTSDTPAsnsppqgtsdtpGFSSptqVTTATLVSSSPPQVTSDTpaSSSPPQvtsdTPASS 510
Cdd:pfam11179 88 ----NKSTRNNSNSSNNNGKPKPLAA------------CYMS---TRSAAMMALALGQQSGEK--KDKKPA----AGKAA 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 511 SPPQVTSETPASSSPPQvTSDTSASISPPQVISDTPASSSPPQV-----TSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:pfam11179 143 SPAQSQSQSQSQNASPH-TNNRAVSMTRPAATRRLPNAAAMSNVnaansTCTATATSLPSNRARSKPSTPTATRAAAQLN 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 586 AS---SSPTNMT-SDTPASSSPPWPVITEVTRpeSTIPAGRSLAN-ITSKAQEDSPLGVISTHPQMS-FQSSTSQAL--- 656
Cdd:pfam11179 222 GMgifSGGSNSSgSDNDGFSASGSSAATALRR--LYFKSGRSIKNkINASTSSSTPLNGLPLNAVSNaFHNSVGGATamh 299
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 657 DETAGERVP----------TIPDFQAHSEFQKACAILQRL--RDFLPTSPTSAQKNNSWSSQtPAVSCPFQPLgrLTTTE 724
Cdd:pfam11179 300 AMGTAGGVPklvvmgtssaSIPDTTINTSTDSACTLITNVthTDTSETCDSLDLGDNSGPSE-PLFSSLEEPL--LTAIH 376
|
410 420
....*....|....*....|....*....
gi 115583681 725 ksshqmaqqdmeqhpMDGAHNAFGISAGG 753
Cdd:pfam11179 377 ---------------IDSEHEGFGGMAGG 390
|
|
| LH2 |
smart00308 |
Lipoxygenase homology 2 (beta barrel) domain; |
1130-1235 |
4.99e-12 |
|
Lipoxygenase homology 2 (beta barrel) domain;
Pssm-ID: 214608 [Multi-domain] Cd Length: 105 Bit Score: 64.20 E-value: 4.99e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1130 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNsgDSPSWY 1209
Cdd:smart00308 2 KYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDYLFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEH--RHPEWF 79
|
90 100
....*....|....*....|....*.
gi 115583681 1210 VSQVIVSDMTTRKKWHFQCNCWLAVD 1235
Cdd:smart00308 80 LKSITVKDLPTGGKYHFPCNSWVYPD 105
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
147-513 |
5.47e-12 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 71.74 E-value: 5.47e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 147 PPQGASIWRNEFGPGPLLP--MKRRGAETERHMIPGNGPPlAMCHQPAPPELFETLCFPIDPASSAPPKATHRMTITSLT 224
Cdd:PHA03307 79 APANESRSTPTWSLSTLAPasPAREGSPTPPGPSSPDPPP-PTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 225 GRPQVTSDT-------LASSSPPQG--TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSP---------------PQ 280
Cdd:PHA03307 158 SPAAVASDAassrqaaLPLSSPEETarAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPapapgrsaaddagasSS 237
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 281 VTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsatsassSPPQGTSDTPASSSPPQGT 360
Cdd:PHA03307 238 DSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS----------SSPRERSPSPSPSSPGSGP 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 361 LDTPSS-----SSPPQGTSDTPASSSPPqgtSETPASnSPPQGTSETPGFSSPPqvttatlvSSSPPQVTSETPASSSPT 435
Cdd:PHA03307 308 APSSPRassssSSSRESSSSSTSSSSES---SRGAAV-SPGPSPSRSPSPSRPP--------PPADPSSPRKRPRPSRAP 375
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvtsdTPASSSPP 513
Cdd:PHA03307 376 SSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEP------WPGSPPPP 447
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
232-643 |
1.33e-11 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 70.01 E-value: 1.33e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 232 DTLASSSPPQGTSDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSAT 311
Cdd:PRK07764 379 ERLERRLGVAGGAGAPAAAAPSA--------------AAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSP 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 312 SASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPA-SSSPPQ------ 384
Cdd:PRK07764 445 AGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATlRERWPEilaavp 524
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 385 -------GTSETPASNSPPQGTSETPGFSSPP------QVTTATLVSSSPPQVTSET----------PASSSPTQVTSET 441
Cdd:PRK07764 525 krsrktwAILLPEATVLGVRGDTLVLGFSTGGlarrfaSPGNAEVLVTALAEELGGDwqveavvgpaPGAAGGEGPPAPA 604
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 442 PASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK07764 605 SSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 522 SSSPPQVtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP---------ASSSPTN 592
Cdd:PRK07764 685 PAPAAPA---APAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPddppdpagaPAQPPPP 761
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|.
gi 115583681 593 MTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTH 643
Cdd:PRK07764 762 PAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEEE 812
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
442-612 |
1.40e-11 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 69.60 E-value: 1.40e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 442 PASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvTSDTPASSSPPQVTSETP- 520
Cdd:pfam17823 48 PRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREG---AADGAASRALAAAASSSPs 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 521 --ASSSPPQVTSDTSASISPPQvisdTPASSSPPQVTSETP--ASSSPTNMTSDT-PASSSPTNMTSDTPASSSPTNMTS 595
Cdd:pfam17823 125 saAQSLPAAIAALPSEAFSAPR----AAACRANASAAPRAAiaAASAPHAASPAPrTAASSTTAASSTTAASSAPTTAAS 200
|
170
....*....|....*..
gi 115583681 596 DTPASSSPPWPVITEVT 612
Cdd:pfam17823 201 SAPATLTPARGISTAAT 217
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
364-621 |
2.65e-11 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 68.80 E-value: 2.65e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 364 PSSSSPPQ--GTSDTPASSSPPQGTSETPASNS---PPQGTSETPGFSSPpqvtTATLVSSSPPqvTSETPASssptqvT 438
Cdd:PLN03209 324 PSQRVPPKesDAADGPKPVPTKPVTPEAPSPPIeeePPQPKAVVPRPLSP----YTAYEDLKPP--TSPIPTP------P 391
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 439 SETPASSSPTQVTS--DTPASNSPPQGTSDTPGfSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvtSDTPASSSPPQVT 516
Cdd:PLN03209 392 SSSPASSKSVDAVAkpAEPDVVPSPGSASNVPE-VEPAQVEAKKTRPLSPYARYEDLKPPTSP----SPTAPTGVSPSVS 466
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 517 SETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPA-SSSPTNMTS 595
Cdd:PLN03209 467 STSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSApPTALADEQH 546
|
250 260
....*....|....*....|....*...
gi 115583681 596 DTPASSSP--PWPVITEVTRPESTIPAG 621
Cdd:PLN03209 547 HAQPKPRPlsPYTMYEDLKPPTSPTPSP 574
|
|
| DUF4045 |
pfam13254 |
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ... |
352-604 |
2.94e-11 |
|
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.
Pssm-ID: 433066 [Multi-domain] Cd Length: 415 Bit Score: 67.89 E-value: 2.94e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 352 ASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQvTSETPAS 431
Cdd:pfam13254 58 PGLSPTKLSREGSPESTSRPSSSHSEATIVRHSKDDERPSTPDEGFVKPALPRHSRSSSALSNTGSEEDSPS-LPTSPPS 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 432 SSPTQVT---SETPAS---------SSPTQvtSDTPASNSPP------------QGTSDTPGFSSPTQVTTATLVSSSPP 487
Cdd:pfam13254 137 PSKTMDPkrwSPTKSSwlesalnrpESPKP--KAQPSQPAQPawmkelnkirqsRASVDLGRPNSFKEVTPVGLMRSPAP 214
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 488 QVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTS--ASISPPQVISDTPASS-----SPPQVTSETPA 560
Cdd:pfam13254 215 GGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPAPTSASEPppKTKELPKDSEEPAAPSksaeaSTEKKEPDTES 294
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 115583681 561 SSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPP 604
Cdd:pfam13254 295 SPETSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPP 338
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
294-676 |
3.63e-11 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 68.86 E-value: 3.63e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 294 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGT 373
Cdd:PRK07764 401 AAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAP 480
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 374 SDTPASSSPPQGTSETPASNSPPQGTSETPGF-SSPPQV----------------TTATLVSSSPPQVT--SETPAS--- 431
Cdd:PRK07764 481 APAPPAAPAPAAAPAAPAAPAAPAGADDAATLrERWPEIlaavpkrsrktwaillPEATVLGVRGDTLVlgFSTGGLarr 560
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 432 -SSP-------TQVTSETPASSSPTqVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVT 503
Cdd:PRK07764 561 fASPgnaevlvTALAEELGGDWQVE-AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEA 639
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 504 SDTPASSS-PPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPtnmTSDTPASSSPTNMTS 582
Cdd:PRK07764 640 SAAPAPGVaAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP---APAPAATPPAGQADD 716
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 583 DTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPlgviSTHPQMSFQSSTSQALDETAGE 662
Cdd:PRK07764 717 PAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAP----AAAPPPSPPSEEEEMAEDDAPS 792
|
410
....*....|....
gi 115583681 663 RVPtiPDFQAHSEF 676
Cdd:PRK07764 793 MDD--EDRRDAEEV 804
|
|
| SPT5 |
COG5164 |
Transcription elongation factor SPT5 [Transcription]; |
203-656 |
3.72e-11 |
|
Transcription elongation factor SPT5 [Transcription];
Pssm-ID: 444063 [Multi-domain] Cd Length: 495 Bit Score: 68.13 E-value: 3.72e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPA----SSSPPQvtsatsasssppQGTSDTPA---- 274
Cdd:COG5164 3 LYGPGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAqnqgSTTPAG------------NTGGTRPAgnqg 70
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 275 SSSPPQvtsatsasssppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASS 354
Cdd:COG5164 71 ATGPAQnqg--------gtTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGS 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 355 SPPQGTLDTPSSSSPPQGtsDTPASSSPPQGTSETPA----SNSPPQGTSETPGfssPPQVTTATLVSSSPPQVTSETPA 430
Cdd:COG5164 143 TPPGPGSTGPGGSTTPPG--DGGSTTPPGPGGSTTPPddggSTTPPNKGETGTD---IPTGGTPRQGPDGPVKKDDKNGK 217
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 431 SSSPTQVTSETPAsSSPTQVTSdtPASNSPPQGTSDTPGFSSPTQvttatlvSSSPPQVTSDTPASSSPPQVTSDTPASS 510
Cdd:COG5164 218 GNPPDDRGGKTGP-KDQRPKTN--PIERRGPERPEAAALPAELTA-------LEAENRAANPEPATKTIPETTTVKDLAT 287
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 511 SPPQVTSETPASS------SPPQVTSDTsasiSPPQVISDTpassspPQVTSETPASSSPT-NMTSDTPASSSPTNMTSD 583
Cdd:COG5164 288 VLGKKGSDLVTNLmkkgkgTNINAALDF----ETAATIALE------GNVITEKEIEADIMeTVTTEEQETDSLLEETPP 357
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 584 TPasssPTNMTSDTPASSSPPWPVITEVTRPESTIP-----AGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQAL 656
Cdd:COG5164 358 VP----VVMGHVDHGKTSLLDAIRHSDVTDGEVGTIsqhigAYTVQIAGTPITFLDTPGFESFTAMAMRVAQITDIAI 431
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
417-595 |
5.74e-11 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 67.00 E-value: 5.74e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 417 VSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTpgfssptqvTTATLVSSSPPQVTSDTPAS 496
Cdd:pfam05539 157 LRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTA---------TANQRLSSTEPVGTQGTTTS 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 497 SSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPASSSPTNMTSDTPASSS 576
Cdd:pfam05539 228 SNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRR---KTPPATSNRRSPHSTATPPPTTKRQETGRPTPR 304
|
170 180
....*....|....*....|....*
gi 115583681 577 PTNMT--SDTPASSSPT----NMTS 595
Cdd:pfam05539 305 PTATTqsGSSPPHSSPPgvqaNPTT 329
|
|
| DUF2967 |
pfam11179 |
Protein of unknown function (DUF2967); This family of proteins with unknown function appears ... |
236-668 |
5.79e-11 |
|
Protein of unknown function (DUF2967); This family of proteins with unknown function appears to be restricted to Drosophila.
Pssm-ID: 402654 [Multi-domain] Cd Length: 954 Bit Score: 68.14 E-value: 5.79e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 236 SSSPPQGTSDTPASSsPPQVTSATSASSSPPQGTSDTPASSspPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASS 315
Cdd:pfam11179 15 SSAPPHAALAGPITA-APTGAAAAAATSTAAASAASSTITA--PGAGPGGTPTSRSRGAQAMTASLAHAAQGNANANKST 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 316 SPPQGTSDTPASSSPPQ---VTSATSASSSPPQGTSDTPASSSPPQgtldTPSSSSPPQGTSDTPASSSPPQgTSETPAS 392
Cdd:pfam11179 92 RNNSNSSNNNGKPKPLAacyMSTRSAAMMALALGQQSGEKKDKKPA----AGKAASPAQSQSQSQSQNASPH-TNNRAVS 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 393 NSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTsetpASSSPTQVTSDTPASNSPPQGTSDTPGFS- 471
Cdd:pfam11179 167 MTRPAATRRLPNAAAMSNVNAANSTCTATATSLPSNRARSKPSTPT----ATRAAAQLNGMGIFSGGSNSSGSDNDGFSa 242
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 472 --SPTQVTTATLVSSSPPQVTSDTPASSsppqvTSDTPASSSPPQVTSET--PASSSPPQVTSDTSASISPPQVISDTpA 547
Cdd:pfam11179 243 sgSSAATALRRLYFKSGRSIKNKINAST-----SSSTPLNGLPLNAVSNAfhNSVGGATAMHAMGTAGGVPKLVVMGT-S 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 548 SSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNmtsdtPASSSPPWPVITEVT-----RPESTIPAGR 622
Cdd:pfam11179 317 SASIPDTTINTSTDSACTLITNVTHTDTSETCDSLDLGDNSGPSE-----PLFSSLEEPLLTAIHidsehEGFGGMAGGR 391
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|
gi 115583681 623 SLANITSKAQ-EDSPLGVISTHPQMSFQSST-SQ--ALDETAGERVPTIP 668
Cdd:pfam11179 392 GGANGRGATElELTSCSRYPPRPDMNLQDSTeSQesCLSILTGEPSSTTP 441
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
183-624 |
1.75e-10 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 66.63 E-value: 1.75e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 183 PPLAMCHQPAPPELFETLCFPID-PASSAPPKAThrmtITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSA 261
Cdd:PHA03378 529 PPQPRAGRRAPCVYTEDLDIESDePASTEPVHDQ----LLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQ 604
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 262 SSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDT-------PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVT 334
Cdd:PHA03378 605 TPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPItfnvlvfPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTM 684
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 335 SATSASSSPPQGTSDTPASSSPPQGtldTPSSSSPPQGtsdTPASSSPPQGtseTPASNSPPQGtseTPGFSSPPQvttA 414
Cdd:PHA03378 685 LPIQWAPGTMQPPPRAPTPMRPPAA---PPGRAQRPAA---ATGRARPPAA---APGRARPPAA---APGRARPPA---A 749
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 415 TLVSSSPPQvtsetpASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGtsdTPGFSSPTQVTTATLVSSSPPQVTSDTP 494
Cdd:PHA03378 750 APGRARPPA------AAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRG---APTPQPPPQAGPTSMQLMPRAAPGQQGP 820
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 495 ASSSPPQ-----VTSDTPASSSPPQVTSETPASSSP-PQvtSDTSASISPPQVIsdTPASSSPPQVtsetpasssPTNMT 568
Cdd:PHA03378 821 TKQILRQlltggVKRGRPSLKKPAALERQAAAGPTPsPG--SGTSDKIVQAPVF--YPPVLQPIQV---------MRQLG 887
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 569 SDTPASSSptnmtsdtPASSSPTNMTSDTPASSSPPWPVITEVTR------PESTIPAGRSL 624
Cdd:PHA03378 888 SVRAAAAS--------TVTQAPTEYTGERRGVGPMHPTDIPPSKRaktdayVESQPPHGGQS 941
|
|
| Hamartin |
pfam04388 |
Hamartin protein; This family includes the hamartin protein which is thought to function as a ... |
417-716 |
1.93e-10 |
|
Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.
Pssm-ID: 461287 [Multi-domain] Cd Length: 730 Bit Score: 66.23 E-value: 1.93e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 417 VSSSPPqvTSETPASSSPTQVTSETP-ASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQV-TSDTP 494
Cdd:pfam04388 276 PTASPY--TDQQSSYGSSTSTPSSTPrLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGMTTPPTSPGMVpTTPSE 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 495 ASSSPPQVTSDtpaSSSPPQVTSE-----TPASSSPPQVTSDTSASISPPQVISDTPASSSPPQvTSETPASSSP----- 564
Cdd:pfam04388 354 LSPSSSHLSSR---GSSPPEAAGEatpetTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPR-KDGRSQSSFPplskq 429
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 565 --TNMTSDTPASSSPTNMTSDTpasSSPTNMTSDTpaSSSPPWPVITEVTRPESTipagRSLANITSKAQEdsplgviST 642
Cdd:pfam04388 430 apTNPNSRGLLEPPGDKSSVTL---SELPDFIKDL--ALSSEDSVEGAEEEAAIS----QELSEITTEKNE-------TD 493
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 643 HPQMSFQSSTSQALDETAG-ERVPTIPDFQAHSEFQKACailqrlrDFLPTSPTSAQKNNSWSSQTPAVSCPFQP 716
Cdd:pfam04388 494 CSRGGLDMPFSRTMESLAGsQRSRNRIASYCSSTSQSDS-------HGPATTPESKPSALAEDGLRRTKSCSFKQ 561
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
438-608 |
2.28e-10 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 63.00 E-value: 2.28e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 438 TSETPASSSPTQVTSDTPAsnsppqgTSDTPGFSSPTQVTTATLVSSSPPqVTSDTPASSSPPQVTSdTPASSSPPQVTS 517
Cdd:PHA03255 25 TSSGSSTASAGNVTGTTAV-------TTPSPSASGPSTNQSTTLTTTSAP-ITTTAILSTNTTTVTS-TGTTVTPVPTTS 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 518 ETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDT 597
Cdd:PHA03255 96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQ 175
|
170
....*....|...
gi 115583681 598 PASSS--PPWPVI 608
Cdd:PHA03255 176 PSLSYglPLWTLV 188
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
420-634 |
5.56e-10 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 64.56 E-value: 5.56e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 420 SPPQ-VTSETPASSSPTQVTSETPASssptqvtsdtPASNSPPQGTSDTPGFSSPTqvttATLVSSSPPQVTSDTPASSS 498
Cdd:PLN03209 329 PPKEsDAADGPKPVPTKPVTPEAPSP----------PIEEEPPQPKAVVPRPLSPY----TAYEDLKPPTSPIPTPPSSS 394
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 499 PPQVTS-------DTPASSSPPQVTSETPASSsPPQVTSDTSASISPPQVISD--TPASSSP-PQVTSETPASSSPT-NM 567
Cdd:PLN03209 395 PASSKSvdavakpAEPDVVPSPGSASNVPEVE-PAQVEAKKTRPLSPYARYEDlkPPTSPSPtAPTGVSPSVSSTSSvPA 473
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 568 TSDTPASSSPTNMTSDTPASSSPTNMTS-----DTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQED 634
Cdd:PLN03209 474 VPDTAPATAATDAAAPPPANMRPLSPYAvyddlKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQ 545
|
|
| Ion_trans |
pfam00520 |
Ion transport protein; This family contains sodium, potassium and calcium ion channels. This ... |
1932-2123 |
6.56e-10 |
|
Ion transport protein; This family contains sodium, potassium and calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane.
Pssm-ID: 459842 [Multi-domain] Cd Length: 238 Bit Score: 61.51 E-value: 6.56e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1932 RLAFFTRKRNLLDTSIVLISFSILGLSMQSLSLLhkkmqqyhcdrdrfisfyeaLRVnsavthLRGFLLLfatvRVWDLL 2011
Cdd:pfam00520 60 KKRYFRSPWNILDFVVVLPSLISLVLSSVGSLSG--------------------LRV------LRLLRLL----RLLRLI 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 2012 RHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFG----------WSISDYQSFFRSIVTVVGLL----MGT 2077
Cdd:pfam00520 110 RRLEGLRTLVNSLIRSLKSLGNLLLLLLLFLFIFAIIGYQLFGgklktwenpdNGRTNFDNFPNAFLWLFQTMttegWGD 189
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 115583681 2078 SKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAFGKERK 2123
Cdd:pfam00520 190 IMYDTIDGKGEFWAYIYFVSFIILGGFLLLNLFIAVIIDNFQELTE 235
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
364-577 |
1.03e-09 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 63.14 E-value: 1.03e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 364 PSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfssppqvTTATLVSSSPPQVTSETPASSSPTQVTSETPA 443
Cdd:pfam05539 169 KTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTA---------TANQRLSSTEPVGTQGTTTSSNPEPQTEPPPS 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 444 SSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVT-------TATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQvT 516
Cdd:pfam05539 240 QRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTppatsnrRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPH-S 318
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 517 SETPASSSPPQVTSDTSASISPPQVIS-----DTPASSSPPQVTSETPASSSPTNMTSDTPASSSP 577
Cdd:pfam05539 319 SPPGVQANPTTQNLVDCKELDPPKPNSicygvGIYNEALPRGCDIVVPLCSTYTIMCMDTYYSKPF 384
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
345-514 |
1.46e-09 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 62.76 E-value: 1.46e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 345 QGTSDTPASSSPPQGTLDTPSSSSPPQG----TSDTPASSSPPQGTSETPASNSPPQGTseTPGFSSPPQVTTATLVSSS 420
Cdd:pfam05539 176 KTTSWPTEVSHPTYPSQVTPQSQPATQGhqtaTANQRLSSTEPVGTQGTTTSSNPEPQT--EPPPSQRGPSGSPQHPPST 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 421 PPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPP 500
Cdd:pfam05539 254 TSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQET-----GRPTPRPTATTQSGSSPPHSSPPGVQANPT 328
|
170
....*....|....
gi 115583681 501 QVTSDTPASSSPPQ 514
Cdd:pfam05539 329 TQNLVDCKELDPPK 342
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
360-596 |
2.42e-09 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 62.50 E-value: 2.42e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 360 TLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNsppqgTSETPgfssppQVTTATLVSSSPPQVTSETPASSSP-TQVT 438
Cdd:PRK08581 14 TLVLPTLTSPTAYADDPQKDSTAKTTSHDSKKSN-----DDETS------KDTSSKDTDKADNNNTSNQDNNDKKfSTID 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 439 SETPASS---SPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQvTSDTPASSSPPQVTSDTPASSSPPqv 515
Cdd:PRK08581 83 SSTSDSNniiDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD-YEQPRNSEKSTNDSNKNSDSSIKN-- 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 516 tSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:PRK08581 160 -DTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSE 238
|
.
gi 115583681 596 D 596
Cdd:PRK08581 239 D 239
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
377-593 |
3.33e-09 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 60.72 E-value: 3.33e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 377 PASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQV-TSETPASSSPTQVTSDTP 455
Cdd:PRK10905 23 PSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAEQTAGNTQQDVSLPPISSTPTQGqTPVATDGQQRVEVQGDLN 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 456 ASNSPPQGTSDTPGFSS----PTQVTT-----------ATLVSSSPPQVTSDTPA------SSSPPQVTSDTPASSSPPQ 514
Cdd:PRK10905 103 NALTQPQNQQQLNNVAVnstlPTEPATvapvrngnasrQTAKTQTAERPATTRPArkqaviEPKKPQATAKTEPKPVAQT 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 515 VTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT--NMTSDTPASSSptNMTSDTPASSSPTN 592
Cdd:PRK10905 183 PKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTagNVGSLKSAPSS--HYTLQLSSSSNYDN 260
|
.
gi 115583681 593 M 593
Cdd:PRK10905 261 L 261
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
414-665 |
3.49e-09 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 62.20 E-value: 3.49e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 414 ATLVSSSPPQVTSETPASSSPTQVTsETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSdT 493
Cdd:PRK12323 372 AGPATAAAAPVAQPAPAAAAPAAAA-PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAP-A 449
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 494 PASSSPPqvtsdTPASSSPPQVTS-ETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PRK12323 450 PAPAPAA-----APAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 573 ASSSPTNMTSDTPASSSptnmtsdTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHP------QM 646
Cdd:PRK12323 525 SIPDPATADPDDAFETL-------APAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLPvrglaqQL 597
|
250
....*....|....*....
gi 115583681 647 SFQSSTSQALDETAGERVP 665
Cdd:PRK12323 598 ARQSELAGVEGDTVRLRVP 616
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
347-676 |
3.92e-09 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 62.32 E-value: 3.92e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 347 TSDTPASSSPPQGTLDTPSSSSPPQGTS------DTPassSPPQGTSE-TPAS----NSP-PQGT----SETPgfSSPPQ 410
Cdd:TIGR00927 76 SSDPPKSSSEMEGEMLAPQATVGRDEATpsiameNTP---SPPRRTAKiTPTTpknnYSPtAAGTervkEDTP--ATPSR 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 411 VTTATLVSSSPPQVTSETPA------SSSPTQVTSE----TPaSSSPTQVTSDTPAS-NSPPQGTSDTPGFSSPTQVTTA 479
Cdd:TIGR00927 151 ALNHYISTSGRQRVKSYTPKprgevkSSSPTQTREKvrkyTP-SPLGRMVNSYAPSTfMTMPRSHGITPRTTVKDSEITA 229
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 480 T---LVSSSPPQV---TSDTP----ASSSPPQVTSDTPAS--SSPPQVTsETPASSSPPQVTSDTSA---------SISP 538
Cdd:TIGR00927 230 TykmLETNPSKRTagkTTPTPlkgmTDNTPTFLTREVETDllTSPRSVV-EKNTLTTPRRVESNSSTnhwglvgknNLTT 308
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 539 PQ--VISDTPASSSPpQVTSETPASSSPTnmtsDTPASSSPTNMTSDTPASSSPTNMTsdtpaSSSPPWPVITEVTRPES 616
Cdd:TIGR00927 309 PQgtVLEHTPATSEG-QVTISIMTGSSPA----ETKASTAAWKIRNPLSRTSAPAVRI-----ASATFRGLEKNPSTAPS 378
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 115583681 617 TIPAGRSLANITSKAQE---DSPLGVISTHPQMSFQSST-SQALDETAGERVPTIPDFQAHSEF 676
Cdd:TIGR00927 379 TPATPRVRAVLTTQVHHcvvVKPAPAVPTTPSPSLTTALfPEAPSPSPSALPPGQPDLHPKAEY 442
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
464-622 |
8.22e-09 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 58.38 E-value: 8.22e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 464 TSDTPGFSSPTQVTTATlvsssppQVTSDTPASSSPPQVTSDTPASSSPPqVTSETPASSSPPQVTSdTSASISPPQVIS 543
Cdd:PHA03255 25 TSSGSSTASAGNVTGTT-------AVTTPSPSASGPSTNQSTTLTTTSAP-ITTTAILSTNTTTVTS-TGTTVTPVPTTS 95
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 544 DTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGR 622
Cdd:PHA03255 96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDER 174
|
|
| PHA02682 |
PHA02682 |
ORF080 virion core protein; Provisional |
437-658 |
8.92e-09 |
|
ORF080 virion core protein; Provisional
Pssm-ID: 177464 [Multi-domain] Cd Length: 280 Bit Score: 59.10 E-value: 8.92e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 437 VTSETPAS--SSPTQVTSDTPASNSPPQGTSDTPG----------FSSPTQVTTATLVSSSPPQVTSDTPASSSP-PQVT 503
Cdd:PHA02682 17 VLADTSSSlfTKCPQATIPAPAAPCPPDADVDPLDkysvkeagryYQSRLKANSACMQRPSGQSPLAPSPACAAPaPACP 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 504 SDTPASSSPpQVTSETPASSSPPQvtsdTSASISPPQVISDT--PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMT 581
Cdd:PHA02682 97 ACAPAAPAP-AVTCPAPAPACPPA----TAPTCPPPAVCPAParPAPACPPSTRQCPPAPPLPTPKPAPAAKPIFLHNQL 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 582 S--DTPASSSPTNMTSdtPASSsppwPVItEVTRPESTIPAGRS--------LANITSKAQEDSPLGVISTHPQMSFQSS 651
Cdd:PHA02682 172 PppDYPAASCPTIETA--PAAS----PVL-EPRIPDKIIDADNDdkdlikkeLADIADSVRDLNAESLSLTRDIENAKST 244
|
....*..
gi 115583681 652 TSQALDE 658
Cdd:PHA02682 245 TQAAIDD 251
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
425-588 |
1.01e-08 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 57.99 E-value: 1.01e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 425 TSETPASSSPTQVTSETpASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTS-DTPASSSPPQVT 503
Cdd:PHA03255 25 TSSGSSTASAGNVTGTT-AVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPvPTTSNASTINVT 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 504 SDTPASSSppqVTSETPASSSPPQVTSDTSasisppQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSD 583
Cdd:PHA03255 104 TKVTAQNI---TATEAGTGTSTGVTSNVTT------RSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDER 174
|
....*
gi 115583681 584 TPASS 588
Cdd:PHA03255 175 QPSLS 179
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
402-697 |
1.10e-08 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 60.57 E-value: 1.10e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 402 TPGFSSPpQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSsptqvTSDTPASNSPPQGTSDTPgFSSPTQVTTAT- 480
Cdd:PRK08581 17 LPTLTSP-TAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKD-----TDKADNNNTSNQDNNDKK-FSTIDSSTSDSn 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 481 ---------LVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTS-ETPASSS-PPQVTSDTSAS-ISPPQvisdTPAS 548
Cdd:PRK08581 90 niidfiyknLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDyEQPRNSEkSTNDSNKNSDSsIKNDT----DTQS 165
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 549 SSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSppwpvitevtrpESTIPAGRSLANIT 628
Cdd:PRK08581 166 SKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSK------------DNQSMSDSALDSIL 233
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 629 SKAQEDsplgviSTHPQMSFQSSTSQALDETAGERVPTIPdfqAHSEFQKACAILQRLRDFLPTSPTSA 697
Cdd:PRK08581 234 DQYSED------AKKTQKDYASQSKKDKTETSNTKNPQLP---TQDELKHKSKPAQSFENDVNQSNTRS 293
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
203-654 |
1.26e-08 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 60.45 E-value: 1.26e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQ---VTSATSASSSPPQGTSDTPASSSPP 279
Cdd:PHA03377 450 PERPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPPSRRRRgacVVYDDDIIEVIDVETTEEEESVTQP 529
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 280 QVTSATSASSSPPQGTSD--------TPASSSPPQVTSATSASSSPPQGTSDTPASSspPQVTSATSASSSPPQGTSDTP 351
Cdd:PHA03377 530 AKPHRKVQDGFQRSGRRQkratppkvSPSDRGPPKASPPVMAPPSTGPRVMATPSTG--PRDMAPPSTGPRQQAKCKDGP 607
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 352 ASSSPPQgtlDTPSSSSP-----------------PQGTSDTPAS------SSPPQGTSETPASNSPPQGTSETPGFSSP 408
Cdd:PHA03377 608 PASGPHE---KQPPSSAPrdmapsvvrmflrerllEQSTGPKPKSfwemraGRDGSGIQQEPSSRRQPATQSTPPRPSWL 684
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 409 PQVTTATLVSSSPPQVTSETPASS-SPTQVTS--ETPASSSPTQVT--SDTPASNSPPQGTSDTPGFSSP--TQVTTATL 481
Cdd:PHA03377 685 PSVFVLPSVDAGRAQPSEESHLSSmSPTQPISheEQPRYEDPDDPLdlSLHPDQAPPPSHQAPYSGHEEPqaQQAPYPGY 764
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 482 VSSSPPQV--------------TSDTPASSSPPQVTSDTPASSSPPQVTSETPAsSSPPQVTSDTSASISPPQviSDTPA 547
Cdd:PHA03377 765 WEPRPPQApylgyqepqaqgvqVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPG-HGHPQGPWAPRPPHLPPQ--WDGSA 841
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 548 SSSPPQVTSETPASSSPTNMTSDTpaSSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPvitevTR-PESTIPAGRSLAn 626
Cdd:PHA03377 842 GHGQDQVSQFPHLQSETGPPRLQL--SQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIP-----TRfPPPPMPLQDSMA- 913
|
490 500
....*....|....*....|....*...
gi 115583681 627 itskAQEDSPLgviSTHPQMSFQSSTSQ 654
Cdd:PHA03377 914 ----VGCDSSG---TACPSMPFASDYSQ 934
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
203-502 |
1.56e-08 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 59.59 E-value: 1.56e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVT 282
Cdd:pfam17823 153 NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVG 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 283 SATSASssppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPpQVTSATSASSSPPQGTSDTPASSSPPQgtld 362
Cdd:pfam17823 233 NSSPAA-----GTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDP-HARRLSPAKHMPSDTMARNPAAPMGAQ---- 302
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 363 tpSSSSPPQGTSDTPASSSPPQGTSEtpasnspPQGTSETPGFSSPPQVTTATLVSSSPPQvTSETPASSSPTQVTSETP 442
Cdd:pfam17823 303 --AQGPIIQVSTDQPVHNTAGEPTPS-------PSNTTLEPNTPKSVASTNLAVVTTTKAQ-AKEPSASPVPVLHTSMIP 372
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 115583681 443 --ASSSPTQVTSDTPasnsPPQGTSDTPGFSSPTQVTT-ATLVSSSppqvTSDTPASSSPPQV 502
Cdd:pfam17823 373 evEATSPTTQPSPLL----PTQGAAGPGILLAPEQVATeATAGTAS----AGPTPRSSGDPKT 427
|
|
| PLAT_plant_stress |
cd01754 |
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of ... |
1131-1235 |
1.58e-08 |
|
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of its members are stress induced. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.
Pssm-ID: 238852 Cd Length: 129 Bit Score: 55.24 E-value: 1.58e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1131 YLIQVYTGYRRRAATTAKVVITLYGSEGH-------SEPHHLCDPEKTVFERGALDVF------LLSTGSWLgdlhglRL 1197
Cdd:cd01754 3 YTIYVQTGSIWKAGTDSRISLQIYDADGPglrianlEAWGGLMGAGHDYFERGNLDRFsgrgpcLPSPPCWM------NL 76
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 115583681 1198 WHDNSGDSPSWYVSQVIVsdmtTRKKWHFQCNC-------WLAVD 1235
Cdd:cd01754 77 TSDGTGNHPGWYVNYVEV----TQAGQHAPCMQhlfaveqWLATD 117
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
382-577 |
1.59e-08 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 60.25 E-value: 1.59e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 382 PPQGTSETPASNSPPQGTSETPGfsSPPQVTTATLVSSSPPQvtseTPASSSPTQVTSETPASSSPTQVTSDTPASNSPP 461
Cdd:PRK07003 360 PAVTGGGAPGGGVPARVAGAVPA--PGARAAAAVGASAVPAV----TAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPP 433
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 462 Q-GTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSE-TPASSSPPQVTSDTSASISPP 539
Cdd:PRK07003 434 AtADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEpAPRAAAPSAATPAAVPDARAP 513
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 115583681 540 QVIS--DTPASSSPPqvtseTPASSSPTnmtsdtPASSSP 577
Cdd:PRK07003 514 AAASreDAPAAAAPP-----APEARPPT------PAAAAP 542
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
269-487 |
1.84e-08 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 59.38 E-value: 1.84e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 269 TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSasssppqGTS 348
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTA-------ATS 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 349 DTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPqgtseTPGFSSPPQVTTATLVSSSP-PQVTSE 427
Cdd:COG3469 74 STTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGS-----VTSTTSSTAGSTTTSGASATsSAGSTT 148
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 428 TPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGF-----SSPTQVTTATLVSSSPP 487
Cdd:COG3469 149 TTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASgattpSATTTATTTGPPTPGLP 213
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
167-654 |
3.38e-08 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 59.30 E-value: 3.38e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 167 KRRGAETERHMIP-----GNGPPLAMCHQPAPPElfetlcfpIDPASSAPPKATHRMTITSLTGRPQVTSdtLASSSPPQ 241
Cdd:PHA03377 541 QRSGRRQKRATPPkvspsDRGPPKASPPVMAPPS--------TGPRVMATPSTGPRDMAPPSTGPRQQAK--CKDGPPAS 610
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 242 GTSD-TPASSSP----PQVTSATSASSSPPQGTSDTP-------ASSSPPQVTSATSASSSPPQgTSDTPASSSPPQVTS 309
Cdd:PHA03377 611 GPHEkQPPSSAPrdmaPSVVRMFLRERLLEQSTGPKPksfwemrAGRDGSGIQQEPSSRRQPAT-QSTPPRPSWLPSVFV 689
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 310 ATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLdtPSSSSPPQGTSDTPASSSPPQGTSET 389
Cdd:PHA03377 690 LPSVDAGRAQPSEESHLSSMSPTQPISHEEQPRYEDPDDPLDLSLHPDQAPP--PSHQAPYSGHEEPQAQQAPYPGYWEP 767
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 390 PASNSPPQGTSETPGfssppqvtTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:PHA03377 768 RPPQAPYLGYQEPQA--------QGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDG 839
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 470 FSSPTQvttaTLVSSSPPqVTSDTpassSPPqvtsdTPASSSPPQVT-SETPASSSPPqvtsdtsasisppqvisdtPAS 548
Cdd:PHA03377 840 SAGHGQ----DQVSQFPH-LQSET----GPP-----RLQLSQVPQLPySQTLVSSSAP-------------------SWS 886
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 549 SSPPqvtsETPASSSPTNM-TSDTPASSSPTnMTSDTPASSSPtNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANI 627
Cdd:PHA03377 887 SPQP----RAPIRPIPTRFpPPPMPLQDSMA-VGCDSSGTACP-SMPFASDYSQGAFTPLDINAQTPKRPRVEESSHGPA 960
|
490 500
....*....|....*....|....*...
gi 115583681 628 TSKAQEDSPLGVISTHPQMS-FQSSTSQ 654
Cdd:PHA03377 961 RCSQATTEAQEILSDNSEISvFPKDAKQ 988
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
413-575 |
4.45e-08 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 56.07 E-value: 4.45e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 413 TATLVSSSPPQVTSETPASSSpTQVTSETPASSSPTQVTSDTPASNSPPQGTsdTPGFSSPTQVTTATLVSSSPPQVTSD 492
Cdd:PHA03255 20 TSLIWTSSGSSTASAGNVTGT-TAVTTPSPSASGPSTNQSTTLTTTSAPITT--TAILSTNTTTVTSTGTTVTPVPTTSN 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 493 TPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PHA03255 97 ASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQP 176
|
...
gi 115583681 573 ASS 575
Cdd:PHA03255 177 SLS 179
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
443-595 |
4.53e-08 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 55.35 E-value: 4.53e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 443 ASSSPTQVTSDTPASNSppqgtSDTPGFSSPTQVTTATLVSSSPPqvtsdTPASSSPPQVTSDTPASSSPPQVTSETPAS 522
Cdd:pfam09595 31 ASLILIGESNKEAALII-----TDIIDININKQHPEQEHHENPPL-----NEAAKEAPSESEDAPDIDPNNQHPSQDRSE 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 523 SSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSS----PTNMTSDTPAS----SSPTNMTSDTPASSSPTNMT 594
Cdd:pfam09595 101 APPLEPAAKTKPSEHEPANPPDASNRLSPPDASTAAIREARtfrkPSTGKRNNPSSaqsdQSPPRANHEAIGRANPFAMS 180
|
.
gi 115583681 595 S 595
Cdd:pfam09595 181 S 181
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
406-569 |
4.93e-08 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 54.96 E-value: 4.93e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 406 SSPPQVTT--ATLVSSSPPQVTsetPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVS 483
Cdd:pfam09595 33 LILIGESNkeAALIITDIIDIN---INKQHPEQEHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAK 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 484 SSPpqvTSDTPASssppqvTSDTPASSSPPQVTSETPASSsppqvTSDTSASISPPQVISDTPASSSPPQVTSETPASSS 563
Cdd:pfam09595 110 TKP---SEHEPAN------PPDASNRLSPPDASTAAIREA-----RTFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRAN 175
|
....*.
gi 115583681 564 PTNMTS 569
Cdd:pfam09595 176 PFAMSS 181
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
366-517 |
6.19e-08 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 54.96 E-value: 6.19e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 366 SSSPPqGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASS 445
Cdd:pfam09595 32 SLILI-GESNKEAALIITDIIDININKQHPEQEHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKT 110
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 446 SPTQVT----SDTPASNSPPQGTSdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTS 517
Cdd:pfam09595 111 KPSEHEpanpPDASNRLSPPDAST-----AAIREARTFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSS 181
|
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
245-644 |
7.27e-08 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 57.81 E-value: 7.27e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 245 DTPASSSPPQVTSATSA------SSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPP 318
Cdd:PRK14949 369 DDPAEISLPEGQTPSALaaavqaPHANEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEAVAEADASAEPADTVEQALDD 448
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 319 QgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPA---SSSPPQGTSETPASNSP 395
Cdd:PRK14949 449 E-SELLAALNAEQAVILSQAQSQGFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVddtSASNNSAADNTVDDNYS 527
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 396 PQGTSETPG--------------------------FSSPPQVTTATLVSSSPPQVTSET-PASSSPTQVTSETPASSSP- 447
Cdd:PRK14949 528 AEDTLESNGldegdyaqdsapldayqddyvafsseSYNALSDDEQHSANVQSAQSAAEAqPSSQSLSPISAVTTAAASLa 607
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 448 ----------------TQVTSDTPASNSPPQGTSDTPGFSSPtqvttatlvsSSPPQVTSDTPASSSPPQVTSdtpASSS 511
Cdd:PRK14949 608 dddildavlaardsllSDLDALSPKEGDGKKSSADRKPKTPP----------SRAPPASLSKPASSPDASQTS---ASFD 674
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 512 PPQVTSETPASSSPPQVTSDTSASISPPQV-ISDTPASSSPPQVtseTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 590
Cdd:PRK14949 675 LDPDFELATHQSVPEAALASGSAPAPPPVPdPYDRPPWEEAPEV---ASANDGPNNAAEGNLSESVEDASNSELQAVEQQ 751
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 115583681 591 TNMTSDTPASSSPPwpviTEVTRPESTIPAGRSLANITSKaqedSPLGVISTHP 644
Cdd:PRK14949 752 ATHQPQVQAEAQSP----ASTTALTQTSSEVQDTELNLVL----LSSGSITGHP 797
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
349-662 |
1.06e-07 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 57.39 E-value: 1.06e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 349 DTPASSSPPQGtldTPSSSSPPQGTSDTPASSS--PPQGTSETPASNSPPQGTSET-----PGFS---SPPQVTTATLVS 418
Cdd:PTZ00449 503 DSDKHDEPPEG---PEASGLPPKAPGDKEGEEGehEDSKESDEPKEGGKPGETKEGevgkkPGPAkehKPSKIPTLSKKP 579
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 419 SSP-----PQVTSETPASSSPtqVTSETPASssptqvtsdtPASNSPPQgTSDTPgfSSPTQVTTATLVSSSPPQVTSDT 493
Cdd:PTZ00449 580 EFPkdpkhPKDPEEPKKPKRP--RSAQRPTR----------PKSPKLPE-LLDIP--KSPKRPESPKSPKRPPPPQRPSS 644
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 494 PASSSPPQV--TSDTPASSSPPQVTS---------ETPASSSPPQVTS---DTSASISPPQVISDTPAS--SSPPQVTSE 557
Cdd:PTZ00449 645 PERPEGPKIikSPKPPKSPKPPFDPKfkekfyddyLDAAAKSKETKTTvvlDESFESILKETLPETPGTpfTTPRPLPPK 724
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 558 TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMtSDTPASSSPPWPVITEVTRPESTIPAG---RSLANITSKAQ-E 633
Cdd:PTZ00449 725 LPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFF-HETPADTPLPDILAEEFKEEDIHAETGepdEAMKRPDSPSEhE 803
|
330 340
....*....|....*....|....*....
gi 115583681 634 DSPLGvisTHPQMSFQSSTSQALDETAGE 662
Cdd:PTZ00449 804 DKPPG---DHPSLPKKRHRLDGLALSTTD 829
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
346-569 |
1.17e-07 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 57.11 E-value: 1.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 346 GTSDTPASSSPPQGTlDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQvt 425
Cdd:PRK08581 59 DTDKADNNNTSNQDN-NDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD-- 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 426 SETPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQGTSDTPGFSS-PTQVTTATLVSSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK08581 136 YEQPRNSEKSTNDSNKNSDSSIKNDTDT---QSSKQDKADNQKAPSSnNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTA 212
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 505 DTPASSSPPQVTSETPASSSPPQVTSD------------------TSASISPPQVISDTPASSSPPQVTSETPASSSPTN 566
Cdd:PRK08581 213 NQKSSSKDNQSMSDSALDSILDQYSEDakktqkdyasqskkdkteTSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTR 292
|
...
gi 115583681 567 MTS 569
Cdd:PRK08581 293 STS 295
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
352-606 |
1.76e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.87 E-value: 1.76e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 352 ASSSPPQGTLDTPSSssPPQGTSDTPASSSPPQGTSETPAsnsPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPAS 431
Cdd:PHA03247 243 VISHPLRGDIAAPAP--PPVVGEGADRAPETARGATGPPP---PPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPP 317
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 432 SSPTQVTSETPASSSPTQVTSDTPASNSP-PQGTSDT--PGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSD--- 505
Cdd:PHA03247 318 PAPAGDAEEEDDEDGAMEVVSPLPRPRQHyPLGFPKRrrPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPfar 397
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 506 TPASSSPPQVTSETPASSSPPqvtsdtsasiSPPQVISDTPASSSPPQVTSETPASSSPTNmtsdTPASSSPTNMTSDT- 584
Cdd:PHA03247 398 GPGGDDQTRPAAPVPASVPTP----------APTPVPASAPPPPATPLPSAEPGSDDGPAP----PPERQPPAPATEPAp 463
|
250 260
....*....|....*....|..
gi 115583681 585 PASSSPTNMTSDTPASSSPPWP 606
Cdd:PHA03247 464 DDPDDATRKALDALRERRPPEP 485
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
409-615 |
1.93e-07 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 56.59 E-value: 1.93e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 409 PQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT----PASNSPPQGTSDTPGFSSPTQVTTATLVSs 484
Cdd:PRK10811 848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEvveePVVVAEPQPEEVVVVETTHPEVIAAPVTE- 926
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 485 sPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP-QVISDTPASSSPPQVTSETPASSS 563
Cdd:PRK10811 927 -QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAaPVVAEVAAEVETVTAVEPEVAPAQ 1005
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 115583681 564 PTNMTSDTPASSSPtnMTSdTPAsssptnmtsdtPASSSPPwPVITEVTRPE 615
Cdd:PRK10811 1006 VPEATVEHNHATAP--MTR-APA-----------PEYVPEA-PRHSDWQRPT 1042
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
352-578 |
2.02e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 56.26 E-value: 2.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 352 ASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfSSPPQVTTATLVSSSPPQVTSETPAS 431
Cdd:PRK08691 363 AASCDANAVIENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAA---AMPSEGKTAGPVSNQENNDVPPWEDA 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 432 SSPTQvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDtpasSSPPQVTSDTPASSS 511
Cdd:PRK08691 440 PDEAQ-TAAGTAQTSAKSIQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPN----DEAVETETFAHEAPA 514
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 115583681 512 PPQVTSETPASSSPPQvtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PRK08691 515 EPFYGYGFPDNDCPPE----DGAEIPPPDWEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFST 577
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
240-589 |
2.11e-07 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 56.46 E-value: 2.11e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 240 PQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQ 319
Cdd:NF033609 558 PEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSA 637
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 320 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGT 399
Cdd:NF033609 638 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 717
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 400 SETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPGFSSPTQVTT 478
Cdd:NF033609 718 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD 797
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 479 ATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSET 558
Cdd:NF033609 798 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNS 877
|
330 340 350
....*....|....*....|....*....|..
gi 115583681 559 PASSSPTNMTSDTPASSSPTNMT-SDTPASSS 589
Cdd:NF033609 878 PKNGTNASNKNEAKDSKEPLPDTgSEDEANTS 909
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
349-601 |
2.72e-07 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 56.07 E-value: 2.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 349 DTPASSSPPqgtlDTPSSSSP-PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:NF033609 540 DKPVVPEQP----DEPGEIEPiPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASD 615
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 428 TPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:NF033609 616 SDSASDSDSASDSDSASDSDSASDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 692
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASS-SPTNMTSDTPA 586
Cdd:NF033609 693 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDS 772
|
250
....*....|....*.
gi 115583681 587 -SSSPTNMTSDTPASS 601
Cdd:NF033609 773 dSDSDSDSDSDSDSDS 788
|
|
| DUF2967 |
pfam11179 |
Protein of unknown function (DUF2967); This family of proteins with unknown function appears ... |
324-680 |
2.82e-07 |
|
Protein of unknown function (DUF2967); This family of proteins with unknown function appears to be restricted to Drosophila.
Pssm-ID: 402654 [Multi-domain] Cd Length: 954 Bit Score: 56.20 E-value: 2.82e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 324 TPAS--SSPPQVTSATSASSSPPQGTSDTPAsSSPPQGTLDTPSSSSppQGTSDTPAS-SSPPQGTSETPASNSPPQGTS 400
Cdd:pfam11179 22 ALAGpiTAAPTGAAAAAATSTAAASAASSTI-TAPGAGPGGTPTSRS--RGAQAMTASlAHAAQGNANANKSTRNNSNSS 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 401 ETPGFSSPP-----QVTTATLVSSSPPQVTSETpASSSPTQVTSETPASSSP-TQVTSDTPASN------SPPQGTSDTP 468
Cdd:pfam11179 99 NNNGKPKPLaacymSTRSAAMMALALGQQSGEK-KDKKPAAGKAASPAQSQSqSQSQNASPHTNnravsmTRPAATRRLP 177
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 469 GFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVT----------------------SD-----TPASSSP--------- 512
Cdd:pfam11179 178 NAAAMSNVNAANSTCTATATSLPSNRARSKPSTPTatraaaqlngmgifsggsnssgSDndgfsASGSSAAtalrrlyfk 257
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 513 ----------PQVTSETPASSSPPQ-VTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMT 581
Cdd:pfam11179 258 sgrsiknkinASTSSSTPLNGLPLNaVSNAFHNSVGGATAMHAMGTAGGVPKLVVMGTSSASIPDTTINTSTDSACTLIT 337
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 582 SDTPASSSPTNMTSDTPASSSPPWPVITEVTRPestipagrSLANITSKAQEDSPLGVISTHPQMSFQSSTSqaLDETAG 661
Cdd:pfam11179 338 NVTHTDTSETCDSLDLGDNSGPSEPLFSSLEEP--------LLTAIHIDSEHEGFGGMAGGRGGANGRGATE--LELTSC 407
|
410 420
....*....|....*....|.
gi 115583681 662 ERVPTIPD--FQAHSEFQKAC 680
Cdd:pfam11179 408 SRYPPRPDmnLQDSTESQESC 428
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
295-624 |
2.91e-07 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 56.16 E-value: 2.91e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 295 TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPssSSPPQGTS 374
Cdd:TIGR00927 76 SSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTP--ATPSRALN 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 375 DTPASSSPPQGTSETPA------SNSPPQGTSETPGFSSPPQvttATLVSSSPPQVTSETPASSSPTQVTS--------- 439
Cdd:TIGR00927 154 HYISTSGRQRVKSYTPKprgevkSSSPTQTREKVRKYTPSPL---GRMVNSYAPSTFMTMPRSHGITPRTTvkdseitat 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 440 ----ETPASSSPTQVTSDTP----ASNSPPQGTSD--TPGFSSPTQVTTATLVsSSPPQVTSDtpaSSSPPQVTSDTPAS 509
Cdd:TIGR00927 231 ykmlETNPSKRTAGKTTPTPlkgmTDNTPTFLTREveTDLLTSPRSVVEKNTL-TTPRRVESN---SSTNHWGLVGKNNL 306
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 510 SSPPQVTSETPASSSPPQVTSDTSASISPpqviSDTPASSSPPQVTSETPASSSPTNMTSDTP-----ASSSPTNMTSDT 584
Cdd:TIGR00927 307 TTPQGTVLEHTPATSEGQVTISIMTGSSP----AETKASTAAWKIRNPLSRTSAPAVRIASATfrgleKNPSTAPSTPAT 382
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 115583681 585 P-ASSSPTN------MTSDTPASSSPPWPVITEVTRPESTIPAGRSL 624
Cdd:TIGR00927 383 PrVRAVLTTqvhhcvVVKPAPAVPTTPSPSLTTALFPEAPSPSPSAL 429
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
490-669 |
3.01e-07 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 53.75 E-value: 3.01e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 490 TSDTPASSSPPQVTSDTPasssppqVTSETPASSSPPQVTSDTSASISPPqvISDTPASSSPPQVTSETPASSSPTNMTS 569
Cdd:PHA03255 25 TSSGSSTASAGNVTGTTA-------VTTPSPSASGPSTNQSTTLTTTSAP--ITTTAILSTNTTTVTSTGTTVTPVPTTS 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 570 DTPASSSPTNMTSDTPASSSptnmtsdTPASSSPpwPVITEV-TRPESTIPAGRSLANITSKAQEDSplgvisthpqmsf 648
Cdd:PHA03255 96 NASTINVTTKVTAQNITATE-------AGTGTST--GVTSNVtTRSSSTTSATTRITNATTLAPTLS------------- 153
|
170 180
....*....|....*....|.
gi 115583681 649 qSSTSQALDETAGErVPTIPD 669
Cdd:PHA03255 154 -SKGTSNATKTTAE-LPTVPD 172
|
|
| COG5099 |
COG5099 |
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ... |
361-656 |
3.36e-07 |
|
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];
Pssm-ID: 227430 [Multi-domain] Cd Length: 777 Bit Score: 55.52 E-value: 3.36e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 361 LDTPSSSSPPQgtSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQvTTATLVSSSPPQVTSETPASSSPTQVTSE 440
Cdd:COG5099 69 KITSSSSSRRK--PSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNK-SNSALSSTQQGNANSSVTLSSSTASSMFN 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 441 TPASSSPTQVTSDTPASNSPP--QGTSDTPGFSSPTQvttaTLVSSSPPQVTSDTpaSSSPPQVTSDTPASSSPpqvTSE 518
Cdd:COG5099 146 SNKLPLPNPNHSNSATTNQSGssFINTPASSSSQPLT----NLVVSSIKRFPYLT--SLSPFFNYLIDPSSDSA---TAS 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 519 TPASSSPPQVTSDTSASISPPQVISDTPASSSPpQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTP 598
Cdd:COG5099 217 ADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSV-ENNIILNSSSSINELTSIYGSVPSIRNLRGLNSALVSFLNVSSSSL 295
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 599 ASSSPPwpvITEVtrPESTIPAGRSLANITSK------AQEDSPLGVISTHPQMS-----FQSSTSQAL 656
Cdd:COG5099 296 AFSALN---GKEV--SPTGSPSTRSFARVLPKsspnnlLTEILTTGVNPPQSLPSllnpvFLSTSTGFS 359
|
|
| Caprin-1_C |
pfam12287 |
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ... |
416-651 |
3.40e-07 |
|
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.
Pssm-ID: 463522 [Multi-domain] Cd Length: 320 Bit Score: 54.41 E-value: 3.40e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 416 LVSSSPPQVTSETPASSSPtqvtsetPASssptqvtsdtPASNSPPQgtSDTPGFSSPTQVttatlvssspPQVTSDTPA 495
Cdd:pfam12287 30 IVSAQPPSQSPDLSQMVCP-------PAS----------PEQRLSQQ--SDVLQQPEQTQV----------SPVSPSSNA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 496 SSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPpqvisdtpasSSPPQVTSETPASSSPTNMTSdTPASS 575
Cdd:pfam12287 81 CASSGSEYQFHTSEPPQPEAIDPIQSSMSLPSELAPPSPPLSP----------ASQPQVFQSKPASSSGINVNA-APFQS 149
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 576 SPT--NMTSDTP-ASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHpqMSFQSS 651
Cdd:pfam12287 150 MQTvfNVNAPVPpRNEQELKESSQYSSGYNQSFSSQSTQTVPQCQLPSEQLEQTVVGAYHPDGTIQVSNGH--LAFYPA 226
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
363-532 |
3.59e-07 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 53.37 E-value: 3.59e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 363 TPSSSSPPQGTSDTPASSSppqgTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:PHA03255 25 TSSGSSTASAGNVTGTTAV----TTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTI 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 443 ASSspTQVTSDTPASNSppQGTSDTPGFSSptQVTTATlvSSSPPQVTSDTPASSSPPQVTSDTpaSSSPPQVTSETPAS 522
Cdd:PHA03255 101 NVT--TKVTAQNITATE--AGTGTSTGVTS--NVTTRS--SSTTSATTRITNATTLAPTLSSKG--TSNATKTTAELPTV 170
|
170
....*....|
gi 115583681 523 SSPPQVTSDT 532
Cdd:PHA03255 171 PDERQPSLSY 180
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
439-631 |
3.62e-07 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 55.69 E-value: 3.62e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 439 SETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQvtSDTPASSSPPQVTSE 518
Cdd:NF033609 33 SSKEADASENSVTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQ--QETTQSASTNATTEE 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 519 TPASSsppQVTSDTSASISPPQVI--SDTPASSSPPQVTSETpaSSSPTNMTSDTpasSSPTN--------MTSDTPASS 588
Cdd:NF033609 111 TPVTG---EATTTATNQANTPATTqsSNTNAEELVNQTSNET--TSNDTNTVSSV---NSPQNstnaenvsTTQDTSTEA 182
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 115583681 589 SPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKA 631
Cdd:NF033609 183 TPSNNESAPQSTDASNKDVVNQAVNTSAPRMRAFSLAAVAADA 225
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
473-606 |
3.78e-07 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 55.05 E-value: 3.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 473 PTQVTTAtlvsssppQVTSDTPASSSPPQVTSDTPASSSPPQ----VTSETPASSSPPQVTSDTSASISPPQVISDTPAS 548
Cdd:pfam05539 169 KTAVTTS--------KTTSWPTEVSHPTYPSQVTPQSQPATQghqtATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQ 240
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 549 SSPPQVTSETPASSSPtnmTSDTPASSSPTNMTSDTPASSSPTNmtsdTPASSSPPWP 606
Cdd:pfam05539 241 RGPSGSPQHPPSTTSQ---DQSTTGDGQEHTQRRKTPPATSNRR----SPHSTATPPP 291
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
270-604 |
4.11e-07 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 55.69 E-value: 4.11e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 270 SDT-PASSSPPQVTSATSASSSPPQGTSDTPASSSppqvtsatsasssppqgtSDTPASSSPPQVTSATSASSSPPQGTS 348
Cdd:NF033609 561 SDSdPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSA------------------SDSDSASDSDSASDSDSASDSDSASDS 622
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 349 DTPASSSPPQGTldtpSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:NF033609 623 DSASDSDSASDS----DSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 698
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 429 PASSSPTQVTSETPASSSPTQVTSDTpASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDtpaSSSPPQVTSDTPA 508
Cdd:NF033609 699 DSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDS 774
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 509 SSSPPQVTSETPASSSPPQVTSDT-SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:NF033609 775 DSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 854
|
330 340
....*....|....*....|.
gi 115583681 588 SSPTNMTSDTPASSS----PP 604
Cdd:NF033609 855 DSESDSNSDSESGSNnnvvPP 875
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
347-465 |
5.20e-07 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 54.34 E-value: 5.20e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 347 TSDTPASSSPPQGTldTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVS--SSPPQV 424
Cdd:PRK12799 296 HGTVPVAAVTPSSA--VTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVAlpAAEPVN 373
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 115583681 425 TSETPASSSPTQVTSE---TPASSSPTQVTSDTPASNSPPQGTS 465
Cdd:PRK12799 374 MQPQPMSTTETQQSSTgniTSTANGPTTSLPAAPASNIPVSPTS 417
|
|
| PRK13914 |
PRK13914 |
invasion associated endopeptidase; |
417-602 |
5.75e-07 |
|
invasion associated endopeptidase;
Pssm-ID: 237555 [Multi-domain] Cd Length: 481 Bit Score: 54.42 E-value: 5.75e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 417 VSSSPPQVTSETPASSSPTQVTsetPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATL--------------- 481
Cdd:PRK13914 143 VTSTPVAPTQEVKKETTTQQAA---PAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVksgdtiwalsvkygv 219
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 482 ----------VSSSPPQVTSD----TPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPA 547
Cdd:PRK13914 220 svqdimswnnLSSSSIYVGQKlaikQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPT 299
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 548 SSS----PPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSS 602
Cdd:PRK13914 300 EAAkpapAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSS 358
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
423-582 |
5.79e-07 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 54.71 E-value: 5.79e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASN------SPPQGTSDTPG-FSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:PLN02217 508 EVQNTGPGAAITKRVTWPGIKKLSDEEILKFTPAQYiqgdawIPGKGVPYIPGlFAGNPGSTNSTPTGSAASSNTTFSSD 587
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 496 SSSppqvTSDTPASSSPPQVTSETPASSSpPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNM-TSDTPAS 574
Cdd:PLN02217 588 SPS----TVVAPSTSPPAGHLGSPPATPS-KIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIkVASTESS 662
|
....*...
gi 115583681 575 SSPTNMTS 582
Cdd:PLN02217 663 VSMVSMST 670
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
347-511 |
6.59e-07 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 52.60 E-value: 6.59e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 347 TSDTPASSSppqgtldTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTseTPGFSSPPQVTTATLVSSSPPQVTS 426
Cdd:PHA03255 25 TSSGSSTAS-------AGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITT--TAILSTNTTTVTSTGTTVTPVPTTS 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 427 ETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTpgfSSPTQVTTATLVSSSPPQVTSD--TPASSSPPQVTS 504
Cdd:PHA03255 96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTT---SATTRITNATTLAPTLSSKGTSnaTKTTAELPTVPD 172
|
....*..
gi 115583681 505 DTPASSS 511
Cdd:PHA03255 173 ERQPSLS 179
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
477-607 |
7.81e-07 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 53.95 E-value: 7.81e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 477 TTATLVSSSPPqvTSDTPASSSPPQVTSDTPASSSPPqvtseTPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799 296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPSPAVIP-----SSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPA 368
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 115583681 557 ETPASSSPTNMTSDTPASSSPTNMtsdTPASSSPTNMTSDTPASSSPPWPV 607
Cdd:PRK12799 369 AEPVNMQPQPMSTTETQQSSTGNI---TSTANGPTTSLPAAPASNIPVSPT 416
|
|
| GPS |
pfam01825 |
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ... |
1020-1058 |
8.50e-07 |
|
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.
Pssm-ID: 460350 Cd Length: 44 Bit Score: 47.30 E-value: 8.50e-07
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 115583681 1020 QCYFWDRYNRT---WKSDGCQVGPKSTiLKTQCLCDHLTFFS 1058
Cdd:pfam01825 2 QCVFWDFTNSTtgrWSTEGCTTVSLND-THTVCSCNHLTSFA 42
|
|
| SP2_N |
cd22540 |
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ... |
351-619 |
8.51e-07 |
|
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.
Pssm-ID: 411776 [Multi-domain] Cd Length: 511 Bit Score: 54.16 E-value: 8.51e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 351 PASSSPPQGTLDTPSSSS------PPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSS---- 420
Cdd:cd22540 66 PLPLGPGKNSIGFLSAKGniiqlqGSQLSSSAPGGQQVFAIQNPTMIIKGSQTRSSTNQQYQISPQIQAAGQINNSgqiq 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 421 -----------PPQVTSETPASSSPTQVTsetPASSSPTQVTSDTPASN-------SPPQGTSDTP-----GFSSPTQVT 477
Cdd:cd22540 146 iipgtnqaiitPVQVLQQPQQAHKPVPIK---PAPLQTSNTNSASLQVPgnviklqSGGNVALTLPvnnlvGTQDGATQL 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 478 TATLVSSSPPQVTSDTPASSSPPQVTS--------------------------DTPASSSPPQV--------TSETPASS 523
Cdd:cd22540 223 QLAAAPSKPSKKIRKKSAQAAQPAVTVaeqvetvliettadniiqagnnllivQSPGTGQPAVLqqvqvlqpKQEQQVVQ 302
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 524 SPP------QVTSDTSASI--SPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:cd22540 303 IPQqalrvvQAASATLPTVpqKPLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTANNGT 382
|
330 340
....*....|....*....|....
gi 115583681 596 DTPASSSPpwpvitevTRPESTIP 619
Cdd:cd22540 383 GTSKPNYN--------VRKERTLP 398
|
|
| PAP1 |
pfam08601 |
Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene ... |
346-536 |
8.61e-07 |
|
Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene transcription in response to H2O2. This region is cysteine rich. Alkylation of cysteine residues following treatment with a cysteine alkylating agent can mask the accessibility of the nuclear exporter Crm1, triggering nuclear accumulation and Pap1 dependent transcriptional expression.
Pssm-ID: 369990 Cd Length: 363 Bit Score: 53.71 E-value: 8.61e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 346 GTSDTPASSSPPQGTLDTPSSSSPPqgtSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:pfam08601 31 QLSKAKQNTAKPGVRSDSRSPSPNA---STSTPDSQPPPSASSSTTPNQGSNGLNAFTGEDNNNYSNSAANPGATRGSTA 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 426 SETPASSSPTQVTSETPASS-SPTQvTSDTPASNSPPQGTSDTPGFSSPTQ--VTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:pfam08601 108 SSARSQSSPYSFGSGTSTSSdSPSS-SSSSHQGQLSSCGTSPEPSTQSPGGqkSVETMIGEEQCAHGTIDGEKSFCAKLG 186
|
170 180 190
....*....|....*....|....*....|....
gi 115583681 503 TSDTPASSSPPQVTSETPASSSPPQVTSDTSASI 536
Cdd:pfam08601 187 MACGNINNPIPAAMSKSNSLSNTPGHASNDSNGL 220
|
|
| CLECT |
cd00037 |
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ... |
37-142 |
9.33e-07 |
|
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.
Pssm-ID: 153057 [Multi-domain] Cd Length: 116 Bit Score: 49.54 E-value: 9.33e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 37 SCYQLNRLFCDFQEADNYCHAQRGRLAHTWNPKLRGFLKSFL---NEETVW-------------WVRGNLTLPGSHPGIN 100
Cdd:cd00037 1 SCYKFSTEKLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLkksSSSDVWiglndlssegtwkWSDGSPLVDYTNWAPG 80
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 115583681 101 QTGGDDvlrnqkPGECpsvVTHSNAVFSRWN--LCIEKHHFICQ 142
Cdd:cd00037 81 EPNPGG------SEDC---VVLSSSSDGKWNdvSCSSKLPFICE 115
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
377-665 |
1.18e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.84 E-value: 1.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 377 PASSSPPQGTS---ETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPtQVTSETPASSSPTQVTSD 453
Cdd:PRK07764 365 PSASDDERGLLarlERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAP-AAAPQPAPAPAPAPAPPS 443
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 454 TPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQ------------------- 514
Cdd:PRK07764 444 PAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAgaddaatlrerwpeilaav 523
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 515 ---VTSETPASSSPPQVTS----------DTSAS---ISPPQ---------------------VISDTPASSSPPQVTSE 557
Cdd:PRK07764 524 pkrSRKTWAILLPEATVLGvrgdtlvlgfSTGGLarrFASPGnaevlvtalaeelggdwqveaVVGPAPGAAGGEGPPAP 603
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 558 TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPL 637
Cdd:PRK07764 604 ASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683
|
330 340
....*....|....*....|....*...
gi 115583681 638 GVISTHPQMSFQSSTSQALDETAGERVP 665
Cdd:PRK07764 684 APAPAAPAAPAGAAPAQPAPAPAATPPA 711
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
373-559 |
1.23e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 51.83 E-value: 1.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 373 TSDTPASSSPpQGTSETPASNSPpqgtseTPGFSSPPQVTTATLVSSSPPqVTSETPASSSPTQVTSeTPASSSPTQVTS 452
Cdd:PHA03255 25 TSSGSSTASA-GNVTGTTAVTTP------SPSASGPSTNQSTTLTTTSAP-ITTTAILSTNTTTVTS-TGTTVTPVPTTS 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 453 DTpasnsppqgtsdtpgfSSPTQVTTATLVSSSppqvTSDTPASSSPPQVTSDTPASSSPPQVTSE-TPASSSPPQVTSD 531
Cdd:PHA03255 96 NA----------------STINVTTKVTAQNIT----ATEAGTGTSTGVTSNVTTRSSSTTSATTRiTNATTLAPTLSSK 155
|
170 180
....*....|....*....|....*...
gi 115583681 532 TSASISPPQVISDTPASSSPPQVTSETP 559
Cdd:PHA03255 156 GTSNATKTTAELPTVPDERQPSLSYGLP 183
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
429-713 |
1.55e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 1.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 429 PASSSPTQVTSETPASSSPTQVTSDTPASNSPPqgtsdTPGFSSPTQVTTATLVSSSPP----------QVTSDTPASSS 498
Cdd:PHA03247 2478 PVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPP-----APSRLAPAILPDEPVGEPVHPrmltwirgleELASDDAGDPP 2552
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 499 PPqvtsdtpassSPPQVTSETPASSSPPqvtSDTSASISPPQVISDTPASSSPPQvtSETPasSSPTNMTSDTPASSSPt 578
Cdd:PHA03247 2553 PP----------LPPAAPPAAPDRSVPP---PRPAPRPSEPAVTSRARRPDAPPQ--SARP--RAPVDDRGDPRGPAPP- 2614
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 579 nmtsdtpaSSSPtnmtsdtPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQALDE 658
Cdd:PHA03247 2615 --------SPLP-------PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP 2679
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 659 TAGERVPTIPDfqahsefqkACAILQRLRDFLPTSPTSAQKNNSWSSQTPAVSCP 713
Cdd:PHA03247 2680 PQRPRRRAARP---------TVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGP 2725
|
|
| Caprin-1_C |
pfam12287 |
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ... |
362-498 |
1.57e-06 |
|
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.
Pssm-ID: 463522 [Multi-domain] Cd Length: 320 Bit Score: 52.49 E-value: 1.57e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 362 DTPS-----SSSPPQGTSDTPASSSPPQgtsetpasnSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQ 436
Cdd:pfam12287 23 DKPSdsaivSAQPPSQSPDLSQMVCPPA---------SPEQRLSQQSDVLQQPEQTQVSPVSPSSNACASSGSEYQFHTS 93
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 437 VTSETPASSSPtqvtsdtPASNSPPqgtsdtpgfSSPTQvTTATLVSSSPPQVTSDTPASSS 498
Cdd:pfam12287 94 EPPQPEAIDPI-------QSSMSLP---------SELAP-PSPPLSPASQPQVFQSKPASSS 138
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
410-632 |
1.70e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.53 E-value: 1.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 410 QVTTATLVSSSPPQVTSetpASSSPTQVTSETpasssptQVTSDTPASNSPPQGTS-DTPGfssPTQVTTATLVSSSPPQ 488
Cdd:PHA03378 518 QRVMATLLPPSPPQPRA---GRRAPCVYTEDL-------DIESDEPASTEPVHDQLlPAPG---LGPLQIQPLTSPTTSQ 584
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 489 VTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP---QVIS-DTPASSSPPQVTSETPASSSP 564
Cdd:PHA03378 585 LASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPlrmQPITfNVLVFPTPHQPPQVEITPYKP 664
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 115583681 565 T-NMTSDTPASSSPTNMTSDTPASSSPTNMTSD--TPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQ 632
Cdd:PHA03378 665 TwTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPprAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRAR 735
|
|
| ARG80 |
COG5068 |
Regulator of arginine metabolism and related MADS box-containing transcription factors ... |
347-604 |
2.48e-06 |
|
Regulator of arginine metabolism and related MADS box-containing transcription factors [Transcription];
Pssm-ID: 227400 [Multi-domain] Cd Length: 412 Bit Score: 52.33 E-value: 2.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 347 TSDTPASSSPPQGTLDTPSSSSpPQGTSDTPASSSPPQGTSETPASNSPPQGtsetpgfSSPPQVTTATLVSSSPPQVts 426
Cdd:COG5068 163 PSDSSEEPSSSASFSVDPNDNN-PMGSFQHNGSPQTNFIPLQNPQTQQYQQH-------SSRKDHPTVPHSNTNNGRP-- 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 427 etPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQG-TSDTPGFSSPTQVTTATLVsSSPPQVTSDTPASSSPPQVTSD 505
Cdd:COG5068 233 --PAKFMIPELHSSHSTLDLPSDFISD---SGFPNQSsTSIFPLDSAIIQITPPHLP-NNPPQENRHELYSNDSSMVSET 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 506 TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:COG5068 307 PPPKNLPNGSPNQSPLNNLSRGNPASPNSIIRENNQVEDESFNGRQGSAIWNALISTTQPNSGLHTEASTAPSSTIPADP 386
|
250
....*....|....*....
gi 115583681 586 ASSSPTNMTSDTPASSSPP 604
Cdd:COG5068 387 LKNAAQTNSGTRNNNFSDN 405
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
408-739 |
2.62e-06 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 52.75 E-value: 2.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 408 PPQVTTATLVSSSPPQVTSE----TPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVS 483
Cdd:PHA03377 399 PVQQRPVMFVSRVPWRKPRTlpwpTPKTHPVKRTLVKTSGRSDEAEQAQSTPERPGPSDQPSVPVEPAHLTPVEHTTVIL 478
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 484 SSPPQVTSDTPASSSPPQV---------------------TSDTPASSSPPQVTSETPASSSppQVTSDTSASISPPQVI 542
Cdd:PHA03377 479 HQPPQSPPTVAIKPAPPPSrrrrgacvvydddiievidveTTEEEESVTQPAKPHRKVQDGF--QRSGRRQKRATPPKVS 556
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 543 -SDT-PASSSPPQV---TSETPASSSPTNMTSDT-PASSSPTNMT--SDTPASSSPTNMtsdTPASSSPPW---PVITEV 611
Cdd:PHA03377 557 pSDRgPPKASPPVMappSTGPRVMATPSTGPRDMaPPSTGPRQQAkcKDGPPASGPHEK---QPPSSAPRDmapSVVRMF 633
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 612 TR----PESTIPAGRSLANItsKAQEDSPLgviSTHPQMSFQSSTSQALDETAgERVPTIPDFQAHSEFQKACAILQRLR 687
Cdd:PHA03377 634 LRerllEQSTGPKPKSFWEM--RAGRDGSG---IQQEPSSRRQPATQSTPPRP-SWLPSVFVLPSVDAGRAQPSEESHLS 707
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 115583681 688 DFLPTSPTSAQKNNSWSSQTPAVSCPFQPLgrltTTEKSSHQMAQQDME--QHP 739
Cdd:PHA03377 708 SMSPTQPISHEEQPRYEDPDDPLDLSLHPD----QAPPPSHQAPYSGHEepQAQ 757
|
|
| DUF4045 |
pfam13254 |
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ... |
353-499 |
2.69e-06 |
|
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.
Pssm-ID: 433066 [Multi-domain] Cd Length: 415 Bit Score: 52.09 E-value: 2.69e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 353 SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSS---PPQVTTATLVSSSPPQVT--SE 427
Cdd:pfam13254 210 RSPAPGGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPAPTSASEPPPKTkelPKDSEEPAAPSKSAEASTekKE 289
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 115583681 428 TPASSSP--TQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSsppQVTSDTPASSSP 499
Cdd:pfam13254 290 PDTESSPetSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPPKDFRANLRSR---EVPKDKSKKDEP 360
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
496-577 |
2.84e-06 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 51.57 E-value: 2.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 496 SSSPPQVTSDTPASSSPPQVTSETPASSSP--PQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PRK10856 168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247
|
....
gi 115583681 574 SSSP 577
Cdd:PRK10856 248 AADP 251
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
356-586 |
3.14e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 52.57 E-value: 3.14e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 356 PPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT 435
Cdd:PRK12323 365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 436 QVTSETPASSSPTQVTSDTPASnSPPQGTSDTPGFSSPTQVTTATlvssSPPQVTSDTPASSSPPQVTSDTPASSSPPQV 515
Cdd:PRK12323 445 GGAPAPAPAPAAAPAAAARPAA-AGPRPVAAAAAAAPARAAPAAA----PAPADDDPPPWEELPPEFASPAPAQPDAAPA 519
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 115583681 516 TSETPASSSPPQVTSDTSASISPPQvisdtPASSSPPQVTSETPASSSPTnmTSDTPASSSPTNMTSDTPA 586
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPA-----PAAAPAPRAAAATEPVVAPR--PPRASASGLPDMFDGDWPA 583
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
211-586 |
3.63e-06 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 51.99 E-value: 3.63e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 211 PPKATHRMTITSLTgRPQVTSDTLASSSPPQGTSDT----PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATS 286
Cdd:COG5180 187 EPRDALKDSPEKLD-RPKVEVKDEAQEEPPDLTGGAdhprPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERR 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 287 ASSSPpqgtsDTPASSSPPQvtsatSASSSPPQGTSDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQGTLDTPSS 366
Cdd:COG5180 266 RAAIG-----DTPAAEPPGL-----PVLEAGSEPQSDAPEAETARP--------------IDVKGVASAPPATRPVRPPG 321
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 367 SSPPQGTSDTPASSSPPQGTSETpASNSPPQGTSETPGFSSPPqvtTATLVSSSPpqvtsetPASSSPTQVTSETPASSS 446
Cdd:COG5180 322 GARDPGTPRPGQPTERPAGVPEA-ASDAGQPPSAYPPAEEAVP---GKPLEQGAP-------RPGSSGGDGAPFQPPNGA 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 447 PtQVTSDTPASNSPPQGTSDTPGFSSPTQVTTAtlVSSSPPQvtsdTPASSSPPQVTSDTPASS----SPPQVTSETPAS 522
Cdd:COG5180 391 P-QPGLGRRGAPGPPMGAGDLVQAALDGGGRET--ASLGGAA----GGAGQGPKADFVPGDAESvsgpAGLADQAGAAAS 463
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 523 SSP-PQVTSDTSASISPPQVISDTPASSSPPqvTSETPASSSPtnmTSDTPASSSPTNMTSDTPA 586
Cdd:COG5180 464 TAMaDFVAPVTDATPVDVADVLGVRPDAILG--GNVAPASGLD---AETRIIEAEGAPATEDFVA 523
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
190-576 |
3.68e-06 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 52.22 E-value: 3.68e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 190 QPAPPELFEtlcfPIDPASSAPPKathrmtitSLTGRPQVTSDTlASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGT 269
Cdd:NF033609 547 QPDEPGEIE----PIPEDSDSDPG--------SDSGSDSSNSDS-GSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSA 613
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 270 SDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSD 349
Cdd:NF033609 614 SDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 350 TPASSSPpqgtlDTPS-SSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:NF033609 694 SDSDSDS-----DSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 768
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 429 PASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSppqvtSDTP 507
Cdd:NF033609 769 DSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-----SDSD 843
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPqvisDTPASSSPPQVTSETPASSSPTNMT-SDTPASSS 576
Cdd:NF033609 844 SDSDSDSDSDSDSESDSNSDSESGSNNNVVPP----NSPKNGTNASNKNEAKDSKEPLPDTgSEDEANTS 909
|
|
| DUF2967 |
pfam11179 |
Protein of unknown function (DUF2967); This family of proteins with unknown function appears ... |
483-754 |
3.85e-06 |
|
Protein of unknown function (DUF2967); This family of proteins with unknown function appears to be restricted to Drosophila.
Pssm-ID: 402654 [Multi-domain] Cd Length: 954 Bit Score: 52.34 E-value: 3.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 483 SSSPPQVTSDTPASSsPPQVTSDTPASSSPPQVTSET--PASSSPPQVTSDTSASISPPQVISDTPAS---SSPPQVTSE 557
Cdd:pfam11179 15 SSAPPHAALAGPITA-APTGAAAAAATSTAAASAASStiTAPGAGPGGTPTSRSRGAQAMTASLAHAAqgnANANKSTRN 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 558 TPASSSPTNMTSDTPASSSPT----------NMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANI 627
Cdd:pfam11179 94 NSNSSNNNGKPKPLAACYMSTrsaammalalGQQSGEKKDKKPAAGKAASPAQSQSQSQSQNASPHTNNRAVSMTRPAAT 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 628 TSKAqedSPLGVISTHPQMSFQSSTSQALDETAGERVPTIPdfqahsEFQKACAILQRLRDFLPTSPTSAQKNNSWSSQT 707
Cdd:pfam11179 174 RRLP---NAAAMSNVNAANSTCTATATSLPSNRARSKPSTP------TATRAAAQLNGMGIFSGGSNSSGSDNDGFSASG 244
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 115583681 708 PAVSCPFQPL----GRLTTTEKSSHQMAQQDMEQHPMDGAHNAFGISAGGS 754
Cdd:pfam11179 245 SSAATALRRLyfksGRSIKNKINASTSSSTPLNGLPLNAVSNAFHNSVGGA 295
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
239-551 |
4.18e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 52.16 E-value: 4.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 239 PPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsatsassspp 318
Cdd:PRK07003 360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAP------------- 426
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 319 qgtsdtPASSSPPQvtsatsasssPPQGTSDTPASSSPPQGTLDTPSS-SSPPQGTSDTPASSSPPQGtseTPASNSPPQ 397
Cdd:PRK07003 427 ------PAAPAPPA----------TADRGDDAADGDAPVPAKANARASaDSRCDERDAQPPADSGSAS---APASDAPPD 487
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 398 GTSEtpgfSSPPqvttATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPqgtSDTPGFSSPTQV- 476
Cdd:PRK07003 488 AAFE----PAPR----AAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPA---ARAGGAAAALDVl 556
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 477 TTATLVSSSPPQVTSDTPASSSPPQVTSDTPAsssPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSP 551
Cdd:PRK07003 557 RNAGMRVSSDRGARAAAAAKPAAAPAAAPKPA---APRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
319-547 |
4.26e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 52.19 E-value: 4.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 319 QGTSDTPASSSPPQVTSATSASSSPPQGTSdTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQG 398
Cdd:PRK12323 368 SGGGAGPATAAAAPVAQPAPAAAAPAAAAP-APAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 399 TSETPgfSSPPQVTTATlvsSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPtqvTT 478
Cdd:PRK12323 447 APAPA--PAPAAAPAAA---ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDA---AP 518
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 479 ATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPqvtsDTSASISPPQVISDTPA 547
Cdd:PRK12323 519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP----RASASGLPDMFDGDWPA 583
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
363-587 |
4.48e-06 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 51.43 E-value: 4.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 363 TPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGtsetpGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:COG5651 165 TPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANL-----GLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAG 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 443 ASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPAS 522
Cdd:COG5651 240 AAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAA 319
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 523 SSPP-QVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:COG5651 320 GATGaGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
485-636 |
4.87e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 52.02 E-value: 4.87e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 485 SPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSE---TPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPAS 561
Cdd:PRK08691 379 SPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAaamPSEGKTAGPVSNQENNDVPPWE---DAPDEAQTAAGTAQTSAK 455
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 115583681 562 SSPTNMTSDTPASSSPT-NMTSDTPASSSPTNMTSDTPASSSP-PWPVITEVTRPESTIPAgrslANITSKAQEDSP 636
Cdd:PRK08691 456 SIQTASEAETPPENQVSkNKAADNETDAPLSEVPSENPIQATPnDEAVETETFAHEAPAEP----FYGYGFPDNDCP 528
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
211-606 |
6.07e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 51.60 E-value: 6.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 211 PP---KATHRMTITSLTGRPQVTSDTLASssPPQGTSDTPassSPPQVTSATSASSSPPQGTSDTPASS-SPPQVTSATS 286
Cdd:PHA03379 379 PPiflRRLHRLLLMRAGKLTERAREALEK--ASEPTYGTP---RPPVEKPRPEVPQSLETATSHGSAQVpEPPPVHDLEP 453
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 287 ASSSPPQGTSDTPASSSPPqvtsatsasssppqgtsdtpassSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS 366
Cdd:PHA03379 454 GPLHDQHSMAPCPVAQLPP-----------------------GPLQDLEPGDQLPGVVQDGRPACAPVPAPAGPIVRPWE 510
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 367 SSPPQGTSDTPASSSP----------PQGTSETPASNSPP----QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASS 432
Cdd:PHA03379 511 ASLSQVPGVAFAPVMPqpmpvepvpvPTVALERPVCPAPPliamQGPGETSGIVRVRERWRPAPWTPNPPRSPSQMSVRD 590
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 433 SPTQVTSET-----PASSSPTQVTSDTPAS-----NSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTS---DTPASSSP 499
Cdd:PHA03379 591 RLARLRAEAqpyqaSVEVQPPQLTQVSPQQpmeypLEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYFDlplQQPISQGA 670
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 500 PQVTSDTPASSSPPqVTSETPASSSPPqVTSDTSASISPPQVISDTPAssSPPQVtseTPASSSPTNMTSDTPASSSPTN 579
Cdd:PHA03379 671 PLAPLRASMGPVPP-VPATQPQYFDIP-LTEPINQGASAAHFLPQQPM--EGPLV---PERWMFQGATLSQSVRPGVAQS 743
|
410 420
....*....|....*....|....*..
gi 115583681 580 MTSDTPASSSPTNMTSDTPASSSPPWP 606
Cdd:PHA03379 744 QYFDLPLTQPINHGAPAAHFLHQPPME 770
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
455-625 |
6.27e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 51.77 E-value: 6.27e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 455 PASNSPPQGTSDTPGfssPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPqvtsETPASSSPPQVTSD--T 532
Cdd:PRK07003 368 PGGGVPARVAGAVPA---PGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRA----EAPPAAPAPPATADrgD 440
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 533 SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTN-MTSDTPASSSPTNMTSDTPASSSPPW--PVIT 609
Cdd:PRK07003 441 DAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDaAFEPAPRAAAPSAATPAAVPDARAPAaaSRED 520
|
170
....*....|....*.
gi 115583681 610 EVTRPESTIPAGRSLA 625
Cdd:PRK07003 521 APAAAAPPAPEARPPT 536
|
|
| PLN03223 |
PLN03223 |
Polycystin cation channel protein; Provisional |
1995-2150 |
6.72e-06 |
|
Polycystin cation channel protein; Provisional
Pssm-ID: 215637 [Multi-domain] Cd Length: 1634 Bit Score: 51.87 E-value: 6.72e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1995 LRGFLLLFATVRVWDLLRHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFGWSISDYQSFFRSIVTVVGLL 2074
Cdd:PLN03223 1294 LSGINIILLLGRILKLMDFQPRLGVITRTLWLAGADLMHFFVIFGMVFVGYAFIGHVIFGNASVHFSDMTDSINSLFENL 1373
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 2075 MG-----TSKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAFGkERKACEKEAT-----LTDMLLQKLSSLLG 2144
Cdd:PLN03223 1374 LGdityfNEDLKNLTGLQFVVGMIYFYSYNIFVFMILFNFLLAIICDAFG-EVKANAAETVsvhteLFPMLRDKWRSMFK 1452
|
....*.
gi 115583681 2145 IRLHQN 2150
Cdd:PLN03223 1453 GWFYKN 1458
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
371-484 |
7.55e-06 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 51.24 E-value: 7.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 371 QGTSDTPASSSP--PQGTSETPAS-NSPPQGTSeTPGFSSPPQVTTATLV--SSSPPQVTSETPASSSPTQVTSETPASS 445
Cdd:PLN02217 545 QGDAWIPGKGVPyiPGLFAGNPGStNSTPTGSA-ASSNTTFSSDSPSTVVapSTSPPAGHLGSPPATPSKIVSPSTSPPA 623
|
90 100 110
....*....|....*....|....*....|....*....
gi 115583681 446 SPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSS 484
Cdd:PLN02217 624 SHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESS 662
|
|
| GPS |
smart00303 |
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ... |
1019-1058 |
9.48e-06 |
|
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.
Pssm-ID: 197639 Cd Length: 49 Bit Score: 44.69 E-value: 9.48e-06
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 115583681 1019 TQCYFWDRYNRTWKSDGCQVGPKS-TIlkTQCLCDHLTFFS 1058
Cdd:smart00303 3 PICVFWDESSGEWSTRGCELLETNgTH--TTCSCNHLTTFA 41
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
351-448 |
1.07e-05 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 50.34 E-value: 1.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 351 PASSSPPQGTLDTPSSSSPPQ-GTSD--TPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:PHA03291 176 PLGEGSADGSCDPALPLSAPRlGPADvfVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEG 255
|
90 100
....*....|....*....|.
gi 115583681 428 TPASSSPTqVTSETPASSSPT 448
Cdd:PHA03291 256 TPAPPTPG-GGEAPPANATPA 275
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
494-674 |
1.17e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 50.94 E-value: 1.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 494 PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PHA03307 22 PRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPP---TGPPPGPGTEAPANESRSTPTWSLSTLAPA 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 574 SSSPTNmtSDTPASSSPTNMTSDTPASSSP---PWPVITEVTRPE-STIPAGRSLANITSKAQEDSPLGVISTHPQMSFQ 649
Cdd:PHA03307 99 SPAREG--SPTPPGPSSPDPPPPTPPPASPppsPAPDLSEMLRPVgSPGPPPAASPPAAGASPAAVASDAASSRQAALPL 176
|
170 180
....*....|....*....|....*..
gi 115583681 650 SSTSQALD--ETAGERVPTIPDFQAHS 674
Cdd:PHA03307 177 SSPEETARapSSPPAEPPPSTPPAAAS 203
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
321-469 |
1.32e-05 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 50.05 E-value: 1.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 321 TSDTPASSSPPQ----VTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPqgTSDTPASSSPPQGTSETPASNSPP 396
Cdd:pfam05539 191 SQVTPQSQPATQghqtATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPS--GSPQHPPSTTSQDQSTTGDGQEHT 268
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 397 QGTSETPGFSSPPQV-TTATLVSSSPPQVTSE-TPASSSPTQVTSeTPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:pfam05539 269 QRRKTPPATSNRRSPhSTATPPPTTKRQETGRpTPRPTATTQSGS-SPPHSSPPGVQANPTTQNLVDCKELDPPK 342
|
|
| PRK13914 |
PRK13914 |
invasion associated endopeptidase; |
352-588 |
1.42e-05 |
|
invasion associated endopeptidase;
Pssm-ID: 237555 [Multi-domain] Cd Length: 481 Bit Score: 50.19 E-value: 1.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 352 ASSSPPQGTLDTPSSSSPPQGTsdtPASSSPPQGTSETPASNSPPQG--TSETPGFSSppQVTTATLVSSSPPQVTSeTP 429
Cdd:PRK13914 143 VTSTPVAPTQEVKKETTTQQAA---PAAETKTEVKQTTQATTPAPKVaeTKETPVVDQ--NATTHAVKSGDTIWALS-VK 216
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 430 ASSSPTQVTSETPASSSPTQVTSD----TPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSD 505
Cdd:PRK13914 217 YGVSVQDIMSWNNLSSSSIYVGQKlaikQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPK 296
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 506 TPASSSPPqvtseTPAssspPQVTSDTSASISPPQVISDTPASSSPPQVTSETpaSSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:PRK13914 297 APTEAAKP-----APA----PSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTN--TNSNTNTNSNTNANQGSSNNNSNSS 365
|
...
gi 115583681 586 ASS 588
Cdd:PRK13914 366 ASA 368
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
395-532 |
1.44e-05 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 50.47 E-value: 1.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 395 PPQGTSETPG-FSSPPQVTTATLVSSSPPQVTSETpaSSSPTqvTSETPASSSPTQVTSDTPASnsppqgtsdtpgfssP 473
Cdd:PLN02217 551 PGKGVPYIPGlFAGNPGSTNSTPTGSAASSNTTFS--SDSPS--TVVAPSTSPPAGHLGSPPAT---------------P 611
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 115583681 474 TQVTTAtlvSSSPPQ----VTSDTPASSSPPQVTSDTPaSSSPPQVTSETPASSSPPQVTSDT 532
Cdd:PLN02217 612 SKIVSP---STSPPAshlgSPSTTPSSPESSIKVASTE-TASPESSIKVASTESSVSMVSMST 670
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
477-595 |
1.53e-05 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 49.71 E-value: 1.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 477 TTATLVSSSPPQVTSDTPASSSPPqvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799 307 SSAVTQSSAITPSSAAIPSPAVIP-----SSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMST 381
|
90 100 110
....*....|....*....|....*....|....*....
gi 115583681 557 ETPASSSPTNMtsdTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:PRK12799 382 TETQQSSTGNI---TSTANGPTTSLPAAPASNIPVSPTS 417
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
224-422 |
1.53e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.14 E-value: 1.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 224 TGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSS 303
Cdd:COG3469 12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 304 PPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS--SSPPQGTSDTPASSS 381
Cdd:COG3469 92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSgtETATGGTTTTSTTTT 171
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 115583681 382 PPQGTSETPASNSPPQGTSETPGFSSPP-QVTTATLVSSSPP 422
Cdd:COG3469 172 TTSASTTPSATTTATATTASGATTPSATtTATTTGPPTPGLP 213
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
403-558 |
1.55e-05 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 50.09 E-value: 1.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 403 PGFSSPPQVTTATLVSSSPPQVTSETPASSspTQVTSETPASSSPTQ---VTSDTPASNSPPQGTSdTPGFSSPTQVTTA 479
Cdd:PLN02217 514 PGAAITKRVTWPGIKKLSDEEILKFTPAQY--IQGDAWIPGKGVPYIpglFAGNPGSTNSTPTGSA-ASSNTTFSSDSPS 590
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 480 TLV--SSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPqVTSDTSASISPPQVISDTPASSSPPQVTSE 557
Cdd:PLN02217 591 TVVapSTSPPAGHLGSPPATPSKIVSPSTSPPASHLGSPSTTPSSPESS-IKVASTETASPESSIKVASTESSVSMVSMS 669
|
.
gi 115583681 558 T 558
Cdd:PLN02217 670 T 670
|
|
| SP4_N |
cd22536 |
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ... |
345-733 |
1.78e-05 |
|
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.
Pssm-ID: 411773 [Multi-domain] Cd Length: 623 Bit Score: 49.92 E-value: 1.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 345 QGTSDTPASSSPPQGTldtpSSSSPPQGTSDTPASSSPPQGTSETPASNSP-PQGTSEtpgFSSPPQVTTatlVSSSPPQ 423
Cdd:cd22536 90 QGVSAATSSAAPSSSN----NGSTSPTKVKAGNSNASAPGQFQVIQVQNMQnPSGSVQ---YQVIPQIQT---VEGQQIQ 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 424 VTSETPASSSPTQVTSE-TPASS--SPTQVTSDTPASNSPPQGTSDT-------PGFSSPTQVTTatlVSSSPPQVTSDT 493
Cdd:cd22536 160 ISPANATALQDLQGQIQlIPAGNnqAILTTPNRTASGNIIAQNLANQtvpvqirPGVSIPLQLQT---IPGAQAQVVTTL 236
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 494 PASSSppQVTSDTPasssppqVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMT----- 568
Cdd:cd22536 237 PINIG--GVTLALP-------VINNVAAGGGSGQLVQPSDGGVSNGNQLVSTPITTASVSTMPESPSSSTTCTTTastsl 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 569 --SDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVI-TEVTRPESTIPAGRSLANITSKAQE--------DSPL 637
Cdd:cd22536 308 tsSDTLVSSAETGQYASTAASSERTEEEPQTSAAESEAQSSSqLQSNGLQNVQDQSNSLQQVQIVGQPilqqiqiqQPQQ 387
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 638 GVISTHPQMSFQSSTSQALDETAGERVPTIpDFQAhseFQKACAILQRlrdfLPTSPTSAQKnnSWssQTPAVscpfQPL 717
Cdd:cd22536 388 QIIQAIQPQSFQLQSGQTIQTIQQQPLQNV-QLQA---VQSPTQVLIR----APTLTPSGQI--SW--QTVQV----QNI 451
|
410
....*....|....*.
gi 115583681 718 GRLTTTEKSSHQMAQQ 733
Cdd:cd22536 452 QSLSNLQVQNAGLPQQ 467
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
476-616 |
1.79e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 49.81 E-value: 1.79e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 476 VTTATLVSSSPPQVTSDTPASSSPPQvtsDTPASSSPPQVTSETPASSSPPQvtsdtsASISPPQVISDTPASSSPPqvt 555
Cdd:PRK14950 355 VIEALLVPVPAPQPAKPTAAAPSPVR---PTPAPSTRPKAAAAANIPPKEPV------RETATPPPVPPRPVAPPVP--- 422
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 556 seTPASSSPTNMTSDTPASSSPtnmTSDTPASSSPTNMTSDTPASS----SPPWPVITEVTRPES 616
Cdd:PRK14950 423 --HTPESAPKLTRAAIPVDEKP---KYTPPAPPKEEEKALIADGDVleqlEAIWKQILRDVPPRS 482
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
468-628 |
1.80e-05 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 50.09 E-value: 1.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 468 PGFSSPTQVTTATLVSSSPPQVTSDTPASssppQVTSDT--PASSSP--PQVTSETPASSsppqvTSDTSASISPpqviS 543
Cdd:PLN02217 514 PGAAITKRVTWPGIKKLSDEEILKFTPAQ----YIQGDAwiPGKGVPyiPGLFAGNPGST-----NSTPTGSAAS----S 580
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 544 DTPASSSPPQvTSETPASSSPTNMTSDTPASSSpTNMTSDTPASSSPTNMTSDTPAS-SSPPWPVITEVTRPESTIPAGR 622
Cdd:PLN02217 581 NTTFSSDSPS-TVVAPSTSPPAGHLGSPPATPS-KIVSPSTSPPASHLGSPSTTPSSpESSIKVASTETASPESSIKVAS 658
|
....*.
gi 115583681 623 SLANIT 628
Cdd:PLN02217 659 TESSVS 664
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
474-620 |
1.82e-05 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 50.04 E-value: 1.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 474 TQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSP----------PQVTSETPASSSPPQVTSDTSASIS-----P 538
Cdd:PRK10811 848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPvveavaevveEPVVVAEPQPEEVVVVETTHPEVIAapvteQ 927
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 539 PQVISDTPASssppqVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTI 618
Cdd:PRK10811 928 PQVITESDVA-----VAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVA 1002
|
..
gi 115583681 619 PA 620
Cdd:PRK10811 1003 PA 1004
|
|
| Mating_C |
pfam12737 |
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine ... |
361-632 |
1.98e-05 |
|
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine the mating type of an individual, and only individuals with differing mating types can mate. Basidiomycete fungi have evolved a unique mating system, termed tetrapolar or bifactorial incompatibility, in which mating type is determined by two unlinked loci; compatibility at both loci is required for mating to occur. The multi-allelic tetrapolar mating system is considered to be a novel innovation that could have only evolved once, and is thus unique to the mushroom fungi. This domain is C-terminal to the homeodomain transcription factor region.
Pssm-ID: 372279 [Multi-domain] Cd Length: 412 Bit Score: 49.60 E-value: 1.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 361 LDTPSSSSPPQGTSDTPASSSpPQGTSETPASNSPpqgtseTPGFSSPPQVTTATLVSSSPPQVTSEtpassspTQVTSE 440
Cdd:pfam12737 123 LDSPSSSSSPEKCLPSPAPSE-QEALSEISAACGP------TPSTLTPLNVAPSLTPSKKRKRCLSD-------GFDGPK 188
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 441 TPASSSPT---QVTSDT-PASNSPPQGTSDTPGFSSPtqvtTATLVSSSPPQVTSDTPASSSP-----------PQVTSD 505
Cdd:pfam12737 189 RPPNKRVQprpQTVSDPfPTSTSIPEWDEWLQNHMSP----SLTLHGDIPPPVSVEAPDSNTPldieifnfpyhPDLTPS 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 506 TPASSSPPQVTSETPASSSPP-------QVTSDTSASISPPQVISDTPAS----SSPPQVTSETPASSSPTNMTSDTPAS 574
Cdd:pfam12737 265 PAPSLSDSVIEVATPTTESDYmcngtlrQTFSWFEFDFPELIQPTNTPASnnelSLPFDPSTDIVVSRTILPLLDWRSQS 344
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 115583681 575 SSPTNMTSDTPA-----SSSPTnmTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQ 632
Cdd:pfam12737 345 FLSQTFASPPHSilrsnSSSPD--VSAFALDLTPAFTPITYSLSESEKEAKRRELEELEARLQ 405
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
226-466 |
1.99e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.98 E-value: 1.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 226 RPQVTSDTLASSSPPQGTSDTPASSSPPQVTSAtsasssppqGTSDTPASSSPPqvtsatsasssppqgtSDTPASSSPP 305
Cdd:PRK07764 583 QVEAVVGPAPGAAGGEGPPAPASSGPPEEAARP---------AAPAAPAAPAAP----------------APAGAAAAPA 637
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 306 QVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQgtlDTPSSSSPPQGTSDTPASSSPPQG 385
Cdd:PRK07764 638 EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAP---AAPAGAAPAQPAPAPAATPPAGQA 714
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 386 TSETPASNSPPQGTSETPGFSS-----PPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSP 460
Cdd:PRK07764 715 DDPAAQPPQAAQGASAPSPAADdpvplPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMD 794
|
....*.
gi 115583681 461 PQGTSD 466
Cdd:PRK07764 795 DEDRRD 800
|
|
| DUF612 |
pfam04747 |
Protein of unknown function, DUF612; This family includes several uncharacterized proteins ... |
270-575 |
2.09e-05 |
|
Protein of unknown function, DUF612; This family includes several uncharacterized proteins from Caenorhabditis elegans.
Pssm-ID: 282585 [Multi-domain] Cd Length: 511 Bit Score: 49.68 E-value: 2.09e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 270 SDTPASSSPpQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSppqgTSDTPASSSPPQvtSATSASSSPPQGTSD 349
Cdd:pfam04747 198 TNTPAEPAE-QVQEITGKKNKKNKKKSESEATAAPASVEQVVEQPKV----VTEEPHQQAAPQ--EKKNKKNKRKSESEN 270
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 350 TPASS-SPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQgtSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:pfam04747 271 VPAASeTPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQAEAPK--SKKPTADDNMDFLDFVTAKEEPKDEPAET 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 429 PASssPTQVTSETPASSSPTQVTSdTPASNSPPQGTSDTPGfSSPTQVTTATLVSS-SPPQVT----SDTPASSSPPQVT 503
Cdd:pfam04747 349 PAA--PVEEVVENVVENVVEKSTT-PPATENKKKNKKDKKK-SESEKVTEQPVESApAPPQVEqvveTTPPASENKKKNK 424
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 504 SDTPASSSPPQVtsETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASS 575
Cdd:pfam04747 425 KDKKKSESEKAV--EEPVQAAPSSKKPTADDNMDFLDFVTAKPDKSESVEEHIAAPMIVEPAHADEETAAAA 494
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
356-555 |
2.15e-05 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 49.54 E-value: 2.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 356 PPQGTLDTPSSSSPPQG----------TSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVttatlvssSPPQVT 425
Cdd:PLN03209 382 PPTSPIPTPPSSSPASSksvdavakpaEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDL--------KPPTSP 453
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 426 SETPASSSPTQVTSETPASSSPtqvtsDTPASNSPPQGTSDTPGFSSPtqvTTATLVSSSPPQVTSDTPASSSPPQVTSD 505
Cdd:PLN03209 454 SPTAPTGVSPSVSSTSSVPAVP-----DTAPATAATDAAAPPPANMRP---LSPYAVYDDLKPPTSPSPAAPVGKVAPSS 525
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 115583681 506 TPAssSPPQVTSETPASSSPPQVTSDTSAS-ISPPQVISDT--PASSSPPQVT 555
Cdd:PLN03209 526 TNE--VVKVGNSAPPTALADEQHHAQPKPRpLSPYTMYEDLkpPTSPTPSPVL 576
|
|
| PHA02682 |
PHA02682 |
ORF080 virion core protein; Provisional |
348-636 |
2.41e-05 |
|
ORF080 virion core protein; Provisional
Pssm-ID: 177464 [Multi-domain] Cd Length: 280 Bit Score: 48.32 E-value: 2.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 348 SDTPAS--SSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGT-------SETPASNSPPQGTSetPGFSSPPqvttatlVS 418
Cdd:PHA02682 19 ADTSSSlfTKCPQATIPAPAAPCPPDADVDPLDKYSVKEAGryyqsrlKANSACMQRPSGQS--PLAPSPA-------CA 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 419 SSPPQVTSETPASSSPTqVTSETPASSSPTqvtsdTPASNSPPQGTSDTPGfssptqvttatlvsssppqvtsdTPASSS 498
Cdd:PHA02682 90 APAPACPACAPAAPAPA-VTCPAPAPACPP-----ATAPTCPPPAVCPAPA-----------------------RPAPAC 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 499 PPQVTSDTPAsssPPQVTSEtPASSSPPQVTSDtsaSISPPQVisdtPASSSPpqvTSETPASSSPT------------- 565
Cdd:PHA02682 141 PPSTRQCPPA---PPLPTPK-PAPAAKPIFLHN---QLPPPDY----PAASCP---TIETAPAASPVlepripdkiidad 206
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 115583681 566 NMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSppwpviteVTrpESTIPAGRSLANITSKAQEDSP 636
Cdd:PHA02682 207 NDDKDLIKKELADIADSVRDLNAESLSLTRDIENAKS--------TT--QAAIDDLRRLLTGGGVARRDTP 267
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
487-642 |
2.83e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 49.46 E-value: 2.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 487 PQVTSDT-PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PRK07003 360 PAVTGGGaPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRG 439
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 566 N--MTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVIST 642
Cdd:PRK07003 440 DdaADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASR 518
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
508-599 |
2.83e-05 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 49.47 E-value: 2.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDT-PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPA 586
Cdd:PRK11907 18 LTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEART 97
|
90
....*....|...
gi 115583681 587 SSSPTnmTSDTPA 599
Cdd:PRK11907 98 VTPAA--TETSKP 108
|
|
| PRK13335 |
PRK13335 |
superantigen-like protein SSL3; Reviewed; |
413-520 |
2.92e-05 |
|
superantigen-like protein SSL3; Reviewed;
Pssm-ID: 139494 [Multi-domain] Cd Length: 356 Bit Score: 48.58 E-value: 2.92e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 413 TATLVSSSPPQVTSETPASSSPTQV--TSETPASSSPTQVTSDTPASNsppQGTSDTPGFSSPTQVTTATLVSSSPPQVT 490
Cdd:PRK13335 55 TAGANSATTQAANTRQERTPKLEKApnTNEEKTSASKIEKISQPKQEE---QKSLNISATPAPKQEQSQTTTESTTPKTK 131
|
90 100 110
....*....|....*....|....*....|....*
gi 115583681 491 SDTPASSSPPQ----VTSDTPASSSPPQVTSE-TP 520
Cdd:PRK13335 132 VTTPPSTNTPQpmqsTKSDTPQSPTIKQAQTDmTP 166
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
472-564 |
3.03e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 49.04 E-value: 3.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 472 SPTQVTTATLVSSSPPQVTsDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAsisPPQVISDTPASSSP 551
Cdd:PRK14950 362 PVPAPQPAKPTAAAPSPVR-PTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPH---TPESAPKLTRAAIP 437
|
90
....*....|...
gi 115583681 552 PQVTSETPASSSP 564
Cdd:PRK14950 438 VDEKPKYTPPAPP 450
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
534-633 |
3.16e-05 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 49.47 E-value: 3.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 534 ASISPPQVISDTPASSSPPQvTSETPASSSPTnmtsDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTR 613
Cdd:PRK11907 18 LTASNPKLAQAEEIVTTTPA-TSTEAEQTTPV----ESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTT 92
|
90 100
....*....|....*....|
gi 115583681 614 PEStiPAGRSLANITSKAQE 633
Cdd:PRK11907 93 SEA--RTVTPAATETSKPVE 110
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
436-638 |
3.21e-05 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 49.15 E-value: 3.21e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQvtSDTPASsspPQVTSDTPASSSPPQV 515
Cdd:PLN03209 304 EVIAETTAPLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPI--EEEPPQ---PKAVVPRPLSPYTAYE 378
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 516 TSETPASSSPPQVTSDTSAS-----ISPPQVISDTPASSSPPQV------TSET-----------------PASSSPTNM 567
Cdd:PLN03209 379 DLKPPTSPIPTPPSSSPASSksvdaVAKPAEPDVVPSPGSASNVpevepaQVEAkktrplspyaryedlkpPTSPSPTAP 458
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 568 TSDTPASSSPT--NMTSDTPASSSPTNMTSDTPASSSP--PWPVITEVTRPESTIPAGRSLANITSKAQEDSPLG 638
Cdd:PLN03209 459 TGVSPSVSSTSsvPAVPDTAPATAATDAAAPPPANMRPlsPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVG 533
|
|
| KAR9 |
pfam08580 |
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ... |
322-589 |
3.29e-05 |
|
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.
Pssm-ID: 430088 [Multi-domain] Cd Length: 684 Bit Score: 49.06 E-value: 3.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 322 SDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQG-TS 400
Cdd:pfam08580 427 NKTPGSSPPSS--------------VIMTPVNKGSKTPSSRRGSSFDFGSSSERVINSKLRRESKLPQIASTLKQTKrPS 492
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 401 ETPGFSSPPQVTtatlvSSSPPQ-VTSETPA-SSSPTQVTSETPASSSPTQVTSDTPASNSPPQ-GTSDTPGFSSPTQVT 477
Cdd:pfam08580 493 KIPRASPNHSGF-----LSTPSNtATSETPTpALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHnFKPLTLTTPSPTPSR 567
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 478 TATLVSSSPPQVTSDTPASSSPpqVTSDTPASSSPPQVTSETPASSSPPQvtsdtsasiSPPQVISDTPASSSP-PQVTS 556
Cdd:pfam08580 568 SSRSSSTLPPVSPLSRDKSRSP--APTCRSVSRASRRRASRKPTRIGSPN---------SRTSLLDEPPYPKLTlSKGLP 636
|
250 260 270
....*....|....*....|....*....|...
gi 115583681 557 ETPASSSPTNMTSDTPASSSPTNMTSDTPASSS 589
Cdd:pfam08580 637 RTPRNRQSYAGTSPSRSVSVSSGLGPQTRPGTS 669
|
|
| KAR9 |
pfam08580 |
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ... |
414-621 |
3.37e-05 |
|
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.
Pssm-ID: 430088 [Multi-domain] Cd Length: 684 Bit Score: 49.06 E-value: 3.37e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 414 ATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQV-----TSDTPASNSP-----PQGTSDTPGFSSPTQVTTATLVS 483
Cdd:pfam08580 422 ATLVANKTPGSSPPSSVIMTPVNKGSKTPSSRRGSSFdfgssSERVINSKLRresklPQIASTLKQTKRPSKIPRASPNH 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 484 SSPPQVTSDTPASSSPpqvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSS 563
Cdd:pfam08580 502 SGFLSTPSNTATSETP------TPALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHNFKPLTLTTPSPTPSRSSRSSSTL 575
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 564 PTNMTSDTPASSSPT-NMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAG 621
Cdd:pfam08580 576 PPVSPLSRDKSRSPApTCRSVSRASRRRASRKPTRIGSPNSRTSLLDEPPYPKLTLSKG 634
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
426-565 |
3.96e-05 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 48.41 E-value: 3.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 426 SETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG--FSSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQV 502
Cdd:PTZ00436 208 AAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAkaAAPPAKAAAPPAKAAAPPAKAAAPPAkAAAPPAK 287
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 115583681 503 TSDTPASSSppqvTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PTZ00436 288 AAAPPAKAA----AAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
411-549 |
3.98e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 48.65 E-value: 3.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 411 VTTATLVSSSPPQvtSETPASSSPTQVTsETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATlvsSSPPQVT 490
Cdd:PRK14950 355 VIEALLVPVPAPQ--PAKPTAAAPSPVR-PTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPV---PHTPESA 428
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 491 SDTPASSSPPQVtsdTPASSSPPQVTSETPASSSPPQVTSDTSASIspPQVISDTPASS 549
Cdd:PRK14950 429 PKLTRAAIPVDE---KPKYTPPAPPKEEEKALIADGDVLEQLEAIW--KQILRDVPPRS 482
|
|
| PRK12727 |
PRK12727 |
flagellar biosynthesis protein FlhF; |
358-566 |
4.04e-05 |
|
flagellar biosynthesis protein FlhF;
Pssm-ID: 237182 [Multi-domain] Cd Length: 559 Bit Score: 48.83 E-value: 4.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 358 QGTLDTPSSSSP--PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT 435
Cdd:PRK12727 52 QRALETARSDTPatAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMALRQPVSVPRQAPAAAPVR 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 436 QVTSETPAssspTQVTSDTPASNSPPQGTSDTPgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQV 515
Cdd:PRK12727 132 AASIPSPA----AQALAHAAAVRTAPRQEHALS--AVPEQLFADFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAY 205
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 115583681 516 TSETPASSSPPQVTSDTS--ASISPPQVISDTPASSSPPQVTSETPASSSPTN 566
Cdd:PRK12727 206 AQDDDEQLDDDGFDLDDAlpQILPPAALPPIVVAPAAPAALAAVAAAAPAPQN 258
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
225-461 |
4.04e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 49.08 E-value: 4.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 225 GRPQVTSDTLASSSPPQGTSDTPASSSP-PQVTSATSASSSPPQGTSDTPASSSppQVTSATSASSSPPQGTSDTPASSS 303
Cdd:PRK07003 383 PGARAAAAVGASAVPAVTAVTGAAGAALaPKAAAAAAATRAEAPPAAPAPPATA--DRGDDAADGDAPVPAKANARASAD 460
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 304 PPqvTSATSASSSPPQGTSDTPASSSPPQVTSATSasssppqgTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPP 383
Cdd:PRK07003 461 SR--CDERDAQPPADSGSASAPASDAPPDAAFEPA--------PRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAP 530
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 384 QGTSETPASNSPP---------------------QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:PRK07003 531 EARPPTPAAAAPAaraggaaaaldvlrnagmrvsSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPP 610
|
250
....*....|....*....
gi 115583681 443 ASSSPTQVTSDTPASnSPP 461
Cdd:PRK07003 611 NGAARAEQAAESRGA-PPP 628
|
|
| PHA02732 |
PHA02732 |
hypothetical protein; Provisional |
355-609 |
4.62e-05 |
|
hypothetical protein; Provisional
Pssm-ID: 165099 [Multi-domain] Cd Length: 1467 Bit Score: 48.98 E-value: 4.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 355 SPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSP-PQGTSETPGFS-SPPQVTTATLVSSSPPQ----VTSET 428
Cdd:PHA02732 1074 SPSYIFLNSWASSYVAPGFLGSPYALPYFMNQTSALVGNTAlPKGLNVFSGYMfGAGTVASAFLYMNSTPQspvlALLLA 1153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 429 PASSSPTQVTSETPASSSPTQVTSDTP------ASNSPPQG----TSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS 498
Cdd:PHA02732 1154 PYISYKFNALSLGFSITADAAIFSLFGipapqlLSSYIPTGsvlyQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASS 1233
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 499 PPQVTSDTPASSSPPQVTSETpASSSPP--QVTSDTSASISPPQVISdtPASSSPP--QVTSETPASSSPTNMTS-DTPA 573
Cdd:PHA02732 1234 PPAATTPTPPPSSSSSSSAQS-ISTSPGqiQIVLNGSTTIHINFLFF--PALSTPKigQILAMPIVNSSGAFISLyVNSA 1310
|
250 260 270
....*....|....*....|....*....|....*.
gi 115583681 574 SSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT 609
Cdd:PHA02732 1311 ISANFNVTIEYVFSNGTVIKRFTDEPGQIFPLPLIN 1346
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
405-521 |
4.79e-05 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 48.70 E-value: 4.79e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 405 FSSPPQVTTATLVSSSPPQVTSETPASssptqvtseTPASSSPTQVTSDTPASNSppqGTSDTPGFSSPTQVTTATLVSS 484
Cdd:PRK11907 6 FSKSAVALTLALLTASNPKLAQAEEIV---------TTTPATSTEAEQTTPVESD---ATEEADNTETPVAATTAAEAPS 73
|
90 100 110
....*....|....*....|....*....|....*..
gi 115583681 485 SPPQVTSDTPASSSPPQVTSDTPasSSPPQVTSETPA 521
Cdd:PRK11907 74 SSETAETSDPTSEATDTTTSEAR--TVTPAATETSKP 108
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
235-668 |
4.79e-05 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 48.90 E-value: 4.79e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 235 ASSSPPQ-GTSDTPASSSPP----QVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQG--------TSDTPAS 301
Cdd:PHA03377 446 AQSTPERpGPSDQPSVPVEPahltPVEHTTVILHQPPQSPPTVAIKPAPPPSRRRRGACVVYDDDiievidveTTEEEES 525
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 302 SSPPQVTSATSASSSPPQGT------------SDT-PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLD-TPSSS 367
Cdd:PHA03377 526 VTQPAKPHRKVQDGFQRSGRrqkratppkvspSDRgPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQqAKCKD 605
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 368 SPPQGTSD--TPASSSP-----------------PQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQvtset 428
Cdd:PHA03377 606 GPPASGPHekQPPSSAPrdmapsvvrmflrerllEQSTGPKPKSFWEMRAGRDGSGIQQEPSSRRQPATQSTPPR----- 680
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 429 PASSSPTQVTSETPASSS-PTQVTSDTPASNSPPQGTSDTPGFSSPtqvTTATLVSSSPPQVTSdtPASSSPPQVTSDTP 507
Cdd:PHA03377 681 PSWLPSVFVLPSVDAGRAqPSEESHLSSMSPTQPISHEEQPRYEDP---DDPLDLSLHPDQAPP--PSHQAPYSGHEEPQ 755
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 508 ASSSPPQVTSETPASSSPPQVTSDTSASISPpqvISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:PHA03377 756 AQQAPYPGYWEPRPPQAPYLGYQEPQAQGVQ---VSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPH 832
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 588 SSPTNMTSDTPA------------SSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTH---PQMSFQSST 652
Cdd:PHA03377 833 LPPQWDGSAGHGqdqvsqfphlqsETGPPRLQLSQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIPTRfppPPMPLQDSM 912
|
490
....*....|....*.
gi 115583681 653 SQALDeTAGERVPTIP 668
Cdd:PHA03377 913 AVGCD-SSGTACPSMP 927
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
448-585 |
5.34e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 48.23 E-value: 5.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 448 TQVTSDTPASNSPPQGTSdtPGFS---SPTQVTTATLVSSSPPQvTSDTPASSSPPQVTsdtPASSSPPQVtSETPASSS 524
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIK--PVFTqpaAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSAT---QPAGTPPTV-SVDPPAAV 435
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 115583681 525 PPQVTSDTSASISPPQVISDTPASSSppQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:PRK14971 436 PVNPPSTAPQAVRPAQFKEEKKIPVS--KVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQK 494
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
470-573 |
5.39e-05 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 48.70 E-value: 5.39e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPqvTSDTPASSSPPQVTSET---PASSSPPQVTSDTSASISPPQVISDTP 546
Cdd:PRK11907 6 FSKSAVALTLALLTASNPKLAQAEEIVTTTP--ATSTEAEQTTPVESDATeeaDNTETPVAATTAAEAPSSSETAETSDP 83
|
90 100
....*....|....*....|....*..
gi 115583681 547 ASSSPPQVTSETPASSSPTnmTSDTPA 573
Cdd:PRK11907 84 TSEATDTTTSEARTVTPAA--TETSKP 108
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
293-521 |
5.97e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 48.33 E-value: 5.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 293 QGTSDTPASSSPPQVTSATSASSSPpqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQG 372
Cdd:PRK12323 368 SGGGAGPATAAAAPVAQPAPAAAAP---AAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 373 TSDTPASSSPPQgtseTPASNSPPQgtsetpgfSSPPQVTTATLVSSSPPQvtseTPASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK12323 445 GGAPAPAPAPAA----APAAAARPA--------AAGPRPVAAAAAAAPARA----APAAAPAPADDDPPPWEELPPEFAS 508
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 453 DTPASNSPPQGTSDTPGFSSP-TQVTTATLVSSSPPQVTSDTPASSSP-----PQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK12323 509 PAPAQPDAAPAGWVAESIPDPaTADPDDAFETLAPAPAAAPAPRAAAAtepvvAPRPPRASASGLPDMFDGDWPA 583
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
373-504 |
6.19e-05 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 47.79 E-value: 6.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 373 TSDTPASSSPPqgTSETPASNSPPQGTSETPGfssppQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK12799 296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPS-----PAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPA 368
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 115583681 453 DTPASNSPPQGTSDTPGFSSPTQVTTATlvsSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK12799 369 AEPVNMQPQPMSTTETQQSSTGNITSTA---NGPTTSLPAAPASNIPVSPTS 417
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
347-580 |
6.53e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 48.15 E-value: 6.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 347 TSDTPASSSPPQgTLDTPSSSSPPQgtsdTPASSSPPQGTsETPASNSPPQgtSETPGFSspPQVTTATLVSSSppqvts 426
Cdd:PTZ00449 617 LLDIPKSPKRPE-SPKSPKRPPPPQ----RPSSPERPEGP-KIIKSPKPPK--SPKPPFD--PKFKEKFYDDYL------ 680
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 427 eTPASSSPTQVTSETPASSSPTQVTSDTPASNSPPqgtSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASS---SPPQVT 503
Cdd:PTZ00449 681 -DAAAKSKETKTTVVLDESFESILKETLPETPGTP---FTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIeffTPPEEE 756
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 504 S----DTPASSSPPQVTSETPASsspPQVTSDTSASISP------PQVISDTPASSSP--PQ--------VTSETPASSS 563
Cdd:PTZ00449 757 RtffhETPADTPLPDILAEEFKE---EDIHAETGEPDEAmkrpdsPSEHEDKPPGDHPslPKkrhrldglALSTTDLESD 833
|
250
....*....|....*..
gi 115583681 564 PTNMTSDtpASSSPTNM 580
Cdd:PTZ00449 834 AGRIAKD--ASGKIVKL 848
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
239-716 |
6.58e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 48.15 E-value: 6.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 239 PPQGTSDTPASSSPPQvtsatsasssppqGtsdtPASSSPPQVTSATSASSSPPQGTSDtpaSSSPPQVTSATSASSSPP 318
Cdd:PTZ00449 497 APIEEEDSDKHDEPPE-------------G----PEASGLPPKAPGDKEGEEGEHEDSK---ESDEPKEGGKPGETKEGE 556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 319 QGTSDTPASSSPPqvtsatsasssppqgtSDTPASSSPPQGTLDTPSSSSPpqgtsDTPASSSPPQGTSETPASNSP--P 396
Cdd:PTZ00449 557 VGKKPGPAKEHKP----------------SKIPTLSKKPEFPKDPKHPKDP-----EEPKKPKRPRSAQRPTRPKSPklP 615
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 397 QgTSETPGFSSPPQVTTAtlvSSSPPqvtsetpassSPTQVTSETPASSSPTQVTSDTPASNSPP----------QGTSD 466
Cdd:PTZ00449 616 E-LLDIPKSPKRPESPKS---PKRPP----------PPQRPSSPERPEGPKIIKSPKPPKSPKPPfdpkfkekfyDDYLD 681
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 467 TPGFSSPTqVTTATLVSSSPPQVTSDTPASSSPPQVTSDT--PASSSPPQVTSETPASSSPPQvtSDTSASISPPQ---- 540
Cdd:PTZ00449 682 AAAKSKET-KTTVVLDESFESILKETLPETPGTPFTTPRPlpPKLPRDEEFPFEPIGDPDAEQ--PDDIEFFTPPEeert 758
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 541 VISDTPASSSPPQVTSETPASSsptNMTSDTPASSSPTNMtsdtpaSSSPTNMtSDTPASSSPPWPVITE------VTRP 614
Cdd:PTZ00449 759 FFHETPADTPLPDILAEEFKEE---DIHAETGEPDEAMKR------PDSPSEH-EDKPPGDHPSLPKKRHrldglaLSTT 828
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 615 ESTIPAGRSLANITSKaqedsplgVISTHPQMSFQSSTSQALDETAGERVPTI---------PDFQAHSEFQKACAILQR 685
Cdd:PTZ00449 829 DLESDAGRIAKDASGK--------IVKLKRSKSFDDLTTVEEAEEMGAEARKIvvdddgteaDDEDTHPPEEKHKSEVRR 900
|
490 500 510
....*....|....*....|....*....|.
gi 115583681 686 LRdflPTSPTSAQKNNSWSSQTPAVSCPFQP 716
Cdd:PTZ00449 901 RR---PPKKPSKPKKPSKPKKPKKPDSAFIP 928
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
471-591 |
6.74e-05 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 47.64 E-value: 6.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 471 SSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSDTPA-SSSPPQVTSETP--ASSSPPQVTSDTSASISPPQVISDTP 546
Cdd:PTZ00436 220 AAPAKAAAAPAKAAAPPAKAAAAPAkAAAAPAKAAAPPAkAAAPPAKAAAPPakAAAPPAKAAAPPAKAAAPPAKAAAAP 299
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 115583681 547 A-SSSPPQVTSETPA-SSSPTNMTSDTPASSSPTNMTSDTPASSSPT 591
Cdd:PTZ00436 300 AkAAAAPAKAAAAPAkAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
|
|
| PRK13042 |
PRK13042 |
superantigen-like protein SSL4; Reviewed; |
481-577 |
7.96e-05 |
|
superantigen-like protein SSL4; Reviewed;
Pssm-ID: 183854 [Multi-domain] Cd Length: 291 Bit Score: 46.93 E-value: 7.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 481 LVSSSPPQVTSDTPASSSPPQVTSDTPaSSSPPQVTSETPASSspPQVTSDTSASISPPQvisDTPASSSPPQVTSETPa 560
Cdd:PRK13042 15 LLTTGVITTTTQAANATTPSSTKVEAP-QSTPPSTKVEAPQSK--PNATTPPSTKVEAPQ---QTPNATTPSSTKVETP- 87
|
90
....*....|....*..
gi 115583681 561 sSSPTnmTSDTPASSSP 577
Cdd:PRK13042 88 -QSPT--TKQVPTEINP 101
|
|
| SAP130_C |
pfam16014 |
Histone deacetylase complex subunit SAP130 C-terminus; |
415-590 |
8.20e-05 |
|
Histone deacetylase complex subunit SAP130 C-terminus;
Pssm-ID: 464973 [Multi-domain] Cd Length: 371 Bit Score: 47.24 E-value: 8.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 415 TLVSSSPPQVTSETPA-----SSSPTQVTSETPASSSPTQVT---SDTPASNSPPqgTSDTPGFSSPTQVTTATlVSSSP 486
Cdd:pfam16014 1 ALGSSPRPSILRKKPAtegakPKPDIHVAVAPPVTVAVEALPgqnSEQQTASASP--PSQHPAQAIPTILAPAA-PPSQP 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 487 PQVTSDTPASS--SPPQVTSDTPASSSPPQvtsetPASSSPPQ-VTSDTSASISPPQVISDTPASSSPPQVTSETPASSs 563
Cdd:pfam16014 78 SVVLSTLPAAMavTPPIPASMANVVAPPTQ-----PAASSTAAcAVSSVLPEIKIKQEAEPMDTSQSVPPLTPTSISPA- 151
|
170 180
....*....|....*....|....*..
gi 115583681 564 ptnMTSDTPASSSPtnmTSDTPASSSP 590
Cdd:pfam16014 152 ---LTSLANNLSVP---AGDLLPGASP 172
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
457-551 |
8.62e-05 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 46.94 E-value: 8.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 457 SNSPPQGTSDTPgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASI 536
Cdd:PRK10856 159 GQSVPLDTSTTT--DPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAP 236
|
90
....*....|....*
gi 115583681 537 SPPQVISDTPASSSP 551
Cdd:PRK10856 237 LPTDQAGVSTPAADP 251
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
521-635 |
8.66e-05 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 45.72 E-value: 8.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 521 ASSSPPQVTSDTSASISPpQVISDTPASSSPPQVTSETP-----ASSSPT--------NMTSDTPASSSPTNMTSDTPAS 587
Cdd:pfam09595 31 ASLILIGESNKEAALIIT-DIIDININKQHPEQEHHENPplneaAKEAPSesedapdiDPNNQHPSQDRSEAPPLEPAAK 109
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 115583681 588 SSPT----NMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDS 635
Cdd:pfam09595 110 TKPSehepANPPDASNRLSPPDASTAAIREARTFRKPSTGKRNNPSSAQSDQ 161
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
321-609 |
8.89e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.46 E-value: 8.89e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 321 TSDTPA-------SSSPPQVTSATSASSSPPQGTSDTPASSSPPQgtLDTPSSSSPPQgtSDTPASSSPPQgtSETPASN 393
Cdd:NF033839 280 TQDTPKepgnkkpSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQ--LEKPKPEVKPQ--PEKPKPEVKPQ--LETPKPE 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 394 SPPQGTSETPGFSSPPQvttaTLVSSSPPQVTSETPasssptQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFsSP 473
Cdd:NF033839 354 VKPQPEKPKPEVKPQPE----KPKPEVKPQPETPKP------EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEV-KP 422
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 474 TQVTTATLVSSSPPqvtsdTPASSSPPQvtSDTPASSSPPQvtSETPASSSPPQVtsdtsasisppqvisDTPASSSPPQ 553
Cdd:NF033839 423 QPEKPKPEVKPQPE-----KPKPEVKPQ--PEKPKPEVKPQ--PETPKPEVKPQP---------------EKPKPEVKPQ 478
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 554 VTSETPASSSPtnmTSDTPASSSPTNMTSDTPAS--SSPTNMTSDTPASSSPPWPVIT 609
Cdd:NF033839 479 PEKPKPDNSKP---QADDKKPSTPNNLSKDKQPSnqASTNEKATNKPKKSLPSTGSIS 533
|
|
| KLF12_N |
cd21441 |
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ... |
555-603 |
9.20e-05 |
|
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.
Pssm-ID: 410608 [Multi-domain] Cd Length: 197 Bit Score: 45.77 E-value: 9.20e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 115583681 555 TSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:cd21441 65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVP 113
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
353-472 |
9.33e-05 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 45.33 E-value: 9.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 353 SSSPPQGTLDTPSSSSPPQGTSDtPASSSPPQGTSETPASNSppqGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASS 432
Cdd:pfam09595 66 ENPPLNEAAKEAPSESEDAPDID-PNNQHPSQDRSEAPPLEP---AAKTKPSEHEPANPPDASNRLSPPDASTAAIREAR 141
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 115583681 433 SPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSS 472
Cdd:pfam09595 142 TFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSS 181
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
451-538 |
9.74e-05 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 46.94 E-value: 9.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 451 TSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTS 530
Cdd:PRK10856 164 LDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAG 243
|
....*...
gi 115583681 531 DTSASISP 538
Cdd:PRK10856 244 VSTPAADP 251
|
|
| Hamartin |
pfam04388 |
Hamartin protein; This family includes the hamartin protein which is thought to function as a ... |
205-585 |
9.80e-05 |
|
Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.
Pssm-ID: 461287 [Multi-domain] Cd Length: 730 Bit Score: 47.74 E-value: 9.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 205 DPASSAPPKATHRMTITSLTGrpqvTSDTLASSSPPQGTSDTPASSSPPQVTsatsasssppqGTSDTPASS-----SPP 279
Cdd:pfam04388 288 YGSSTSTPSSTPRLQLSSSSG----TSPPYLSPPSIRLKTDSFPLWSPSSVC-----------GMTTPPTSPgmvptTPS 352
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 280 QVTSATSASSSPPQGTSD---------TPASSSPPQvtsatsasssppqgTSDTPASSSPPQVTSatsasssppqgtsdt 350
Cdd:pfam04388 353 ELSPSSSHLSSRGSSPPEaageatpetTPAKDSPYL--------------KQPPPLSDSHVHRAL--------------- 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 351 PASSSPpqgtldtpssSSPPQgTSDTPASSSPPQgTSETPASNSppqgtseTPGFSSPP-QVTTATLvsSSPPQVTSETP 429
Cdd:pfam04388 404 PASSQP----------SSPPR-KDGRSQSSFPPL-SKQAPTNPN-------SRGLLEPPgDKSSVTL--SELPDFIKDLA 462
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 430 ASSSPTQVTSETPAS-----SSPTQVTSDTPASNsppqgtsdtPGFSSPTQVTTATLVSSsppQVTSDTPASSSPPQVTS 504
Cdd:pfam04388 463 LSSEDSVEGAEEEAAisqelSEITTEKNETDCSR---------GGLDMPFSRTMESLAGS---QRSRNRIASYCSSTSQS 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 505 DTPASSSPPQVTSETPASSSPPQVTSDTSASISPP--QVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTS 582
Cdd:pfam04388 531 DSHGPATTPESKPSALAEDGLRRTKSCSFKQSFTPieQPIESSDDCPTDEQDGENGLETSILTPSPCKIPSRQKVSTQSG 610
|
...
gi 115583681 583 DTP 585
Cdd:pfam04388 611 QPL 613
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
361-450 |
1.00e-04 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 46.94 E-value: 1.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 361 LDTPSSSSP-PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTS 439
Cdd:PRK10856 164 LDTSTTTDPaTTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAG 243
|
90
....*....|.
gi 115583681 440 ETPASSSPTQV 450
Cdd:PRK10856 244 VSTPAADPNAL 254
|
|
| PLN02983 |
PLN02983 |
biotin carboxyl carrier protein of acetyl-CoA carboxylase |
365-554 |
1.00e-04 |
|
biotin carboxyl carrier protein of acetyl-CoA carboxylase
Pssm-ID: 215533 [Multi-domain] Cd Length: 274 Bit Score: 46.37 E-value: 1.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 365 SSSSPpqgtsdTPASSSPPQGTSETPASNSppQGTSETPGFSSPPQVTTATLVSSSPPQVTS-ETPASSSPTQVTSETPA 443
Cdd:PLN02983 3 SLSVP------CAKTAAAAANVGSRLSRSS--FRLQPKPNISFPSKGPNPKRSAVPKVKAQLnEVAVDGSSNSAKSDDPK 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 444 SSSPTQVTSDTPASNSPPQGTSDTPGFSSP--TQVTT------------------------------------ATLVSSS 485
Cdd:PLN02983 75 SEVAPSEPKDEPPSNSSSKPNLPDEESISEfmTQVSSlvklvdsrdivelqlkqldcelvirkkealpqppppAPVVMMQ 154
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 115583681 486 PPQVTSDTPASSSPPQVTSDTPASSSPPqvtseTPASSSPPQVTSDTSASISPPQ--VISDTPASSSPPQV 554
Cdd:PLN02983 155 PPPPHAMPPASPPAAQPAPSAPASSPPP-----TPASPPPAKAPKSSHPPLKSPMagTFYRSPAPGEPPFV 220
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
482-620 |
1.02e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 47.55 E-value: 1.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 482 VSSSPPQVTSDTPASssppQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PRK07994 367 EPEVPPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSE 442
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 115583681 562 SSPTNMTsdTPASSSPTNMTSDTPASSSPTNMTSDTPA----SSSPPWPVITEVTRPESTIPA 620
Cdd:PRK07994 443 PAAASRA--RPVNSALERLASVRPAPSALEKAPAKKEAyrwkATNPVEVKKEPVATPKALKKA 503
|
|
| PRK13108 |
PRK13108 |
prolipoprotein diacylglyceryl transferase; Reviewed |
427-588 |
1.10e-04 |
|
prolipoprotein diacylglyceryl transferase; Reviewed
Pssm-ID: 237284 [Multi-domain] Cd Length: 460 Bit Score: 47.28 E-value: 1.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 427 ETPASSSPTQVTSETPASSSPTQVTS---DTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSppqvtsDTPASSSPPQVT 503
Cdd:PRK13108 280 EAPGALRGSEYVVDEALEREPAELAAaavASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVT------DEVAAESVVQVA 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 504 SDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSS--PTNMT 581
Cdd:PRK13108 354 DRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPakPDELA 433
|
....*..
gi 115583681 582 SDTPASS 588
Cdd:PRK13108 434 VAGPGDD 440
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
353-550 |
1.10e-04 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 47.48 E-value: 1.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 353 SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfSSPPQVTTATLVSSSPPQVTSETPASS 432
Cdd:PRK08581 129 LNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPS---SNNTKPSTSNKQPNSPKPTQPNQSNSQ 205
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 433 SPTQVTSETPASSSPTQVTSDTPASNSPPQgtsdtpgFSSPTQVTTATLVSSSPPQVTSdTPASSSPPQVTSDTPASSSP 512
Cdd:PRK08581 206 PASDDTANQKSSSKDNQSMSDSALDSILDQ-------YSEDAKKTQKDYASQSKKDKTE-TSNTKNPQLPTQDELKHKSK 277
|
170 180 190
....*....|....*....|....*....|....*...
gi 115583681 513 PQVTSETPAssspPQVTSDTSASISPPQVISDTPASSS 550
Cdd:PRK08581 278 PAQSFENDV----NQSNTRSTSLFETGPSLSNNDDSGS 311
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
483-604 |
1.14e-04 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 45.33 E-value: 1.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 483 SSSPPQVTSDTpASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASS 562
Cdd:pfam09595 32 SLILIGESNKE-AALIITDIIDININKQHPEQEHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKT 110
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 115583681 563 SPT----NMTSDTPASSSPTNMTSDTPASSS----PTNMTSDTPAS----SSPP 604
Cdd:pfam09595 111 KPSehepANPPDASNRLSPPDASTAAIREARtfrkPSTGKRNNPSSaqsdQSPP 164
|
|
| PLN03131 |
PLN03131 |
hypothetical protein; Provisional |
353-609 |
1.30e-04 |
|
hypothetical protein; Provisional
Pssm-ID: 178677 [Multi-domain] Cd Length: 705 Bit Score: 47.08 E-value: 1.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 353 SSSPPQGTLDTPSSSSPPQGTSdtpaSSSPPQGTSETPA-SNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPAS 431
Cdd:PLN03131 335 AGSGSHASLDHFKAPVAPEAAA----PMAPPIDLFQLPAtSPAPPVDLFEIPPLDPAPAINAYQPPQTSLPSSIDLFGGI 410
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 432 SSPTQVTS---ETPASSSPTqvtSDTPASNSPPQGTSDTPGFS--SPTQVTTATLVSSSPPQVTSDTPASSSPP-QVTSD 505
Cdd:PLN03131 411 TQQQSINSldeKSPELSIPK---NEGWATFDGIQPIASTPGNEnlTPFSIGPSMAGSANFDQVPSLDKGMQWPPfQNSSD 487
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 506 TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT-NMTSDTPASSSPT------ 578
Cdd:PLN03131 488 EESASGPAPWLGDLHNVEAPDNTSAQNWNAFEFDDSVAGIPLEGIKQSSEPQTAANMPPTaDQLIGCKALEDFNkdgikr 567
|
250 260 270
....*....|....*....|....*....|....
gi 115583681 579 ---NMTSDTPASSSPTNMTSDtPASSSPPWPVIT 609
Cdd:PLN03131 568 tapHGQGELPGLDEPSDILAE-PSYTPPAHPIME 600
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
522-603 |
1.32e-04 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 46.56 E-value: 1.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 522 SSSPPQVTSDTSASISPPQVISDTPASSSP--PQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPA 599
Cdd:PRK10856 168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247
|
....
gi 115583681 600 SSSP 603
Cdd:PRK10856 248 AADP 251
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
468-591 |
1.33e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 47.02 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 468 PGFSSPTQVT--TATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTsdtsASISPPQVISDT 545
Cdd:PRK14951 366 PAAAAEAAAPaeKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAP----AAAAPAAAPAAA 441
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 115583681 546 PASSSPPQVTSETPAS--SSPTNMTSDTPASSSPTNMTSDTPASSSPT 591
Cdd:PRK14951 442 PAAVALAPAPPAQAAPetVAIPVRVAPEPAVASAAPAPAAAPAAARLT 489
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
460-636 |
1.37e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 47.15 E-value: 1.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 460 PPQGTSDTPGFSSPTQVTTAtlVSSSPPQVTSDTPASSSPPQvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPP 539
Cdd:PRK07003 360 PAVTGGGAPGGGVPARVAGA--VPAPGARAAAAVGASAVPAV----TAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPP 433
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 540 Q---VISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTnmTSDTPASSSPTNMTSDtPASSSPPWPVITEVTRPES 616
Cdd:PRK07003 434 AtadRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSG--SASAPASDAPPDAAFE-PAPRAAAPSAATPAAVPDA 510
|
170 180
....*....|....*....|
gi 115583681 617 TIPAGRSLANITSKAQEDSP 636
Cdd:PRK07003 511 RAPAAASREDAPAAAAPPAP 530
|
|
| PLAT_RAB6IP1 |
cd01757 |
PLAT/LH2 domain present in RAB6 interacting protein 1 (Rab6IP1)_like family. PLAT/LH2 domains ... |
1189-1241 |
1.37e-04 |
|
PLAT/LH2 domain present in RAB6 interacting protein 1 (Rab6IP1)_like family. PLAT/LH2 domains consists of an eight stranded beta-barrel. In RabIP1 this domain may participate in lipid-mediated modulation of Rab6IP1's function via it's generally proposed function of mediating interaction with lipids or membrane bound proteins.
Pssm-ID: 238855 Cd Length: 114 Bit Score: 43.30 E-value: 1.37e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1189 LGDLHGLRLWHDNSGDSPSWYVSQVIVSDMTTRKKWHFQCNCWL-------AVDLGNCER 1241
Cdd:cd01757 52 LGKLTTVQIGHDNSGLLAKWLVEYVMVRNEITGHTYKFPCGRWLgegvddgNGEDGSLER 111
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
322-539 |
1.41e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 46.42 E-value: 1.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 322 SDTPASSSPPQVTSATSASSSPPQGTSDTPAS--------SSPPQGTLDTPSSSSPPQGTsdtpASSSPPQGTSETPASN 393
Cdd:COG5651 163 ALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNpgfanlglTGLNQVGIGGLNSGSGPIGL----NSGPGNTGFAGTGAAA 238
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 394 SPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSP 473
Cdd:COG5651 239 GAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGA 318
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 474 TQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP 539
Cdd:COG5651 319 AGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
362-476 |
1.41e-04 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 46.56 E-value: 1.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 362 DTPSSSSPPQGTSDTP--ASSSPPQGTSETPASNSPPQGTSetpgfSSPPQVTTATlvsssPPQVTSETPASSSPTQVTS 439
Cdd:PRK10856 148 DQSSAELSQNSGQSVPldTSTTTDPATTPAPAAPVDTTPTN-----SQTPAVATAP-----APAVDPQQNAVVAPSQANV 217
|
90 100 110
....*....|....*....|....*....|....*..
gi 115583681 440 ETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQV 476
Cdd:PRK10856 218 DTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNAL 254
|
|
| PLAT_LOX |
cd01753 |
PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they ... |
1131-1232 |
1.46e-04 |
|
PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they catalyze enzymatic lipid peroxidation in complex biological structures via direct dioxygenation of phospholipids and cholesterol esters of biomembranes and plasma lipoproteins. Both types of enzymes are cytosolic but need this domain to access their sequestered membrane or micelle bound substrates.
Pssm-ID: 238851 Cd Length: 113 Bit Score: 43.07 E-value: 1.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 1131 YLIQVYTGYRRRAATTAKVVITLYGSEGHSEPhHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWYV 1210
Cdd:cd01753 3 YKVTVATGSSLFAGTDDYIYLTLVGTAGESEK-QLLDRPGYDFERGAVDEYKVKVPEDLGELLLVRLRKRKYLLFDAWFC 81
|
90 100
....*....|....*....|..
gi 115583681 1211 SQVIVSDmTTRKKWHFQCNCWL 1232
Cdd:cd01753 82 NYITVTG-PGGDEYHFPCYRWI 102
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
427-578 |
1.50e-04 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 46.48 E-value: 1.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 427 ETPASSSPTQVTSETPASSSPTQVTSdtpASNSPPQGTSDTPGFSS--PTQVTTATLVSSSPPQVTSDTPA-SSSPPQVT 503
Cdd:PTZ00436 191 EDAAAAAAAKQKAAAKKAAAPSGKKS---AKAAAPAKAAAAPAKAAapPAKAAAAPAKAAAAPAKAAAPPAkAAAPPAKA 267
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 504 SDTPA-SSSPPQVTSETP--ASSSPPQVTSDTSASISPPQVISDTPA-SSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PTZ00436 268 AAPPAkAAAPPAKAAAPPakAAAPPAKAAAAPAKAAAAPAKAAAAPAkAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
267-500 |
1.59e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 46.42 E-value: 1.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 267 QGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsATSASSSPPQG 346
Cdd:COG5651 155 AAASAAAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGP----IGLNSGPGNTG 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 347 TSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTS 426
Cdd:COG5651 231 FAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGL 310
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 115583681 427 ETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPP 500
Cdd:COG5651 311 GAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
|
|
| PRK11901 |
PRK11901 |
hypothetical protein; Reviewed |
345-530 |
1.63e-04 |
|
hypothetical protein; Reviewed
Pssm-ID: 237015 [Multi-domain] Cd Length: 327 Bit Score: 46.21 E-value: 1.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 345 QGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQgtsetPASNSPPQGTS--ETPGFssppqvttatlVSSSPP 422
Cdd:PRK11901 91 NQSSPSAANNTSDGHDASGVKNTAPPQDISAPPISPTPTQ-----AAPPQTPNGQQriELPGN-----------ISDALS 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 423 QVTSETPASSSPTQvtseTPASSSPTqvtsdTPASNSPPQGTSdtPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:PRK11901 155 QQQGQVNAASQNAQ----GNTSTLPT-----APATVAPSKGAK--VPATAETHPTPPQKPATKKPAVNHHKTATVAVPPA 223
|
170 180
....*....|....*....|....*....
gi 115583681 503 TSDTPASSSPPQ-VTSETPASSSPPQVTS 530
Cdd:PRK11901 224 TSGKPKSGAASArALSSAPASHYTLQLSS 252
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
430-603 |
1.76e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 46.91 E-value: 1.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 430 ASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgfssptQVTTAtlVSSSPPQVT-SDTPasSSPPQVTSDTPA 508
Cdd:TIGR00927 55 SSQQPIKLASRDLSNDEMMMVSSDPPKSSSEMEGEMLAP------QATVG--RDEATPSIAmENTP--SPPRRTAKITPT 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 509 SSSppqvTSETPASSSPPQVTSDTSAsiSPPQVISDTPASSSPPQVTSETPA------SSSPTNMTSDTPA-SSSPTNMT 581
Cdd:TIGR00927 125 TPK----NNYSPTAAGTERVKEDTPA--TPSRALNHYISTSGRQRVKSYTPKprgevkSSSPTQTREKVRKyTPSPLGRM 198
|
170 180
....*....|....*....|..
gi 115583681 582 SDTPASSspTNMTSDTPASSSP 603
Cdd:TIGR00927 199 VNSYAPS--TFMTMPRSHGITP 218
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
295-461 |
1.87e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 45.28 E-value: 1.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 295 TSDTPASSSppqvtsatsassspPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSppqGTS 374
Cdd:PHA03255 25 TSSGSSTAS--------------AGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST---GTT 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 375 DTPASSSpPQGTSETPASNSPPQGTSET-PGFSSPPQVTtatlvSSSPPQVTSETPASSSPTQVTSETPASSS-----PT 448
Cdd:PHA03255 88 VTPVPTT-SNASTINVTTKVTAQNITATeAGTGTSTGVT-----SNVTTRSSSTTSATTRITNATTLAPTLSSkgtsnAT 161
|
170
....*....|...
gi 115583681 449 QVTSDTPasnSPP 461
Cdd:PHA03255 162 KTTAELP---TVP 171
|
|
| PRK12495 |
PRK12495 |
hypothetical protein; Provisional |
447-563 |
1.88e-04 |
|
hypothetical protein; Provisional
Pssm-ID: 183558 [Multi-domain] Cd Length: 226 Bit Score: 45.24 E-value: 1.88e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 447 PT---QVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQV-TSDTPASSSPPQVTSETPAS 522
Cdd:PRK12495 62 PTcqqPVTEDGAAGDDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTsATDEAATDPPATAAARDGPT 141
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 115583681 523 SSPPQVTSDTSASISP---PQVISDTPASSSPPQVTSETPASSS 563
Cdd:PRK12495 142 PDPTAQPATPDERRSPrqrPPVSGEPPTPSTPDAHVAGTLQAAR 185
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
483-564 |
1.95e-04 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 45.79 E-value: 1.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 483 SSSPPQVTSDTPASSSPPQVTSDTPASSSP--PQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPA 560
Cdd:PRK10856 168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247
|
....
gi 115583681 561 SSSP 564
Cdd:PRK10856 248 AADP 251
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
208-572 |
1.96e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 46.91 E-value: 1.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 208 SSAPPKATHRMTITSLTGRPQVTSDtlaSSSPPQGTSDTPassSPPQVTSATSASSSPpqgTSDTPASSSPPQVtsatsa 287
Cdd:TIGR00927 76 SSDPPKSSSEMEGEMLAPQATVGRD---EATPSIAMENTP---SPPRRTAKITPTTPK---NNYSPTAAGTERV------ 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 288 sssppqgTSDTPASssPPQVTSATSASSSPPQGTSDTPA------SSSPPQVTSatsasssppQGTSDTPASSSPPQGTL 361
Cdd:TIGR00927 141 -------KEDTPAT--PSRALNHYISTSGRQRVKSYTPKprgevkSSSPTQTRE---------KVRKYTPSPLGRMVNSY 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 362 DTPSSSSPPQGTSDTPASSsppQGTSETPASNS--PPQGTSETPGFSSPpqVTTATLVSSSPPQVTS--ETPASSSPTQV 437
Cdd:TIGR00927 203 APSTFMTMPRSHGITPRTT---VKDSEITATYKmlETNPSKRTAGKTTP--TPLKGMTDNTPTFLTRevETDLLTSPRSV 277
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 438 TsETPASSSPTQVTSDTPAS-------NSP--PQGT--SDTPGfSSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSD 505
Cdd:TIGR00927 278 V-EKNTLTTPRRVESNSSTNhwglvgkNNLttPQGTvlEHTPA-TSEGQVTISIMTGSSPAETKASTAAwKIRNPLSRTS 355
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 506 TPA-------------SSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPaSSSPTNMTSDTP 572
Cdd:TIGR00927 356 APAvriasatfrglekNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPSLTTALFPEAP-SPSPSALPPGQP 434
|
|
| KLF12_N |
cd21441 |
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ... |
542-603 |
2.05e-04 |
|
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.
Pssm-ID: 410608 [Multi-domain] Cd Length: 197 Bit Score: 44.61 E-value: 2.05e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 542 ISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:cd21441 65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
375-554 |
2.12e-04 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 46.10 E-value: 2.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 375 DTPASSSPPQGTSETPASnsPPQGTSETPGfSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTqvtsdT 454
Cdd:PTZ00436 192 DAAAAAAAKQKAAAKKAA--APSGKKSAKA-AAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAA-----P 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 455 PASNSPPQGTSDTPgfssPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSDTPA-SSSPPQVTSETPASSSppqvtsdt 532
Cdd:PTZ00436 264 PAKAAAPPAKAAAP----PAKAAAPPAKAAAPPAKAAAAPAkAAAAPAKAAAAPAkAAAPPAKAAAPPAKAA-------- 331
|
170 180
....*....|....*....|..
gi 115583681 533 sasiSPPQVISDTPASSSPPQV 554
Cdd:PTZ00436 332 ----TPPAKAAAPPAKAAAAPV 349
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
507-619 |
2.24e-04 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 46.19 E-value: 2.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 507 PASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPA----SSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTS 582
Cdd:pfam05539 169 KTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATAnqrlSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQ 248
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 115583681 583 DTPASSSP-TNMTSD----TPASSSPPWPVITEVTRPESTIP 619
Cdd:pfam05539 249 HPPSTTSQdQSTTGDgqehTQRRKTPPATSNRRSPHSTATPP 290
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
147-433 |
2.38e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 46.30 E-value: 2.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 147 PPQGASIWRNEF-GPGPLLPmkRRGAETERHMIPGNGPPLAMCHQPAPPElfETlcfPIDPASSAPPKATHRMTiTSLTG 225
Cdd:pfam03154 294 PPQPFPLTPQSSqSQVPPGP--SPAAPGQSQQRIHTPPSQSQLQSQQPPR--EQ---PLPPAPLSMPHIKPPPT-TPIPQ 365
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 226 RPQVTSDT----LASSSPPQGTSDTPAsssPPQVTSATSAsssppqgTSDTPASSSPPQVTSATsasssppQGTSDTPAS 301
Cdd:pfam03154 366 LPNPQSHKhpphLSGPSPFQMNSNLPP---PPALKPLSSL-------STHHPPSAHPPPLQLMP-------QSQQLPPPP 428
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 302 SSPPQVTsatsasssppQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSS 381
Cdd:pfam03154 429 AQPPVLT----------QSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSA 498
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 382 PPQGTSETPASNS---PP-QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:pfam03154 499 SVSSSGPVPAAVScplPPvQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
429-561 |
2.52e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 46.25 E-value: 2.52e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 429 PASSSPTQVTSEtpaSSSPTQVTSDTPASNSPPQgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPA 508
Cdd:PRK14951 366 PAAAAEAAAPAE---KKTPARPEAAAPAAAPVAQ--------AAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAP 434
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 509 SSSPPQVTSETPASSSPPQVTSDTSASI-----SPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PRK14951 435 AAAPAAAPAAVALAPAPPAQAAPETVAIpvrvaPEPAVASAAPAPAAAPAAARLTPTE 492
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
321-497 |
2.56e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 44.89 E-value: 2.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 321 TSDTPASSSPPQVTSATSASSSPPQGtSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSppqGTSETPASnsppqgts 400
Cdd:PHA03255 25 TSSGSSTASAGNVTGTTAVTTPSPSA-SGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST---GTTVTPVP-------- 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 401 eTPGFSSPPQVTTatlvsssppQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTAT 480
Cdd:PHA03255 93 -TTSNASTINVTT---------KVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATK 162
|
170
....*....|....*..
gi 115583681 481 LVSSSPPQVTSDTPASS 497
Cdd:PHA03255 163 TTAELPTVPDERQPSLS 179
|
|
| PRK14959 |
PRK14959 |
DNA polymerase III subunits gamma and tau; Provisional |
364-499 |
2.67e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184923 [Multi-domain] Cd Length: 624 Bit Score: 46.21 E-value: 2.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 364 PSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPqvTSETPASSSPTQVTSETPa 443
Cdd:PRK14959 367 PVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPA--PSAAPSPRVPWDDAPPAP- 443
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 444 sssptqvtsdtPASNSPPQGTSDTPGFSSPT--QVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:PRK14959 444 -----------PRSGIPPRPAPRMPEASPVPgaPDSVASASDAPPTLGDPSDTAEHTP 490
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
425-556 |
2.68e-04 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 45.86 E-value: 2.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 425 TSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVts 504
Cdd:PRK12799 296 HGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQS----ATTTQASAVALSSAGVLPSDVTLPGTVALPAA-- 369
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 115583681 505 dTPASSSPPQVTSETPASSSPPQVTSDTSasiSPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799 370 -EPVNMQPQPMSTTETQQSSTGNITSTAN---GPTTSLPAAPASNIPVSPTS 417
|
|
| PHA03193 |
PHA03193 |
tegument protein VP11/12; Provisional |
375-511 |
2.73e-04 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 177555 Cd Length: 594 Bit Score: 46.25 E-value: 2.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 375 DTPASSS---PPQG--TSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPAS------SSPTQVTSETPA 443
Cdd:PHA03193 440 DSPFQRKramPEDGgeIHEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTSNEMkgdaecPAAQDAAAILPA 519
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 444 SSsptQVTSDTPASNSPPQGTSDTpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSS 511
Cdd:PHA03193 520 SF---QIENGGAADGSGLAIPAAM---CDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSK 581
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
487-594 |
2.90e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 45.92 E-value: 2.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 487 PQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVisdTPASSSPPQVtSETPASSSPTN 566
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSA---TQPAGTPPTV-SVDPPAAVPVN 438
|
90 100
....*....|....*....|....*...
gi 115583681 567 MTSDTPASSSPTNMTSDTPASSSPTNMT 594
Cdd:PRK14971 439 PPSTAPQAVRPAQFKEEKKIPVSKVSSL 466
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
350-435 |
2.96e-04 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 45.72 E-value: 2.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 350 TPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASnspPQGTSETPGFSSPPqvttatlvsssPPQVTSETP 429
Cdd:PHA03291 204 VPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQ---AGTTPEAEGTPAPP-----------TPGGGEAPP 269
|
....*.
gi 115583681 430 ASSSPT 435
Cdd:PHA03291 270 ANATPA 275
|
|
| AF-4 |
pfam05110 |
AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and ... |
178-606 |
2.96e-04 |
|
AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homolog Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.
Pssm-ID: 461550 [Multi-domain] Cd Length: 514 Bit Score: 45.89 E-value: 2.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 178 IPGNGPPLAMCHQPAPPelfetlCFPIDPASSAPPKATHrmtitsltgrpqvtsdtlaSSSPPQGTsdtPASSSPPqvts 257
Cdd:pfam05110 75 IPKNSVPQTPQEKPDQP------FFPDKTSGLPPSFHTS-------------------SHSQPMGP---PSSSSPS---- 122
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 258 atsasSSPPQGTSDTPASSSPPQvtsatsasssPPQGTSDTPASSSPPQvtsatSASSSPPQGTSDTPASSSPPQVTSAT 337
Cdd:pfam05110 123 -----VSSSQSQKKSQARTEPAH----------GGHSSSGSQSSQRSQG-----QSRSKGGQESHSSSHHKRQERREDLF 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 338 SASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPpqGTSETPASNSPPQGTSETPGFSSPpqVTTATLV 417
Cdd:pfam05110 183 SCASLSHSLEELSPLLSSLSSPVKPLSPSHSRQHTGSKAQNSSDH--HGKEYSHSKSPRDSEAGSHGPESP--STSLLAS 258
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 418 SSSPPQVTSETPASSSPTQVTSETPASSSP----TQVTSDTPASNSPPQG----------------TSDTPGFSSPTQVT 477
Cdd:pfam05110 259 SSQLSSQTFPPSLPSKTSAMQQKPTAYVRPmdgqDQAPSESPELKPSPEDyhgqsygklsdlkanaKAKLSKLKIPSQPL 338
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 478 TATLVS--------------SSPPQVTS-DTPASSSP---PQVTSDTPASSSPPQ------VTSETPASSSPPQVT---- 529
Cdd:pfam05110 339 EQSLSNdvhcveeilkemthSWPPPLTAiHTPSTAEPskfPFPTKESQHVTSGYQnqkqydAPSKTLPTSQQGTSMledd 418
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 530 ---------SDTSASISPPQviSDTPASSSPPQVTSETPASSSPTNMTSdtpASSSPTNMTSDTPASSSPTNMTSDTPAS 600
Cdd:pfam05110 419 lklsssedsDDDQAPEKPPP--SSAPPSAPQSQPNSVASAHSSSGESGS---SSDSESSSESDSESESSSSDSEANEPPR 493
|
....*.
gi 115583681 601 SSPPWP 606
Cdd:pfam05110 494 SATPEP 499
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
435-572 |
3.06e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 45.92 E-value: 3.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 435 TQVTSETPASSSPTQVTSDT---PASNSPPQGTSDtpgfssPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSS 511
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIKPVftqPAAAPQPSAAAA------ASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVP 436
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 115583681 512 PPqvtsetPASSSPPQVTSDTSAS---ISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PRK14971 437 VN------PPSTAPQAVRPAQFKEekkIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQK 494
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
431-525 |
3.09e-04 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 45.40 E-value: 3.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 431 SSSPTQVTSETPASSSPTQVTSDTPASNSPPQGtsdtpgfSSPTQVTTATlvssSPPQVTSDTPASSSPPQVTSDTPASS 510
Cdd:PRK10856 168 TTTDPATTPAPAAPVDTTPTNSQTPAVATAPAP-------AVDPQQNAVV----APSQANVDTAATPAPAAPATPDGAAP 236
|
90
....*....|....*
gi 115583681 511 SPPQVTSETPASSSP 525
Cdd:PRK10856 237 LPTDQAGVSTPAADP 251
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
366-456 |
3.24e-04 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 46.00 E-value: 3.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 366 SSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSET-PGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPAS 444
Cdd:PRK11907 19 TASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEARTV 98
|
90
....*....|..
gi 115583681 445 SSPTqvTSDTPA 456
Cdd:PRK11907 99 TPAA--TETSKP 108
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
377-513 |
3.36e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 45.86 E-value: 3.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 377 PASSSPPQGTSETPASNSPPQGTsetPGFSSPPQVTTATLVSSSPPQVTSET---PASSSPTQVTSETPASSSPTQVTSD 453
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAA---PAAAPVAQAAAAPAPAAAPAAAASAPaapPAAAPPAPVAAPAAAAPAAAPAAAP 442
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 454 TPASNSPPQGTSDTPGFSSPtqvttatlvsssPPQVTSDTPASSSPPQVTSDTPASSSPP 513
Cdd:PRK14951 443 AAVALAPAPPAQAAPETVAI------------PVRVAPEPAVASAAPAPAAAPAAARLTP 490
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
455-573 |
3.99e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 45.63 E-value: 3.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 455 PASNSPPQGTSDTPGfsspTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSdtSA 534
Cdd:PRK07994 366 PEPEVPPQSAAPAAS----AQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK--AK 439
|
90 100 110
....*....|....*....|....*....|....*....
gi 115583681 535 SISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PRK07994 440 KSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEA 478
|
|
| PHA03132 |
PHA03132 |
thymidine kinase; Provisional |
354-473 |
4.09e-04 |
|
thymidine kinase; Provisional
Pssm-ID: 222997 [Multi-domain] Cd Length: 580 Bit Score: 45.52 E-value: 4.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 354 SSPPQGTLDTPSSSSPPQGTSDTPasSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:PHA03132 65 GVATSTIYTVPRPPRGPEQTLDKP--DSLPASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFLYQVNFPVILAPIGES 142
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 115583681 434 PTqvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSP 473
Cdd:PHA03132 143 NS--SSEELSEEEEHSRPPPSESLKVKNGGKVYPKGFSKH 180
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
399-530 |
4.12e-04 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 45.09 E-value: 4.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 399 TSETPGFSSPPqvTTATLVSSSPPQVTSETPASS---SPTQVTSETPASSSPTQVTSDTPASnsppqgtSDTPGFSSPTQ 475
Cdd:PRK12799 296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPSPAvipSSVTTQSATTTQASAVALSSAGVLP-------SDVTLPGTVAL 366
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 476 VTTATLVSSSPPQVTSDTPASSSppqvTSDTPASSSPPQVTSETPASSSPPQVTS 530
Cdd:PRK12799 367 PAAEPVNMQPQPMSTTETQQSST----GNITSTANGPTTSLPAAPASNIPVSPTS 417
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
469-647 |
4.50e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 45.25 E-value: 4.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 469 GFSSPTQVTTATLVSSSPPQV--TSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPP--------QVTSDTSASIS- 537
Cdd:PRK12323 370 GGAGPATAAAAPVAQPAPAAAapAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPApealaaarQASARGPGGAPa 449
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 538 PPQVISDTPASSSPPQVTS-ETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWP--VITEVTRP 614
Cdd:PRK12323 450 PAPAPAAAPAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAgwVAESIPDP 529
|
170 180 190
....*....|....*....|....*....|...
gi 115583681 615 ESTIPAGRSLANITSKAQEDSPLGVISTHPQMS 647
Cdd:PRK12323 530 ATADPDDAFETLAPAPAAAPAPRAAAATEPVVA 562
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
204-410 |
4.79e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 45.36 E-value: 4.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 204 IDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvts 283
Cdd:PRK07764 588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD--- 664
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 284 atSASSSPPQGTSDTPASSSPPQVtsatsasssppQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDT 363
Cdd:PRK07764 665 --GGDGWPAKAGGAAPAAPPPAPA-----------PAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 115583681 364 PSSSS---PPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQ 410
Cdd:PRK07764 732 SPAADdpvPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
460-577 |
4.84e-04 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 44.95 E-value: 4.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 460 PPQGT-SDTPGFSSPTQvttatlvSSSPPQVTSDTPaSSSPPQVTsdTPASSSPPQVTSETPASSSPPQVTSDTSASISP 538
Cdd:PHA03291 167 PAEGTlAAPPLGEGSAD-------GSCDPALPLSAP-RLGPADVF--VPATPRPTPRTTASPETTPTPSTTTSPPSTTIP 236
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 115583681 539 PQVISDTPASSSPPQVTSETPASSSP-TNMTSDTPASSSP 577
Cdd:PHA03291 237 APSTTIAAPQAGTTPEAEGTPAPPTPgGGEAPPANATPAP 276
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
442-578 |
4.89e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 45.09 E-value: 4.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 442 PASSSPTQVTSDTPASNSPPQGTsdtPGFSSPTQVTTATLVSSSPPQVTSdtpASSSPPqvtsdTPASSSPPQVTSETPA 521
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAA---PAAAPVAQAAAAPAPAAAPAAAAS---APAAPP-----AAAPPAPVAAPAAAAP 434
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 115583681 522 SSSPPQVTSdtSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PRK14951 435 AAAPAAAPA--AVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLT 489
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
378-468 |
4.99e-04 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 45.23 E-value: 4.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 378 ASSSPPQGTSETPASNSPPQGTSETPgfSSPPQVTTAT---LVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT 454
Cdd:PRK11907 18 LTASNPKLAQAEEIVTTTPATSTEAE--QTTPVESDATeeaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEA 95
|
90
....*....|....
gi 115583681 455 PAsnSPPQGTSDTP 468
Cdd:PRK11907 96 RT--VTPAATETSK 107
|
|
| PRK13042 |
PRK13042 |
superantigen-like protein SSL4; Reviewed; |
428-518 |
5.03e-04 |
|
superantigen-like protein SSL4; Reviewed;
Pssm-ID: 183854 [Multi-domain] Cd Length: 291 Bit Score: 44.62 E-value: 5.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 428 TPASSSPTQVTSETPASSSPTQVTSDTPASNSPpqgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:PRK13042 18 TGVITTTTQAANATTPSSTKVEAPQSTPPSTKV----------EAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETP 87
|
90
....*....|.
gi 115583681 508 ASSSPPQVTSE 518
Cdd:PRK13042 88 QSPTTKQVPTE 98
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
390-530 |
5.07e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 45.24 E-value: 5.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 390 PASNSPPQGTSetpgfSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:PRK07994 366 PEPEVPPQSAA-----PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKK 440
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 115583681 470 fSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASS--SPPQVTSETPASSSPPQVTS 530
Cdd:PRK07994 441 -SEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwkATNPVEVKKEPVATPKALKK 502
|
|
| CytochromB561_N |
pfam09786 |
Cytochrome B561, N terminal; Members of this family are found in the N terminal region of ... |
350-558 |
5.50e-04 |
|
Cytochrome B561, N terminal; Members of this family are found in the N terminal region of cytochrome B561, as well as in various other putative uncharacterized proteins.
Pssm-ID: 462899 Cd Length: 579 Bit Score: 45.20 E-value: 5.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 350 TPASSSPPQGTLDTPSssspPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFS--SPPQVTTATLVSSSPPQVTSE 427
Cdd:pfam09786 129 PPKSKSSPQSPSPVLV----PLHQSVSPSSSESRKGGDKSPAGSGKKLRSFSTSSKSpaSPSVYLRGSPVPLNSSPLPSD 204
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 428 TPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSpPQVTSDTP 507
Cdd:pfam09786 205 RNYENSVQSSPEIDSAVSTPWSRKRATIGKEIRTEKMLERFLAEVDEKITESAFGKASPSNVSGSANRSGS-TRSTPLRS 283
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 115583681 508 ASSSPPQVTSETPASSSPpqvtSDTSASISPPQVISDTPASSSPPQVTSET 558
Cdd:pfam09786 284 VRMSPGSQKFTTPPKKGE----GDLPSPMSMEENIEAFENLGIYPQIEQWR 330
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
357-501 |
5.54e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 45.15 E-value: 5.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 357 PQGTLDTPSSSSPPQGTSDT---PASSSPPQgtseTPASNSPPQGTSETPgfSSPPQVTTATLVSSSPPQVtSETPASSS 433
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIKPVftqPAAAPQPS----AAAAASPSPSQSSAA--AQPSAPQSATQPAGTPPTV-SVDPPAAV 435
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 434 PTQVTSETPASSSPTQVTSDTPasNSPPQGTSDTPGFSSPTQVTTAtlvsssppQVTSDTPASSSPPQ 501
Cdd:PRK14971 436 PVNPPSTAPQAVRPAQFKEEKK--IPVSKVSSLGPSTLRPIQEKAE--------QATGNIKEAPTGTQ 493
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
396-540 |
5.64e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 45.15 E-value: 5.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 396 PQGTSETPGFSSPPQVTT-ATLVSSSPPQVTSETPASSSPTQvTSETPASSSPTQVtsdTPASNSPPQGTSDTPgfsSPT 474
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIKpVFTQPAAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSA---TQPAGTPPTVSVDPP---AAV 435
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 475 QVTTAtlvSSSPPQVtsdTPASSSPPQVTSDTPASSSPPQVTSetPASSSPPQVTSDTSASISPPQ 540
Cdd:PRK14971 436 PVNPP---STAPQAV---RPAQFKEEKKIPVSKVSSLGPSTLR--PIQEKAEQATGNIKEAPTGTQ 493
|
|
| PRK14959 |
PRK14959 |
DNA polymerase III subunits gamma and tau; Provisional |
346-460 |
7.10e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184923 [Multi-domain] Cd Length: 624 Bit Score: 44.67 E-value: 7.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 346 GTSDTPASSSPPQGTLDTPSSSSPpQGTSdtPASS-SPPQGTSETPASNSPPqgTSETPGFSSPPQVTTATLVSSSPPQV 424
Cdd:PRK14959 382 SGSAAEGPASGGAATIPTPGTQGP-QGTA--PAAGmTPSSAAPATPAPSAAP--SPRVPWDDAPPAPPRSGIPPRPAPRM 456
|
90 100 110
....*....|....*....|....*....|....*.
gi 115583681 425 TSETPASSSPTQVTSEtpASSSPTQVTSDTPASNSP 460
Cdd:PRK14959 457 PEASPVPGAPDSVASA--SDAPPTLGDPSDTAEHTP 490
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
354-502 |
7.23e-04 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 44.17 E-value: 7.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 354 SSPPQGTlDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPgfssPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:PTZ00436 208 AAAPSGK-KSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAP----PAKAAAPPAKAAAPPAKAAAPPAKAA 282
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 115583681 434 ptqvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSS--PTQVTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:PTZ00436 283 ----APPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAapPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
269-454 |
7.34e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 43.35 E-value: 7.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 269 TSDTPASSSppqvtsatsassspPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSppqVTSATSASSSPPQGTS 348
Cdd:PHA03255 25 TSSGSSTAS--------------AGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTT---AILSTNTTTVTSTGTT 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 349 DTPASSSPPQGTLDTPSSSsppqgTSDTPASSSPPQGTSETPASNSPPQGTSETpgfSSPPQVTTATLVSSSPPQVTset 428
Cdd:PHA03255 88 VTPVPTTSNASTINVTTKV-----TAQNITATEAGTGTSTGVTSNVTTRSSSTT---SATTRITNATTLAPTLSSKG--- 156
|
170 180
....*....|....*....|....*.
gi 115583681 429 paSSSPTQVTSETPASSSPTQVTSDT 454
Cdd:PHA03255 157 --TSNATKTTAELPTVPDERQPSLSY 180
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
383-514 |
7.66e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 44.77 E-value: 7.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 383 PQGTSETPASNSPPQGTSetPGFS---SPPQVTTATLVSSSPPQvTSETPASSSPTQVtseTPASSSPTQVTSDTP-ASN 458
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIK--PVFTqpaAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSA---TQPAGTPPTVSVDPPaAVP 436
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 459 SPPQGT---SDTPGFSSPTQVTTATLVSSSPPQVTSdtPASSSPPQVTSDTPASSSPPQ 514
Cdd:PRK14971 437 VNPPSTapqAVRPAQFKEEKKIPVSKVSSLGPSTLR--PIQEKAEQATGNIKEAPTGTQ 493
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
295-526 |
8.14e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 44.11 E-value: 8.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 295 TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSppqvtSATSASSSPPQGTSDTPASSSPPQGTldtpSSSSPPQGTS 374
Cdd:COG5651 162 VALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPG-----FANLGLTGLNQVGIGGLNSGSGPIGL----NSGPGNTGFA 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 375 DTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT 454
Cdd:COG5651 233 GTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGA 312
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 455 PASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSdTPASSSPPQVTSDTPASSSPPQVTSETPASSSPP 526
Cdd:COG5651 313 GGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAA-AAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
500-602 |
8.20e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 44.38 E-value: 8.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 500 PQVTSDTPASSSPPQVTSET---PASSSPPQVTSDTSASispPQVISDTPASSSPPQVTseTPASSSPTNMTSdtPASSS 576
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIKPVftqPAAAPQPSAAAAASPS---PSQSSAAAQPSAPQSAT--QPAGTPPTVSVD--PPAAV 435
|
90 100
....*....|....*....|....*.
gi 115583681 577 PTNMTSDTPASSSPTNMTSDTPASSS 602
Cdd:PRK14971 436 PVNPPSTAPQAVRPAQFKEEKKIPVS 461
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
443-552 |
8.81e-04 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 44.46 E-value: 8.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 443 ASSSPTQVTSDTPASNSPPqgtsdtpgfSSPTQVTTATLVSSSppqVTSDTPASSSPPQVTSDTPASSSPpqvtsETPAS 522
Cdd:PRK11907 18 LTASNPKLAQAEEIVTTTP---------ATSTEAEQTTPVESD---ATEEADNTETPVAATTAAEAPSSS-----ETAET 80
|
90 100 110
....*....|....*....|....*....|
gi 115583681 523 SSPPQVTSDTSASISPPQVISDTpaSSSPP 552
Cdd:PRK11907 81 SDPTSEATDTTTSEARTVTPAAT--ETSKP 108
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
404-526 |
9.91e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 44.32 E-value: 9.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 404 GFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVS 483
Cdd:PRK14951 369 AAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALA 448
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 115583681 484 SSPPQVTSDTPAsSSPPQVTSDTPASSSPPQVTSETPASSSPP 526
Cdd:PRK14951 449 PAPPAQAAPETV-AIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
487-620 |
1.03e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 44.09 E-value: 1.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 487 PQVTSDTPASssPPQVTSDTPASssppQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTN 566
Cdd:PRK07994 361 PAAPLPEPEV--PPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQG 434
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 567 mtSDTPASSSPTNMTSDTPASSSPTNMTSDTP-ASSSPPWPVITEVTRPESTIPA 620
Cdd:PRK07994 435 --ATKAKKSEPAAASRARPVNSALERLASVRPaPSALEKAPAKKEAYRWKATNPV 487
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
434-560 |
1.03e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 44.09 E-value: 1.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 434 PTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS----DTPAS 509
Cdd:PRK07994 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRaqgaTKAKK 440
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 510 SSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQ-------VTSETPA 560
Cdd:PRK07994 441 SEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATnpvevkkEPVATPK 498
|
|
| PHA03193 |
PHA03193 |
tegument protein VP11/12; Provisional |
347-476 |
1.11e-03 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 177555 Cd Length: 594 Bit Score: 43.94 E-value: 1.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 347 TSDTPASSsppQGTLDTPSSSsppqGTSDTPASSSPPQGTSETPASNSPPQGTsETPGFSSPPQVTTATLVSSSppQVTS 426
Cdd:PHA03193 456 IHEALANN---GQAIFPECFS----GDLPPIAQALLSADELPNDTTASTSNEM-KGDAECPAAQDAAAILPASF--QIEN 525
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 115583681 427 ETPASSSPTQVTSetpASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQV 476
Cdd:PHA03193 526 GGAADGSGLAIPA---AMCDATAVESPSTVAETPPERLLAAESGPRCKAT 572
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
375-512 |
1.12e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 43.79 E-value: 1.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 375 DTPASSSPPQGTSETPASNSPpqgtSETPGFSSPPQvttatlvSSSPPQVTSETPaSSSPTQVTseTPASSSPTQVTSDT 454
Cdd:PHA03291 152 GATNASLFPLGLAAFPAEGTL----AAPPLGEGSAD-------GSCDPALPLSAP-RLGPADVF--VPATPRPTPRTTAS 217
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 455 PASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPAsSSPPQVTSDTPASSSP 512
Cdd:PHA03291 218 PETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPA-PPTPGGGEAPPANATP 274
|
|
| PHA03193 |
PHA03193 |
tegument protein VP11/12; Provisional |
433-583 |
1.13e-03 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 177555 Cd Length: 594 Bit Score: 43.94 E-value: 1.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 433 SPTQVTSETPASSSPTqvtSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSD-----TPASSSPPQ-VTSDT 506
Cdd:PHA03193 441 SPFQRKRAMPEDGGEI---HEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTsnemkGDAECPAAQdAAAIL 517
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 507 PASSsppQVTSETPASSSPPQVTSDtsaSISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT------NM 580
Cdd:PHA03193 518 PASF---QIENGGAADGSGLAIPAA---MCDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSKVEeilrrlRM 591
|
...
gi 115583681 581 TSD 583
Cdd:PHA03193 592 ASD 594
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
485-609 |
1.14e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 43.93 E-value: 1.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 485 SPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAsiSPPQVISDTPASSSPPQVTSETPASSSP 564
Cdd:PRK14951 372 AAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAA--PPAPVAAPAAAAPAAAPAAAPAAVALAP 449
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 115583681 565 TNMTSDTPASSSPTNMTSDTPASSSPtnmtSDTPASSSPPWPVIT 609
Cdd:PRK14951 450 APPAQAAPETVAIPVRVAPEPAVASA----APAPAAAPAAARLTP 490
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
206-449 |
1.14e-03 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 43.91 E-value: 1.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 206 PASSAPPKATHRMTitsltgRPQvtsdtlASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSP------- 278
Cdd:pfam03546 246 PAAATPAQAKPALK------TPQ------TKASPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACASSPAvargaqr 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 279 PQVTSATSASSSPPQGTSDTPASSSPPQV-------TSATSASSSPPQGTSDTPASSSPP---QVTSATSASSSPPQGTS 348
Cdd:pfam03546 314 PEEDSSSSEESESEEETAPAAAVGQAKSVgkglqgkAASAPTKGPSGQGTAPVPPGKTGPavaQVKAEAQEDSESSEEES 393
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 349 D------TPASSSPPQGTLDTPSSSSPPQGTSDTPASSSP-------PQGTSETPASNSPPQGTSETPGFSSPPQ----- 410
Cdd:pfam03546 394 DseeaaaTPAQVKASGKTPQAKANPAPTKASSAKGAASAPgkvvaaaAQAKQGSPAKVKPPARTPQNSAISVRGQasvpa 473
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 411 ----VTTATLVSSSPPQVTSETPASSS----------PTQV-----TSETPASSSPTQ 449
Cdd:pfam03546 474 vgkaVATAAQAQKGPVGGPQEEDSESSeeesdseeeaPAQAkpsgkTPQVRAASAPAK 531
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
363-565 |
1.19e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 44.05 E-value: 1.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 363 TPSSSSPPQGTSDTPASSSP--PQGTSETPASNSPPQGTSETPGFSSPPQVTTAtlVSSSPPQVTSETPASSSPTqvtse 440
Cdd:PRK14086 89 DPSAGEPAPPPPHARRTSEPelPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTA--RPAYPAYQQRPEPGAWPRA----- 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 441 tPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPpqvtsetP 520
Cdd:PRK14086 162 -ADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPP-------P 233
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 115583681 521 ASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PRK14086 234 GAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEPT 278
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
345-444 |
1.23e-03 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 43.92 E-value: 1.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 345 QGTSDTPASSSP---------PQGTLDTPSSSSPPQ---GTSDTPASSSPPQGTSETPASNSPPQGTSE--TPGFSSP-- 408
Cdd:PLN02217 545 QGDAWIPGKGVPyipglfagnPGSTNSTPTGSAASSnttFSSDSPSTVVAPSTSPPAGHLGSPPATPSKivSPSTSPPas 624
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 115583681 409 ----PQVTTATLVSSSPPQVTSETPASSSPTQVTSETPAS 444
Cdd:PLN02217 625 hlgsPSTTPSSPESSIKVASTETASPESSIKVASTESSVS 664
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
419-512 |
1.30e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 43.48 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 419 SSPPQVTSETPASSSPTQVtseTPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS 498
Cdd:PRK10856 161 SVPLDTSTTTDPATTPAPA---APVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPL 237
|
90
....*....|....
gi 115583681 499 PPQVTSDTPASSSP 512
Cdd:PRK10856 238 PTDQAGVSTPAADP 251
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
471-600 |
1.31e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 43.95 E-value: 1.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 471 SSPTQVTTATLVSS-----SPPQVTSDTPASSSP-PQVTSDTPASSSPPQVTSETPASSSP--PQVTSDTSASISPPQVI 542
Cdd:PHA03269 21 NLNTNIPIPELHTSaatqkPDPAPAPHQAASRAPdPAVAPTSAASRKPDLAQAPTPAASEKfdPAPAPHQAASRAPDPAV 100
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 543 SDTPASSSPPQvtsetpASSSPTNMTSDTPASS-SPTNMTSDTPassSPTNMTSDTPAS 600
Cdd:PHA03269 101 APQLAAAPKPD------AAEAFTSAAQAHEAPAdAGTSAASKKP---DPAAHTQHSPPP 150
|
|
| KLF12_N |
cd21441 |
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ... |
425-486 |
1.31e-03 |
|
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.
Pssm-ID: 410608 [Multi-domain] Cd Length: 197 Bit Score: 42.30 E-value: 1.31e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 425 TSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSP 486
Cdd:cd21441 65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
|
|
| PHA03193 |
PHA03193 |
tegument protein VP11/12; Provisional |
409-570 |
1.42e-03 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 177555 Cd Length: 594 Bit Score: 43.55 E-value: 1.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 409 PQVTTATLVSSSPPqvTSETPASSSPTQVTSETPASSSPtqvtsdtPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQ 488
Cdd:PHA03193 442 PFQRKRAMPEDGGE--IHEALANNGQAIFPECFSGDLPP-------IAQALLSADELPNDTTASTSNEMKGDAECPAAQD 512
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 489 VTSDTPASSsppQVTSDTPASSSPPQVTSetpASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT--- 565
Cdd:PHA03193 513 AAAILPASF---QIENGGAADGSGLAIPA---AMCDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSKVEeil 586
|
....*...
gi 115583681 566 ---NMTSD 570
Cdd:PHA03193 587 rrlRMASD 594
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
476-627 |
1.45e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 43.56 E-value: 1.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 476 VTTATLVSSSppqvTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQV-ISDTPASSSP--P 552
Cdd:PHA03269 9 IITIACINLI----IANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLaQAPTPAASEKfdP 84
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 553 QVTSETPASSSPTNMTSDTPAsSSPTNMTSDTPASSS---PTNMTSDTPASSSPPWPVITEVTRPESTIpAGRSLANI 627
Cdd:PHA03269 85 APAPHQAASRAPDPAVAPQLA-AAPKPDAAEAFTSAAqahEAPADAGTSAASKKPDPAAHTQHSPPPFA-YTRSMEHI 160
|
|
| NupH_GANP |
pfam16768 |
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the ... |
371-589 |
1.53e-03 |
|
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the nucleoporin-homology domain at the N-terminus of human GANP or germinal-centre associated nuclear proteins. GANP is part of the TREX-2 complex that links transcription with nuclear messenger RNA export, and it associates with the mRNP particle through the interaction of the NupH_GANP with NXF1, the export factor. This attachment mediates efficient delivery of mRNPs to nuclear pore complexes.
Pssm-ID: 435572 [Multi-domain] Cd Length: 292 Bit Score: 42.97 E-value: 1.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 371 QGTSDTPASSSPPQGTSETPAS---NSPP------QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSptqVTSET 441
Cdd:pfam16768 9 QQPSAFSTSSSPSTGTFQAKPPfrfGQPSlfgqnnTLSGKNSGFSQVSSFPTTSGVSHSSSGQTLGFTQTSG---VGLFS 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 442 PASSSPTQVTSDTPASNSPPQgtsdTPGFS--SPTQV----------TTATLVSSS----------PPQVTSDTP---AS 496
Cdd:pfam16768 86 GLEHTPSFVATSGPSSSSVPS----NPGFSfkSPTNLgafpststfgPESGEVASSgfgktefsfkPPENAVFRPifgAE 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 497 SSP----PQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASispPQVISDTPASSSPPQvTSETPASSSPT---NMTS 569
Cdd:pfam16768 162 SEPektqSQITSGFFTFSHPVSSGPGGLAPFSFSQVTSSSATS---SNFTFSKPVSSNNSS-SAFAPALSSQNveeEKRG 237
|
250 260
....*....|....*....|
gi 115583681 570 DTPASSSPTNMTSDTPASSS 589
Cdd:pfam16768 238 PKSFFGSSNSSFTSFPNSSS 257
|
|
| PRK11901 |
PRK11901 |
hypothetical protein; Reviewed |
484-655 |
1.59e-03 |
|
hypothetical protein; Reviewed
Pssm-ID: 237015 [Multi-domain] Cd Length: 327 Bit Score: 43.13 E-value: 1.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 484 SSPPQVTSDTPASSSPPQVTS----DTPASSSPPQVTSETPASSSPPQVTSD--TSASISPPQVISDTPASSSPPQVTSE 557
Cdd:PRK11901 55 GSALKSPTEHESQQSSNNAGAekniDLSGSSSLSSGNQSSPSAANNTSDGHDasGVKNTAPPQDISAPPISPTPTQAAPP 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 558 TPASSS-----PTNMTS----------------DTPASSSPTnmtsdTPASSSPTNMTSDTPASSSPPWPVITE-----V 611
Cdd:PRK11901 135 QTPNGQqrielPGNISDalsqqqgqvnaasqnaQGNTSTLPT-----APATVAPSKGAKVPATAETHPTPPQKPatkkpA 209
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 115583681 612 TRPESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQ---SSTSQA 655
Cdd:PRK11901 210 VNHHKTATVAVPPATSGKPKSGAASARALSSAPASHYTlqlSSASRS 256
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
418-499 |
1.60e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 43.09 E-value: 1.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 418 SSSPPQVTSETPA--SSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:PRK10856 168 TTTDPATTPAPAApvDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247
|
....
gi 115583681 496 SSSP 499
Cdd:PRK10856 248 AADP 251
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
391-485 |
1.85e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 42.71 E-value: 1.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 391 ASNSPPQGTSE--TPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT--QVTSETPASSSPTQVTSDTPASNSPPQGTSD 466
Cdd:PRK10856 152 AELSQNSGQSVplDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPapAVDPQQNAVVAPSQANVDTAATPAPAAPATP 231
|
90 100
....*....|....*....|
gi 115583681 467 TPGFSSPT-QVTTATLVSSS 485
Cdd:PRK10856 232 DGAAPLPTdQAGVSTPAADP 251
|
|
| Caprin-1_C |
pfam12287 |
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ... |
322-553 |
1.86e-03 |
|
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.
Pssm-ID: 463522 [Multi-domain] Cd Length: 320 Bit Score: 42.86 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 322 SDTPASSSP--PQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS---SSPPQGTSDTPASSSPPQGTSETPASNSPP 396
Cdd:pfam12287 32 SAQPPSQSPdlSQMVCPPASPEQRLSQQSDVLQQPEQTQVSPVSPSSnacASSGSEYQFHTSEPPQPEAIDPIQSSMSLP 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 397 qgtsetpgfSSPPQvTTATLVSSSPPQVTSETPASSSPTQVtsetpaSSSPTQVTSDTPASNSP-PQGTSDTPGFSSPTQ 475
Cdd:pfam12287 112 ---------SELAP-PSPPLSPASQPQVFQSKPASSSGINV------NAAPFQSMQTVFNVNAPvPPRNEQELKESSQYS 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 476 VTTATLVSSSPPQvtsdtpassSPPQvtSDTPASssppQVTSETPASSSPPQVTSDTSASIS--PPQvisdTPASSSPPQ 553
Cdd:pfam12287 176 SGYNQSFSSQSTQ---------TVPQ--CQLPSE----QLEQTVVGAYHPDGTIQVSNGHLAfyPAQ----TNGFPRPPQ 236
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
414-552 |
1.88e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 43.32 E-value: 1.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 414 ATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTsdtPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSdt 493
Cdd:PRK07994 363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVP---PPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK-- 437
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 494 PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAS-ISPPQVISDTPASSSPP 552
Cdd:PRK07994 438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYrWKATNPVEVKKEPVATP 497
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
397-662 |
1.91e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.46 E-value: 1.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 397 QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASS---SPTQVTSET-PASSSPTQVTSDTPA-SNSPPQGTSDTPGFS 471
Cdd:NF033849 247 VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTrgwSHTQSTSESeSTGQSSSVGTSESQShGTTEGTSTTDSSSHS 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 472 SPTQVTTATLVSSSpPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDT-------SASISPPQVISD 544
Cdd:NF033849 327 QSSSYNVSSGTGVS-SSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSsgvsggfSGGIAGGGVTSE 405
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 545 TPASSsppQVTSET-PASSSPTNMTSDTPASSSPTNMTSDTPASS-SPTNMTSDTpASSSPPWPVITEVTRPESTipaGR 622
Cdd:NF033849 406 GLGAS---QGGSEGwGSGDSVQSVSQSYGSSSSTGTSSGHSDSSShSTSSGQADS-VSQGTSWSEGTGTSQGQSV---GT 478
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 115583681 623 SLANITSKAQEDSPLGVISTHPQMSFQSSTSQALDETAGE 662
Cdd:NF033849 479 SESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGT 518
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
494-650 |
1.91e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 43.16 E-value: 1.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 494 PASSSPPQVTSdtpASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDT---PASSSPPQVTSETPASSSPTNMTSD 570
Cdd:PRK14951 366 PAAAAEAAAPA---EKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPaapPAAAPPAPVAAPAAAAPAAAPAAAP 442
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 571 TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT--EVTRPESTIPAGRSLANITSKAQEDSPLGVISthpQMSF 648
Cdd:PRK14951 443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAapAAARLTPTEEGDVWHATVQQLAAAEAITALAR---ELAL 519
|
..
gi 115583681 649 QS 650
Cdd:PRK14951 520 QS 521
|
|
| PRK14948 |
PRK14948 |
DNA polymerase III subunit gamma/tau; |
325-532 |
1.98e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237862 [Multi-domain] Cd Length: 620 Bit Score: 43.41 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 325 PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPG 404
Cdd:PRK14948 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSLNLEE 443
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 405 -----FSSPPQVTT-------ATLVSSSPPQVT------------SETP--------ASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK14948 444 lwqqiLAKLELPSTrmllsqqAELVSLDSNRAViavspnwlgmvqSRKPlleqafakVLGRSIKLNLESQSGSASNTAKT 523
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 453 DTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPqvtsdtPASSSPPQVTSETPASSSPPQVTSDT 532
Cdd:PRK14948 524 PPPPQKSPPPPAP-TPPLPQPTATAPPPTPPPPPPTATQASSNAPAQI------PADSSPPPPIPEEPTPSPTKDSSPEE 596
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
513-666 |
2.13e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 43.23 E-value: 2.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 513 PQVTSETPASSSPPQVTSD-TSASISPPQVISDTPASSSPPQvTSETPASSSPTNMTsdTPASSSPTNMTSdtPASSSPT 591
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIKPvFTQPAAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSAT--QPAGTPPTVSVD--PPAAVPV 437
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 592 NMTSDTPASSSPPwpviteVTRPESTIPagrslaniTSKAqedSPLGVISTHPQmsfQSSTSQALDETAGERVPT 666
Cdd:PRK14971 438 NPPSTAPQAVRPA------QFKEEKKIP--------VSKV---SSLGPSTLRPI---QEKAEQATGNIKEAPTGT 492
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
489-623 |
2.15e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 43.02 E-value: 2.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 489 VTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSS-PPQVTSdTSASISPPQVIsdTPASSSPPQVTSETPASSSPTNM 567
Cdd:PHA03291 150 VEGATNASLFPLGLAAFPAEGTLAAPPLGEGSADGScDPALPL-SAPRLGPADVF--VPATPRPTPRTTASPETTPTPST 226
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 115583681 568 TSDTPASSSPTNMTSDTPASSSPTNMTSDTPAsssPPWPVITEvTRPESTIPAGRS 623
Cdd:PHA03291 227 TTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPA---PPTPGGGE-APPANATPAPEA 278
|
|
| PLN02983 |
PLN02983 |
biotin carboxyl carrier protein of acetyl-CoA carboxylase |
430-604 |
2.18e-03 |
|
biotin carboxyl carrier protein of acetyl-CoA carboxylase
Pssm-ID: 215533 [Multi-domain] Cd Length: 274 Bit Score: 42.13 E-value: 2.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 430 ASSSPTQVTSETPASSSPTQvTSDTPASNSPPQgtsdtPGFSSPTQ----------VTTATL----VSSSPPQVTSDTPA 495
Cdd:PLN02983 1 MASLSVPCAKTAAAAANVGS-RLSRSSFRLQPK-----PNISFPSKgpnpkrsavpKVKAQLnevaVDGSSNSAKSDDPK 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 496 SSSPPQVTSDTPA--SSSPPQVTSETPASSSPPQVTS-----DtSASISPPQ--------VISDTPASSSPPqvtseTPA 560
Cdd:PLN02983 75 SEVAPSEPKDEPPsnSSSKPNLPDEESISEFMTQVSSlvklvD-SRDIVELQlkqldcelVIRKKEALPQPP-----PPA 148
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 115583681 561 S---SSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPA----SSSPP 604
Cdd:PLN02983 149 PvvmMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAkapkSSHPP 199
|
|
| CytochromB561_N |
pfam09786 |
Cytochrome B561, N terminal; Members of this family are found in the N terminal region of ... |
417-577 |
2.28e-03 |
|
Cytochrome B561, N terminal; Members of this family are found in the N terminal region of cytochrome B561, as well as in various other putative uncharacterized proteins.
Pssm-ID: 462899 Cd Length: 579 Bit Score: 42.89 E-value: 2.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 417 VSSSPPQVTSETPASSSPTQVTS---ETPASSSPTQVTSDTPASNSPPQGTSDTPGfssptqvttatlvssspPQVTSDT 493
Cdd:pfam09786 89 VQSKSPSKGTKTPSRLTNQQLGLlglKPNDSSFVTTHRKKPPKSKSSPQSPSPVLV-----------------PLHQSVS 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 494 PASSSPPQVTSDTPASSSPPQVTSETPASSSppqvtsdtsasISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:pfam09786 152 PSSSESRKGGDKSPAGSGKKLRSFSTSSKSP-----------ASPSVYLRGSPVPLNSSPLPSDRNYENSVQSSPEIDSA 220
|
....
gi 115583681 574 SSSP 577
Cdd:pfam09786 221 VSTP 224
|
|
| KLF12_N |
cd21441 |
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ... |
529-590 |
2.33e-03 |
|
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.
Pssm-ID: 410608 [Multi-domain] Cd Length: 197 Bit Score: 41.53 E-value: 2.33e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 529 TSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 590
Cdd:cd21441 65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
|
|
| KLF12_N |
cd21441 |
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ... |
438-499 |
2.67e-03 |
|
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.
Pssm-ID: 410608 [Multi-domain] Cd Length: 197 Bit Score: 41.15 E-value: 2.67e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 438 TSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:cd21441 65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
|
|
| PAP1 |
pfam08601 |
Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene ... |
272-470 |
2.74e-03 |
|
Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene transcription in response to H2O2. This region is cysteine rich. Alkylation of cysteine residues following treatment with a cysteine alkylating agent can mask the accessibility of the nuclear exporter Crm1, triggering nuclear accumulation and Pap1 dependent transcriptional expression.
Pssm-ID: 369990 Cd Length: 363 Bit Score: 42.54 E-value: 2.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 272 TPASSSPPqvtsatsasssPPQGTSDTPASSSPPQVTSATSASSsppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTP 351
Cdd:pfam08601 53 PNASTSTP-----------DSQPPPSASSSTTPNQGSNGLNAFT----GEDNNNYSNSAANPGATRGSTASSARSQSSPY 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 352 ASSSPPQGTLDTPSSSSPPQGTsdtPASSSppqGTSETPASNSPPQGTSetpgfssppqvTTATLVSSSPPQVTSETPAS 431
Cdd:pfam08601 118 SFGSGTSTSSDSPSSSSSSHQG---QLSSC---GTSPEPSTQSPGGQKS-----------VETMIGEEQCAHGTIDGEKS 180
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 115583681 432 S-SPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGF 470
Cdd:pfam08601 181 FcAKLGMACGNINNPIPAAMSKSNSLSNTPGHASNDSNGL 220
|
|
| CTD |
smart01104 |
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ... |
426-535 |
2.74e-03 |
|
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.
Pssm-ID: 215026 [Multi-domain] Cd Length: 121 Bit Score: 39.81 E-value: 2.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 426 SETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSD 505
Cdd:smart01104 12 SKTPAWGSRTPGTAAGGAPTARGGSGSRTPAWGGAGSRTP-AWGGAGPTGSRTPAWGGASAWGNKSSEGSASSWAAGPGG 90
|
90 100 110
....*....|....*....|....*....|..
gi 115583681 506 TPASSSP--PQVTSeTPASSSPPQVTSDTSAS 535
Cdd:smart01104 91 AYGAPTPgyGGTPS-AYGPATPGGGAMAGSAS 121
|
|
| KLF12_N |
cd21441 |
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ... |
482-596 |
3.01e-03 |
|
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.
Pssm-ID: 410608 [Multi-domain] Cd Length: 197 Bit Score: 41.15 E-value: 3.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 482 VSSSPPQ--VTSDTPASSSPPQVTSDTPASSSPpqvtseTPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETP 559
Cdd:cd21441 35 VKAEPPEdsLSTDHFQTQTEPVDLSINKARTSP------TAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSS 108
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 115583681 560 ASSSPTNMTSDTPASSSPT-------NMTSDTPaSSSPTNMTSD 596
Cdd:cd21441 109 ASSVPTVLTPGPLVASASGvggqqflHIIHPVP-PSSPMNLQSN 151
|
|
| PHA03249 |
PHA03249 |
DNA packaging tegument protein UL25; Provisional |
429-561 |
3.02e-03 |
|
DNA packaging tegument protein UL25; Provisional
Pssm-ID: 223023 Cd Length: 653 Bit Score: 42.69 E-value: 3.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 429 PASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTsDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS--PPQVTSDT 506
Cdd:PHA03249 33 PRPRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSET-DSGSEAEAERGRRAGMGGRNKATKPSRRNKTTQcrPTSLALAT 111
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583681 507 PASSSPPQVTSETPASSSPPQVTSDTS-------ASISPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PHA03249 112 AATMPATPSSGKSPKVSSPPSIPSLSEedegaerNSGGDDSSHTDNESTQSQPEADDEPDLA 173
|
|
| COG5099 |
COG5099 |
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ... |
155-601 |
3.06e-03 |
|
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];
Pssm-ID: 227430 [Multi-domain] Cd Length: 777 Bit Score: 42.81 E-value: 3.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 155 RNEFGPGPLLPMKRRGAETERHMIPGNGPPLAMCHQPAPPELFetlcFPIDPASSAPPKATHRMTITSLTGRpQVTSDTL 234
Cdd:COG5099 1 PNSDTMNNLLPSIKSQLHHSKKSPPSSTTSQELMNGNSTPNSF----SPIPSKASSSATFTLNLPINNSVNH-KITSSSS 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 235 ASSSPPqgtsdTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSppqvtsatsas 314
Cdd:COG5099 76 SRRKPS-----GSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSST----------- 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 315 sspPQGTSDTPASSSPPQVT--SATSASSSPPQGTSDTPASSSPPQGtldtPSSSSPPQGTSDTpaSSSPPQGTSETPAS 392
Cdd:COG5099 140 ---ASSMFNSNKLPLPNPNHsnSATTNQSGSSFINTPASSSSQPLTN----LVVSSIKRFPYLT--SLSPFFNYLIDPSS 210
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 393 NSPpqgTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSpTQVTSETPASSSPTQVTSDTPASNSP---PQGTSDTPG 469
Cdd:COG5099 211 DSA---TASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQS-VENNIILNSSSSINELTSIYGSVPSIrnlRGLNSALVS 286
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDtpaSSSPPQVTSETPASSSPPQVTSdtsaSISPPQVISdtPASS 549
Cdd:COG5099 287 FLNVSSSSLAFSALNGKEVSPTGSPSTRSFARVLPK---SSPNNLLTEILTTGVNPPQSLP----SLLNPVFLS--TSTG 357
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 115583681 550 SPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASS 601
Cdd:COG5099 358 FSLTNLSGYLNPNKNLKKNTLSSLSNLGYSSNVPSPSSSESTRNILGNISPN 409
|
|
| Mating_C |
pfam12737 |
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine ... |
167-534 |
3.29e-03 |
|
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine the mating type of an individual, and only individuals with differing mating types can mate. Basidiomycete fungi have evolved a unique mating system, termed tetrapolar or bifactorial incompatibility, in which mating type is determined by two unlinked loci; compatibility at both loci is required for mating to occur. The multi-allelic tetrapolar mating system is considered to be a novel innovation that could have only evolved once, and is thus unique to the mushroom fungi. This domain is C-terminal to the homeodomain transcription factor region.
Pssm-ID: 372279 [Multi-domain] Cd Length: 412 Bit Score: 42.28 E-value: 3.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 167 KRRGAETERHMIPGNGPPLAMChqpAPPELFETLCFPIDPASSAP-----PKATHRMTITSLT---------GRPQVTSD 232
Cdd:pfam12737 50 KKRKRQAERSMRDALAYPSPER---SPASSPERNLSPQVDVCQLTirqnnLNLKRRSSSSSDVdssnaerchKRPRLDSP 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 233 tlASSSPPQGTSDTPASSSPPQVTSATSAsssppqgTSDTPASSSPPQVTSATSASSSPPQGTSD-TPASSSPPqvtsaT 311
Cdd:pfam12737 127 --SSSSSPEKCLPSPAPSEQEALSEISAA-------CGPTPSTLTPLNVAPSLTPSKKRKRCLSDgFDGPKRPP-----N 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 312 SASSSPPQGTSDT-PASSSPPQVtsatsasssppqgtSDTPASSSPPQGTLdtpSSSSPPQGTSDTPASSSPpqgtSETP 390
Cdd:pfam12737 193 KRVQPRPQTVSDPfPTSTSIPEW--------------DEWLQNHMSPSLTL---HGDIPPPVSVEAPDSNTP----LDIE 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 391 ASNSPPQgtsetpgfsspPQVTTATLVSSSPPQVTSETPASSSPtqVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGF 470
Cdd:pfam12737 252 IFNFPYH-----------PDLTPSPAPSLSDSVIEVATPTTESD--YMCNGTLRQTFSWFEFDFPELIQPTNTPASNNEL 318
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 471 SSPTQVTTATLVSSSPPQV------TSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSA 534
Cdd:pfam12737 319 SLPFDPSTDIVVSRTILPLldwrsqSFLSQTFASPPHSILRSNSSSPDVSAFALDLTPAFTPITYSLSES 388
|
|
| NupH_GANP |
pfam16768 |
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the ... |
354-524 |
3.33e-03 |
|
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the nucleoporin-homology domain at the N-terminus of human GANP or germinal-centre associated nuclear proteins. GANP is part of the TREX-2 complex that links transcription with nuclear messenger RNA export, and it associates with the mRNP particle through the interaction of the NupH_GANP with NXF1, the export factor. This attachment mediates efficient delivery of mRNPs to nuclear pore complexes.
Pssm-ID: 435572 [Multi-domain] Cd Length: 292 Bit Score: 41.82 E-value: 3.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 354 SSPPQGTLDTPSSSSPPQGTSDTPA---SSSPPQGTSETPASNSPPQGTSETPGFSSPPqvTTATLVSSSPPQVTSETPA 430
Cdd:pfam16768 56 SSFPTTSGVSHSSSGQTLGFTQTSGvglFSGLEHTPSFVATSGPSSSSVPSNPGFSFKS--PTNLGAFPSTSTFGPESGE 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 431 SSSPTQVTSETPASSSPTQVTSDTPASNSPP-----QGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSppqVTSD 505
Cdd:pfam16768 134 VASSGFGKTEFSFKPPENAVFRPIFGAESEPektqsQITSGFFTFSHPVSSGPGGLAPFSFSQVTSSSATSSN---FTFS 210
|
170
....*....|....*....
gi 115583681 506 TPASSSPPQvTSETPASSS 524
Cdd:pfam16768 211 KPVSSNNSS-SAFAPALSS 228
|
|
| SAP130_C |
pfam16014 |
Histone deacetylase complex subunit SAP130 C-terminus; |
480-665 |
3.50e-03 |
|
Histone deacetylase complex subunit SAP130 C-terminus;
Pssm-ID: 464973 [Multi-domain] Cd Length: 371 Bit Score: 42.23 E-value: 3.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 480 TLVSSSPPQVTSDTPAS-----------SSPPQVTSDT---PASSSPPQVTSETPASSSPPQVTSDTSASISPPqvisdt 545
Cdd:pfam16014 1 ALGSSPRPSILRKKPATegakpkpdihvAVAPPVTVAVealPGQNSEQQTASASPPSQHPAQAIPTILAPAAPP------ 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 546 pasSSPPQVTSETPASSSptnMTSDTPASSSPTNMTSDTPASSSPTNMtsdtPASSSPPWPVITEVTRPESTipagrsla 625
Cdd:pfam16014 75 ---SQPSVVLSTLPAAMA---VTPPIPASMANVVAPPTQPAASSTAAC----AVSSVLPEIKIKQEAEPMDT-------- 136
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 115583681 626 nitskAQEDSPLGVISTHPQMSFQSSTsqaLDETAGERVP 665
Cdd:pfam16014 137 -----SQSVPPLTPTSISPALTSLANN---LSVPAGDLLP 168
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
346-437 |
3.54e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 41.94 E-value: 3.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 346 GTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASnsppQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:PRK10856 167 STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPS----QANVDTAATPAPAAPATPDGAAPLPTDQA 242
|
90
....*....|..
gi 115583681 426 SETPASSSPTQV 437
Cdd:PRK10856 243 GVSTPAADPNAL 254
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
347-501 |
3.99e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 42.16 E-value: 3.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 347 TSDTPASSSPPQG----TLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTAtlVSSSPP 422
Cdd:PRK07994 367 EPEVPPQSAAPAAsaqaTAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKA--KKSEPA 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASN---SPPQGTSDTPGfSSPTQVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:PRK07994 445 AASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwkaTNPVEVKKEPV-ATPKALKKALEHEKTPELAAKLAAEAIER 523
|
..
gi 115583681 500 PQ 501
Cdd:PRK07994 524 DP 525
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
345-510 |
4.50e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 42.16 E-value: 4.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 345 QGTSDTPAssSPPQGTLDTPSSssppQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQV 424
Cdd:PRK07994 362 AAPLPEPE--VPPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGA 435
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 425 TSetPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFS-SPTQVTTATLVSSSPPQVTSDTPASSSPPQVT 503
Cdd:PRK07994 436 TK--AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALEHEKTPELA 513
|
....*..
gi 115583681 504 SDTPASS 510
Cdd:PRK07994 514 AKLAAEA 520
|
|
| KLF10_11_N |
cd21974 |
N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily ... |
363-528 |
4.63e-03 |
|
N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily is composed of Kruppel-like factor or Krueppel-like factor (KLF) 10, KLF11, and similar proteins. KLF10 was first identified in human osteoblasts and plays a role in mediating estrogen (E2) signaling in bone and skeletal homeostasis and a regulatory role in tumor formation and metastasis. KLF11 is involved in cell growth, apoptosis, cellular inflammation and differentiation, endometriosis, and cholesterol, prostaglandin, neurotransmitter, fat, and sugar metabolism. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved a-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. KLF10/11 belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF10, KLF11, and similar proteins.
Pssm-ID: 409243 [Multi-domain] Cd Length: 229 Bit Score: 41.07 E-value: 4.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 363 TPSSSSPpqgtSDTPASSSPPQGTSETPASNSPPQgtsetpgfsSPPQV-TTATLVSSSPPQVTSETPASSSPTQVTSET 441
Cdd:cd21974 32 TPSSDSS----DEDDAPESPKDFHSLSSLCMTPPY---------SPPFFeASHSPSVASLHPPSAASSQPPPEPESSEPP 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 442 PASSSPTQVTSdtpasnsppqgtsdtpgfssptqVT--TATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSET 519
Cdd:cd21974 99 AASPQRAQATS-----------------------VIrhTADPVPVSPPPVLCQMLPVSSSSGVIVAFLKAPQQPSPQPQK 155
|
....*....
gi 115583681 520 PASSSPPQV 528
Cdd:cd21974 156 PALPQPQVV 164
|
|
| TYA |
pfam01021 |
Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a ... |
423-543 |
4.65e-03 |
|
Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a fusion protein of TYA and TYB. The TYA protein is analogous to the gag protein of retroviruses. TYA a is cleaved to form 46kd protein which can form mature virion like particles. This entry corresponds to the capsid protein from Ty1 and Ty2 transposons.
Pssm-ID: 425992 Cd Length: 384 Bit Score: 41.87 E-value: 4.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 423 QVTSETPASSSPTQVtSETPASSSPTqvtsdtPASNSPPQ-GTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSP-- 499
Cdd:pfam01021 33 KANSQQTTTPGSSAV-PENHHHASPQ------PASVPPPQnGPYSQQCMMTPNQANPSGWPFYGHPSMMPYTPYQMSPmy 105
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 500 ---------PQVTSD--TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVIS 543
Cdd:pfam01021 106 fppgpqsqfPQYPSSvgTPLSTPSPESGNTFTDSSSAKSDMTSTNKYVRPPPILT 160
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
162-451 |
4.69e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 41.84 E-value: 4.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 162 PLLPMKRRGAETERHMIPGNGPPLAMCHQPAPPELFETLCfPIDPASSAPPKATHRMTitsltgRPQVTSDTLASSSPPQ 241
Cdd:PLN03209 312 PLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEA-PSPPIEEEPPQPKAVVP------RPLSPYTAYEDLKPPT 384
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 242 GTSDTPASSSPPQVTSATSASSSPPQGTSdtPASSSPPQVTSATSASSSPPQGTSDTPASS----SPPQvtsatsasssp 317
Cdd:PLN03209 385 SPIPTPPSSSPASSKSVDAVAKPAEPDVV--PSPGSASNVPEVEPAQVEAKKTRPLSPYARyedlKPPT----------- 451
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 318 pqGTSDTPASSSPPQVTSATSASssppqGTSDTPASSSPPQGTLDTPSSSSPpqgTSDTPASSSPPQGTSETPASNSPPQ 397
Cdd:PLN03209 452 --SPSPTAPTGVSPSVSSTSSVP-----AVPDTAPATAATDAAAPPPANMRP---LSPYAVYDDLKPPTSPSPAAPVGKV 521
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 115583681 398 GTSETPgfSSPPQVTTATLVSSSPPQVTSE---TPASSSPTQVTSETPASSSPTQVT 451
Cdd:PLN03209 522 APSSTN--EVVKVGNSAPPTALADEQHHAQpkpRPLSPYTMYEDLKPPTSPTPSPVL 576
|
|
| KLF12_N |
cd21441 |
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ... |
460-525 |
4.73e-03 |
|
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.
Pssm-ID: 410608 [Multi-domain] Cd Length: 197 Bit Score: 40.38 E-value: 4.73e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 115583681 460 PPQGTSDTPGFSSPTQV---------TTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSP 525
Cdd:cd21441 39 PPEDSLSTDHFQTQTEPvdlsinkarTSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVP 113
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
351-459 |
4.83e-03 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 41.47 E-value: 4.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 351 PA-SSSPPQGTLDTPS-SSSPPQGTSDTPA-SSSPPQGTSETPA-SNSPPQGTSETP--GFSSPPQVTTATLVSSSPPQV 424
Cdd:PTZ00436 229 PAkAAAPPAKAAAAPAkAAAAPAKAAAPPAkAAAPPAKAAAPPAkAAAPPAKAAAPPakAAAPPAKAAAAPAKAAAAPAK 308
|
90 100 110
....*....|....*....|....*....|....*.
gi 115583681 425 TSETPA-SSSPTQVTSETPASSSPTQVTSDTPASNS 459
Cdd:PTZ00436 309 AAAAPAkAAAPPAKAAAPPAKAATPPAKAAAPPAKA 344
|
|
| ARG80 |
COG5068 |
Regulator of arginine metabolism and related MADS box-containing transcription factors ... |
443-644 |
5.09e-03 |
|
Regulator of arginine metabolism and related MADS box-containing transcription factors [Transcription];
Pssm-ID: 227400 [Multi-domain] Cd Length: 412 Bit Score: 41.54 E-value: 5.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 443 ASSSPTQVTSdTPASNSPPQGTSDTPGF-SSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQvtsetpa 521
Cdd:COG5068 116 SVLTGTEVLL-LVISENGLVHTFTTPKLeSVVKSLEGKSLIQSPCSNAPSDSSEEPSSSASFSVDPNDNNPMG------- 187
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 522 ssSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTN------MTSDTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:COG5068 188 --SFQHNGSPQTNFIPLQNPQTQQYQQHSSRKDHPTVPHSNTNNGrppakfMIPELHSSHSTLDLPSDFISDSGFPNQSS 265
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 596 dtpASSSPPWPVITEVTRPES-----TIPAGRSLANITSKAQE-----DSPLGVISTHP 644
Cdd:COG5068 266 ---TSIFPLDSAIIQITPPHLpnnppQENRHELYSNDSSMVSEtpppkNLPNGSPNQSP 321
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
321-419 |
5.14e-03 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 42.00 E-value: 5.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 321 TSDTPASSSPPqvtsatsassSPPQGTSDTPA-----SSSPPQGTLDTPSSSSPPQGTSDTpassSPPQGT----SETPA 391
Cdd:PLN02217 569 TNSTPTGSAAS----------SNTTFSSDSPStvvapSTSPPAGHLGSPPATPSKIVSPST----SPPASHlgspSTTPS 634
|
90 100
....*....|....*....|....*...
gi 115583681 392 SNSPPQGTSETPGFSSPPQVTTATLVSS 419
Cdd:PLN02217 635 SPESSIKVASTETASPESSIKVASTESS 662
|
|
| PHA03193 |
PHA03193 |
tegument protein VP11/12; Provisional |
492-604 |
5.46e-03 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 177555 Cd Length: 594 Bit Score: 42.01 E-value: 5.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 492 DTPASSSPPQV-----TSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISD-----TPASSSPPQ-VTSETPA 560
Cdd:PHA03193 440 DSPFQRKRAMPedggeIHEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTsnemkGDAECPAAQdAAAILPA 519
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 115583681 561 SSSPTNmtsDTPASSSPTNMTSDtpaSSSPTNMTSDTPASSSPP 604
Cdd:PHA03193 520 SFQIEN---GGAADGSGLAIPAA---MCDATAVESPSTVAETPP 557
|
|
| PRK10672 |
PRK10672 |
endolytic peptidoglycan transglycosylase RlpA; |
359-448 |
5.54e-03 |
|
endolytic peptidoglycan transglycosylase RlpA;
Pssm-ID: 236733 [Multi-domain] Cd Length: 361 Bit Score: 41.59 E-value: 5.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 359 GTLDTPSSSSPPQGTSDT-PASSSPPQGTSETPAsnspPQGTSetpGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQV 437
Cdd:PRK10672 202 GGMGTPSVQPAPAPQGDVlPVSNSTLKSEDPTGA----PVTSS---GFLGAPTTLAPGVLEGSEPTPTAPSSAPATAPAA 274
|
90
....*....|.
gi 115583681 438 TSETPASSSPT 448
Cdd:PRK10672 275 AAPQAAATSSS 285
|
|
| PHA03201 |
PHA03201 |
uracil DNA glycosylase; Provisional |
455-552 |
5.60e-03 |
|
uracil DNA glycosylase; Provisional
Pssm-ID: 165468 Cd Length: 318 Bit Score: 41.42 E-value: 5.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 455 PASNSPPQGTSDTPGFSSPTQVTTATLVSSSPpqvTSDTPASSSPPQVTSDTPASSSPPQvtsetpasSSPPQVTSDTSA 534
Cdd:PHA03201 4 ARSRSPSPPRRPSPPRPTPPRSPDASPEETPP---SPPGPGAEPPPGRAAGPAAPRRRPR--------GCPAGVTFSSSA 72
|
90
....*....|....*...
gi 115583681 535 SISPPQVISDTPASSSPP 552
Cdd:PHA03201 73 PPRPPLGLDDAPAATPPP 90
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
149-447 |
5.85e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 41.76 E-value: 5.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 149 QGASIWRNEFGPGP---------LLPM-KRRGAETERHMIPGNGPPLAMCHQPAPPELFetlcfPIDPASSAPPKATHRM 218
Cdd:PRK07003 329 QIATVGRGELGLAPdeyagftmtLLRMlAFEPAVTGGGAPGGGVPARVAGAVPAPGARA-----AAAVGASAVPAVTAVT 403
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 219 TITSLTGRPQVtsdtlASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGT-SDTPASSSPPQVTSATSASSSPPQGTSD 297
Cdd:PRK07003 404 GAAGAALAPKA-----AAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVpAKANARASADSRCDERDAQPPADSGSAS 478
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 298 TPASSSPPQVTSATSasssppqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTP 377
Cdd:PRK07003 479 APASDAPPDAAFEPA--------PRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAA 550
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 378 AS-----------SSPPQGTSETPASNSPPQGTSETPgfsSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSS 446
Cdd:PRK07003 551 AAldvlrnagmrvSSDRGARAAAAAKPAAAPAAAPKP---AAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPP 627
|
.
gi 115583681 447 P 447
Cdd:PRK07003 628 P 628
|
|
| PHA03132 |
PHA03132 |
thymidine kinase; Provisional |
357-606 |
6.08e-03 |
|
thymidine kinase; Provisional
Pssm-ID: 222997 [Multi-domain] Cd Length: 580 Bit Score: 41.67 E-value: 6.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 357 PQGTLDTPSSSSPPQGTSDTPASSSPpqgTSETPASNSPPqgtSETPGFSSPPQVTTATLvsSSPPQVTSETPASSSPTQ 436
Cdd:PHA03132 21 PEGSRDENFDAERDDFLTPLGSTSEA---TSEDDDDLYPP---RETGSGGGVATSTIYTV--PRPPRGPEQTLDKPDSLP 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 437 VTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPtqvttatlvsssPPQVTSDTPASSSPPqvtSDTPASSSPPqvt 516
Cdd:PHA03132 93 ASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFLYQ------------VNFPVILAPIGESNS---SSEELSEEEE--- 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 517 SETPASSSPPQVTSDTSasisppqvISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSdtpASSSPTN-MTS 595
Cdd:PHA03132 155 HSRPPPSESLKVKNGGK--------VYPKGFSKHKTHKRSEFSGLTKKAARKRKGSFVFKPSQLKE---LSGSLKNlLHL 223
|
250
....*....|.
gi 115583681 596 DTPASSSPPWP 606
Cdd:PHA03132 224 DDSAETDPATR 234
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
270-768 |
6.21e-03 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 41.96 E-value: 6.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 270 SDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVtsatsassSPPQGTSDTPASSSPPQ----VTSATSASSSPPQ 345
Cdd:PHA03377 447 QSTPERPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQS--------PPTVAIKPAPPPSRRRRgacvVYDDDIIEVIDVE 518
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 346 GTSDTPASSSP------PQGTLD-----TPSSSSPPQGTSDT-PASSSPPqgTSETPASNSPPQGTSET-PGFSSPPQVT 412
Cdd:PHA03377 519 TTEEEESVTQPakphrkVQDGFQrsgrrQKRATPPKVSPSDRgPPKASPP--VMAPPSTGPRVMATPSTgPRDMAPPSTG 596
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 413 TATL--VSSSPPQVTSE--TPASSSP-----------------TQVTSETPASSSPTQVTSDTPASNSPP---QGTSDTP 468
Cdd:PHA03377 597 PRQQakCKDGPPASGPHekQPPSSAPrdmapsvvrmflrerllEQSTGPKPKSFWEMRAGRDGSGIQQEPssrRQPATQS 676
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 469 GFSSPTQVTTATLVSSSP---PQVTSDTPASS-SPPQVTS----------DTPASSSPPQVTSETPASSSPPQVTSDTSA 534
Cdd:PHA03377 677 TPPRPSWLPSVFVLPSVDagrAQPSEESHLSSmSPTQPISheeqpryedpDDPLDLSLHPDQAPPPSHQAPYSGHEEPQA 756
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 535 SISPPQVISDTPASSSPPQVTSETPA-----SSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPvit 609
Cdd:PHA03377 757 QQAPYPGYWEPRPPQAPYLGYQEPQAqgvqvSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHL--- 833
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 610 evtRPESTIPAGRSLANITS----KAQEDSPLGVISTHPQMSFqsstSQALDETAGervPTIPDFQAHsefqkacAILQR 685
Cdd:PHA03377 834 ---PPQWDGSAGHGQDQVSQfphlQSETGPPRLQLSQVPQLPY----SQTLVSSSA---PSWSSPQPR-------APIRP 896
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 686 LRDFLPTSPTSAQKNNSWSSQTPAVSCPFQPLGrlttTEKSSHQMAQQDME-QHPM-----DGAHNAFGISAGGSEIQSD 759
Cdd:PHA03377 897 IPTRFPPPPMPLQDSMAVGCDSSGTACPSMPFA----SDYSQGAFTPLDINaQTPKrprveESSHGPARCSQATTEAQEI 972
|
....*....
gi 115583681 760 IQLRSEFEV 768
Cdd:PHA03377 973 LSDNSEISV 981
|
|
| KLF10_11_N |
cd21974 |
N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily ... |
490-613 |
6.29e-03 |
|
N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily is composed of Kruppel-like factor or Krueppel-like factor (KLF) 10, KLF11, and similar proteins. KLF10 was first identified in human osteoblasts and plays a role in mediating estrogen (E2) signaling in bone and skeletal homeostasis and a regulatory role in tumor formation and metastasis. KLF11 is involved in cell growth, apoptosis, cellular inflammation and differentiation, endometriosis, and cholesterol, prostaglandin, neurotransmitter, fat, and sugar metabolism. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved a-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. KLF10/11 belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF10, KLF11, and similar proteins.
Pssm-ID: 409243 [Multi-domain] Cd Length: 229 Bit Score: 40.69 E-value: 6.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 490 TSDTPASSSPPQVTSDTPASS--------SPPQVtsETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQ------V- 554
Cdd:cd21974 34 SSDSSDEDDAPESPKDFHSLSslcmtppySPPFF--EASHSPSVASLHPPSAASSQPPPEPESSEPPAASPQraqatsVi 111
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 115583681 555 --TSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTR 613
Cdd:cd21974 112 rhTADPVPVSPPPVLCQMLPVSSSSGVIVAFLKAPQQPSPQPQKPALPQPQVVLVGGQVPQ 172
|
|
| PRK13335 |
PRK13335 |
superantigen-like protein SSL3; Reviewed; |
347-425 |
6.35e-03 |
|
superantigen-like protein SSL3; Reviewed;
Pssm-ID: 139494 [Multi-domain] Cd Length: 356 Bit Score: 41.27 E-value: 6.35e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583681 347 TSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSspPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:PRK13335 89 ASKIEKISQPKQEEQKSLNISATPAPKQEQSQTT--TESTTPKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMT 165
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
247-497 |
6.48e-03 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 41.08 E-value: 6.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 247 PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQ---VTSATSASSSPPQGtsD 323
Cdd:PRK10905 23 PSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAEQTAGNTQQDVSLPPISSTPTQgqtPVATDGQQRVEVQG--D 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 324 TPASSSPPQvtsatsassspPQGTSDTPASSSppqgTLDT-PSSSSPPQG---TSDTPASSSPPQGTSETPASNsppQGT 399
Cdd:PRK10905 101 LNNALTQPQ-----------NQQQLNNVAVNS----TLPTePATVAPVRNgnaSRQTAKTQTAERPATTRPARK---QAV 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 400 SEtpgfSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTA 479
Cdd:PRK10905 163 IE----PKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVG 238
|
250
....*....|....*...
gi 115583681 480 TLVSSSPPQVTSDTPASS 497
Cdd:PRK10905 239 SLKSAPSSHYTLQLSSSS 256
|
|
| PRK11901 |
PRK11901 |
hypothetical protein; Reviewed |
456-635 |
6.70e-03 |
|
hypothetical protein; Reviewed
Pssm-ID: 237015 [Multi-domain] Cd Length: 327 Bit Score: 41.21 E-value: 6.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 456 ASNSPPQGTSDTpgfsSPTQVTTATLV----SSSPPQVTSDTPASSSPPQVTSD--TPASSSPPQVTSETPASSSP---- 525
Cdd:PRK11901 57 ALKSPTEHESQQ----SSNNAGAEKNIdlsgSSSLSSGNQSSPSAANNTSDGHDasGVKNTAPPQDISAPPISPTPtqaa 132
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 526 PQVTSDTSASISPPQVISD-----------------TPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASS 588
Cdd:PRK11901 133 PPQTPNGQQRIELPGNISDalsqqqgqvnaasqnaqGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNH 212
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 115583681 589 SPTNMTSDTPASSSPPWPVITEVTRPESTiPAGRSLANITSKAQEDS 635
Cdd:PRK11901 213 HKTATVAVPPATSGKPKSGAASARALSSA-PASHYTLQLSSASRSDT 258
|
|
| DUF4045 |
pfam13254 |
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ... |
225-395 |
6.72e-03 |
|
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.
Pssm-ID: 433066 [Multi-domain] Cd Length: 415 Bit Score: 41.31 E-value: 6.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 225 GRP----QVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsaTSASSSPPQGTSDTPA 300
Cdd:pfam13254 195 GRPnsfkEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPA-----PTSASEPPPKTKELPK 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 301 SSSPPQVTSATSASSSPPQgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQgtsDTPASS 380
Cdd:pfam13254 270 DSEEPAAPSKSAEASTEKK-EPDTESSPETSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPPK---DFRANL 345
|
170
....*....|....*
gi 115583681 381 SPPQGTSETPASNSP 395
Cdd:pfam13254 346 RSREVPKDKSKKDEP 360
|
|
| PHA03201 |
PHA03201 |
uracil DNA glycosylase; Provisional |
390-513 |
7.80e-03 |
|
uracil DNA glycosylase; Provisional
Pssm-ID: 165468 Cd Length: 318 Bit Score: 40.65 E-value: 7.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 390 PASNSPPQGTSETPGFSSPPQVTTATlvssspPQVTSETPASssptqvtsetPASSSPTQVTSDTPASNSPPQGtsdtpg 469
Cdd:PHA03201 4 ARSRSPSPPRRPSPPRPTPPRSPDAS------PEETPPSPPG----------PGAEPPPGRAAGPAAPRRRPRG------ 61
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 115583681 470 fssptqvttatlvssSPPQVTSDTPASSSPPQVTSDTPASSSPP 513
Cdd:PHA03201 62 ---------------CPAGVTFSSSAPPRPPLGLDDAPAATPPP 90
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
502-629 |
8.08e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 41.25 E-value: 8.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 502 VTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQV-TSETPASSSPTNmtsdtPASSSPTNM 580
Cdd:PHA03269 18 IIANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLaQAPTPAASEKFD-----PAPAPHQAA 92
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 115583681 581 TSDTPASSSPTNMTSDTPASSSPPwpvITEVTRPESTIPAGRSLANITS 629
Cdd:PHA03269 93 SRAPDPAVAPQLAAAPKPDAAEAF---TSAAQAHEAPADAGTSAASKKP 138
|
|
| Mating_C |
pfam12737 |
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine ... |
441-716 |
8.26e-03 |
|
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine the mating type of an individual, and only individuals with differing mating types can mate. Basidiomycete fungi have evolved a unique mating system, termed tetrapolar or bifactorial incompatibility, in which mating type is determined by two unlinked loci; compatibility at both loci is required for mating to occur. The multi-allelic tetrapolar mating system is considered to be a novel innovation that could have only evolved once, and is thus unique to the mushroom fungi. This domain is C-terminal to the homeodomain transcription factor region.
Pssm-ID: 372279 [Multi-domain] Cd Length: 412 Bit Score: 41.13 E-value: 8.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 441 TPASSSPTQVTSDTPASNSPpqGTSDTPGFSSptqvttatlvSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETP 520
Cdd:pfam12737 72 SPASSPERNLSPQVDVCQLT--IRQNNLNLKR----------RSSSSSDVDSSNAERCHKRPRLDSPSSSSSPEKCLPSP 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 521 ASSSPPQVtSDTSASISPpqvisdTPASSSPPQVTSETPASSSPTNMTSD-TPASSSPTN--------MTSDT-PASSSP 590
Cdd:pfam12737 140 APSEQEAL-SEISAACGP------TPSTLTPLNVAPSLTPSKKRKRCLSDgFDGPKRPPNkrvqprpqTVSDPfPTSTSI 212
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 591 TN--------MTSDTPASSSPPWPVitEVTRPESTIPAGRSLANITSKAQED-SPLGVISTHPQMSFQSSTSQALDETAG 661
Cdd:pfam12737 213 PEwdewlqnhMSPSLTLHGDIPPPV--SVEAPDSNTPLDIEIFNFPYHPDLTpSPAPSLSDSVIEVATPTTESDYMCNGT 290
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 115583681 662 ERvPTIPDFQAH--SEFQKACAILQRLRDFLPTSP-TSAQKNNSWSSQTPAVSCPFQP 716
Cdd:pfam12737 291 LR-QTFSWFEFDfpELIQPTNTPASNNELSLPFDPsTDIVVSRTILPLLDWRSQSFLS 347
|
|
| PHA03201 |
PHA03201 |
uracil DNA glycosylase; Provisional |
507-606 |
8.37e-03 |
|
uracil DNA glycosylase; Provisional
Pssm-ID: 165468 Cd Length: 318 Bit Score: 40.65 E-value: 8.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 507 PASSSPPQVTSETPASSSPPQvTSDTSASISPPqvisdtpassSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPA 586
Cdd:PHA03201 4 ARSRSPSPPRRPSPPRPTPPR-SPDASPEETPP----------SPPGPGAEPPPGRAAGPAAPRRRPRGCPAGVTFSSSA 72
|
90 100
....*....|....*....|..
gi 115583681 587 SSSPTNMTSDTPASSSPP--WP 606
Cdd:PHA03201 73 PPRPPLGLDDAPAATPPPldWT 94
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
147-456 |
8.84e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 41.21 E-value: 8.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 147 PPQGASiwrnEFGPGPLLPMKRRGAETER-----------HMIPGNGPPLAMCHQPAP---------PELFETLCFPIDP 206
Cdd:PTZ00449 510 PPEGPE----ASGLPPKAPGDKEGEEGEHedskesdepkeGGKPGETKEGEVGKKPGPakehkpskiPTLSKKPEFPKDP 585
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 207 ASSAPPKATHRmtitslTGRPQVTSDTLASSSP--PQgTSDTPASSSPPQvTSATSASSSPPQgtsdTPASSSPPQVTSA 284
Cdd:PTZ00449 586 KHPKDPEEPKK------PKRPRSAQRPTRPKSPklPE-LLDIPKSPKRPE-SPKSPKRPPPPQ----RPSSPERPEGPKI 653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 285 TSasssppqgTSDTPASSSPP-------QVTSATSASSSPPQGTSDTPASSSPPQvtSATSASSSPPQGTSDTPASSSPP 357
Cdd:PTZ00449 654 IK--------SPKPPKSPKPPfdpkfkeKFYDDYLDAAAKSKETKTTVVLDESFE--SILKETLPETPGTPFTTPRPLPP 723
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 358 QGTLDtPSSSSPPQGTSDTPaSSSPPQgtsetpaSNSPPQGTS----ETPGFSSPPQVTTATLvssSPPQVTSETPASSS 433
Cdd:PTZ00449 724 KLPRD-EEFPFEPIGDPDAE-QPDDIE-------FFTPPEEERtffhETPADTPLPDILAEEF---KEEDIHAETGEPDE 791
|
330 340
....*....|....*....|...
gi 115583681 434 PTQvTSETPASSSPTQvTSDTPA 456
Cdd:PTZ00449 792 AMK-RPDSPSEHEDKP-PGDHPS 812
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
370-521 |
8.87e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 41.00 E-value: 8.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 370 PQGTSDTPASssPPQGTSETPASnsppQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSetpASSSPTQ 449
Cdd:PRK07994 361 PAAPLPEPEV--PPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLA---ARQQLQR 431
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 115583681 450 VTSDTPASNSPPQGTSDTPgfssPTQVTTATLVSSSP-PQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK07994 432 AQGATKAKKSEPAAASRAR----PVNSALERLASVRPaPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKAL 500
|
|
| CLECT |
smart00034 |
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function ... |
35-142 |
8.90e-03 |
|
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function as calcium-dependent carbohydrate binding modules.
Pssm-ID: 214480 [Multi-domain] Cd Length: 124 Bit Score: 38.35 E-value: 8.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 35 GNSCYQLNRLFCDFQEADNYCHAQRGRLA-------HTWnpkLRGFLKSFLNEETVW-------------WVRGNLTLPG 94
Cdd:smart00034 9 GGKCYKFSTEKKTWEDAQAFCQSLGGHLAsihseaeNDF---VASLLKNSGSSDYYWiglsdpdsngswqWSDGSGPVSY 85
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 115583681 95 SHPGINQTggddvlrNQKPGECPSVVTHSNavfsRWNL--CIEKHHFICQ 142
Cdd:smart00034 86 SNWAPGEP-------NNSSGDCVVLSTSGG----KWNDvsCTSKLPFVCE 124
|
|
| KLF12_N |
cd21441 |
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ... |
399-451 |
9.08e-03 |
|
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.
Pssm-ID: 410608 [Multi-domain] Cd Length: 197 Bit Score: 39.61 E-value: 9.08e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 115583681 399 TSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVT 451
Cdd:cd21441 65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLT 117
|
|
| PHA02682 |
PHA02682 |
ORF080 virion core protein; Provisional |
324-445 |
9.24e-03 |
|
ORF080 virion core protein; Provisional
Pssm-ID: 177464 [Multi-domain] Cd Length: 280 Bit Score: 40.23 E-value: 9.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 324 TPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSdtPASSSPPQGTSETPAsnsPPQGTSEtP 403
Cdd:PHA02682 85 SPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPAR--PAPACPPSTRQCPPA---PPLPTPK-P 158
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 115583681 404 GFSSPPQVTTATLvssSPPQVtsetPASSSPTQVTSetPASS 445
Cdd:PHA02682 159 APAAKPIFLHNQL---PPPDY----PAASCPTIETA--PAAS 191
|
|
| DamX |
COG3266 |
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ... |
345-549 |
9.51e-03 |
|
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442497 [Multi-domain] Cd Length: 455 Bit Score: 40.99 E-value: 9.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 345 QGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSP--- 421
Cdd:COG3266 171 QGTLQALGAVAALLGLRKAEEALALRAGSAAADALALLLLLLASALGEAVAAAAELAALALLAAGAAEVLTARLVLLlli 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 422 PQVTSETPASSSptQVTSETPASSSPTQVTSDTPASNSPPQGTSdtpgfssPTQVTTATLVSSSPPQVTSDTPASSSPPQ 501
Cdd:COG3266 251 IGSALKAPSQAS--SASAPATTSLGEQQEVSLPPAVAAQPAAAA-------AAQPSAVALPAAPAAAAAAAAPAEAAAPQ 321
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 115583681 502 VTSDTPASSSPPQVTS---ETPASSSPPQVTSDTSASISPPQVISDTPASS 549
Cdd:COG3266 322 PTAAKPVVTETAAPAApapEAAAAAAAPAAPAVAKKLAADEQWLASQPASH 372
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
377-609 |
9.79e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 41.17 E-value: 9.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 377 PASSSPPQGTSETPASNSPPQGTSETPGFSSPPqVTTATLVSSSPPQVtSETPASSSPTQVTSETPASSSPTQVTSDTPA 456
Cdd:pfam09770 107 PAARAAQSSAQPPASSLPQYQYASQQSQQPSKP-VRTGYEKYKEPEPI-PDLQVDASLWGVAPKKAAAPAPAPQPAAQPA 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 457 SNSPP------------QGTSDTPGfsSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSS 524
Cdd:pfam09770 185 SLPAPsrkmmsleeveaAMRAQAKK--PAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHP 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583681 525 PPQVTSDTSASISPPQVISDTPASSSPPQVtseTPASSSPT------NMTSDTPAsSSPTNMTSdtPASSSPTNMTSDTP 598
Cdd:pfam09770 263 VTILQRPQSPQPDPAQPSIQPQAQQFHQQP---PPVPVQPTqilqnpNRLSAARV-GYPQNPQP--GVQPAPAHQAHRQQ 336
|
250
....*....|.
gi 115583681 599 ASSSPPWPVIT 609
Cdd:pfam09770 337 GSFGRQAPIIT 347
|
|
|