NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039789982|ref|XP_017168315|]
View 

polycystin-1-like protein 3 isoform X5 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
1131-1249 1.31e-55

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


:

Pssm-ID: 238850  Cd Length: 120  Bit Score: 189.41  E-value: 1.31e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1131 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1210
Cdd:cd01752      2 LYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPEKPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSWY 81
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982 1211 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1249
Cdd:cd01752     82 LSRVIVRDLQTGKKWFFLCNDWLSVEEGDGTVERTFPVA 120
Polycystin_dom super family cl48672
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
1709-1894 1.09e-35

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


The actual alignment was detected with superfamily member pfam20519:

Pssm-ID: 466668  Cd Length: 199  Bit Score: 135.24  E-value: 1.09e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1709 FSEIKTVEDFYPWANGTLLPNLYGD-----YRGFITDGNSFLLGNVLIRQTRIPNDIFFPGSLHKQMKSPPQHQ-----E 1778
Cdd:pfam20519    2 LLTVTDLDDIWDWLSSVLLPALHSNktpsgLPGSFIAYESLLLGVPRLRQLRVRNSSCLVHDKFVREINECHAGysppsE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1779 DRENYGAGWVPPDTNITKvdSIWHYQNQESLGGYPIQGELATYSGGGYVVRLGRNHSAATRVLQHLEQRRWLDHCTKALF 1858
Cdd:pfam20519   82 DRKLYSALPYKPVHYGSK--YWFIYTPPGLLMGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVF 159
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1039789982 1859 VEFTVFNANVNLLCAVTLILESSGVGTFLTSLQLDS 1894
Cdd:pfam20519  160 VDFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQS 195
PKD_channel super family cl37568
Polycystin cation channel; This family contains the cation channel region from group II of ...
1899-2119 4.33e-34

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


The actual alignment was detected with superfamily member pfam08016:

Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 131.63  E-value: 4.33e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1899 QSSERGFAWIVSQVVYYLLVCYYAFIQGCRLKRQRLAFFTRKRNLLDTSIVLISFSILGLSMQSLSLLHKKMQQYHCDRD 1978
Cdd:pfam08016    2 YVTNRSLFILLCEIVFVVFFLYFVVEEILKIRKHRPSYLRSVWNLLDLAIVILSVVLIVLNIYRDFLADRLIKSVEASPV 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1979 RFISFYEALRVNSAVTHLRGFLLLFATVRVWDLLRHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFGWSI 2058
Cdd:pfam08016   82 TFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLFGTQA 161
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982 2059 SDYQSFFRSIVTVVGLLMGTSKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAF 2119
Cdd:pfam08016  162 PNFSNFVKSILTLFRTILGDFGYNEIFSGNRVLGPLLFLTFVFLVIFILLNLFLAIINDSY 222
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
347-705 2.40e-26

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 118.10  E-value: 2.40e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQgTLDTPSSSSPPQGTSDTpaSSSPPQGTSE-----TPASNSPPQGT-SETPGFSSPPQ-VTTATLVSS 419
Cdd:pfam05109  445 TTGLPSSTHVPT-NLTAPASTGPTVSTADV--TSPTPAGTTSgaspvTPSPSPRDNGTeSKAPDMTSPTSaVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  420 SP-PQVTSETPASSSPT-QVTSETPASSSPT-QVTSDTPASNSP-PQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:pfam05109  522 SPtPAVTTPTPNATSPTlGKTSPTSAVTTPTpNATSPTPAVTTPtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQ 601
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  496 SSSPPQV---TSDTPASSSPPQvTSETPASSSPPQVTSDTSASIS-PPQVISDTPASSSPPQVTSE-------------- 557
Cdd:pfam05109  602 ANTTNHTlggTSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSlRPSSISETLSPSTSDNSTSHmplltsahptggen 680
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  558 ----TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPT------NMTSDTPA--SSSPPWPVITEVTRPESTIPAGRslA 625
Cdd:pfam05109  681 itqvTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTStkpgevNVTKGTPPknATSPQAPSGQKTAVPTVTSTGGK--A 758
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  626 NITSKAQEDSPLGV-ISTHPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEFQKACAILQRLRDFLPTSPTSAQKNNSW 704
Cdd:pfam05109  759 NSTTGGKHTTGHGArTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQPRFSNL 838

                   .
gi 1039789982  705 S 705
Cdd:pfam05109  839 S 839
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
147-513 4.78e-12

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 71.74  E-value: 4.78e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  147 PPQGASIWRNEFGPGPLLP--MKRRGAETERHMIPGNGPPlAMCHQPAPPELFETLCFPIDPASSAPPKATHRMTITSLT 224
Cdd:PHA03307    79 APANESRSTPTWSLSTLAPasPAREGSPTPPGPSSPDPPP-PTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  225 GRPQVTSDT-------LASSSPPQG--TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSP---------------PQ 280
Cdd:PHA03307   158 SPAAVASDAassrqaaLPLSSPEETarAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPapapgrsaaddagasSS 237
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  281 VTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsatsassSPPQGTSDTPASSSPPQGT 360
Cdd:PHA03307   238 DSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS----------SSPRERSPSPSPSSPGSGP 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  361 LDTPSS-----SSPPQGTSDTPASSSPPqgtSETPASnSPPQGTSETPGFSSPPqvttatlvSSSPPQVTSETPASSSPT 435
Cdd:PHA03307   308 APSSPRassssSSSRESSSSSTSSSSES---SRGAAV-SPGPSPSRSPSPSRPP--------PPADPSSPRKRPRPSRAP 375
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvtsdTPASSSPP 513
Cdd:PHA03307   376 SSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEP------WPGSPPPP 447
GPS pfam01825
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
1021-1059 1.06e-06

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


:

Pssm-ID: 460350  Cd Length: 44  Bit Score: 47.30  E-value: 1.06e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1039789982 1021 QCYFWDRYNRT---WKSDGCQVGPKSTiLKTQCLCDHLTFFS 1059
Cdd:pfam01825    2 QCVFWDFTNSTtgrWSTEGCTTVSLND-THTVCSCNHLTSFA 42
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
37-142 1.41e-06

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


:

Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 49.16  E-value: 1.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982   37 SCYQLNRLFCDFQEADNYCHAQRGRLAHTWNPKLRGFLKSFL---NEETVW-------------WVRGNLTLPGSHPGIN 100
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLkksSSSDVWiglndlssegtwkWSDGSPLVDYTNWAPG 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1039789982  101 QTGGDDvlrnqkPGECpsvVTHSNAVFSRWN--LCIEKHHFICQ 142
Cdd:cd00037     81 EPNPGG------SEDC---VVLSSSSDGKWNdvSCSSKLPFICE 115
 
Name Accession Description Interval E-value
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
1131-1249 1.31e-55

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 189.41  E-value: 1.31e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1131 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1210
Cdd:cd01752      2 LYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPEKPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSWY 81
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982 1211 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1249
Cdd:cd01752     82 LSRVIVRDLQTGKKWFFLCNDWLSVEEGDGTVERTFPVA 120
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
1709-1894 1.09e-35

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 135.24  E-value: 1.09e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1709 FSEIKTVEDFYPWANGTLLPNLYGD-----YRGFITDGNSFLLGNVLIRQTRIPNDIFFPGSLHKQMKSPPQHQ-----E 1778
Cdd:pfam20519    2 LLTVTDLDDIWDWLSSVLLPALHSNktpsgLPGSFIAYESLLLGVPRLRQLRVRNSSCLVHDKFVREINECHAGysppsE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1779 DRENYGAGWVPPDTNITKvdSIWHYQNQESLGGYPIQGELATYSGGGYVVRLGRNHSAATRVLQHLEQRRWLDHCTKALF 1858
Cdd:pfam20519   82 DRKLYSALPYKPVHYGSK--YWFIYTPPGLLMGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVF 159
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1039789982 1859 VEFTVFNANVNLLCAVTLILESSGVGTFLTSLQLDS 1894
Cdd:pfam20519  160 VDFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQS 195
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
1899-2119 4.33e-34

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 131.63  E-value: 4.33e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1899 QSSERGFAWIVSQVVYYLLVCYYAFIQGCRLKRQRLAFFTRKRNLLDTSIVLISFSILGLSMQSLSLLHKKMQQYHCDRD 1978
Cdd:pfam08016    2 YVTNRSLFILLCEIVFVVFFLYFVVEEILKIRKHRPSYLRSVWNLLDLAIVILSVVLIVLNIYRDFLADRLIKSVEASPV 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1979 RFISFYEALRVNSAVTHLRGFLLLFATVRVWDLLRHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFGWSI 2058
Cdd:pfam08016   82 TFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLFGTQA 161
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982 2059 SDYQSFFRSIVTVVGLLMGTSKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAF 2119
Cdd:pfam08016  162 PNFSNFVKSILTLFRTILGDFGYNEIFSGNRVLGPLLFLTFVFLVIFILLNLFLAIINDSY 222
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
347-705 2.40e-26

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 118.10  E-value: 2.40e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQgTLDTPSSSSPPQGTSDTpaSSSPPQGTSE-----TPASNSPPQGT-SETPGFSSPPQ-VTTATLVSS 419
Cdd:pfam05109  445 TTGLPSSTHVPT-NLTAPASTGPTVSTADV--TSPTPAGTTSgaspvTPSPSPRDNGTeSKAPDMTSPTSaVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  420 SP-PQVTSETPASSSPT-QVTSETPASSSPT-QVTSDTPASNSP-PQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:pfam05109  522 SPtPAVTTPTPNATSPTlGKTSPTSAVTTPTpNATSPTPAVTTPtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQ 601
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  496 SSSPPQV---TSDTPASSSPPQvTSETPASSSPPQVTSDTSASIS-PPQVISDTPASSSPPQVTSE-------------- 557
Cdd:pfam05109  602 ANTTNHTlggTSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSlRPSSISETLSPSTSDNSTSHmplltsahptggen 680
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  558 ----TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPT------NMTSDTPA--SSSPPWPVITEVTRPESTIPAGRslA 625
Cdd:pfam05109  681 itqvTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTStkpgevNVTKGTPPknATSPQAPSGQKTAVPTVTSTGGK--A 758
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  626 NITSKAQEDSPLGV-ISTHPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEFQKACAILQRLRDFLPTSPTSAQKNNSW 704
Cdd:pfam05109  759 NSTTGGKHTTGHGArTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQPRFSNL 838

                   .
gi 1039789982  705 S 705
Cdd:pfam05109  839 S 839
PHA03247 PHA03247
large tegument protein UL36; Provisional
159-661 8.03e-24

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 110.80  E-value: 8.03e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  159 GPGPLLPMKRRGAETERHMipgngPPlamcHQPAPPelfetlcfPIDPAS-------SAPPKATHRMTITSLTGRPQVTS 231
Cdd:PHA03247  2550 DPPPPLPPAAPPAAPDRSV-----PP----PRPAPR--------PSEPAVtsrarrpDAPPQSARPRAPVDDRGDPRGPA 2612
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  232 DtlASSSPPQGTS-DTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSppqgtSDTPASSSPPQVTSA 310
Cdd:PHA03247  2613 P--PSPLPPDTHApDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL-----GRAAQASSPPQRPRR 2685
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  311 TSASSSPPQGTS--DTPASSSPPQvtsatsassspPQGTSDTPASSSPPQGTLDTPSSSSPPQgtsdTPASSSPPQGTSe 388
Cdd:PHA03247  2686 RAARPTVGSLTSlaDPPPPPPTPE-----------PAPHALVSATPLPPGPAAARQASPALPA----APAPPAVPAGPA- 2749
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  389 TPASNSPPQgtseTPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSP-PQGTSDT 467
Cdd:PHA03247  2750 TPGGPARPA----RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAlPPAASPA 2825
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  468 PGFSSPTQVTTATLVSSSPPQVTSDTPASS----------SPPQVTSDTPASSSPPQVTS-ETPASSSPPQVTSDTSASI 536
Cdd:PHA03247  2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSvapggdvrrrPPSRSPAAKPAAPARPPVRRlARPAVSRSTESFALPPDQP 2905
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  537 SPPQvisdTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPtnmtsdtpaSSSPPWPVITEVTRPES 616
Cdd:PHA03247  2906 ERPP----QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP---------SGAVPQPWLGALVPGRV 2972
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*
gi 1039789982  617 TIPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALDETA 661
Cdd:PHA03247  2973 AVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEET 3017
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
1132-1247 6.16e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 95.58  E-value: 6.16e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1132 YLIQVYTGYRRRAATTAKVVITLYGSEGHS--EPHHLCDPEktvFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSW 1209
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESaqLEITLDNPD---FERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEW 77
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982 1210 YVSQVIV-SDMTTRKKWHFQCNCWLAVDLGNcERDRVFT 1247
Cdd:pfam01477   78 FLKSITVeVPGETGGKYTFPCNSWVYGSKKY-KETRVFF 115
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
373-609 4.93e-15

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 80.57  E-value: 4.93e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  373 TSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTS 452
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  453 DTPASNSPPqgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPpqVTSDT 532
Cdd:COG3469     81 TATAAAAAA---------TSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSA--GSTTT 149
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  533 SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSsptnmtsdTPASSSPPWPVIT 609
Cdd:COG3469    150 TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT--------GPPTPGLPKHVLV 218
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
147-513 4.78e-12

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 71.74  E-value: 4.78e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  147 PPQGASIWRNEFGPGPLLP--MKRRGAETERHMIPGNGPPlAMCHQPAPPELFETLCFPIDPASSAPPKATHRMTITSLT 224
Cdd:PHA03307    79 APANESRSTPTWSLSTLAPasPAREGSPTPPGPSSPDPPP-PTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  225 GRPQVTSDT-------LASSSPPQG--TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSP---------------PQ 280
Cdd:PHA03307   158 SPAAVASDAassrqaaLPLSSPEETarAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPapapgrsaaddagasSS 237
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  281 VTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsatsassSPPQGTSDTPASSSPPQGT 360
Cdd:PHA03307   238 DSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS----------SSPRERSPSPSPSSPGSGP 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  361 LDTPSS-----SSPPQGTSDTPASSSPPqgtSETPASnSPPQGTSETPGFSSPPqvttatlvSSSPPQVTSETPASSSPT 435
Cdd:PHA03307   308 APSSPRassssSSSRESSSSSTSSSSES---SRGAAV-SPGPSPSRSPSPSRPP--------PPADPSSPRKRPRPSRAP 375
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvtsdTPASSSPP 513
Cdd:PHA03307   376 SSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEP------WPGSPPPP 447
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
1131-1236 4.96e-12

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 64.20  E-value: 4.96e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  1131 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNsgDSPSWY 1210
Cdd:smart00308    2 KYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDYLFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEH--RHPEWF 79
                            90       100
                    ....*....|....*....|....*.
gi 1039789982  1211 VSQVIVSDMTTRKKWHFQCNCWLAVD 1236
Cdd:smart00308   80 LKSITVKDLPTGGKYHFPCNSWVYPD 105
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
347-677 2.75e-10

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 66.17  E-value: 2.75e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQGTLDTPSSSSPPQGTS------DTPassSPPQGTSE-TPAS----NSP-PQGT----SETPgfSSPPQ 410
Cdd:TIGR00927   76 SSDPPKSSSEMEGEMLAPQATVGRDEATpsiameNTP---SPPRRTAKiTPTTpknnYSPtAAGTervkEDTP--ATPSR 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  411 VTTATLVSSSPPQVTSETPA------SSSPTQVTSE----TPaSSSPTQVTSDTPAS-NSPPQGTSDTPGFSSPTQVTTA 479
Cdd:TIGR00927  151 ALNHYISTSGRQRVKSYTPKprgevkSSSPTQTREKvrkyTP-SPLGRMVNSYAPSTfMTMPRSHGITPRTTVKDSEITA 229
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  480 T---LVSSSPPQV---TSDTP----ASSSPPQVTSDTPAS--SSPPQVTsETPASSSPPQVTSDTSA---------SISP 538
Cdd:TIGR00927  230 TykmLETNPSKRTagkTTPTPlkgmTDNTPTFLTREVETDllTSPRSVV-EKNTLTTPRRVESNSSTnhwglvgknNLTT 308
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  539 PQ--VISDTPASSSPpQVTSETPASSSPTnmtsDTPASSSPTNMTSDTPASSSPTNMTsdtpaSSSPPWPVITEVTRPES 616
Cdd:TIGR00927  309 PQgtVLEHTPATSEG-QVTISIMTGSSPA----ETKASTAAWKIRNPLSRTSAPAVRI-----ASATFRGLEKNPSTAPS 378
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039789982  617 TIPAGRSLANITSKAQE---DSPLGVISTHPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEF 677
Cdd:TIGR00927  379 TPATPRVRAVLTTQVHHcvvVKPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDLHPKAEY 442
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
349-709 1.84e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.84  E-value: 1.84e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  349 DTPASSSPPqgtlDTPSSSSP-PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:NF033609   540 DKPVVPEQP----DEPGEIEPiPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASD 615
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  428 TPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:NF033609   616 SDSASDSDSASDSDSASDSDSASDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 692
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASS-SPTNMTSDTPA 586
Cdd:NF033609   693 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDS 772
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  587 -SSSPTNMTSDTPASSSPPwpviTEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALDETAGERV 665
Cdd:NF033609   773 dSDSDSDSDSDSDSDSDSD----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 848
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1039789982  666 PTIPDFQAHSEFQKACAILQRLRDFLPTSP----TSAQKNNSWSSQTP 709
Cdd:NF033609   849 DSDSDSDSESDSNSDSESGSNNNVVPPNSPkngtNASNKNEAKDSKEP 896
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
240-589 1.97e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.46  E-value: 1.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  240 PQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQ 319
Cdd:NF033609   558 PEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSA 637
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  320 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGT 399
Cdd:NF033609   638 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 717
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  400 SETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPGFSSPTQVTT 478
Cdd:NF033609   718 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD 797
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  479 ATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSET 558
Cdd:NF033609   798 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNS 877
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1039789982  559 PASSSPTNMTSDTPASSSPTNMT-SDTPASSS 589
Cdd:NF033609   878 PKNGTNASNKNEAKDSKEPLPDTgSEDEANTS 909
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
439-631 2.30e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.46  E-value: 2.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  439 SETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQvtSDTPASSSPPQVTSE 518
Cdd:NF033609    33 SSKEADASENSVTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQ--QETTQSASTNATTEE 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  519 TPASSsppQVTSDTSASISPPQVI--SDTPASSSPPQVTSETpaSSSPTNMTSDTpasSSPTN--------MTSDTPASS 588
Cdd:NF033609   111 TPVTG---EATTTATNQANTPATTqsSNTNAEELVNQTSNET--TSNDTNTVSSV---NSPQNstnaenvsTTQDTSTEA 182
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1039789982  589 SPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKA 631
Cdd:NF033609   183 TPSNNESAPQSTDASNKDVVNQAVNTSAPRMRAFSLAAVAADA 225
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
270-604 3.06e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.07  E-value: 3.06e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  270 SDT-PASSSPPQVTSATSASSSPPQGTSDTPASSSppqvtsatsasssppqgtSDTPASSSPPQVTSATSASSSPPQGTS 348
Cdd:NF033609   561 SDSdPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSA------------------SDSDSASDSDSASDSDSASDSDSASDS 622
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  349 DTPASSSPPQGTldtpSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:NF033609   623 DSASDSDSASDS----DSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  429 PASSSPTQVTSETPASSSPTQVTSDTpASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDtpaSSSPPQVTSDTPA 508
Cdd:NF033609   699 DSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDS 774
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  509 SSSPPQVTSETPASSSPPQVTSDT-SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:NF033609   775 DSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 854
                          330       340
                   ....*....|....*....|.
gi 1039789982  588 SSPTNMTSDTPASSS----PP 604
Cdd:NF033609   855 DSESDSNSDSESGSNnnvvPP 875
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
351-619 8.23e-07

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 54.16  E-value: 8.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  351 PASSSPPQGTLDTPSSSS------PPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSS---- 420
Cdd:cd22540     66 PLPLGPGKNSIGFLSAKGniiqlqGSQLSSSAPGGQQVFAIQNPTMIIKGSQTRSSTNQQYQISPQIQAAGQINNSgqiq 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  421 -----------PPQVTSETPASSSPTQVTsetPASSSPTQVTSDTPASN-------SPPQGTSDTP-----GFSSPTQVT 477
Cdd:cd22540    146 iipgtnqaiitPVQVLQQPQQAHKPVPIK---PAPLQTSNTNSASLQVPgnviklqSGGNVALTLPvnnlvGTQDGATQL 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  478 TATLVSSSPPQVTSDTPASSSPPQVTS--------------------------DTPASSSPPQV--------TSETPASS 523
Cdd:cd22540    223 QLAAAPSKPSKKIRKKSAQAAQPAVTVaeqvetvliettadniiqagnnllivQSPGTGQPAVLqqvqvlqpKQEQQVVQ 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  524 SPP------QVTSDTSASI--SPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:cd22540    303 IPQqalrvvQAASATLPTVpqKPLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTANNGT 382
                          330       340
                   ....*....|....*....|....
gi 1039789982  596 DTPASSSPpwpvitevTRPESTIP 619
Cdd:cd22540    383 GTSKPNYN--------VRKERTLP 398
GPS pfam01825
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
1021-1059 1.06e-06

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


Pssm-ID: 460350  Cd Length: 44  Bit Score: 47.30  E-value: 1.06e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1039789982 1021 QCYFWDRYNRT---WKSDGCQVGPKSTiLKTQCLCDHLTFFS 1059
Cdd:pfam01825    2 QCVFWDFTNSTtgrWSTEGCTTVSLND-THTVCSCNHLTSFA 42
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
37-142 1.41e-06

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 49.16  E-value: 1.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982   37 SCYQLNRLFCDFQEADNYCHAQRGRLAHTWNPKLRGFLKSFL---NEETVW-------------WVRGNLTLPGSHPGIN 100
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLkksSSSDVWiglndlssegtwkWSDGSPLVDYTNWAPG 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1039789982  101 QTGGDDvlrnqkPGECpsvVTHSNAVFSRWN--LCIEKHHFICQ 142
Cdd:cd00037     81 EPNPGG------SEDC---VVLSSSSDGKWNdvSCSSKLPFICE 115
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
190-576 2.95e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.60  E-value: 2.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  190 QPAPPELFEtlcfPIDPASSAPPKathrmtitSLTGRPQVTSDTlASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGT 269
Cdd:NF033609   547 QPDEPGEIE----PIPEDSDSDPG--------SDSGSDSSNSDS-GSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSA 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  270 SDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSD 349
Cdd:NF033609   614 SDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  350 TPASSSPpqgtlDTPS-SSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:NF033609   694 SDSDSDS-----DSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 768
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  429 PASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSppqvtSDTP 507
Cdd:NF033609   769 DSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-----SDSD 843
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPqvisDTPASSSPPQVTSETPASSSPTNMT-SDTPASSS 576
Cdd:NF033609   844 SDSDSDSDSDSDSESDSNSDSESGSNNNVVPP----NSPKNGTNASNKNEAKDSKEPLPDTgSEDEANTS 909
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
1020-1059 1.10e-05

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 44.30  E-value: 1.10e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1039789982  1020 TQCYFWDRYNRTWKSDGCQVGPKS-TIlkTQCLCDHLTFFS 1059
Cdd:smart00303    3 PICVFWDESSGEWSTRGCELLETNgTH--TTCSCNHLTTFA 41
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
224-422 1.61e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.14  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  224 TGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSS 303
Cdd:COG3469     12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  304 PPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS--SSPPQGTSDTPASSS 381
Cdd:COG3469     92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSgtETATGGTTTTSTTTT 171
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1039789982  382 PPQGTSETPASNSPPQGTSETPGFSSPP-QVTTATLVSSSPP 422
Cdd:COG3469    172 TTSASTTPSATTTATATTASGATTPSATtTATTTGPPTPGLP 213
PLN03223 PLN03223
Polycystin cation channel protein; Provisional
1996-2132 5.75e-05

Polycystin cation channel protein; Provisional


Pssm-ID: 215637 [Multi-domain]  Cd Length: 1634  Bit Score: 48.79  E-value: 5.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1996 LRGFLLLFATVRVWDLLRHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFGWSISDYQSFFRSIVTVVGLL 2075
Cdd:PLN03223  1294 LSGINIILLLGRILKLMDFQPRLGVITRTLWLAGADLMHFFVIFGMVFVGYAFIGHVIFGNASVHFSDMTDSINSLFENL 1373
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039789982 2076 MG-----TSKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAFG--KERKACEVSNQT 2132
Cdd:PLN03223  1374 LGdityfNEDLKNLTGLQFVVGMIYFYSYNIFVFMILFNFLLAIICDAFGevKANAAETVSVHT 1437
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
321-609 7.38e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.84  E-value: 7.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  321 TSDTPA-------SSSPPQVTSATSASSSPPQGTSDTPASSSPPQgtLDTPSSSSPPQgtSDTPASSSPPQgtSETPASN 393
Cdd:NF033839   280 TQDTPKepgnkkpSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQ--LEKPKPEVKPQ--PEKPKPEVKPQ--LETPKPE 353
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  394 SPPQGTSETPGFSSPPQvttaTLVSSSPPQVTSETPasssptQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFsSP 473
Cdd:NF033839   354 VKPQPEKPKPEVKPQPE----KPKPEVKPQPETPKP------EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEV-KP 422
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  474 TQVTTATLVSSSPPqvtsdTPASSSPPQvtSDTPASSSPPQvtSETPASSSPPQVtsdtsasisppqvisDTPASSSPPQ 553
Cdd:NF033839   423 QPEKPKPEVKPQPE-----KPKPEVKPQ--PEKPKPEVKPQ--PETPKPEVKPQP---------------EKPKPEVKPQ 478
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  554 VTSETPASSSPtnmTSDTPASSSPTNMTSDTPAS--SSPTNMTSDTPASSSPPWPVIT 609
Cdd:NF033839   479 PEKPKPDNSKP---QADDKKPSTPNNLSKDKQPSnqASTNEKATNKPKKSLPSTGSIS 533
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
147-433 2.38e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.30  E-value: 2.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  147 PPQGASIWRNEF-GPGPLLPmkRRGAETERHMIPGNGPPLAMCHQPAPPElfETlcfPIDPASSAPPKATHRMTiTSLTG 225
Cdd:pfam03154  294 PPQPFPLTPQSSqSQVPPGP--SPAAPGQSQQRIHTPPSQSQLQSQQPPR--EQ---PLPPAPLSMPHIKPPPT-TPIPQ 365
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  226 RPQVTSDT----LASSSPPQGTSDTPAsssPPQVTSATSAsssppqgTSDTPASSSPPQVTSATsasssppQGTSDTPAS 301
Cdd:pfam03154  366 LPNPQSHKhpphLSGPSPFQMNSNLPP---PPALKPLSSL-------STHHPPSAHPPPLQLMP-------QSQQLPPPP 428
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  302 SSPPQVTsatsasssppQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSS 381
Cdd:pfam03154  429 AQPPVLT----------QSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSA 498
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  382 PPQGTSETPASNS---PP-QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:pfam03154  499 SVSSSGPVPAAVScplPPvQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
426-535 2.57e-03

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 39.81  E-value: 2.57e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982   426 SETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSD 505
Cdd:smart01104   12 SKTPAWGSRTPGTAAGGAPTARGGSGSRTPAWGGAGSRTP-AWGGAGPTGSRTPAWGGASAWGNKSSEGSASSWAAGPGG 90
                            90       100       110
                    ....*....|....*....|....*....|..
gi 1039789982   506 TPASSSP--PQVTSeTPASSSPPQVTSDTSAS 535
Cdd:smart01104   91 AYGAPTPgyGGTPS-AYGPATPGGGAMAGSAS 121
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
397-602 6.70e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.91  E-value: 6.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  397 QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASS---SPTQVTSET-PASSSPTQVTSDTPA-SNSPPQGTSDTPGFS 471
Cdd:NF033849   247 VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTrgwSHTQSTSESeSTGQSSSVGTSESQShGTTEGTSTTDSSSHS 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  472 SPTQVTTATLVSSSpPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSdTSASISppqviSDTPASSSP 551
Cdd:NF033849   327 QSSSYNVSSGTGVS-SSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRS-SSSGVS-----GGFSGGIAG 399
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  552 PQVTSETPASSSPTN---MTSDTPASSSPTNMTS-DTPASSSPTNMTSDTPASSS 602
Cdd:NF033849   400 GGVTSEGLGASQGGSegwGSGDSVQSVSQSYGSSsSTGTSSGHSDSSSHSTSSGQ 454
 
Name Accession Description Interval E-value
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
1131-1249 1.31e-55

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 189.41  E-value: 1.31e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1131 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1210
Cdd:cd01752      2 LYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPEKPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSWY 81
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982 1211 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1249
Cdd:cd01752     82 LSRVIVRDLQTGKKWFFLCNDWLSVEEGDGTVERTFPVA 120
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
1709-1894 1.09e-35

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 135.24  E-value: 1.09e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1709 FSEIKTVEDFYPWANGTLLPNLYGD-----YRGFITDGNSFLLGNVLIRQTRIPNDIFFPGSLHKQMKSPPQHQ-----E 1778
Cdd:pfam20519    2 LLTVTDLDDIWDWLSSVLLPALHSNktpsgLPGSFIAYESLLLGVPRLRQLRVRNSSCLVHDKFVREINECHAGysppsE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1779 DRENYGAGWVPPDTNITKvdSIWHYQNQESLGGYPIQGELATYSGGGYVVRLGRNHSAATRVLQHLEQRRWLDHCTKALF 1858
Cdd:pfam20519   82 DRKLYSALPYKPVHYGSK--YWFIYTPPGLLMGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVF 159
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1039789982 1859 VEFTVFNANVNLLCAVTLILESSGVGTFLTSLQLDS 1894
Cdd:pfam20519  160 VDFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQS 195
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
1899-2119 4.33e-34

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 131.63  E-value: 4.33e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1899 QSSERGFAWIVSQVVYYLLVCYYAFIQGCRLKRQRLAFFTRKRNLLDTSIVLISFSILGLSMQSLSLLHKKMQQYHCDRD 1978
Cdd:pfam08016    2 YVTNRSLFILLCEIVFVVFFLYFVVEEILKIRKHRPSYLRSVWNLLDLAIVILSVVLIVLNIYRDFLADRLIKSVEASPV 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1979 RFISFYEALRVNSAVTHLRGFLLLFATVRVWDLLRHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFGWSI 2058
Cdd:pfam08016   82 TFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLFGTQA 161
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982 2059 SDYQSFFRSIVTVVGLLMGTSKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAF 2119
Cdd:pfam08016  162 PNFSNFVKSILTLFRTILGDFGYNEIFSGNRVLGPLLFLTFVFLVIFILLNLFLAIINDSY 222
PLAT_repeat cd01756
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 ...
1132-1249 5.79e-29

PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238854  Cd Length: 120  Bit Score: 113.03  E-value: 5.79e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1132 YLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTV-FERGALDVFLLSTGSwLGDLHGLRLWHDNSGDSPSWY 1210
Cdd:cd01756      3 YEVTVKTGDVKGAGTDANVFITLYGENGDTGKRKLKKSNNKNkFERGQTDKFTVEAVD-LGKLKKIRIGHDNSGLGAGWF 81
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982 1211 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1249
Cdd:cd01756     82 LDKVEIREPGTGDEYTFPCNRWLDKDEDDGQIVRELYPS 120
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
347-705 2.40e-26

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 118.10  E-value: 2.40e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQgTLDTPSSSSPPQGTSDTpaSSSPPQGTSE-----TPASNSPPQGT-SETPGFSSPPQ-VTTATLVSS 419
Cdd:pfam05109  445 TTGLPSSTHVPT-NLTAPASTGPTVSTADV--TSPTPAGTTSgaspvTPSPSPRDNGTeSKAPDMTSPTSaVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  420 SP-PQVTSETPASSSPT-QVTSETPASSSPT-QVTSDTPASNSP-PQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:pfam05109  522 SPtPAVTTPTPNATSPTlGKTSPTSAVTTPTpNATSPTPAVTTPtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQ 601
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  496 SSSPPQV---TSDTPASSSPPQvTSETPASSSPPQVTSDTSASIS-PPQVISDTPASSSPPQVTSE-------------- 557
Cdd:pfam05109  602 ANTTNHTlggTSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSlRPSSISETLSPSTSDNSTSHmplltsahptggen 680
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  558 ----TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPT------NMTSDTPA--SSSPPWPVITEVTRPESTIPAGRslA 625
Cdd:pfam05109  681 itqvTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTStkpgevNVTKGTPPknATSPQAPSGQKTAVPTVTSTGGK--A 758
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  626 NITSKAQEDSPLGV-ISTHPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEFQKACAILQRLRDFLPTSPTSAQKNNSW 704
Cdd:pfam05109  759 NSTTGGKHTTGHGArTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQPRFSNL 838

                   .
gi 1039789982  705 S 705
Cdd:pfam05109  839 S 839
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
325-652 8.72e-26

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 116.55  E-value: 8.72e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  325 PASSSPpqVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSP-PQGTSETPASNSP-PQGTSET 402
Cdd:pfam05109  461 PASTGP--TVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPtPNATSPTPAVTTPtPNATSPT 538
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  403 PGFSSPPQ-VTTATLVSSSP-PQVTSETPASSSPT--------QVTSETPASSSPTqVTSDTPASNSPPQ---GTSDTPG 469
Cdd:pfam05109  539 LGKTSPTSaVTTPTPNATSPtPAVTTPTPNATIPTlgktsptsAVTTPTPNATSPT-VGETSPQANTTNHtlgGTSSTPV 617
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASS 549
Cdd:pfam05109  618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTS 697
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  550 SPPQVTSETPASSSPTNmtSDTPASSSPTNMTSDTPasssPTNMTS-DTPASSSPPWPVITEVTRPESTIPAGRSLANIT 628
Cdd:pfam05109  698 SPAPRPGTTSQASGPGN--SSTSTKPGEVNVTKGTP----PKNATSpQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHG 771
                          330       340
                   ....*....|....*....|....*.
gi 1039789982  629 SKAQED--SPLGVISTHPQMSFQSST 652
Cdd:pfam05109  772 ARTSTEptTDYGGDSTTPRTRYNATT 797
PHA03247 PHA03247
large tegument protein UL36; Provisional
159-661 8.03e-24

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 110.80  E-value: 8.03e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  159 GPGPLLPMKRRGAETERHMipgngPPlamcHQPAPPelfetlcfPIDPAS-------SAPPKATHRMTITSLTGRPQVTS 231
Cdd:PHA03247  2550 DPPPPLPPAAPPAAPDRSV-----PP----PRPAPR--------PSEPAVtsrarrpDAPPQSARPRAPVDDRGDPRGPA 2612
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  232 DtlASSSPPQGTS-DTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSppqgtSDTPASSSPPQVTSA 310
Cdd:PHA03247  2613 P--PSPLPPDTHApDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL-----GRAAQASSPPQRPRR 2685
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  311 TSASSSPPQGTS--DTPASSSPPQvtsatsassspPQGTSDTPASSSPPQGTLDTPSSSSPPQgtsdTPASSSPPQGTSe 388
Cdd:PHA03247  2686 RAARPTVGSLTSlaDPPPPPPTPE-----------PAPHALVSATPLPPGPAAARQASPALPA----APAPPAVPAGPA- 2749
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  389 TPASNSPPQgtseTPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSP-PQGTSDT 467
Cdd:PHA03247  2750 TPGGPARPA----RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAlPPAASPA 2825
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  468 PGFSSPTQVTTATLVSSSPPQVTSDTPASS----------SPPQVTSDTPASSSPPQVTS-ETPASSSPPQVTSDTSASI 536
Cdd:PHA03247  2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSvapggdvrrrPPSRSPAAKPAAPARPPVRRlARPAVSRSTESFALPPDQP 2905
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  537 SPPQvisdTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPtnmtsdtpaSSSPPWPVITEVTRPES 616
Cdd:PHA03247  2906 ERPP----QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP---------SGAVPQPWLGALVPGRV 2972
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*
gi 1039789982  617 TIPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALDETA 661
Cdd:PHA03247  2973 AVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEET 3017
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
1132-1247 6.16e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 95.58  E-value: 6.16e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1132 YLIQVYTGYRRRAATTAKVVITLYGSEGHS--EPHHLCDPEktvFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSW 1209
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESaqLEITLDNPD---FERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEW 77
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982 1210 YVSQVIV-SDMTTRKKWHFQCNCWLAVDLGNcERDRVFT 1247
Cdd:pfam01477   78 FLKSITVeVPGETGGKYTFPCNSWVYGSKKY-KETRVFF 115
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
345-670 4.16e-21

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 99.65  E-value: 4.16e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  345 QGTSDTPASSSPPqgtldTPSSSSPPQGTSDTPASSS-PPQGTSETPASNSPPQGTSETPgfSSPPQVTTATLVSSSPPQ 423
Cdd:pfam17823  104 EGAADGAASRALA-----AAASSSPSSAAQSLPAAIAaLPSEAFSAPRAAACRANASAAP--RAAIAAASAPHAASPAPR 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  424 V--------TSETPASSSPTQVTSETPASSSPTQVTSD------TPAS--------NSPPQGTSDTPGFSSPTQVTTATL 481
Cdd:pfam17823  177 TaassttaaSSTTAASSAPTTAASSAPATLTPARGISTaatatgHPAAgtalaavgNSSPAAGTVTAAVGTVTPAALATL 256
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  482 VSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS---ET 558
Cdd:pfam17823  257 AAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTlepNT 336
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  559 PASSSPTNMTSDT---------PASSSPTNMTSDTP--ASSSPTNMTSDTP---ASSSPPWPVITEVTRPESTipAGRSL 624
Cdd:pfam17823  337 PKSVASTNLAVVTttkaqakepSASPVPVLHTSMIPevEATSPTTQPSPLLptqGAAGPGILLAPEQVATEAT--AGTAS 414
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1039789982  625 ANITSKAQedsplGVISTHPQMSFQSSTSQQALDETAGERVPTIPD 670
Cdd:pfam17823  415 AGPTPRSS-----GDPKTLAMASCQLSTQGQYLVVTTDPLTPALVD 455
PHA03247 PHA03247
large tegument protein UL36; Provisional
142-629 1.62e-20

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 100.01  E-value: 1.62e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  142 QAAAFPPQGASIWRNEFGPGPllpmkrRGAETERHMIPGNGPPLAMCHQPAPPELF---ETLCFPIDPASSAPPKATHRM 218
Cdd:PHA03247  2613 PPSPLPPDTHAPDPPPPSPSP------AANEPDPHPPPTVPPPERPRDDPAPGRVSrprRARRLGRAAQASSPPQRPRRR 2686
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  219 TItsltgRPQVTSDTLASSSPPQGTsdTPASSSPPQVTSATSASSSPPQGTSD--TPASSSPPQVTSATSA----SSSPP 292
Cdd:PHA03247  2687 AA-----RPTVGSLTSLADPPPPPP--TPEPAPHALVSATPLPPGPAAARQASpaLPAAPAPPAVPAGPATpggpARPAR 2759
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  293 QGTSDTPASSSPPQVtsatsasssppqgtsdtPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPqg 372
Cdd:PHA03247  2760 PPTTAGPPAPAPPAA-----------------PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP-- 2820
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  373 tSDTPASSSPPQgTSETPASNSPPQG---TSETPGFSSPPqvtTATLVSSSPPQVTSETPASSSPTQVTS-ETPASSSPT 448
Cdd:PHA03247  2821 -AASPAGPLPPP-TSAQPTAPPPPPGpppPSLPLGGSVAP---GGDVRRRPPSRSPAAKPAAPARPPVRRlARPAVSRST 2895
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  449 QVTSDTPASNSPPQgtsdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQV 528
Cdd:PHA03247  2896 ESFALPPDQPERPP----QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR 2971
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  529 TSDTSASISPPQVISDTPASSSPPQVTSETP-----ASSSPTNMTSDTPASSSPTNM--TSDTPASSSPTNMTSDTPASS 601
Cdd:PHA03247  2972 VAVPRFRVPQPAPSREAPASSTPPLTGHSLSrvsswASSLALHEETDPPPVSLKQTLwpPDDTEDSDADSLFDSDSERSD 3051
                          490       500       510
                   ....*....|....*....|....*....|....
gi 1039789982  602 S------PPWPVITEVTRPESTIPAGRSLANITS 629
Cdd:PHA03247  3052 LealdplPPEPHDPFAHEPDPATPEAGARESPSS 3085
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
179-604 2.51e-20

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 99.09  E-value: 2.51e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  179 PGNGPPLAMCHQPAPPELFETLCFPIDPASSAPPKATHrmtitSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSA 258
Cdd:PHA03307    67 PPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG-----SPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPV 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  259 TSASSSPPQGTSDTPASSSPPQvtsatsasssppqGTSDTPASSSPPQvtsatsasssppQGTSDTPASSSPPQVTSATS 338
Cdd:PHA03307   142 GSPGPPPAASPPAAGASPAAVA-------------SDAASSRQAALPL------------SSPEETARAPSSPPAEPPPS 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  339 ASSSPPQGTSdtPASSSPPQGTLDTPSSSSPPQGTSDTPASSSppqGTSETPASNSPPQGTSETPGFSSPPQVTTATLVS 418
Cdd:PHA03307   197 TPPAAASPRP--PRRSSPISASASSPAPAPGRSAADDAGASSS---DSSSSESSGCGWGPENECPLPRPAPITLPTRIWE 271
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  419 SSPPQVTSETPASSSPtqVTSETPASSSPTQVTSDTPASNSPPQGTSDtpGFSSPTQVTTATLVSSSPPQVTSDTPASSS 498
Cdd:PHA03307   272 ASGWNGPSSRPGPASS--SSSPRERSPSPSPSSPGSGPAPSSPRASSS--SSSSRESSSSSTSSSSESSRGAAVSPGPSP 347
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  499 PPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PHA03307   348 SRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFY 427
                          410       420
                   ....*....|....*....|....*.
gi 1039789982  579 NMTSDTPASSSPtnmtsdTPASSSPP 604
Cdd:PHA03307   428 ARYPLLTPSGEP------WPGSPPPP 447
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
237-636 8.32e-20

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 97.55  E-value: 8.32e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  237 SSPPQGTSDTPASSSPPQVtsatsasssppqGTSDTPASSSPPQVTSATSASSSPPQGtsdtPASSSPPQVTSATSASSS 316
Cdd:PHA03307    25 PATPGDAADDLLSGSQGQL------------VSDSAELAAVTVVAGAAACDRFEPPTG----PPPGPGTEAPANESRSTP 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  317 PPQGTSDTPASSSPPQvtsatsaSSSPPQGTSDTPA-SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSP 395
Cdd:PHA03307    89 TWSLSTLAPASPAREG-------SPTPPGPSSPDPPpPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAA 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  396 PQGTSETPGFSSPP-QVTTATLVSSSPPQvtSETPASSSPTQVTSETPASSSPTQVTSDTPASnSPPQGTSDTPGFSSPT 474
Cdd:PHA03307   162 VASDAASSRQAALPlSSPEETARAPSSPP--AEPPPSTPPAAASPRPPRRSSPISASASSPAP-APGRSAADDAGASSSD 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  475 QVTTATLVSSSPPQvtSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPqVISDTPASSSPPQV 554
Cdd:PHA03307   239 SSSSESSGCGWGPE--NECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPS-SPGSGPAPSSPRAS 315
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  555 TSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPP--------WPVITEVTRPESTIPAGRSLAN 626
Cdd:PHA03307   316 SSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPrkrprpsrAPSSPAASAGRPTRRRARAAVA 395
                          410
                   ....*....|
gi 1039789982  627 ITSKAQEDSP 636
Cdd:PHA03307   396 GRARRRDATG 405
PHA03247 PHA03247
large tegument protein UL36; Provisional
144-607 1.75e-19

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 96.55  E-value: 1.75e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  144 AAFPPQGasiWRNEFGPGPLLPMkrrgAETERHMIPGNGPplamchQPAPPELFetlcfPIDPASSAPPKATHRMTITSL 223
Cdd:PHA03247  2676 ASSPPQR---PRRRAARPTVGSL----TSLADPPPPPPTP------EPAPHALV-----SATPLPPGPAAARQASPALPA 2737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  224 TGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSasssppqgtsdtPASSSPPQVTSATSASSSPPQGTSDTPASSS 303
Cdd:PHA03247  2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA------------PAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  304 PPQVTSATSASSSPpqgTSDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQGtldtPSSSSPPQGTSDTP----AS 379
Cdd:PHA03247  2806 DPPAAVLAPAAALP---PAASPAGPLPPP--------------TSAQPTAPPPPPG----PPPPSLPLGGSVAPggdvRR 2864
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  380 SSPPQGTSETPASNS-PPQGTSETPGFSSPPQvttaTLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASN 458
Cdd:PHA03247  2865 RPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  459 SPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISP 538
Cdd:PHA03247  2941 PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPP 3020
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  539 PQVISDT---PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPV 607
Cdd:PHA03247  3021 PVSLKQTlwpPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPL 3092
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
171-619 1.77e-19

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 95.75  E-value: 1.77e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  171 AETERHMIPGNGPPLAMCHQPAppelFETLCFPIDPASSAPPKATH---RMTITSLTGRPQVTSDTlaSSSPPQGTSDTP 247
Cdd:pfam05109  412 ATTTTHKVIFSKAPESTTTSPT----LNTTGFAAPNTTTGLPSSTHvptNLTAPASTGPTVSTADV--TSPTPAGTTSGA 485
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  248 ASSSPPQVTSATSASSSPPQGTSDTPASSSPpqvtsatsasssPPQGTSDTPASSSPpqvtsatsasssPPQGTSDTPAS 327
Cdd:pfam05109  486 SPVTPSPSPRDNGTESKAPDMTSPTSAVTTP------------TPNATSPTPAVTTP------------TPNATSPTLGK 541
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  328 SSPpqvtsATSASSSPPQGTSDTPASSSP-PQGTLDTPSSSSPPQG-TSDTPASSSPPQGTSETPA--SNSPPQGTSETP 403
Cdd:pfam05109  542 TSP-----TSAVTTPTPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPTPNATSPTVGETSPQAntTNHTLGGTSSTP 616
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  404 GFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSsptqvTSDTPASNSPPQGTSDTPGFSSPTQVTTAtlvS 483
Cdd:pfam05109  617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPS-----TSDNSTSHMPLLTSAHPTGGENITQVTPA---S 688
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  484 SSPPQVTSDTPASSSPPQVTSDTPASSSppqvTSETPASSSPPQVTSDTSAsiSPPQVISDTpaSSSPPQVTSETPASSS 563
Cdd:pfam05109  689 TSTHHVSTSSPAPRPGTTSQASGPGNSS----TSTKPGEVNVTKGTPPKNA--TSPQAPSGQ--KTAVPTVTSTGGKANS 760
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  564 PTNMTSDT----PASSSP-TNMTSDTPASSSPTNMTSDTPASSS----PPWPVIT-EVTRPESTIP 619
Cdd:pfam05109  761 TTGGKHTTghgaRTSTEPtTDYGGDSTTPRTRYNATTYLPPSTSsklrPRWTFTSpPVTTAQATVP 826
PLAT cd00113
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. ...
1131-1233 5.87e-19

PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. It consists of an eight stranded beta-barrel. The domain can be found in various domain architectures, in case of lipoxygenases, alpha toxin, lipases and polycystin, but also as a single domain or as repeats.The putative function of this domain is to facilitate access to sequestered membrane or micelle bound substrates.


Pssm-ID: 238061  Cd Length: 116  Bit Score: 84.31  E-value: 5.87e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1131 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHhLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1210
Cdd:cd00113      2 RYTVTIKTGDKKGAGTDSNISLALYGENGNSSDI-PILDGPGSFERGSTDTFQIDLKLDIGDITKVYLRRDGSGLSDGWY 80
                           90       100
                   ....*....|....*....|...
gi 1039789982 1211 VSQVIVSDMTTRKKWHFQCNCWL 1233
Cdd:cd00113     81 CESITVQALGTKKVYTFPVNRWV 103
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
192-622 3.84e-18

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 91.77  E-value: 3.84e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  192 APPELFETLCFPIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPqvtsATSASSSPPQGTSD 271
Cdd:PHA03307    53 VTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSP----DPPPPTPPPASPPP 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  272 TPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvtsatsasssppqGTSDTPASSSPPQvtsatsasSSPPQgTSDTP 351
Cdd:PHA03307   129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA-------------SDAASSRQAALPL--------SSPEE-TARAP 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  352 ASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQ-----VTTATLVSSSPPQVTS 426
Cdd:PHA03307   187 SSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSgcgwgPENECPLPRPAPITLP 266
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  427 ETPASSSPTQVTSETPASSSPtqVTSDTPASNSPPQGTSDTPGFSSPtqvttATLVSSSPPQVTSDTPASSSPPQVTSDT 506
Cdd:PHA03307   267 TRIWEASGWNGPSSRPGPASS--SSSPRERSPSPSPSSPGSGPAPSS-----PRASSSSSSSRESSSSSTSSSSESSRGA 339
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  507 PASSSPPQVTSETPASSSPPQVTSDTSASISP---PQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSD 583
Cdd:PHA03307   340 AVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPsraPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDA 419
                          410       420       430
                   ....*....|....*....|....*....|....*....
gi 1039789982  584 TPASSSPTNMTSDTPASSSpPWPvitEVTRPestiPAGR 622
Cdd:PHA03307   420 GAASGAFYARYPLLTPSGE-PWP---GSPPP----PPGR 450
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
207-576 1.70e-17

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 88.48  E-value: 1.70e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  207 ASSAPP------KATHRMTITSLTGRPQVTSDTLASSSPP--QGTSDTPASSSPpqvtsATSASSSPPQGTSDTPASSSP 278
Cdd:pfam17823   62 AATAAPapvtltKGTSAAHLNSTEVTAEHTPHGTDLSEPAtrEGAADGAASRAL-----AAAASSSPSSAAQSLPAAIAA 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  279 PQVTSATSASSSPPQgtsdTPASSSPpqvtsatsasSSPPQGTSDTPASSSPPQvtsatSASSSPPQGTSDTPASSSPPQ 358
Cdd:pfam17823  137 LPSEAFSAPRAAACR----ANASAAP----------RAAIAAASAPHAASPAPR-----TAASSTTAASSTTAASSAPTT 197
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  359 GTLDTPSSSSPPQGTSDTPASSSPPQ---GTSETPASNSPPQGTSETPGFSSPPQV-----------TTATLVSSSPPQV 424
Cdd:pfam17823  198 AASSAPATLTPARGISTAATATGHPAagtALAAVGNSSPAAGTVTAAVGTVTPAALatlaaaagtvaSAAGTINMGDPHA 277
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  425 TSETPASSSPTQVTSETPASSS------PT-QVTSDTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASS 497
Cdd:pfam17823  278 RRLSPAKHMPSDTMARNPAAPMgaqaqgPIiQVSTDQPVHNTAGEPTP-SPSNTTLEPNTPKSVASTNLAVVTTTKAQAK 356
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  498 SPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQvisdtpassSPPQVTSE-TP--ASSSPTNMTS---DT 571
Cdd:pfam17823  357 EPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILL---------APEQVATEaTAgtASAGPTPRSSgdpKT 427

                   ....*
gi 1039789982  572 PASSS 576
Cdd:pfam17823  428 LAMAS 432
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
182-564 4.52e-17

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 88.05  E-value: 4.52e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  182 GPPLAMCHQPAP-PELFETLCFPIDPASSAPPKATHrmtitslTGRPQVTSDTLASSSP-PQGTSDTPASSSPpqvtsat 259
Cdd:pfam05109  465 GPTVSTADVTSPtPAGTTSGASPVTPSPSPRDNGTE-------SKAPDMTSPTSAVTTPtPNATSPTPAVTTP------- 530
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  260 sasssPPQGTSDTPASSSPpqvtsATSASSSPPQGTSDTPASSSP-PQVTSATSASSSPPQG-TSDTPASSSPP--QVTS 335
Cdd:pfam05109  531 -----TPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPTPNATSPTvgETSP 600
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  336 ATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSP-----PQ 410
Cdd:pfam05109  601 QANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAhptggEN 680
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  411 VTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPT------QVTSDTPASN-SPPQGTSDTPGfSSPTQVTTATLVS 483
Cdd:pfam05109  681 ITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTStkpgevNVTKGTPPKNaTSPQAPSGQKT-AVPTVTSTGGKAN 759
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  484 SSPPQVTSDTPASSSPPQVTSDTPASSSPPQvTSETPASSSPPQvtsdTSASISPPQVISDTPASSSppQVTSETPASSS 563
Cdd:pfam05109  760 STTGGKHTTGHGARTSTEPTTDYGGDSTTPR-TRYNATTYLPPS----TSSKLRPRWTFTSPPVTTA--QATVPVPPTSQ 832

                   .
gi 1039789982  564 P 564
Cdd:pfam05109  833 P 833
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
179-576 7.79e-17

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 87.51  E-value: 7.79e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  179 PGNGPPLAMChQPAPPELFETLCFPIDPASSAPPKATHRMT--ITSLTGRPQVTSDTLASSSPP--QGTSDTPASSSPPQ 254
Cdd:pfam03154  186 PPPPGTTQAA-TAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTPTLHPQRLPSPHPPlqPMTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  255 VTSATSASssppqgtSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvt 334
Cdd:pfam03154  265 PLPQPSLH-------GQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQ-- 335
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  335 satsasssppqgtSDTPASSSP-PQGTLDTPSSSSPPQgtsdTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTT 413
Cdd:pfam03154  336 -------------SQQPPREQPlPPAPLSMPHIKPPPT----TPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKP 398
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  414 ATLVSSSPPQVTSETPASSSPTQVTSETPASSSP--TQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTS 491
Cdd:pfam03154  399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPvlTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP 478
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  492 DTPASSSPPQVTSDTPASSSPPQVTSETPASssppqvtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDT 571
Cdd:pfam03154  479 SGPPTSTSSAMPGIQPPSSASVSSSGPVPAA---------VSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNT 549

                   ....*
gi 1039789982  572 PASSS 576
Cdd:pfam03154  550 PSHAS 554
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
206-577 8.33e-17

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 87.28  E-value: 8.33e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  206 PASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDtpaSSSPPQVTSATSASSSPPQGTSDTPASSSPpqvtsat 285
Cdd:pfam05109  461 PASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTE---SKAPDMTSPTSAVTTPTPNATSPTPAVTTP------- 530
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  286 sasssPPQGTSDTPASSSPpqvtsATSASSSPPQGTSDTPASSSP-PQVTSATSASSSPPQG-TSDTPASSSPPQGTlDT 363
Cdd:pfam05109  531 -----TPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPTPNATSPTVGE-TS 599
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  364 PSSSSPPQ---GTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETP--ASSSPT--- 435
Cdd:pfam05109  600 PQANTTNHtlgGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPllTSAHPTgge 679
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSS-PTQVTTATLVSSSPPQVTSDTPASS----SPPQVTSDTPASS 510
Cdd:pfam05109  680 NITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSStSTKPGEVNVTKGTPPKNATSPQAPSgqktAVPTVTSTGGKAN 759
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  511 SP---PQVTSETPASSSPPqvTSDTSASISPPQVISDT-----PASSSP--PQVTSETPASSSpTNMTSDTPASSSP 577
Cdd:pfam05109  760 STtggKHTTGHGARTSTEP--TTDYGGDSTTPRTRYNAttylpPSTSSKlrPRWTFTSPPVTT-AQATVPVPPTSQP 833
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
203-603 4.73e-16

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 83.97  E-value: 4.73e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTlaSSSPPQGTSDTPA----SSSPPQV-----TSATSASSSPPQGTSDTP 273
Cdd:pfam03546   67 PRKGAPPVPPGKTGPAAAQAQAGKPEEDSES--SSEESDSDGETPAaatlTTSPAQVkplgkNSQVRPASTVGKGPSGKG 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  274 ASSSPPQVTSATSASSSPPQGTSDTPASS--------SPPQVTSATSASSSPPQGTSDTP---ASSSPPQVTSATSASSS 342
Cdd:pfam03546  145 ANPAPPGKAGSAAPLVQVGKKEEDSESSSeesdsegeAPPAATQAKPSGKILQVRPASGPakgAAPAPPQKAGPVATQVK 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  343 PPQGTSDT-------------PASSSPPQG--TLDTPSS-SSPPQGTSDTPASSSPPQGTSETPASNSppQGTSETPGFS 406
Cdd:pfam03546  225 AERSKEDSesseessdseeeaPAAATPAQAkpALKTPQTkASPRKGTPITPTSAKVPPVRVGTPAPWK--AGTVTSPACA 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  407 SPPQVTTAT----LVSSSPPQVTSETPASSSPTQVTSET-------PASSSPTQVTSDTPASNSPPQGTSdtpgfSSPTQ 475
Cdd:pfam03546  303 SSPAVARGAqrpeEDSSSSEESESEEETAPAAAVGQAKSvgkglqgKAASAPTKGPSGQGTAPVPPGKTG-----PAVAQ 377
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  476 VTTATLVSSSPPQVTSD------TPASSSPPQVTSDTPASSSPPQVTSETPASSSP-------PQVTSDTSASISPPQVI 542
Cdd:pfam03546  378 VKAEAQEDSESSEEESDseeaaaTPAQVKASGKTPQAKANPAPTKASSAKGAASAPgkvvaaaAQAKQGSPAKVKPPART 457
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  543 SDTPASSSPPQ---------VTSETPASSSPTNMTSDTPASSSPTNMTSD--TPASSSPTNMTSDTPASSSP 603
Cdd:pfam03546  458 PQNSAISVRGQasvpavgkaVATAAQAQKGPVGGPQEEDSESSEEESDSEeeAPAQAKPSGKTPQVRAASAP 529
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
211-608 1.52e-15

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 83.28  E-value: 1.52e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  211 PPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTpASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSS 290
Cdd:pfam03154   94 PERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDE-GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPP 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  291 PPQGTSDTPASSSPPQVTSATSASSSPPQGTSD-----TPASSSPPQVTSATSASSSPPQGTSDT-----PASSSPPQG- 359
Cdd:pfam03154  173 VLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSvppqgSPATSQPPNQTQSTAAPHTLIQQTPTLhpqrlPSPHPPLQPm 252
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  360 TLDTPSSSSPPQGTSDT-------PASSS------------PPQG---TSETPASNSPPQGTSETPGFSSPPQVTTATLV 417
Cdd:pfam03154  253 TQPPPPSQVSPQPLPQPslhgqmpPMPHSlqtgpshmqhpvPPQPfplTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQS 332
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  418 SSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgFSSPTQVTT-------ATLVSSSPP--- 487
Cdd:pfam03154  333 QLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSP-FQMNSNLPPppalkplSSLSTHHPPsah 411
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  488 ----QVTSDTPASSSPP-------QVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:pfam03154  412 ppplQLMPQSQQLPPPPaqppvltQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPG 491
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  557 ETPASSSPTNMTSDTPASSS----PTNMTSDTPASSSPTNMTSDTPASSSPPWPVI 608
Cdd:pfam03154  492 IQPPSSASVSSSGPVPAAVScplpPVQIKEEALDEAEEPESPPPPPRSPSPEPTVV 547
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
373-609 4.93e-15

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 80.57  E-value: 4.93e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  373 TSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTS 452
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  453 DTPASNSPPqgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPpqVTSDT 532
Cdd:COG3469     81 TATAAAAAA---------TSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSA--GSTTT 149
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  533 SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSsptnmtsdTPASSSPPWPVIT 609
Cdd:COG3469    150 TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT--------GPPTPGLPKHVLV 218
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
323-635 5.54e-15

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 80.50  E-value: 5.54e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  323 DTPASSSPPQVTSATSASSSppQGTSdTPASSSPPQGTLDTPSSSSPPQGTS--------DTPASSSPPQGTSETPASNS 394
Cdd:pfam03546   37 ETPAAKTPLQAKPSGKTPQV--RAAS-APAKESPRKGAPPVPPGKTGPAAAQaqagkpeeDSESSSEESDSDGETPAAAT 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  395 PPQGTSETPGFSSPPQVTTATLVSSSP------PQVTSETPASSSPTQVTSETPASSSPTQvTSDTPASNSPPQGTSDTP 468
Cdd:pfam03546  114 LTTSPAQVKPLGKNSQVRPASTVGKGPsgkganPAPPGKAGSAAPLVQVGKKEEDSESSSE-ESDSEGEAPPAATQAKPS 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  469 GFSSPTQVTT--ATLVSSSPPQVTSdtPASSsppQVTSDTP---ASSSPPQVTSE-------TPASSSPPQVTSDTSAsi 536
Cdd:pfam03546  193 GKILQVRPASgpAKGAAPAPPQKAG--PVAT---QVKAERSkedSESSEESSDSEeeapaaaTPAQAKPALKTPQTKA-- 265
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  537 SPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT------- 609
Cdd:pfam03546  266 SPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACASSPAVARGAQRPEEDSSSSEESESEEETAPAAAVGQaksvgkg 345
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 1039789982  610 ---------------EVTRPESTIPAGRSLANITSKAQEDS 635
Cdd:pfam03546  346 lqgkaasaptkgpsgQGTAPVPPGKTGPAVAQVKAEAQEDS 386
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
183-658 1.45e-14

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 79.81  E-value: 1.45e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  183 PPLAMCHQPAPPelfetlcfpidPASSAPPKAThrmtitsltgrpqvtsdTLASSSPPQGTSDTPASSSPPqvtsatsaS 262
Cdd:pfam03154  171 PPVLQAQSGAAS-----------PPSPPPPGTT-----------------QAATAGPTPSAPSVPPQGSPA--------T 214
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  263 SSPPQGTSDTPASSSPPQvtsatsasssppQGTSDTPASSSPPQvtsatsasSSPPQGTSDTPASSSPPQvtsatsasss 342
Cdd:pfam03154  215 SQPPNQTQSTAAPHTLIQ------------QTPTLHPQRLPSPH--------PPLQPMTQPPPPSQVSPQ---------- 264
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  343 ppqgtSDTPASSSPPQGTLDTPSSSSPPQgtSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPP 422
Cdd:pfam03154  265 -----PLPQPSLHGQMPPMPHSLQTGPSH--MQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgFSSPtqvttatlvSSSPPqvtsdTPASSSPPQV 502
Cdd:pfam03154  338 QPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSP-FQMN---------SNLPP-----PPALKPLSSL 402
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  503 TSDTPASSSPP--QVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETP-------ASSSPTNMTSDTPA 573
Cdd:pfam03154  403 STHHPPSAHPPplQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqhpfvPGGPPPITPPSGPP 482
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  574 SSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT-------EVTRPESTIPAGRSlanitskaqeDSPLGVISTHPQM 646
Cdd:pfam03154  483 TSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQikeealdEAEEPESPPPPPRS----------PSPEPTVVNTPSH 552
                          490
                   ....*....|..
gi 1039789982  647 SFQSSTSQQALD 658
Cdd:pfam03154  553 ASQSARFYKHLD 564
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
320-674 2.70e-14

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 78.15  E-value: 2.70e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  320 GTSDTPA----SSSPPQvtsatsasssppQGTSDTPASSSPPQGTLDTPSSSSPPQGT-SDTPA----SSSPPQGT-SET 389
Cdd:COG5164     24 QGSTKPAqnqgSTRPAG------------NTGGTRPAQNQGSTTPAGNTGGTRPAGNQgATGPAqnqgGTTPAQNQgGTR 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  390 PASNSPPQGTSETPGFSSPPQVTTATLV-----SSSPPQVTSETPASssPTQVTSETPASSSPTQVTsdTPASNSPPQGT 464
Cdd:COG5164     92 PAGNTGGTTPAGDGGATGPPDDGGATGPpddggSTTPPSGGSTTPPG--DGGSTPPGPGSTGPGGST--TPPGDGGSTTP 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  465 SDTPGFSSPTQVTTATlvssSPPQvtsdtPASSSPPQVTSDTPAS--SSPPQVTSETPASSSPPQVTSDTSASispPQVI 542
Cdd:COG5164    168 PGPGGSTTPPDDGGST----TPPN-----KGETGTDIPTGGTPRQgpDGPVKKDDKNGKGNPPDDRGGKTGPK---DQRP 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  543 SDTPASSSPPQVTSETPASSSPTNmTSDTPASSSPTNMTSDTPASSS--PTNMTSDTPASSSPPWPVITEVTRPESTIPA 620
Cdd:COG5164    236 KTNPIERRGPERPEAAALPAELTA-LEAENRAANPEPATKTIPETTTvkDLATVLGKKGSDLVTNLMKKGKGTNINAALD 314
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  621 GRSLANITSKAQedsPLGVISTHPQMSFQSSTSQQALDETAGER--VPTIPDFQAH 674
Cdd:COG5164    315 FETAATIALEGN---VITEKEIEADIMETVTTEEQETDSLLEETppVPVVMGHVDH 367
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
186-637 1.24e-13

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 76.63  E-value: 1.24e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  186 AMCHQPAPPELFETLCFPIDPASSAPpkATHRMTITSLTgrPQVTSDTLASSSPP--------QGTSD----TPASSSPP 253
Cdd:COG5665    178 AVPSAPAAPPNAVDYSVLVPIAAQDP--AASVSTPQAFN--ASATSGRSQHIVQAakrvgvewWGDPSllatPPATPATE 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  254 QVTSATSASSSPPQGTSDTPASSSPpqvtsatsasssppQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQV 333
Cdd:COG5665    254 EKSSQQPKSQPTSPSGGTTPPSTNQ--------------LTTSNTPTSTAKAQPQPPTKKQPAKEPPSDTASGNPSAPSV 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  334 TSAtsasssppqgtSDTPASSSPPqgtldTPSSSSPPQGTSDTPASSSPpqgtsetpasnsppqgtsetpgfSSPPQVTT 413
Cdd:COG5665    320 LIN-----------SDSPTSEDPA-----TASVPTTEETTAFTTPSSVP-----------------------STPAEKDT 360
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  414 ATLVSSSPPQVTSetPASSSPTQVTSETPASSSPTQVTSDTPASNSPP---QGTSDTPGFSSPTQvtTATLVSSSPPQvt 490
Cdd:COG5665    361 PATDLATPVSPTP--PETSVDKKVSPDSATSSTKSEKEGGTASSPMPPniaIGAKDDVDATDPSQ--EAKEYTKNAPM-- 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  491 sdTPASSSPPQVTSDTPAS---SSPPQVTSET---------PASSSPPQVTSDTSAS-----ISPPQVISDTPASSSPPQ 553
Cdd:COG5665    435 --TPEADSAPESSVRTEASpsaGSDLEPENTTlrdpapnaiPPPEDPSTIGRLSSGDklaneTGPPVIRRDSTPSSTADQ 512
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  554 VTSETPASSSPTNMTSdtpASSSPTNMTSDTPASSSPTNMTSDTpaSSSPPWPvITEVTRPESTIpAGRSLANITSKAQE 633
Cdd:COG5665    513 SIVGVLAFGLDQRTQA---EISVEAASRSNPLLNSQVKSFPLGK--RSEGAKG-KTQTDRGISNA-LVNASALITNLKSA 585

                   ....
gi 1039789982  634 DSPL 637
Cdd:COG5665    586 ARRS 589
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
351-605 3.08e-13

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 75.66  E-value: 3.08e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  351 PASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQ---VTSE 427
Cdd:PRK07003   362 VTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPAtadRGDD 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  428 TPASSSPTQVTSETPASSSPT-QVTSDTPASNSPPQG--TSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK07003   442 AADGDAPVPAKANARASADSRcDERDAQPPADSGSASapASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDA 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  505 DTPASSSPPQVTSETPASSSPPQVTSDTSASI-----SPPQVISD------TPASSSPPQVTSETPASSSPtnmTSDTPA 573
Cdd:PRK07003   522 PAAAAPPAPEARPPTPAAAAPAARAGGAAAALdvlrnAGMRVSSDrgaraaAAAKPAAAPAAAPKPAAPRV---AVQVPT 598
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1039789982  574 SSSPTNMTSDTPASSSPTNMTSDTPaSSSPPW 605
Cdd:PRK07003   599 PRARAATGDAPPNGAARAEQAAESR-GAPPPW 629
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
209-603 5.85e-13

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 73.95  E-value: 5.85e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  209 SAPPKATHRMTITSLTGRPQVTSDtlaSSSPPQGTSD--TPASSSPPQVTSATSASSSppQGTSdTPASSSP-----PQV 281
Cdd:pfam03546    2 PATPGKAGPAATQAKAGKPEEDSE---SSSEEESDSEeeTPAAKTPLQAKPSGKTPQV--RAAS-APAKESPrkgapPVP 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  282 TSATSASSSPPQ-GTSDTPASSSPPQVTSATSASSSPPQGTSdtPASSSP----PQVTSATSASS-SPPQGTSDTPASSS 355
Cdd:pfam03546   76 PGKTGPAAAQAQaGKPEEDSESSSEESDSDGETPAAATLTTS--PAQVKPlgknSQVRPASTVGKgPSGKGANPAPPGKA 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  356 PPQGTL------DTPSSSSPPQGTSDTPAsssPPQGTSETPASNSPPQGTSETP---GFSSPPQVT--TATLVSSSPPQV 424
Cdd:pfam03546  154 GSAAPLvqvgkkEEDSESSSEESDSEGEA---PPAATQAKPSGKILQVRPASGPakgAAPAPPQKAgpVATQVKAERSKE 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  425 TSETPASSSPTQvtSETPASSSPTQV-----TSDTPAsnSPPQGTSDTPgfsSPTQVTTATLVSSSPPQV-TSDTPASSS 498
Cdd:pfam03546  231 DSESSEESSDSE--EEAPAAATPAQAkpalkTPQTKA--SPRKGTPITP---TSAKVPPVRVGTPAPWKAgTVTSPACAS 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  499 PPQVTSDT--PASSSPPQVTSETPASSSPP----QVTS-----DTSASISPPQVISDTPASSSPP--------QVTSETP 559
Cdd:pfam03546  304 SPAVARGAqrPEEDSSSSEESESEEETAPAaavgQAKSvgkglQGKAASAPTKGPSGQGTAPVPPgktgpavaQVKAEAQ 383
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039789982  560 ASSSPTNMTSD------TPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:pfam03546  384 EDSESSEEESDseeaaaTPAQVKASGKTPQAKANPAPTKASSAKGAASAP 433
DUF2967 pfam11179
Protein of unknown function (DUF2967); This family of proteins with unknown function appears ...
353-754 1.73e-12

Protein of unknown function (DUF2967); This family of proteins with unknown function appears to be restricted to Drosophila.


Pssm-ID: 402654 [Multi-domain]  Cd Length: 954  Bit Score: 73.14  E-value: 1.73e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  353 SSSPPQGTLDTPSSSsPPQGTSDTPASSSPPQGTSET--PASNSPPQGTSETPgfSSPPQVTTATLVSSSPPQVTSetpa 430
Cdd:pfam11179   15 SSAPPHAALAGPITA-APTGAAAAAATSTAAASAASStiTAPGAGPGGTPTSR--SRGAQAMTASLAHAAQGNANA---- 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  431 ssspTQVTSETPASSSPTQVTSDTPAsnsppqgtsdtpGFSSptqVTTATLVSSSPPQVTSDTpaSSSPPQvtsdTPASS 510
Cdd:pfam11179   88 ----NKSTRNNSNSSNNNGKPKPLAA------------CYMS---TRSAAMMALALGQQSGEK--KDKKPA----AGKAA 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  511 SPPQVTSETPASSSPPQvTSDTSASISPPQVISDTPASSSPPQV-----TSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:pfam11179  143 SPAQSQSQSQSQNASPH-TNNRAVSMTRPAATRRLPNAAAMSNVnaansTCTATATSLPSNRARSKPSTPTATRAAAQLN 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  586 AS---SSPTNMT-SDTPASSSPPWPVITEVTRpeSTIPAGRSLAN-ITSKAQEDSPLGVISTHP-----QMSFQSSTSQQ 655
Cdd:pfam11179  222 GMgifSGGSNSSgSDNDGFSASGSSAATALRR--LYFKSGRSIKNkINASTSSSTPLNGLPLNAvsnafHNSVGGATAMH 299
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  656 ALDETAGerVP----------TIPDFQAHSEFQKACAILQRL--RDFLPTSPTSAQKNNSWSSQtPAVSCPFQPLgrLTT 723
Cdd:pfam11179  300 AMGTAGG--VPklvvmgtssaSIPDTTINTSTDSACTLITNVthTDTSETCDSLDLGDNSGPSE-PLFSSLEEPL--LTA 374
                          410       420       430
                   ....*....|....*....|....*....|.
gi 1039789982  724 TEksshqmaqqdmeqhpMDGAHNAFGISAGG 754
Cdd:pfam11179  375 IH---------------IDSEHEGFGGMAGG 390
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
147-513 4.78e-12

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 71.74  E-value: 4.78e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  147 PPQGASIWRNEFGPGPLLP--MKRRGAETERHMIPGNGPPlAMCHQPAPPELFETLCFPIDPASSAPPKATHRMTITSLT 224
Cdd:PHA03307    79 APANESRSTPTWSLSTLAPasPAREGSPTPPGPSSPDPPP-PTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  225 GRPQVTSDT-------LASSSPPQG--TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSP---------------PQ 280
Cdd:PHA03307   158 SPAAVASDAassrqaaLPLSSPEETarAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPapapgrsaaddagasSS 237
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  281 VTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsatsassSPPQGTSDTPASSSPPQGT 360
Cdd:PHA03307   238 DSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS----------SSPRERSPSPSPSSPGSGP 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  361 LDTPSS-----SSPPQGTSDTPASSSPPqgtSETPASnSPPQGTSETPGFSSPPqvttatlvSSSPPQVTSETPASSSPT 435
Cdd:PHA03307   308 APSSPRassssSSSRESSSSSTSSSSES---SRGAAV-SPGPSPSRSPSPSRPP--------PPADPSSPRKRPRPSRAP 375
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvtsdTPASSSPP 513
Cdd:PHA03307   376 SSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEP------WPGSPPPP 447
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
1131-1236 4.96e-12

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 64.20  E-value: 4.96e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  1131 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNsgDSPSWY 1210
Cdd:smart00308    2 KYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDYLFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEH--RHPEWF 79
                            90       100
                    ....*....|....*....|....*.
gi 1039789982  1211 VSQVIVSDMTTRKKWHFQCNCWLAVD 1236
Cdd:smart00308   80 LKSITVKDLPTGGKYHFPCNSWVYPD 105
DUF2967 pfam11179
Protein of unknown function (DUF2967); This family of proteins with unknown function appears ...
236-669 7.61e-12

Protein of unknown function (DUF2967); This family of proteins with unknown function appears to be restricted to Drosophila.


Pssm-ID: 402654 [Multi-domain]  Cd Length: 954  Bit Score: 71.22  E-value: 7.61e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  236 SSSPPQGTSDTPASSsPPQVTSATSASSsppqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASS 315
Cdd:pfam11179   15 SSAPPHAALAGPITA-APTGAAAAAATS-----TAAASAASSTITAPGAGPGGTPTSRSRGAQAMTASLAHAAQGNANAN 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  316 SPPQGTSDTPASSSPPQ------VTSATSASSSPPQGTSDTPASSSPPQgtldTPSSSSPPQGTSDTPASSSPPQgTSET 389
Cdd:pfam11179   89 KSTRNNSNSSNNNGKPKplaacyMSTRSAAMMALALGQQSGEKKDKKPA----AGKAASPAQSQSQSQSQNASPH-TNNR 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  390 PASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTsetpASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:pfam11179  164 AVSMTRPAATRRLPNAAAMSNVNAANSTCTATATSLPSNRARSKPSTPT----ATRAAAQLNGMGIFSGGSNSSGSDNDG 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  470 FS---SPTQVTTATLVSSSPPQVTSDTPASSsppqvTSDTPASSSPPQVTSET--PASSSPPQVTSDTSASISPPQVISD 544
Cdd:pfam11179  240 FSasgSSAATALRRLYFKSGRSIKNKINAST-----SSSTPLNGLPLNAVSNAfhNSVGGATAMHAMGTAGGVPKLVVMG 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  545 TpASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNmtsdtPASSSPPWPVITEVT-----RPESTIP 619
Cdd:pfam11179  315 T-SSASIPDTTINTSTDSACTLITNVTHTDTSETCDSLDLGDNSGPSE-----PLFSSLEEPLLTAIHidsehEGFGGMA 388
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  620 AGRSLANITSKAQ-EDSPLGVISTHPQMSFQSSTSQQ--ALDETAGERVPTIP 669
Cdd:pfam11179  389 GGRGGANGRGATElELTSCSRYPPRPDMNLQDSTESQesCLSILTGEPSSTTP 441
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
442-612 1.06e-11

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 69.99  E-value: 1.06e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  442 PASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvTSDTPASSSPPQVTSETP- 520
Cdd:pfam17823   48 PRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREG---AADGAASRALAAAASSSPs 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  521 --ASSSPPQVTSDTSASISPPQvisdTPASSSPPQVTSETP--ASSSPTNMTSDT-PASSSPTNMTSDTPASSSPTNMTS 595
Cdd:pfam17823  125 saAQSLPAAIAALPSEAFSAPR----AAACRANASAAPRAAiaAASAPHAASPAPrTAASSTTAASSTTAASSAPTTAAS 200
                          170
                   ....*....|....*..
gi 1039789982  596 DTPASSSPPWPVITEVT 612
Cdd:pfam17823  201 SAPATLTPARGISTAAT 217
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
232-643 1.21e-11

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 70.40  E-value: 1.21e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  232 DTLASSSPPQGTSDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSAT 311
Cdd:PRK07764   379 ERLERRLGVAGGAGAPAAAAPSA--------------AAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSP 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  312 SASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPA-SSSPPQ------ 384
Cdd:PRK07764   445 AGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATlRERWPEilaavp 524
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  385 -------GTSETPASNSPPQGTSETPGFSSPP------QVTTATLVSSSPPQVTSET----------PASSSPTQVTSET 441
Cdd:PRK07764   525 krsrktwAILLPEATVLGVRGDTLVLGFSTGGlarrfaSPGNAEVLVTALAEELGGDwqveavvgpaPGAAGGEGPPAPA 604
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  442 PASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK07764   605 SSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  522 SSSPPQVtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP---------ASSSPTN 592
Cdd:PRK07764   685 PAPAAPA---APAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPddppdpagaPAQPPPP 761
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  593 MTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTH 643
Cdd:PRK07764   762 PAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEEE 812
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
352-604 2.07e-11

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 68.27  E-value: 2.07e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  352 ASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQvTSETPAS 431
Cdd:pfam13254   58 PGLSPTKLSREGSPESTSRPSSSHSEATIVRHSKDDERPSTPDEGFVKPALPRHSRSSSALSNTGSEEDSPS-LPTSPPS 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  432 SSPTQVT---SETPAS---------SSPTQvtSDTPASNSPP------------QGTSDTPGFSSPTQVTTATLVSSSPP 487
Cdd:pfam13254  137 PSKTMDPkrwSPTKSSwlesalnrpESPKP--KAQPSQPAQPawmkelnkirqsRASVDLGRPNSFKEVTPVGLMRSPAP 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  488 QVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTS--ASISPPQVISDTPASS-----SPPQVTSETPA 560
Cdd:pfam13254  215 GGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPAPTSASEPppKTKELPKDSEEPAAPSksaeaSTEKKEPDTES 294
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 1039789982  561 SSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPP 604
Cdd:pfam13254  295 SPETSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPP 338
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
364-621 2.26e-11

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 69.18  E-value: 2.26e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  364 PSSSSPPQGT--SDTPASSSPPQGTSETPASNS---PPQGTSETPGFSSPpqvtTATLVSSSPPqvTSETPASssptqvT 438
Cdd:PLN03209   324 PSQRVPPKESdaADGPKPVPTKPVTPEAPSPPIeeePPQPKAVVPRPLSP----YTAYEDLKPP--TSPIPTP------P 391
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  439 SETPASSSPTQVTS--DTPASNSPPQGTSDTPGfSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvtSDTPASSSPPQVT 516
Cdd:PLN03209   392 SSSPASSKSVDAVAkpAEPDVVPSPGSASNVPE-VEPAQVEAKKTRPLSPYARYEDLKPPTSP----SPTAPTGVSPSVS 466
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  517 SETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPA-SSSPTNMTS 595
Cdd:PLN03209   467 STSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSApPTALADEQH 546
                          250       260
                   ....*....|....*....|....*...
gi 1039789982  596 DTPASSSP--PWPVITEVTRPESTIPAG 621
Cdd:PLN03209   547 HAQPKPRPlsPYTMYEDLKPPTSPTPSP 574
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
203-665 3.18e-11

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 68.13  E-value: 3.18e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPA----SSSPPQvtsatsasssppQGTSDTPA---- 274
Cdd:COG5164      3 LYGPGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAqnqgSTTPAG------------NTGGTRPAgnqg 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  275 SSSPPQvtsatsasssppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASS 354
Cdd:COG5164     71 ATGPAQnqg--------gtTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGS 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  355 SPPQGTLDTPSSSSPPQGtsDTPASSSPPQGTSETPA----SNSPPQGTSETPGfssPPQVTTATLVSSSPPQVTSETPA 430
Cdd:COG5164    143 TPPGPGSTGPGGSTTPPG--DGGSTTPPGPGGSTTPPddggSTTPPNKGETGTD---IPTGGTPRQGPDGPVKKDDKNGK 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  431 SSSPTQVTSETPAsSSPTQVTSdtPASNSPPQGTSDTPGFSSPTQvttatlvSSSPPQVTSDTPASSSPPQVTSDTPASS 510
Cdd:COG5164    218 GNPPDDRGGKTGP-KDQRPKTN--PIERRGPERPEAAALPAELTA-------LEAENRAANPEPATKTIPETTTVKDLAT 287
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  511 SPPQVTSETPASS------SPPQVTSDTsasiSPPQVISDTpassspPQVTSETPASSSPT-NMTSDTPASSSPTNMTSD 583
Cdd:COG5164    288 VLGKKGSDLVTNLmkkgkgTNINAALDF----ETAATIALE------GNVITEKEIEADIMeTVTTEEQETDSLLEETPP 357
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  584 TPasssPTNMTSDTPASSSPPWPVITEVTRPESTIP-----AGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALD 658
Cdd:COG5164    358 VP----VVMGHVDHGKTSLLDAIRHSDVTDGEVGTIsqhigAYTVQIAGTPITFLDTPGFESFTAMAMRVAQITDIAILV 433

                   ....*..
gi 1039789982  659 ETAGERV 665
Cdd:COG5164    434 VAADDGD 440
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
417-595 5.80e-11

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 67.00  E-value: 5.80e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  417 VSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTpgfssptqvTTATLVSSSPPQVTSDTPAS 496
Cdd:pfam05539  157 LRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTA---------TANQRLSSTEPVGTQGTTTS 227
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  497 SSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPASSSPTNMTSDTPASSS 576
Cdd:pfam05539  228 SNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRR---KTPPATSNRRSPHSTATPPPTTKRQETGRPTPR 304
                          170       180
                   ....*....|....*....|....*
gi 1039789982  577 PTNMT--SDTPASSSPT----NMTS 595
Cdd:pfam05539  305 PTATTqsGSSPPHSSPPgvqaNPTT 329
Hamartin pfam04388
Hamartin protein; This family includes the hamartin protein which is thought to function as a ...
417-717 6.18e-11

Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.


Pssm-ID: 461287 [Multi-domain]  Cd Length: 730  Bit Score: 67.78  E-value: 6.18e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  417 VSSSPPqvTSETPASSSPTQVTSETP-ASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQV-TSDTP 494
Cdd:pfam04388  276 PTASPY--TDQQSSYGSSTSTPSSTPrLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGMTTPPTSPGMVpTTPSE 353
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  495 ASSSPPQVTSDtpaSSSPPQVTSE-----TPASSSPPQVTSDTSASISPPQVISDTPASSSPPQvTSETPASSSP----- 564
Cdd:pfam04388  354 LSPSSSHLSSR---GSSPPEAAGEatpetTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPR-KDGRSQSSFPplskq 429
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  565 --TNMTSDTPASSSPTNMTSDTpasSSPTNMTSDTpaSSSPPWPVITEVTRPESTipagRSLANITSKAQEdsplgviST 642
Cdd:pfam04388  430 apTNPNSRGLLEPPGDKSSVTL---SELPDFIKDL--ALSSEDSVEGAEEEAAIS----QELSEITTEKNE-------TD 493
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  643 HPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEFQKACailqrlrDFLPTSPTSAQKNNSWSSQTPAVSCPFQP 717
Cdd:pfam04388  494 CSRGGLDMPFSRTMESLAGSQRSRNRIASYCSSTSQSDS-------HGPATTPESKPSALAEDGLRRTKSCSFKQ 561
PHA03378 PHA03378
EBNA-3B; Provisional
183-624 1.71e-10

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 66.63  E-value: 1.71e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  183 PPLAMCHQPAPPELFETLCFPID-PASSAPPKAThrmtITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSA 261
Cdd:PHA03378   529 PPQPRAGRRAPCVYTEDLDIESDePASTEPVHDQ----LLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQ 604
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  262 SSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDT-------PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVT 334
Cdd:PHA03378   605 TPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPItfnvlvfPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTM 684
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  335 SATSASSSPPQGTSDTPASSSPPQGtldTPSSSSPPQGtsdTPASSSPPQGtseTPASNSPPQGtseTPGFSSPPQvttA 414
Cdd:PHA03378   685 LPIQWAPGTMQPPPRAPTPMRPPAA---PPGRAQRPAA---ATGRARPPAA---APGRARPPAA---APGRARPPA---A 749
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  415 TLVSSSPPQvtsetpASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGtsdTPGFSSPTQVTTATLVSSSPPQVTSDTP 494
Cdd:PHA03378   750 APGRARPPA------AAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRG---APTPQPPPQAGPTSMQLMPRAAPGQQGP 820
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  495 ASSSPPQ-----VTSDTPASSSPPQVTSETPASSSP-PQvtSDTSASISPPQVIsdTPASSSPPQVtsetpasssPTNMT 568
Cdd:PHA03378   821 TKQILRQlltggVKRGRPSLKKPAALERQAAAGPTPsPG--SGTSDKIVQAPVF--YPPVLQPIQV---------MRQLG 887
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  569 SDTPASSSptnmtsdtPASSSPTNMTSDTPASSSPPWPVITEVTR------PESTIPAGRSL 624
Cdd:PHA03378   888 SVRAAAAS--------TVTQAPTEYTGERRGVGPMHPTDIPPSKRaktdayVESQPPHGGQS 941
PHA03255 PHA03255
BDLF3; Provisional
438-608 1.82e-10

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 63.38  E-value: 1.82e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  438 TSETPASSSPTQVTSDTPAsnsppqgTSDTPGFSSPTQVTTATLVSSSPPqVTSDTPASSSPPQVTSdTPASSSPPQVTS 517
Cdd:PHA03255    25 TSSGSSTASAGNVTGTTAV-------TTPSPSASGPSTNQSTTLTTTSAP-ITTTAILSTNTTTVTS-TGTTVTPVPTTS 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  518 ETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDT 597
Cdd:PHA03255    96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQ 175
                          170
                   ....*....|...
gi 1039789982  598 PASSS--PPWPVI 608
Cdd:PHA03255   176 PSLSYglPLWTLV 188
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
347-677 2.75e-10

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 66.17  E-value: 2.75e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQGTLDTPSSSSPPQGTS------DTPassSPPQGTSE-TPAS----NSP-PQGT----SETPgfSSPPQ 410
Cdd:TIGR00927   76 SSDPPKSSSEMEGEMLAPQATVGRDEATpsiameNTP---SPPRRTAKiTPTTpknnYSPtAAGTervkEDTP--ATPSR 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  411 VTTATLVSSSPPQVTSETPA------SSSPTQVTSE----TPaSSSPTQVTSDTPAS-NSPPQGTSDTPGFSSPTQVTTA 479
Cdd:TIGR00927  151 ALNHYISTSGRQRVKSYTPKprgevkSSSPTQTREKvrkyTP-SPLGRMVNSYAPSTfMTMPRSHGITPRTTVKDSEITA 229
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  480 T---LVSSSPPQV---TSDTP----ASSSPPQVTSDTPAS--SSPPQVTsETPASSSPPQVTSDTSA---------SISP 538
Cdd:TIGR00927  230 TykmLETNPSKRTagkTTPTPlkgmTDNTPTFLTREVETDllTSPRSVV-EKNTLTTPRRVESNSSTnhwglvgknNLTT 308
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  539 PQ--VISDTPASSSPpQVTSETPASSSPTnmtsDTPASSSPTNMTSDTPASSSPTNMTsdtpaSSSPPWPVITEVTRPES 616
Cdd:TIGR00927  309 PQgtVLEHTPATSEG-QVTISIMTGSSPA----ETKASTAAWKIRNPLSRTSAPAVRI-----ASATFRGLEKNPSTAPS 378
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039789982  617 TIPAGRSLANITSKAQE---DSPLGVISTHPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEF 677
Cdd:TIGR00927  379 TPATPRVRAVLTTQVHHcvvVKPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDLHPKAEY 442
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
420-634 4.65e-10

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 64.56  E-value: 4.65e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  420 SPPQ-VTSETPASSSPTQVTSETPASssptqvtsdtPASNSPPQGTSDTPGFSSPTqvttATLVSSSPPQVTSDTPASSS 498
Cdd:PLN03209   329 PPKEsDAADGPKPVPTKPVTPEAPSP----------PIEEEPPQPKAVVPRPLSPY----TAYEDLKPPTSPIPTPPSSS 394
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  499 PPQVTS-------DTPASSSPPQVTSETPASSsPPQVTSDTSASISPPQVISD--TPASSSP-PQVTSETPASSSPT-NM 567
Cdd:PLN03209   395 PASSKSvdavakpAEPDVVPSPGSASNVPEVE-PAQVEAKKTRPLSPYARYEDlkPPTSPSPtAPTGVSPSVSSTSSvPA 473
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  568 TSDTPASSSPTNMTSDTPASSSPTNMTS-----DTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQED 634
Cdd:PLN03209   474 VPDTAPATAATDAAAPPPANMRPLSPYAvyddlKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQ 545
Ion_trans pfam00520
Ion transport protein; This family contains sodium, potassium and calcium ion channels. This ...
1933-2124 9.73e-10

Ion transport protein; This family contains sodium, potassium and calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane.


Pssm-ID: 459842 [Multi-domain]  Cd Length: 238  Bit Score: 61.13  E-value: 9.73e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1933 RLAFFTRKRNLLDTSIVLISFSILGLSMQSLSLLhkkmqqyhcdrdrfisfyeaLRVnsavthLRGFLLLfatvRVWDLL 2012
Cdd:pfam00520   60 KKRYFRSPWNILDFVVVLPSLISLVLSSVGSLSG--------------------LRV------LRLLRLL----RLLRLI 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 2013 RHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFG----------WSISDYQSFFRSIVTVVGLL----MGT 2078
Cdd:pfam00520  110 RRLEGLRTLVNSLIRSLKSLGNLLLLLLLFLFIFAIIGYQLFGgklktwenpdNGRTNFDNFPNAFLWLFQTMttegWGD 189
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1039789982 2079 SKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAFGKERK 2124
Cdd:pfam00520  190 IMYDTIDGKGEFWAYIYFVSFIILGGFLLLNLFIAVIIDNFQELTE 235
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
364-577 1.07e-09

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 63.14  E-value: 1.07e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  364 PSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfssppqvTTATLVSSSPPQVTSETPASSSPTQVTSETPA 443
Cdd:pfam05539  169 KTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTA---------TANQRLSSTEPVGTQGTTTSSNPEPQTEPPPS 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  444 SSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVT-------TATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQvT 516
Cdd:pfam05539  240 QRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTppatsnrRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPH-S 318
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  517 SETPASSSPPQVTSDTSASISPPQVIS-----DTPASSSPPQVTSETPASSSPTNMTSDTPASSSP 577
Cdd:pfam05539  319 SPPGVQANPTTQNLVDCKELDPPKPNSicygvGIYNEALPRGCDIVVPLCSTYTIMCMDTYYSKPF 384
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
345-514 1.49e-09

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 62.37  E-value: 1.49e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  345 QGTSDTPASSSPPQGTLDTPSSSSPPQG----TSDTPASSSPPQGTSETPASNSPPQGTseTPGFSSPPQVTTATLVSSS 420
Cdd:pfam05539  176 KTTSWPTEVSHPTYPSQVTPQSQPATQGhqtaTANQRLSSTEPVGTQGTTTSSNPEPQT--EPPPSQRGPSGSPQHPPST 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  421 PPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPP 500
Cdd:pfam05539  254 TSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQET-----GRPTPRPTATTQSGSSPPHSSPPGVQANPT 328
                          170
                   ....*....|....
gi 1039789982  501 QVTSDTPASSSPPQ 514
Cdd:pfam05539  329 TQNLVDCKELDPPK 342
PRK08581 PRK08581
amidase domain-containing protein;
360-596 2.17e-09

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 62.50  E-value: 2.17e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  360 TLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNsppqgTSETPgfssppQVTTATLVSSSPPQVTSETPASSSP-TQVT 438
Cdd:PRK08581    14 TLVLPTLTSPTAYADDPQKDSTAKTTSHDSKKSN-----DDETS------KDTSSKDTDKADNNNTSNQDNNDKKfSTID 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  439 SETPASS---SPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQvTSDTPASSSPPQVTSDTPASSSPPqv 515
Cdd:PRK08581    83 SSTSDSNniiDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD-YEQPRNSEKSTNDSNKNSDSSIKN-- 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  516 tSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:PRK08581   160 -DTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSE 238

                   .
gi 1039789982  596 D 596
Cdd:PRK08581   239 D 239
PRK10905 PRK10905
cell division protein DamX; Validated
377-593 2.72e-09

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 61.11  E-value: 2.72e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  377 PASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQV-TSETPASSSPTQVTSDTP 455
Cdd:PRK10905    23 PSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAEQTAGNTQQDVSLPPISSTPTQGqTPVATDGQQRVEVQGDLN 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  456 ASNSPPQGTSDTPGFSS----PTQVTT-----------ATLVSSSPPQVTSDTPA------SSSPPQVTSDTPASSSPPQ 514
Cdd:PRK10905   103 NALTQPQNQQQLNNVAVnstlPTEPATvapvrngnasrQTAKTQTAERPATTRPArkqaviEPKKPQATAKTEPKPVAQT 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  515 VTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT--NMTSDTPASSSptNMTSDTPASSSPTN 592
Cdd:PRK10905   183 PKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTagNVGSLKSAPSS--HYTLQLSSSSNYDN 260

                   .
gi 1039789982  593 M 593
Cdd:PRK10905   261 L 261
PHA03377 PHA03377
EBNA-3C; Provisional
203-657 6.51e-09

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 61.61  E-value: 6.51e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQ---VTSATSASSSPPQGTSDTPASSSPP 279
Cdd:PHA03377   450 PERPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPPSRRRRgacVVYDDDIIEVIDVETTEEEESVTQP 529
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  280 QVTSATSASSSPPQGTSD--------TPASSSPPQVTSATSASSSPPQGTSDTPASSspPQVTSATSASSSPPQGTSDTP 351
Cdd:PHA03377   530 AKPHRKVQDGFQRSGRRQkratppkvSPSDRGPPKASPPVMAPPSTGPRVMATPSTG--PRDMAPPSTGPRQQAKCKDGP 607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  352 ASSSPPQgtlDTPSSSSP-----------------PQGTSDTPAS------SSPPQGTSETPASNSPPQGTSETPGFSSP 408
Cdd:PHA03377   608 PASGPHE---KQPPSSAPrdmapsvvrmflrerllEQSTGPKPKSfwemraGRDGSGIQQEPSSRRQPATQSTPPRPSWL 684
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  409 PQVTTATLVSSSPPQVTSETPASS-SPTQVTS--ETPASSSPTQVT--SDTPASNSPPQGTSDTPGFSSP--TQVTTATL 481
Cdd:PHA03377   685 PSVFVLPSVDAGRAQPSEESHLSSmSPTQPISheEQPRYEDPDDPLdlSLHPDQAPPPSHQAPYSGHEEPqaQQAPYPGY 764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  482 VSSSPPQV--------------TSDTPASSSPPQVTSDTPASSSPPQVTSETPAsSSPPQVTSDTSASISPPQviSDTPA 547
Cdd:PHA03377   765 WEPRPPQApylgyqepqaqgvqVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPG-HGHPQGPWAPRPPHLPPQ--WDGSA 841
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  548 SSSPPQVTSETPASSSPTNMTSDTpaSSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPvitevTR-PESTIPAGRSLAn 626
Cdd:PHA03377   842 GHGQDQVSQFPHLQSETGPPRLQL--SQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIP-----TRfPPPPMPLQDSMA- 913
                          490       500       510
                   ....*....|....*....|....*....|.
gi 1039789982  627 itskAQEDSPLgviSTHPQMSFQSSTSQQAL 657
Cdd:PHA03377   914 ----VGCDSSG---TACPSMPFASDYSQGAF 937
PHA03255 PHA03255
BDLF3; Provisional
464-622 6.78e-09

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 58.76  E-value: 6.78e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  464 TSDTPGFSSPTQVTTATlvsssppQVTSDTPASSSPPQVTSDTPASSSPPqVTSETPASSSPPQVTSdTSASISPPQVIS 543
Cdd:PHA03255    25 TSSGSSTASAGNVTGTT-------AVTTPSPSASGPSTNQSTTLTTTSAP-ITTTAILSTNTTTVTS-TGTTVTPVPTTS 95
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  544 DTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGR 622
Cdd:PHA03255    96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDER 174
PHA03255 PHA03255
BDLF3; Provisional
425-588 8.24e-09

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 58.38  E-value: 8.24e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  425 TSETPASSSPTQVTSETpASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTS-DTPASSSPPQVT 503
Cdd:PHA03255    25 TSSGSSTASAGNVTGTT-AVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPvPTTSNASTINVT 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  504 SDTPASSSppqVTSETPASSSPPQVTSDTSasisppQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSD 583
Cdd:PHA03255   104 TKVTAQNI---TATEAGTGTSTGVTSNVTT------RSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDER 174

                   ....*
gi 1039789982  584 TPASS 588
Cdd:PHA03255   175 QPSLS 179
PHA02682 PHA02682
ORF080 virion core protein; Provisional
437-659 8.93e-09

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 59.10  E-value: 8.93e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  437 VTSETPAS--SSPTQVTSDTPASNSPPQGTSDTPG----------FSSPTQVTTATLVSSSPPQVTSDTPASSSP-PQVT 503
Cdd:PHA02682    17 VLADTSSSlfTKCPQATIPAPAAPCPPDADVDPLDkysvkeagryYQSRLKANSACMQRPSGQSPLAPSPACAAPaPACP 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  504 SDTPASSSPpQVTSETPASSSPPQvtsdTSASISPPQVISDT--PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMT 581
Cdd:PHA02682    97 ACAPAAPAP-AVTCPAPAPACPPA----TAPTCPPPAVCPAParPAPACPPSTRQCPPAPPLPTPKPAPAAKPIFLHNQL 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  582 S--DTPASSSPTNMTSdtPASSsppwPVItEVTRPESTIPAGRSLANITSKAQEDSPLGV-------ISTHPQMSFQSST 652
Cdd:PHA02682   172 PppDYPAASCPTIETA--PAAS----PVL-EPRIPDKIIDADNDDKDLIKKELADIADSVrdlnaesLSLTRDIENAKST 244

                   ....*..
gi 1039789982  653 SQQALDE 659
Cdd:PHA02682   245 TQAAIDD 251
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
479-668 8.94e-09

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 60.36  E-value: 8.94e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  479 ATLVSSSPPQVTSDTPASS-SPPQVTSDTpassSPPQVTSETPASSSPpqvTSDTSASISPPqvisdTPASSSPPQVTSE 557
Cdd:pfam17823   62 AATAAPAPVTLTKGTSAAHlNSTEVTAEH----TPHGTDLSEPATREG---AADGAASRALA-----AAASSSPSSAAQS 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  558 TPASSS-PTNMTSDTPASSSPTnmTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAqedsP 636
Cdd:pfam17823  130 LPAAIAaLPSEAFSAPRAAACR--ANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSA----P 203
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1039789982  637 LGVISTHPQMSFQSSTSQQALDeTAGERVPTI 668
Cdd:pfam17823  204 ATLTPARGISTAATATGHPAAG-TALAAVGNS 234
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
382-577 1.29e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 60.25  E-value: 1.29e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  382 PPQGTSETPASNSPPQGTSETPGfsSPPQVTTATLVSSSPPQvtseTPASSSPTQVTSETPASSSPTQVTSDTPASNSPP 461
Cdd:PRK07003   360 PAVTGGGAPGGGVPARVAGAVPA--PGARAAAAVGASAVPAV----TAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPP 433
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  462 Q-GTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSE-TPASSSPPQVTSDTSASISPP 539
Cdd:PRK07003   434 AtADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEpAPRAAAPSAATPAAVPDARAP 513
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1039789982  540 QVIS--DTPASSSPPqvtseTPASSSPTnmtsdtPASSSP 577
Cdd:PRK07003   514 AAASreDAPAAAAPP-----APEARPPT------PAAAAP 542
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
203-502 1.33e-08

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 59.97  E-value: 1.33e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVT 282
Cdd:pfam17823  153 NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVG 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  283 SATSASssppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPpQVTSATSASSSPPQGTSDTPASSSPPQgtld 362
Cdd:pfam17823  233 NSSPAA-----GTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDP-HARRLSPAKHMPSDTMARNPAAPMGAQ---- 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  363 tpSSSSPPQGTSDTPASSSPPQGTSEtpasnspPQGTSETPGFSSPPQVTTATLVSSSPPQvTSETPASSSPTQVTSETP 442
Cdd:pfam17823  303 --AQGPIIQVSTDQPVHNTAGEPTPS-------PSNTTLEPNTPKSVASTNLAVVTTTKAQ-AKEPSASPVPVLHTSMIP 372
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  443 --ASSSPTQVTSDTPasnsPPQGTSDTPGFSSPTQVTT-ATLVSSSppqvTSDTPASSSPPQV 502
Cdd:pfam17823  373 evEATSPTTQPSPLL----PTQGAAGPGILLAPEQVATeATAGTAS----AGPTPRSSGDPKT 427
PLAT_plant_stress cd01754
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of ...
1132-1236 1.57e-08

PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of its members are stress induced. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238852  Cd Length: 129  Bit Score: 55.24  E-value: 1.57e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1132 YLIQVYTGYRRRAATTAKVVITLYGSEGH-------SEPHHLCDPEKTVFERGALDVF------LLSTGSWLgdlhglRL 1198
Cdd:cd01754      3 YTIYVQTGSIWKAGTDSRISLQIYDADGPglrianlEAWGGLMGAGHDYFERGNLDRFsgrgpcLPSPPCWM------NL 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1039789982 1199 WHDNSGDSPSWYVSQVIVsdmtTRKKWHFQCNC-------WLAVD 1236
Cdd:cd01754     77 TSDGTGNHPGWYVNYVEV----TQAGQHAPCMQhlfaveqWLATD 117
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
269-487 1.89e-08

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 59.38  E-value: 1.89e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  269 TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSasssppqGTS 348
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTA-------ATS 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  349 DTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPqgtseTPGFSSPPQVTTATLVSSSP-PQVTSE 427
Cdd:COG3469     74 STTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGS-----VTSTTSSTAGSTTTSGASATsSAGSTT 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  428 TPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGF-----SSPTQVTTATLVSSSPP 487
Cdd:COG3469    149 TTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASgattpSATTTATTTGPPTPGLP 213
PHA03377 PHA03377
EBNA-3C; Provisional
167-658 2.20e-08

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 59.68  E-value: 2.20e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  167 KRRGAETERHMIP-----GNGPPLAMCHQPAPPElfetlcfpIDPASSAPPKATHRMTITSLTGRPQVTSdtLASSSPPQ 241
Cdd:PHA03377   541 QRSGRRQKRATPPkvspsDRGPPKASPPVMAPPS--------TGPRVMATPSTGPRDMAPPSTGPRQQAK--CKDGPPAS 610
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  242 GTSD-TPASSSP----PQVTSATSASSSPPQGTSDTP-------ASSSPPQVTSATSASSSPPQgTSDTPASSSPPQVTS 309
Cdd:PHA03377   611 GPHEkQPPSSAPrdmaPSVVRMFLRERLLEQSTGPKPksfwemrAGRDGSGIQQEPSSRRQPAT-QSTPPRPSWLPSVFV 689
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  310 ATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLdtPSSSSPPQGTSDTPASSSPPQGTSET 389
Cdd:PHA03377   690 LPSVDAGRAQPSEESHLSSMSPTQPISHEEQPRYEDPDDPLDLSLHPDQAPP--PSHQAPYSGHEEPQAQQAPYPGYWEP 767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  390 PASNSPPQGTSETPGfssppqvtTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:PHA03377   768 RPPQAPYLGYQEPQA--------QGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDG 839
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  470 FSSPTQvttaTLVSSSPPqVTSDTpassSPPqvtsdTPASSSPPQVT-SETPASSSPPqvtsdtsasisppqvisdtPAS 548
Cdd:PHA03377   840 SAGHGQ----DQVSQFPH-LQSET----GPP-----RLQLSQVPQLPySQTLVSSSAP-------------------SWS 886
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  549 SSPPqvtsETPASSSPTNM-TSDTPASSSPTnMTSDTPASSSPtNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANI 627
Cdd:PHA03377   887 SPQP----RAPIRPIPTRFpPPPMPLQDSMA-VGCDSSGTACP-SMPFASDYSQGAFTPLDINAQTPKRPRVEESSHGPA 960
                          490       500       510
                   ....*....|....*....|....*....|..
gi 1039789982  628 TSKAQEDSPLGVISTHPQMS-FQSSTSQQALD 658
Cdd:PHA03377   961 RCSQATTEAQEILSDNSEISvFPKDAKQTDYD 992
PRK08581 PRK08581
amidase domain-containing protein;
402-698 2.66e-08

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 59.03  E-value: 2.66e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  402 TPGFSSPpQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSsptqvTSDTPASNSPPQGTSDTPgFSSPTQVTTAT- 480
Cdd:PRK08581    17 LPTLTSP-TAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKD-----TDKADNNNTSNQDNNDKK-FSTIDSSTSDSn 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  481 ---------LVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTS-ETPASSS-PPQVTSDTSAS-ISPPQvisdTPAS 548
Cdd:PRK08581    90 niidfiyknLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDyEQPRNSEkSTNDSNKNSDSsIKNDT----DTQS 165
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  549 SSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSppwpvitevtrpESTIPAGRSLANIT 628
Cdd:PRK08581   166 SKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSK------------DNQSMSDSALDSIL 233
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  629 SKAQEDSPLgvisTHPQMSFQSSTSQQaldETAGERVPTIPdfqAHSEFQKACAILQRLRDFLPTSPTSA 698
Cdd:PRK08581   234 DQYSEDAKK----TQKDYASQSKKDKT---ETSNTKNPQLP---TQDELKHKSKPAQSFENDVNQSNTRS 293
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
414-666 3.39e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 58.73  E-value: 3.39e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  414 ATLVSSSPPQVTSETPASSSPTQVTsETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSdT 493
Cdd:PRK12323   372 AGPATAAAAPVAQPAPAAAAPAAAA-PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAP-A 449
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  494 PASSSPPqvtsdTPASSSPPQVTS-ETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PRK12323   450 PAPAPAA-----APAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  573 ASSSPTNMTSDTPASSSptnmtsdTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHP------QM 646
Cdd:PRK12323   525 SIPDPATADPDDAFETL-------APAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLPvrglaqQL 597
                          250       260
                   ....*....|....*....|
gi 1039789982  647 SFQSSTsQQALDETAGERVP 666
Cdd:PRK12323   598 ARQSEL-AGVEGDTVRLRVP 616
PHA03255 PHA03255
BDLF3; Provisional
413-575 3.57e-08

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 56.45  E-value: 3.57e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  413 TATLVSSSPPQVTSETPASSSpTQVTSETPASSSPTQVTSDTPASNSPPQGTsdTPGFSSPTQVTTATLVSSSPPQVTSD 492
Cdd:PHA03255    20 TSLIWTSSGSSTASAGNVTGT-TAVTTPSPSASGPSTNQSTTLTTTSAPITT--TAILSTNTTTVTSTGTTVTPVPTTSN 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  493 TPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PHA03255    97 ASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQP 176

                   ...
gi 1039789982  573 ASS 575
Cdd:PHA03255   177 SLS 179
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
443-595 3.72e-08

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 55.35  E-value: 3.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  443 ASSSPTQVTSDTPASNSppqgtSDTPGFSSPTQVTTATLVSSSPPqvtsdTPASSSPPQVTSDTPASSSPPQVTSETPAS 522
Cdd:pfam09595   31 ASLILIGESNKEAALII-----TDIIDININKQHPEQEHHENPPL-----NEAAKEAPSESEDAPDIDPNNQHPSQDRSE 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  523 SSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSS----PTNMTSDTPAS----SSPTNMTSDTPASSSPTNMT 594
Cdd:pfam09595  101 APPLEPAAKTKPSEHEPANPPDASNRLSPPDASTAAIREARtfrkPSTGKRNNPSSaqsdQSPPRANHEAIGRANPFAMS 180

                   .
gi 1039789982  595 S 595
Cdd:pfam09595  181 S 181
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
406-569 4.01e-08

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 55.35  E-value: 4.01e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  406 SSPPQVTT--ATLVSSSPPQVTsetPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVS 483
Cdd:pfam09595   33 LILIGESNkeAALIITDIIDIN---INKQHPEQEHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAK 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  484 SSPpqvTSDTPASssppqvTSDTPASSSPPQVTSETPASSsppqvTSDTSASISPPQVISDTPASSSPPQVTSETPASSS 563
Cdd:pfam09595  110 TKP---SEHEPAN------PPDASNRLSPPDASTAAIREA-----RTFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRAN 175

                   ....*.
gi 1039789982  564 PTNMTS 569
Cdd:pfam09595  176 PFAMSS 181
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
245-670 4.81e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 58.58  E-value: 4.81e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  245 DTPASSSPPQVtsatsasssppQGTSDTPASSSPPQVtsatsasssPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDT 324
Cdd:PRK14949   369 DDPAEISLPEG-----------QTPSALAAAVQAPHA---------NEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEA 428
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  325 PASSSPPQVTSATSASSSPPQgTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPg 404
Cdd:PRK14949   429 VAEADASAEPADTVEQALDDE-SELLAALNAEQAVILSQAQSQGFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQ- 506
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  405 fssppqvttATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSS 484
Cdd:PRK14949   507 ---------VDDTSASNNSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANV 577
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  485 SPPQVT-SDTPASSSPPQVTSDTPASSSPPQ-------------VTSETPASSsppqvTSDTSASISPPQVISDTPaSSS 550
Cdd:PRK14949   578 QSAQSAaEAQPSSQSLSPISAVTTAAASLADddildavlaardsLLSDLDALS-----PKEGDGKKSSADRKPKTP-PSR 651
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  551 PPQVTSETPASSSPTNMTSDT---PASSSPTNMTSDTPASSSPTNMTSDTPASSS---PPW---PVITEVTRPESTIPAG 621
Cdd:PRK14949   652 APPASLSKPASSPDASQTSASfdlDPDFELATHQSVPEAALASGSAPAPPPVPDPydrPPWeeaPEVASANDGPNNAAEG 731
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  622 RSLANI--TSKAQEDSPLGVISTHPQM--SFQSSTSQQALDETAGERVPTIPD 670
Cdd:PRK14949   732 NLSESVedASNSELQAVEQQATHQPQVqaEAQSPASTTALTQTSSEVQDTELN 784
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
366-517 5.09e-08

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 54.96  E-value: 5.09e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  366 SSSPPqGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASS 445
Cdd:pfam09595   32 SLILI-GESNKEAALIITDIIDININKQHPEQEHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKT 110
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  446 SPTQV----TSDTPASNSPPQGTSdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTS 517
Cdd:pfam09595  111 KPSEHepanPPDASNRLSPPDAST-----AAIREARTFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSS 181
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
349-669 7.48e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 57.78  E-value: 7.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  349 DTPASSSPPQGtldTPSSSSPPQGTSDTPASSSPPQGTSEtpaSNSPPQGTSetPGFSSPPQVTtatlvssSPPQVTSET 428
Cdd:PTZ00449   503 DSDKHDEPPEG---PEASGLPPKAPGDKEGEEGEHEDSKE---SDEPKEGGK--PGETKEGEVG-------KKPGPAKEH 567
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  429 PASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQ--VTSDT 506
Cdd:PTZ00449   568 KPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQrpSSPER 647
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  507 PASSSPPQvTSETPASSSPP-------QVTSDTS-ASISPPQVISDTPASSSPPQVTSETPASSSPTNMTsdTPASSSPT 578
Cdd:PTZ00449   648 PEGPKIIK-SPKPPKSPKPPfdpkfkeKFYDDYLdAAAKSKETKTTVVLDESFESILKETLPETPGTPFT--TPRPLPPK 724
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  579 NMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTiPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALD 658
Cdd:PTZ00449   725 LPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHET-PADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHE 803
                          330
                   ....*....|.
gi 1039789982  659 ETAGERVPTIP 669
Cdd:PTZ00449   804 DKPPGDHPSLP 814
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
384-608 9.48e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 57.58  E-value: 9.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  384 QGTSETPASNSPPQGTSETPGFSSPPQVTTAtlvSSSPPQVTSETPASSSPTQVTSETPASSSPTQVtSDTPASNSPPQG 463
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAPAAAAPA---PAAPPAAPAAAPAAAAAARAVAAAPARRSPAPE-ALAAARQASARG 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  464 tsdTPGFSSPTQVTTATLVSSSPPQVTS-DTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVI 542
Cdd:PRK12323   444 ---PGGAPAPAPAPAAAPAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAG 520
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  543 SDTpASSSPPQVTSETPASSSPTNMTSDTPAsSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVI 608
Cdd:PRK12323   521 WVA-ESIPDPATADPDDAFETLAPAPAAAPA-PRAAAATEPVVAPRPPRASASGLPDMFDGDWPAL 584
PRK08581 PRK08581
amidase domain-containing protein;
346-569 9.74e-08

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 57.11  E-value: 9.74e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  346 GTSDTPASSSPPQGTlDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQvt 425
Cdd:PRK08581    59 DTDKADNNNTSNQDN-NDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD-- 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  426 SETPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQGTSDTPGFSS-PTQVTTATLVSSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK08581   136 YEQPRNSEKSTNDSNKNSDSSIKNDTDT---QSSKQDKADNQKAPSSnNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTA 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  505 DTPASSSPPQVTSETPASSSPPQVTSD------------------TSASISPPQVISDTPASSSPPQVTSETPASSSPTN 566
Cdd:PRK08581   213 NQKSSSKDNQSMSDSALDSILDQYSEDakktqkdyasqskkdkteTSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTR 292

                   ...
gi 1039789982  567 MTS 569
Cdd:PRK08581   293 STS 295
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
320-677 1.24e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.30  E-value: 1.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  320 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGT 399
Cdd:PRK07764   401 AAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAP 480
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  400 SETPGFSSPPQVTTATLVSSSPPQ--VTSETPASSSPT--------------------QVTS----------ETPAS--- 444
Cdd:PRK07764   481 APAPPAAPAPAAAPAAPAAPAAPAgaDDAATLRERWPEilaavpkrsrktwaillpeaTVLGvrgdtlvlgfSTGGLarr 560
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  445 -SSP-------------------TQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK07764   561 fASPgnaevlvtalaeelggdwqVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEAS 640
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  505 DTPASSS-PPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPtnmTSDTPASSSPTNMTSD 583
Cdd:PRK07764   641 AAPAPGVaAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP---APAPAATPPAGQADDP 717
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  584 TPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPlgviSTHPQMSFQSStSQQALDETAGE 663
Cdd:PRK07764   718 AAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAP----AAAPPPSPPSE-EEEMAEDDAPS 792
                          410
                   ....*....|....
gi 1039789982  664 RVPtiPDFQAHSEF 677
Cdd:PRK07764   793 MDD--EDRRDAEEV 804
PHA03247 PHA03247
large tegument protein UL36; Provisional
352-606 1.58e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 1.58e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  352 ASSSPPQGTLDTPSSssPPQGTSDTPASSSPPQGTSETPAsnsPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPAS 431
Cdd:PHA03247   243 VISHPLRGDIAAPAP--PPVVGEGADRAPETARGATGPPP---PPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPP 317
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  432 SSPTQVTSETPASSSPTQVTSDTPASNSP-PQGTSDT--PGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSD--- 505
Cdd:PHA03247   318 PAPAGDAEEEDDEDGAMEVVSPLPRPRQHyPLGFPKRrrPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPfar 397
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  506 TPASSSPPQVTSETPASSSPPqvtsdtsasiSPPQVISDTPASSSPPQVTSETPASSSPTNmtsdTPASSSPTNMTSDT- 584
Cdd:PHA03247   398 GPGGDDQTRPAAPVPASVPTP----------APTPVPASAPPPPATPLPSAEPGSDDGPAP----PPERQPPAPATEPAp 463
                          250       260
                   ....*....|....*....|..
gi 1039789982  585 PASSSPTNMTSDTPASSSPPWP 606
Cdd:PHA03247   464 DDPDDATRKALDALRERRPPEP 485
COG5099 COG5099
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ...
361-667 1.83e-07

RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];


Pssm-ID: 227430 [Multi-domain]  Cd Length: 777  Bit Score: 56.68  E-value: 1.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  361 LDTPSSSSPPQgtSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQvTTATLVSSSPPQVTSETPASSSPTQVTSE 440
Cdd:COG5099     69 KITSSSSSRRK--PSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNK-SNSALSSTQQGNANSSVTLSSSTASSMFN 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  441 TPASSSPTQVTSDTPASNSPP--QGTSDTPGFSSPTQvttaTLVSSSPPQVTSDTpaSSSPPQVTSDTPASSSPpqvTSE 518
Cdd:COG5099    146 SNKLPLPNPNHSNSATTNQSGssFINTPASSSSQPLT----NLVVSSIKRFPYLT--SLSPFFNYLIDPSSDSA---TAS 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  519 TPASSSPPQVTSDTSASISPPQVISDTPASSSPpQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTP 598
Cdd:COG5099    217 ADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSV-ENNIILNSSSSINELTSIYGSVPSIRNLRGLNSALVSFLNVSSSSL 295
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  599 ASSSPPwpvITEVtrPESTIPAGRSLANITSK------AQEDSPLGVISTHPQMS-----FQSSTSQQALDETAGERVPT 667
Cdd:COG5099    296 AFSALN---GKEV--SPTGSPSTRSFARVLPKsspnnlLTEILTTGVNPPQSLPSllnpvFLSTSTGFSLTNLSGYLNPN 370
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
377-670 1.84e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.53  E-value: 1.84e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  377 PASSSPPQGTS---ETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPtQVTSETPASSSPTQVTSD 453
Cdd:PRK07764   365 PSASDDERGLLarlERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAP-AAAPQPAPAPAPAPAPPS 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  454 TPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQ------------------- 514
Cdd:PRK07764   444 PAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAgaddaatlrerwpeilaav 523
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  515 ---VTSETPASSSPPQVTS----------DTSAS---ISPPQ---------------------VISDTPASSSPPQVTSE 557
Cdd:PRK07764   524 pkrSRKTWAILLPEATVLGvrgdtlvlgfSTGGLarrFASPGnaevlvtalaeelggdwqveaVVGPAPGAAGGEGPPAP 603
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  558 TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPL 637
Cdd:PRK07764   604 ASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1039789982  638 GVISTHPQMSFQSSTSQQALDETAGERVPTIPD 670
Cdd:PRK07764   684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADD 716
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
349-709 1.84e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.84  E-value: 1.84e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  349 DTPASSSPPqgtlDTPSSSSP-PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:NF033609   540 DKPVVPEQP----DEPGEIEPiPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASD 615
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  428 TPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:NF033609   616 SDSASDSDSASDSDSASDSDSASDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 692
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASS-SPTNMTSDTPA 586
Cdd:NF033609   693 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDS 772
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  587 -SSSPTNMTSDTPASSSPPwpviTEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALDETAGERV 665
Cdd:NF033609   773 dSDSDSDSDSDSDSDSDSD----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 848
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1039789982  666 PTIPDFQAHSEFQKACAILQRLRDFLPTSP----TSAQKNNSWSSQTP 709
Cdd:NF033609   849 DSDSDSDSESDSNSDSESGSNNNVVPPNSPkngtNASNKNEAKDSKEP 896
rne PRK10811
ribonuclease E; Reviewed
409-615 1.91e-07

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 56.59  E-value: 1.91e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  409 PQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT----PASNSPPQGTSDTPGFSSPTQVTTATLVSs 484
Cdd:PRK10811   848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEvveePVVVAEPQPEEVVVVETTHPEVIAAPVTE- 926
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  485 sPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP-QVISDTPASSSPPQVTSETPASSS 563
Cdd:PRK10811   927 -QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAaPVVAEVAAEVETVTAVEPEVAPAQ 1005
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  564 PTNMTSDTPASSSPtnMTSdTPAsssptnmtsdtPASSSPPwPVITEVTRPE 615
Cdd:PRK10811  1006 VPEATVEHNHATAP--MTR-APA-----------PEYVPEA-PRHSDWQRPT 1042
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
240-589 1.97e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.46  E-value: 1.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  240 PQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQ 319
Cdd:NF033609   558 PEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSA 637
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  320 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGT 399
Cdd:NF033609   638 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 717
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  400 SETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPGFSSPTQVTT 478
Cdd:NF033609   718 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD 797
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  479 ATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSET 558
Cdd:NF033609   798 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNS 877
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1039789982  559 PASSSPTNMTSDTPASSSPTNMT-SDTPASSS 589
Cdd:NF033609   878 PKNGTNASNKNEAKDSKEPLPDTgSEDEANTS 909
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
352-578 2.00e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 56.26  E-value: 2.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  352 ASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfSSPPQVTTATLVSSSPPQVTSETPAS 431
Cdd:PRK08691   363 AASCDANAVIENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAA---AMPSEGKTAGPVSNQENNDVPPWEDA 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  432 SSPTQvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDtpasSSPPQVTSDTPASSS 511
Cdd:PRK08691   440 PDEAQ-TAAGTAQTSAKSIQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPN----DEAVETETFAHEAPA 514
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  512 PPQVTSETPASSSPPQvtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PRK08691   515 EPFYGYGFPDNDCPPE----DGAEIPPPDWEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFST 577
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
439-631 2.30e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.46  E-value: 2.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  439 SETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQvtSDTPASSSPPQVTSE 518
Cdd:NF033609    33 SSKEADASENSVTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQ--QETTQSASTNATTEE 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  519 TPASSsppQVTSDTSASISPPQVI--SDTPASSSPPQVTSETpaSSSPTNMTSDTpasSSPTN--------MTSDTPASS 588
Cdd:NF033609   111 TPVTG---EATTTATNQANTPATTqsSNTNAEELVNQTSNET--TSNDTNTVSSV---NSPQNstnaenvsTTQDTSTEA 182
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1039789982  589 SPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKA 631
Cdd:NF033609   183 TPSNNESAPQSTDASNKDVVNQAVNTSAPRMRAFSLAAVAADA 225
DUF2967 pfam11179
Protein of unknown function (DUF2967); This family of proteins with unknown function appears ...
324-681 2.83e-07

Protein of unknown function (DUF2967); This family of proteins with unknown function appears to be restricted to Drosophila.


Pssm-ID: 402654 [Multi-domain]  Cd Length: 954  Bit Score: 56.20  E-value: 2.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  324 TPAS--SSPPQVTSATSASSSPPQGTSDTPAsSSPPQGTLDTPSSSSppQGTSDTPAS-SSPPQGTSETPASNSPPQGTS 400
Cdd:pfam11179   22 ALAGpiTAAPTGAAAAAATSTAAASAASSTI-TAPGAGPGGTPTSRS--RGAQAMTASlAHAAQGNANANKSTRNNSNSS 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  401 ETPGFSSPP-----QVTTATLVSSSPPQVTSETpASSSPTQVTSETPASSSP-TQVTSDTPASN------SPPQGTSDTP 468
Cdd:pfam11179   99 NNNGKPKPLaacymSTRSAAMMALALGQQSGEK-KDKKPAAGKAASPAQSQSqSQSQNASPHTNnravsmTRPAATRRLP 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  469 GFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVT----------------------SD-----TPASSSP--------- 512
Cdd:pfam11179  178 NAAAMSNVNAANSTCTATATSLPSNRARSKPSTPTatraaaqlngmgifsggsnssgSDndgfsASGSSAAtalrrlyfk 257
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  513 ----------PQVTSETPASSSPPQ-VTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMT 581
Cdd:pfam11179  258 sgrsiknkinASTSSSTPLNGLPLNaVSNAFHNSVGGATAMHAMGTAGGVPKLVVMGTSSASIPDTTINTSTDSACTLIT 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  582 SDTPASSSPTNMTSDTPASSSPPWPVITEVTRPestipagrSLANITSKAQEDSPLGVISTHPQMSFQSSTSqqaLDETA 661
Cdd:pfam11179  338 NVTHTDTSETCDSLDLGDNSGPSEPLFSSLEEP--------LLTAIHIDSEHEGFGGMAGGRGGANGRGATE---LELTS 406
                          410       420
                   ....*....|....*....|..
gi 1039789982  662 GERVPTIPD--FQAHSEFQKAC 681
Cdd:pfam11179  407 CSRYPPRPDmnLQDSTESQESC 428
PHA03255 PHA03255
BDLF3; Provisional
363-532 2.94e-07

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 53.75  E-value: 2.94e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  363 TPSSSSPPQGTSDTPASSSppqgTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:PHA03255    25 TSSGSSTASAGNVTGTTAV----TTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTI 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  443 ASSspTQVTSDTPASNSppQGTSDTPGFSSptQVTTATlvSSSPPQVTSDTPASSSPPQVTSDTpaSSSPPQVTSETPAS 522
Cdd:PHA03255   101 NVT--TKVTAQNITATE--AGTGTSTGVTS--NVTTRS--SSTTSATTRITNATTLAPTLSSKG--TSNATKTTAELPTV 170
                          170
                   ....*....|
gi 1039789982  523 SSPPQVTSDT 532
Cdd:PHA03255   171 PDERQPSLSY 180
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
270-604 3.06e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.07  E-value: 3.06e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  270 SDT-PASSSPPQVTSATSASSSPPQGTSDTPASSSppqvtsatsasssppqgtSDTPASSSPPQVTSATSASSSPPQGTS 348
Cdd:NF033609   561 SDSdPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSA------------------SDSDSASDSDSASDSDSASDSDSASDS 622
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  349 DTPASSSPPQGTldtpSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:NF033609   623 DSASDSDSASDS----DSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  429 PASSSPTQVTSETPASSSPTQVTSDTpASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDtpaSSSPPQVTSDTPA 508
Cdd:NF033609   699 DSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDS 774
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  509 SSSPPQVTSETPASSSPPQVTSDT-SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:NF033609   775 DSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 854
                          330       340
                   ....*....|....*....|.
gi 1039789982  588 SSPTNMTSDTPASSS----PP 604
Cdd:NF033609   855 DSESDSNSDSESGSNnnvvPP 875
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
416-651 3.53e-07

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 54.41  E-value: 3.53e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  416 LVSSSPPQVTSETPASSSPtqvtsetPASssptqvtsdtPASNSPPQgtSDTPGFSSPTQVttatlvssspPQVTSDTPA 495
Cdd:pfam12287   30 IVSAQPPSQSPDLSQMVCP-------PAS----------PEQRLSQQ--SDVLQQPEQTQV----------SPVSPSSNA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  496 SSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPpqvisdtpasSSPPQVTSETPASSSPTNMTSdTPASS 575
Cdd:pfam12287   81 CASSGSEYQFHTSEPPQPEAIDPIQSSMSLPSELAPPSPPLSP----------ASQPQVFQSKPASSSGINVNA-APFQS 149
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  576 SPT--NMTSDTP-ASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHpqMSFQSS 651
Cdd:pfam12287  150 MQTvfNVNAPVPpRNEQELKESSQYSSGYNQSFSSQSTQTVPQCQLPSEQLEQTVVGAYHPDGTIQVSNGH--LAFYPA 226
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
473-606 3.81e-07

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 55.05  E-value: 3.81e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  473 PTQVTTAtlvsssppQVTSDTPASSSPPQVTSDTPASSSPPQ----VTSETPASSSPPQVTSDTSASISPPQVISDTPAS 548
Cdd:pfam05539  169 KTAVTTS--------KTTSWPTEVSHPTYPSQVTPQSQPATQghqtATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQ 240
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  549 SSPPQVTSETPASSSPtnmTSDTPASSSPTNMTSDTPASSSPTNmtsdTPASSSPPWP 606
Cdd:pfam05539  241 RGPSGSPQHPPSTTSQ---DQSTTGDGQEHTQRRKTPPATSNRR----SPHSTATPPP 291
motB PRK12799
flagellar motor protein MotB; Reviewed
347-465 4.57e-07

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 54.72  E-value: 4.57e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQGTldTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVS--SSPPQV 424
Cdd:PRK12799   296 HGTVPVAAVTPSSA--VTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVAlpAAEPVN 373
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1039789982  425 TSETPASSSPTQVTSE---TPASSSPTQVTSDTPASNSPPQGTS 465
Cdd:PRK12799   374 MQPQPMSTTETQQSSTgniTSTANGPTTSLPAAPASNIPVSPTS 417
PHA03255 PHA03255
BDLF3; Provisional
347-511 5.25e-07

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 52.98  E-value: 5.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSppqgtldTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTseTPGFSSPPQVTTATLVSSSPPQVTS 426
Cdd:PHA03255    25 TSSGSSTAS-------AGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITT--TAILSTNTTTVTSTGTTVTPVPTTS 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  427 ETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTpgfSSPTQVTTATLVSSSPPQVTSD--TPASSSPPQVTS 504
Cdd:PHA03255    96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTT---SATTRITNATTLAPTLSSKGTSnaTKTTAELPTVPD 172

                   ....*..
gi 1039789982  505 DTPASSS 511
Cdd:PHA03255   173 ERQPSLS 179
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
423-582 5.37e-07

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 55.10  E-value: 5.37e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASN------SPPQGTSDTPG-FSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:PLN02217   508 EVQNTGPGAAITKRVTWPGIKKLSDEEILKFTPAQYiqgdawIPGKGVPYIPGlFAGNPGSTNSTPTGSAASSNTTFSSD 587
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  496 SSSppqvTSDTPASSSPPQVTSETPASSSpPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNM-TSDTPAS 574
Cdd:PLN02217   588 SPS----TVVAPSTSPPAGHLGSPPATPS-KIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIkVASTESS 662

                   ....*...
gi 1039789982  575 SSPTNMTS 582
Cdd:PLN02217   663 VSMVSMST 670
PRK13914 PRK13914
invasion associated endopeptidase;
417-602 5.96e-07

invasion associated endopeptidase;


Pssm-ID: 237555 [Multi-domain]  Cd Length: 481  Bit Score: 54.42  E-value: 5.96e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  417 VSSSPPQVTSETPASSSPTQVTsetPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATL--------------- 481
Cdd:PRK13914   143 VTSTPVAPTQEVKKETTTQQAA---PAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVksgdtiwalsvkygv 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  482 ----------VSSSPPQVTSD----TPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPA 547
Cdd:PRK13914   220 svqdimswnnLSSSSIYVGQKlaikQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPT 299
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  548 SSS----PPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSS 602
Cdd:PRK13914   300 EAAkpapAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSS 358
motB PRK12799
flagellar motor protein MotB; Reviewed
477-607 6.93e-07

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 53.95  E-value: 6.93e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  477 TTATLVSSSPPqvTSDTPASSSPPQVTSDTPASSSPPqvtseTPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799   296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPSPAVIP-----SSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPA 368
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  557 ETPASSSPTNMTSDTPASSSPTNMtsdTPASSSPTNMTSDTPASSSPPWPV 607
Cdd:PRK12799   369 AEPVNMQPQPMSTTETQQSSTGNI---TSTANGPTTSLPAAPASNIPVSPT 416
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
351-619 8.23e-07

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 54.16  E-value: 8.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  351 PASSSPPQGTLDTPSSSS------PPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSS---- 420
Cdd:cd22540     66 PLPLGPGKNSIGFLSAKGniiqlqGSQLSSSAPGGQQVFAIQNPTMIIKGSQTRSSTNQQYQISPQIQAAGQINNSgqiq 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  421 -----------PPQVTSETPASSSPTQVTsetPASSSPTQVTSDTPASN-------SPPQGTSDTP-----GFSSPTQVT 477
Cdd:cd22540    146 iipgtnqaiitPVQVLQQPQQAHKPVPIK---PAPLQTSNTNSASLQVPgnviklqSGGNVALTLPvnnlvGTQDGATQL 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  478 TATLVSSSPPQVTSDTPASSSPPQVTS--------------------------DTPASSSPPQV--------TSETPASS 523
Cdd:cd22540    223 QLAAAPSKPSKKIRKKSAQAAQPAVTVaeqvetvliettadniiqagnnllivQSPGTGQPAVLqqvqvlqpKQEQQVVQ 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  524 SPP------QVTSDTSASI--SPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:cd22540    303 IPQqalrvvQAASATLPTVpqKPLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTANNGT 382
                          330       340
                   ....*....|....*....|....
gi 1039789982  596 DTPASSSPpwpvitevTRPESTIP 619
Cdd:cd22540    383 GTSKPNYN--------VRKERTLP 398
PAP1 pfam08601
Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene ...
346-536 8.25e-07

Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene transcription in response to H2O2. This region is cysteine rich. Alkylation of cysteine residues following treatment with a cysteine alkylating agent can mask the accessibility of the nuclear exporter Crm1, triggering nuclear accumulation and Pap1 dependent transcriptional expression.


Pssm-ID: 369990  Cd Length: 363  Bit Score: 53.71  E-value: 8.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  346 GTSDTPASSSPPQGTLDTPSSSSPPqgtSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:pfam08601   31 QLSKAKQNTAKPGVRSDSRSPSPNA---STSTPDSQPPPSASSSTTPNQGSNGLNAFTGEDNNNYSNSAANPGATRGSTA 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  426 SETPASSSPTQVTSETPASS-SPTQvTSDTPASNSPPQGTSDTPGFSSPTQ--VTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:pfam08601  108 SSARSQSSPYSFGSGTSTSSdSPSS-SSSSHQGQLSSCGTSPEPSTQSPGGqkSVETMIGEEQCAHGTIDGEKSFCAKLG 186
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1039789982  503 TSDTPASSSPPQVTSETPASSSPPQVTSDTSASI 536
Cdd:pfam08601  187 MACGNINNPIPAAMSKSNSLSNTPGHASNDSNGL 220
PHA03255 PHA03255
BDLF3; Provisional
373-559 9.46e-07

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 52.21  E-value: 9.46e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  373 TSDTPASSSPpQGTSETPASNSPpqgtseTPGFSSPPQVTTATLVSSSPPqVTSETPASSSPTQVTSeTPASSSPTQVTS 452
Cdd:PHA03255    25 TSSGSSTASA-GNVTGTTAVTTP------SPSASGPSTNQSTTLTTTSAP-ITTTAILSTNTTTVTS-TGTTVTPVPTTS 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  453 DTpasnsppqgtsdtpgfSSPTQVTTATLVSSSppqvTSDTPASSSPPQVTSDTPASSSPPQVTSE-TPASSSPPQVTSD 531
Cdd:PHA03255    96 NA----------------STINVTTKVTAQNIT----ATEAGTGTSTGVTSNVTTRSSSTTSATTRiTNATTLAPTLSSK 155
                          170       180
                   ....*....|....*....|....*...
gi 1039789982  532 TSASISPPQVISDTPASSSPPQVTSETP 559
Cdd:PHA03255   156 GTSNATKTTAELPTVPDERQPSLSYGLP 183
PHA03255 PHA03255
BDLF3; Provisional
490-670 1.01e-06

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 52.21  E-value: 1.01e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  490 TSDTPASSSPPQVTSDTPasssppqVTSETPASSSPPQVTSDTSASISPPqvISDTPASSSPPQVTSETPASSSPTNMTS 569
Cdd:PHA03255    25 TSSGSSTASAGNVTGTTA-------VTTPSPSASGPSTNQSTTLTTTSAP--ITTTAILSTNTTTVTSTGTTVTPVPTTS 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  570 DTPASSSPTNMTSDTPASSSptnmtsdTPASSSPpwPVITEV-TRPESTIPAGRSLANITSKAQEdsplgvisthpqmsf 648
Cdd:PHA03255    96 NASTINVTTKVTAQNITATE-------AGTGTST--GVTSNVtTRSSSTTSATTRITNATTLAPT--------------- 151
                          170       180
                   ....*....|....*....|..
gi 1039789982  649 QSSTSQQALDETAGErVPTIPD 670
Cdd:PHA03255   152 LSSKGTSNATKTTAE-LPTVPD 172
GPS pfam01825
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
1021-1059 1.06e-06

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


Pssm-ID: 460350  Cd Length: 44  Bit Score: 47.30  E-value: 1.06e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1039789982 1021 QCYFWDRYNRT---WKSDGCQVGPKSTiLKTQCLCDHLTFFS 1059
Cdd:pfam01825    2 QCVFWDFTNSTtgrWSTEGCTTVSLND-THTVCSCNHLTSFA 42
PHA03377 PHA03377
EBNA-3C; Provisional
408-740 1.13e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 54.29  E-value: 1.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  408 PPQVTTATLVSSSPPQVTSE----TPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVS 483
Cdd:PHA03377   399 PVQQRPVMFVSRVPWRKPRTlpwpTPKTHPVKRTLVKTSGRSDEAEQAQSTPERPGPSDQPSVPVEPAHLTPVEHTTVIL 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  484 SSPPQVTSDTPASSSPPQV---------------------TSDTPASSSPPQVTSETPASSSppQVTSDTSASISPPQVI 542
Cdd:PHA03377   479 HQPPQSPPTVAIKPAPPPSrrrrgacvvydddiievidveTTEEEESVTQPAKPHRKVQDGF--QRSGRRQKRATPPKVS 556
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  543 -SDT-PASSSPPQV---TSETPASSSPTNMTSDT-PASSSPTNMT--SDTPASSSPTNMtsdTPASSSPPW---PVITEV 611
Cdd:PHA03377   557 pSDRgPPKASPPVMappSTGPRVMATPSTGPRDMaPPSTGPRQQAkcKDGPPASGPHEK---QPPSSAPRDmapSVVRMF 633
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  612 TR----PESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQAldetageRVPTIPDFQAHSEFQKACAILQRL 687
Cdd:PHA03377   634 LRerllEQSTGPKPKSFWEMRAGRDGSGIQQEPSSRRQPATQSTPPRPS-------WLPSVFVLPSVDAGRAQPSEESHL 706
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  688 RDFLPTSPTSAQKNNSWSSQTPAVSCPFQPLgrltTTEKSSHQMAQQDME--QHP 740
Cdd:PHA03377   707 SSMSPTQPISHEEQPRYEDPDDPLDLSLHPD----QAPPPSHQAPYSGHEepQAQ 757
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
37-142 1.41e-06

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 49.16  E-value: 1.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982   37 SCYQLNRLFCDFQEADNYCHAQRGRLAHTWNPKLRGFLKSFL---NEETVW-------------WVRGNLTLPGSHPGIN 100
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLkksSSSDVWiglndlssegtwkWSDGSPLVDYTNWAPG 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1039789982  101 QTGGDDvlrnqkPGECpsvVTHSNAVFSRWN--LCIEKHHFICQ 142
Cdd:cd00037     81 EPNPGG------SEDC---VVLSSSSDGKWNdvSCSSKLPFICE 115
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
362-498 1.58e-06

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 52.49  E-value: 1.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  362 DTPS-----SSSPPQGTSDTPASSSPPQgtsetpasnSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQ 436
Cdd:pfam12287   23 DKPSdsaivSAQPPSQSPDLSQMVCPPA---------SPEQRLSQQSDVLQQPEQTQVSPVSPSSNACASSGSEYQFHTS 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  437 VTSETPASSSPtqvtsdtPASNSPPqgtsdtpgfSSPTQvTTATLVSSSPPQVTSDTPASSS 498
Cdd:pfam12287   94 EPPQPEAIDPI-------QSSMSLP---------SELAP-PSPPLSPASQPQVFQSKPASSS 138
PHA03378 PHA03378
EBNA-3B; Provisional
410-632 1.76e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.53  E-value: 1.76e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  410 QVTTATLVSSSPPQVTSetpASSSPTQVTSETpasssptQVTSDTPASNSPPQGTS-DTPGfssPTQVTTATLVSSSPPQ 488
Cdd:PHA03378   518 QRVMATLLPPSPPQPRA---GRRAPCVYTEDL-------DIESDEPASTEPVHDQLlPAPG---LGPLQIQPLTSPTTSQ 584
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  489 VTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP---QVIS-DTPASSSPPQVTSETPASSSP 564
Cdd:PHA03378   585 LASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPlrmQPITfNVLVFPTPHQPPQVEITPYKP 664
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  565 T-NMTSDTPASSSPTNMTSDTPASSSPTNMTSD--TPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQ 632
Cdd:PHA03378   665 TwTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPprAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRAR 735
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
353-499 2.17e-06

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 52.48  E-value: 2.17e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  353 SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSS---PPQVTTATLVSSSPPQVT--SE 427
Cdd:pfam13254  210 RSPAPGGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPAPTSASEPPPKTkelPKDSEEPAAPSKSAEASTekKE 289
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039789982  428 TPASSSP--TQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSsppQVTSDTPASSSP 499
Cdd:pfam13254  290 PDTESSPetSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPPKDFRANLRSR---EVPKDKSKKDEP 360
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
171-586 2.29e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 52.76  E-value: 2.29e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  171 AETERHMIPGNGPPLA-MCHQP-----APPELFETLCFPIDPASS-----APPKATHRMTITSLTgRPQVTSDTLASSSP 239
Cdd:COG5180    136 KVTREATSASAGVALAaALLQRsdpilAKDPDGDSASTLPPPAEKldkvlTEPRDALKDSPEKLD-RPKVEVKDEAQEEP 214
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  240 PQGTSDT----PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPpqgtsDTPASSSPPQvtsatSASS 315
Cdd:COG5180    215 PDLTGGAdhprPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRRAAIG-----DTPAAEPPGL-----PVLE 284
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  316 SPPQGTSDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETpASNSP 395
Cdd:COG5180    285 AGSEPQSDAPEAETARP--------------IDVKGVASAPPATRPVRPPGGARDPGTPRPGQPTERPAGVPEA-ASDAG 349
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  396 PQGTSETPGFSSPPqvtTATLVSSSPpqvtsetPASSSPTQVTSETPASSSPtQVTSDTPASNSPPQGTSDTPGFSSPTQ 475
Cdd:COG5180    350 QPPSAYPPAEEAVP---GKPLEQGAP-------RPGSSGGDGAPFQPPNGAP-QPGLGRRGAPGPPMGAGDLVQAALDGG 418
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  476 VTTAtlVSSSPPQvtsdTPASSSPPQVTSDTPASS----SPPQVTSETPASSSP-PQVTSDTSASISPPQVISDTPASSS 550
Cdd:COG5180    419 GRET--ASLGGAA----GGAGQGPKADFVPGDAESvsgpAGLADQAGAAASTAMaDFVAPVTDATPVDVADVLGVRPDAI 492
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1039789982  551 PPqvTSETPASSSPtnmTSDTPASSSPTNMTSDTPA 586
Cdd:COG5180    493 LG--GNVAPASGLD---AETRIIEAEGAPATEDFVA 523
PRK10856 PRK10856
cytoskeleton protein RodZ;
496-577 2.70e-06

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 51.95  E-value: 2.70e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  496 SSSPPQVTSDTPASSSPPQVTSETPASSSP--PQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PRK10856   168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247

                   ....
gi 1039789982  574 SSSP 577
Cdd:PRK10856   248 AADP 251
ARG80 COG5068
Regulator of arginine metabolism and related MADS box-containing transcription factors ...
347-604 2.85e-06

Regulator of arginine metabolism and related MADS box-containing transcription factors [Transcription];


Pssm-ID: 227400 [Multi-domain]  Cd Length: 412  Bit Score: 51.94  E-value: 2.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQGTLDTPSSSSpPQGTSDTPASSSPPQGTSETPASNSPPQGtsetpgfSSPPQVTTATLVSSSPPQVts 426
Cdd:COG5068    163 PSDSSEEPSSSASFSVDPNDNN-PMGSFQHNGSPQTNFIPLQNPQTQQYQQH-------SSRKDHPTVPHSNTNNGRP-- 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  427 etPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQG-TSDTPGFSSPTQVTTATLVsSSPPQVTSDTPASSSPPQVTSD 505
Cdd:COG5068    233 --PAKFMIPELHSSHSTLDLPSDFISD---SGFPNQSsTSIFPLDSAIIQITPPHLP-NNPPQENRHELYSNDSSMVSET 306
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  506 TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:COG5068    307 PPPKNLPNGSPNQSPLNNLSRGNPASPNSIIRENNQVEDESFNGRQGSAIWNALISTTQPNSGLHTEASTAPSSTIPADP 386
                          250
                   ....*....|....*....
gi 1039789982  586 ASSSPTNMTSDTPASSSPP 604
Cdd:COG5068    387 LKNAAQTNSGTRNNNFSDN 405
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
190-576 2.95e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.60  E-value: 2.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  190 QPAPPELFEtlcfPIDPASSAPPKathrmtitSLTGRPQVTSDTlASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGT 269
Cdd:NF033609   547 QPDEPGEIE----PIPEDSDSDPG--------SDSGSDSSNSDS-GSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSA 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  270 SDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSD 349
Cdd:NF033609   614 SDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  350 TPASSSPpqgtlDTPS-SSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:NF033609   694 SDSDSDS-----DSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 768
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  429 PASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSppqvtSDTP 507
Cdd:NF033609   769 DSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-----SDSD 843
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPqvisDTPASSSPPQVTSETPASSSPTNMT-SDTPASSS 576
Cdd:NF033609   844 SDSDSDSDSDSDSESDSNSDSESGSNNNVVPP----NSPKNGTNASNKNEAKDSKEPLPDTgSEDEANTS 909
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
356-586 2.96e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 52.57  E-value: 2.96e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  356 PPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT 435
Cdd:PRK12323   365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  436 QVTSETPASSSPTQVTSDTPASnSPPQGTSDTPGFSSPTQVTTATlvssSPPQVTSDTPASSSPPQVTSDTPASSSPPQV 515
Cdd:PRK12323   445 GGAPAPAPAPAAAPAAAARPAA-AGPRPVAAAAAAAPARAAPAAA----PAPADDDPPPWEELPPEFASPAPAQPDAAPA 519
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  516 TSETPASSSPPQVTSDTSASISPPQvisdtPASSSPPQVTSETPASSSPTnmTSDTPASSSPTNMTSDTPA 586
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPA-----PAAAPAPRAAAATEPVVAPR--PPRASASGLPDMFDGDWPA 583
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
239-551 3.41e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 52.54  E-value: 3.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  239 PPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsatsassspp 318
Cdd:PRK07003   360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAP------------- 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  319 qgtsdtPASSSPPQvtsatsasssPPQGTSDTPASSSPPQGTLDTPSS-SSPPQGTSDTPASSSPPQGtseTPASNSPPQ 397
Cdd:PRK07003   427 ------PAAPAPPA----------TADRGDDAADGDAPVPAKANARASaDSRCDERDAQPPADSGSAS---APASDAPPD 487
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  398 GTSEtpgfSSPPqvttATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPqgtSDTPGFSSPTQV- 476
Cdd:PRK07003   488 AAFE----PAPR----AAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPA---ARAGGAAAALDVl 556
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  477 TTATLVSSSPPQVTSDTPASSSPPQVTSDTPAsssPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSP 551
Cdd:PRK07003   557 RNAGMRVSSDRGARAAAAAKPAAAPAAAPKPA---APRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628
PPE COG5651
PPE-repeat protein [Function unknown];
363-587 3.81e-06

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 51.43  E-value: 3.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  363 TPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGtsetpGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:COG5651    165 TPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANL-----GLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAG 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  443 ASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPAS 522
Cdd:COG5651    240 AAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAA 319
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  523 SSPP-QVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:COG5651    320 GATGaGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
319-547 3.82e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 52.19  E-value: 3.82e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  319 QGTSDTPASSSPPQVTSATSASSSPPQGTSdTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQG 398
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAPAAAAP-APAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  399 TSETPgfSSPPQVTTATlvsSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPtqvTT 478
Cdd:PRK12323   447 APAPA--PAPAAAPAAA---ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDA---AP 518
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  479 ATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPqvtsDTSASISPPQVISDTPA 547
Cdd:PRK12323   519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP----RASASGLPDMFDGDWPA 583
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
485-636 4.84e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 52.02  E-value: 4.84e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  485 SPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSE---TPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPAS 561
Cdd:PRK08691   379 SPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAaamPSEGKTAGPVSNQENNDVPPWE---DAPDEAQTAAGTAQTSAK 455
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  562 SSPTNMTSDTPASSSPT-NMTSDTPASSSPTNMTSDTPASSSP-PWPVITEVTRPESTIPAgrslANITSKAQEDSP 636
Cdd:PRK08691   456 SIQTASEAETPPENQVSkNKAADNETDAPLSEVPSENPIQATPnDEAVETETFAHEAPAEP----FYGYGFPDNDCP 528
PHA03379 PHA03379
EBNA-3A; Provisional
211-606 6.23e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 51.60  E-value: 6.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  211 PP---KATHRMTITSLTGRPQVTSDTLASssPPQGTSDTPassSPPQVTSATSASSSPPQGTSDTPASS-SPPQVTSATS 286
Cdd:PHA03379   379 PPiflRRLHRLLLMRAGKLTERAREALEK--ASEPTYGTP---RPPVEKPRPEVPQSLETATSHGSAQVpEPPPVHDLEP 453
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  287 ASSSPPQGTSDTPASSSPPqvtsatsasssppqgtsdtpassSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS 366
Cdd:PHA03379   454 GPLHDQHSMAPCPVAQLPP-----------------------GPLQDLEPGDQLPGVVQDGRPACAPVPAPAGPIVRPWE 510
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  367 SSPPQGTSDTPASSSP----------PQGTSETPASNSPP----QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASS 432
Cdd:PHA03379   511 ASLSQVPGVAFAPVMPqpmpvepvpvPTVALERPVCPAPPliamQGPGETSGIVRVRERWRPAPWTPNPPRSPSQMSVRD 590
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  433 SPTQVTSET-----PASSSPTQVTSDTPAS-----NSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTS---DTPASSSP 499
Cdd:PHA03379   591 RLARLRAEAqpyqaSVEVQPPQLTQVSPQQpmeypLEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYFDlplQQPISQGA 670
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  500 PQVTSDTPASSSPPqVTSETPASSSPPqVTSDTSASISPPQVISDTPAssSPPQVtseTPASSSPTNMTSDTPASSSPTN 579
Cdd:PHA03379   671 PLAPLRASMGPVPP-VPATQPQYFDIP-LTEPINQGASAAHFLPQQPM--EGPLV---PERWMFQGATLSQSVRPGVAQS 743
                          410       420
                   ....*....|....*....|....*..
gi 1039789982  580 MTSDTPASSSPTNMTSDTPASSSPPWP 606
Cdd:PHA03379   744 QYFDLPLTQPINHGAPAAHFLHQPPME 770
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
371-484 7.00e-06

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 51.24  E-value: 7.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  371 QGTSDTPASSSP--PQGTSETPAS-NSPPQGTSeTPGFSSPPQVTTATLV--SSSPPQVTSETPASSSPTQVTSETPASS 445
Cdd:PLN02217   545 QGDAWIPGKGVPyiPGLFAGNPGStNSTPTGSA-ASSNTTFSSDSPSTVVapSTSPPAGHLGSPPATPSKIVSPSTSPPA 623
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982  446 SPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSS 484
Cdd:PLN02217   624 SHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESS 662
PHA03291 PHA03291
envelope glycoprotein I; Provisional
351-448 1.01e-05

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 50.34  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  351 PASSSPPQGTLDTPSSSSPPQ-GTSD--TPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:PHA03291   176 PLGEGSADGSCDPALPLSAPRlGPADvfVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEG 255
                           90       100
                   ....*....|....*....|.
gi 1039789982  428 TPASSSPTqVTSETPASSSPT 448
Cdd:PHA03291   256 TPAPPTPG-GGEAPPANATPA 275
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
1020-1059 1.10e-05

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 44.30  E-value: 1.10e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1039789982  1020 TQCYFWDRYNRTWKSDGCQVGPKS-TIlkTQCLCDHLTFFS 1059
Cdd:smart00303    3 PICVFWDESSGEWSTRGCELLETNgTH--TTCSCNHLTTFA 41
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
487-666 1.29e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 50.62  E-value: 1.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  487 PQVTSDT-PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PRK07003   360 PAVTGGGaPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRG 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  566 N--MTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTH 643
Cdd:PRK07003   440 DdaADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASRE 519
                          170       180
                   ....*....|....*....|...
gi 1039789982  644 PQMSFQSSTSQQALDETAGERVP 666
Cdd:PRK07003   520 DAPAAAAPPAPEARPPTPAAAAP 542
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
494-656 1.31e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  494 PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PHA03307    22 PRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPP---TGPPPGPGTEAPANESRSTPTWSLSTLAPA 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  574 SSSPTNmtSDTPASSSPTNMTSDTPASSSP---PWPVITEVTRPE-STIPAGRSLANITSKAQEDSPLGVISTHPQMSFQ 649
Cdd:PHA03307    99 SPAREG--SPTPPGPSSPDPPPPTPPPASPppsPAPDLSEMLRPVgSPGPPPAASPPAAGASPAAVASDAASSRQAALPL 176

                   ....*..
gi 1039789982  650 SSTSQQA 656
Cdd:PHA03307   177 SSPEETA 183
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
395-532 1.31e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 50.47  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  395 PPQGTSETPG-FSSPPQVTTATLVSSSPPQVTSETpaSSSPTqvTSETPASSSPTQVTSDTPASnsppqgtsdtpgfssP 473
Cdd:PLN02217   551 PGKGVPYIPGlFAGNPGSTNSTPTGSAASSNTTFS--SDSPS--TVVAPSTSPPAGHLGSPPAT---------------P 611
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  474 TQVTTAtlvSSSPPQ----VTSDTPASSSPPQVTSDTPaSSSPPQVTSETPASSSPPQVTSDT 532
Cdd:PLN02217   612 SKIVSP---STSPPAshlgSPSTTPSSPESSIKVASTE-TASPESSIKVASTESSVSMVSMST 670
motB PRK12799
flagellar motor protein MotB; Reviewed
477-595 1.33e-05

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 50.10  E-value: 1.33e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  477 TTATLVSSSPPQVTSDTPASSSPPqvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799   307 SSAVTQSSAITPSSAAIPSPAVIP-----SSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMST 381
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982  557 ETPASSSPTNMtsdTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:PRK12799   382 TETQQSSTGNI---TSTANGPTTSLPAAPASNIPVSPTS 417
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
321-469 1.43e-05

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 50.05  E-value: 1.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  321 TSDTPASSSPPQ----VTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPqgTSDTPASSSPPQGTSETPASNSPP 396
Cdd:pfam05539  191 SQVTPQSQPATQghqtATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPS--GSPQHPPSTTSQDQSTTGDGQEHT 268
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  397 QGTSETPGFSSPPQV-TTATLVSSSPPQVTSE-TPASSSPTQVTSeTPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:pfam05539  269 QRRKTPPATSNRRSPhSTATPPPTTKRQETGRpTPRPTATTQSGS-SPPHSSPPGVQANPTTQNLVDCKELDPPK 342
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
403-558 1.45e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 50.09  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  403 PGFSSPPQVTTATLVSSSPPQVTSETPASSspTQVTSETPASSSPTQ---VTSDTPASNSPPQGTSdTPGFSSPTQVTTA 479
Cdd:PLN02217   514 PGAAITKRVTWPGIKKLSDEEILKFTPAQY--IQGDAWIPGKGVPYIpglFAGNPGSTNSTPTGSA-ASSNTTFSSDSPS 590
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  480 TLV--SSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPqVTSDTSASISPPQVISDTPASSSPPQVTSE 557
Cdd:PLN02217   591 TVVapSTSPPAGHLGSPPATPSKIVSPSTSPPASHLGSPSTTPSSPESS-IKVASTETASPESSIKVASTESSVSMVSMS 669

                   .
gi 1039789982  558 T 558
Cdd:PLN02217   670 T 670
PRK13914 PRK13914
invasion associated endopeptidase;
352-588 1.45e-05

invasion associated endopeptidase;


Pssm-ID: 237555 [Multi-domain]  Cd Length: 481  Bit Score: 50.19  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  352 ASSSPPQGTLDTPSSSSPPQGTsdtPASSSPPQGTSETPASNSPPQG--TSETPGFSSppQVTTATLVSSSPPQVTSeTP 429
Cdd:PRK13914   143 VTSTPVAPTQEVKKETTTQQAA---PAAETKTEVKQTTQATTPAPKVaeTKETPVVDQ--NATTHAVKSGDTIWALS-VK 216
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  430 ASSSPTQVTSETPASSSPTQVTSD----TPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSD 505
Cdd:PRK13914   217 YGVSVQDIMSWNNLSSSSIYVGQKlaikQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPK 296
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  506 TPASSSPPqvtseTPAssspPQVTSDTSASISPPQVISDTPASSSPPQVTSETpaSSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:PRK13914   297 APTEAAKP-----APA----PSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTN--TNSNTNTNSNTNANQGSSNNNSNSS 365

                   ...
gi 1039789982  586 ASS 588
Cdd:PRK13914   366 ASA 368
DUF612 pfam04747
Protein of unknown function, DUF612; This family includes several uncharacterized proteins ...
228-575 1.58e-05

Protein of unknown function, DUF612; This family includes several uncharacterized proteins from Caenorhabditis elegans.


Pssm-ID: 282585 [Multi-domain]  Cd Length: 511  Bit Score: 50.06  E-value: 1.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  228 QVTSDTLASSSP-PQGTSDTPASSSPpQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSppqgTSDTPASSSPPQ 306
Cdd:pfam04747  181 KVANDRSAAPAPePKTPTNTPAEPAE-QVQEITGKKNKKNKKKSESEATAAPASVEQVVEQPKV----VTEEPHQQAAPQ 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  307 vtSATSASSSPPQGTSDTPASSSPPQvtsatsasssppqgtsdTPASSSPPqgtldtPSSSSPPQGTSDTPASSSppQGT 386
Cdd:pfam04747  256 --EKKNKKNKRKSESENVPAASETPV-----------------EPVVETTP------PASENQKKNKKDKKKSES--EKV 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  387 SETPASNSPPQgtSETPGFSSPPQVTTATLVSSSPPQVTSETPASssPTQVTSETPASSSPTQVTSdTPASNSPPQGTSD 466
Cdd:pfam04747  309 VEEPVQAEAPK--SKKPTADDNMDFLDFVTAKEEPKDEPAETPAA--PVEEVVENVVENVVEKSTT-PPATENKKKNKKD 383
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  467 TPGfSSPTQVTTATLVSS-SPPQVT----SDTPASSSPPQVTSDTPASSSPPQVtsETPASSSPPQVTSDTSASISPPQV 541
Cdd:pfam04747  384 KKK-SESEKVTEQPVESApAPPQVEqvveTTPPASENKKKNKKDKKKSESEKAV--EEPVQAAPSSKKPTADDNMDFLDF 460
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1039789982  542 ISDTPASSSPPQVTSETPASSSPTNMTSDTPASS 575
Cdd:pfam04747  461 VTAKPDKSESVEEHIAAPMIVEPAHADEETAAAA 494
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
224-422 1.61e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.14  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  224 TGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSS 303
Cdd:COG3469     12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  304 PPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS--SSPPQGTSDTPASSS 381
Cdd:COG3469     92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSgtETATGGTTTTSTTTT 171
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1039789982  382 PPQGTSETPASNSPPQGTSETPGFSSPP-QVTTATLVSSSPP 422
Cdd:COG3469    172 TTSASTTPSATTTATATTASGATTPSATtTATTTGPPTPGLP 213
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
468-628 1.62e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 50.09  E-value: 1.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  468 PGFSSPTQVTTATLVSSSPPQVTSDTPASssppQVTSDT--PASSSP--PQVTSETPASSsppqvTSDTSASISPpqviS 543
Cdd:PLN02217   514 PGAAITKRVTWPGIKKLSDEEILKFTPAQ----YIQGDAwiPGKGVPyiPGLFAGNPGST-----NSTPTGSAAS----S 580
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  544 DTPASSSPPQvTSETPASSSPTNMTSDTPASSSpTNMTSDTPASSSPTNMTSDTPAS-SSPPWPVITEVTRPESTIPAGR 622
Cdd:PLN02217   581 NTTFSSDSPS-TVVAPSTSPPAGHLGSPPATPS-KIVSPSTSPPASHLGSPSTTPSSpESSIKVASTETASPESSIKVAS 658

                   ....*.
gi 1039789982  623 SLANIT 628
Cdd:PLN02217   659 TESSVS 664
Mating_C pfam12737
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine ...
361-632 1.64e-05

C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine the mating type of an individual, and only individuals with differing mating types can mate. Basidiomycete fungi have evolved a unique mating system, termed tetrapolar or bifactorial incompatibility, in which mating type is determined by two unlinked loci; compatibility at both loci is required for mating to occur. The multi-allelic tetrapolar mating system is considered to be a novel innovation that could have only evolved once, and is thus unique to the mushroom fungi. This domain is C-terminal to the homeodomain transcription factor region.


Pssm-ID: 372279 [Multi-domain]  Cd Length: 412  Bit Score: 49.60  E-value: 1.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  361 LDTPSSSSPPQGTSDTPASSSpPQGTSETPASNSPpqgtseTPGFSSPPQVTTATLVSSSPPQVTSEtpassspTQVTSE 440
Cdd:pfam12737  123 LDSPSSSSSPEKCLPSPAPSE-QEALSEISAACGP------TPSTLTPLNVAPSLTPSKKRKRCLSD-------GFDGPK 188
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  441 TPASSSPT---QVTSDT-PASNSPPQGTSDTPGFSSPtqvtTATLVSSSPPQVTSDTPASSSP-----------PQVTSD 505
Cdd:pfam12737  189 RPPNKRVQprpQTVSDPfPTSTSIPEWDEWLQNHMSP----SLTLHGDIPPPVSVEAPDSNTPldieifnfpyhPDLTPS 264
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  506 TPASSSPPQVTSETPASSSPP-------QVTSDTSASISPPQVISDTPAS----SSPPQVTSETPASSSPTNMTSDTPAS 574
Cdd:pfam12737  265 PAPSLSDSVIEVATPTTESDYmcngtlrQTFSWFEFDFPELIQPTNTPASnnelSLPFDPSTDIVVSRTILPLLDWRSQS 344
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  575 SSPTNMTSDTPA-----SSSPTnmTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQ 632
Cdd:pfam12737  345 FLSQTFASPPHSilrsnSSSPD--VSAFALDLTPAFTPITYSLSESEKEAKRRELEELEARLQ 405
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
476-616 1.66e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 49.81  E-value: 1.66e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  476 VTTATLVSSSPPQVTSDTPASSSPPQvtsDTPASSSPPQVTSETPASSSPPQvtsdtsASISPPQVISDTPASSSPPqvt 555
Cdd:PRK14950   355 VIEALLVPVPAPQPAKPTAAAPSPVR---PTPAPSTRPKAAAAANIPPKEPV------RETATPPPVPPRPVAPPVP--- 422
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  556 seTPASSSPTNMTSDTPASSSPtnmTSDTPASSSPTNMTSDTPASS----SPPWPVITEVTRPES 616
Cdd:PRK14950   423 --HTPESAPKLTRAAIPVDEKP---KYTPPAPPKEEEKALIADGDVleqlEAIWKQILRDVPPRS 482
rne PRK10811
ribonuclease E; Reviewed
474-620 1.81e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 50.04  E-value: 1.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  474 TQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSP----------PQVTSETPASSSPPQVTSDTSASIS-----P 538
Cdd:PRK10811   848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPvveavaevveEPVVVAEPQPEEVVVVETTHPEVIAapvteQ 927
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  539 PQVISDTPASssppqVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTI 618
Cdd:PRK10811   928 PQVITESDVA-----VAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVA 1002

                   ..
gi 1039789982  619 PA 620
Cdd:PRK10811  1003 PA 1004
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
226-466 1.81e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.98  E-value: 1.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  226 RPQVTSDTLASSSPPQGTSDTPASSSPPQVTSAtsasssppqGTSDTPASSSPPqvtsatsasssppqgtSDTPASSSPP 305
Cdd:PRK07764   583 QVEAVVGPAPGAAGGEGPPAPASSGPPEEAARP---------AAPAAPAAPAAP----------------APAGAAAAPA 637
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  306 QVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQgtlDTPSSSSPPQGTSDTPASSSPPQG 385
Cdd:PRK07764   638 EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAP---AAPAGAAPAQPAPAPAATPPAGQA 714
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  386 TSETPASNSPPQGTSETPGFSS-----PPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSP 460
Cdd:PRK07764   715 DDPAAQPPQAAQGASAPSPAADdpvplPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMD 794

                   ....*.
gi 1039789982  461 PQGTSD 466
Cdd:PRK07764   795 DEDRRD 800
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
356-555 1.89e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 49.92  E-value: 1.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  356 PPQGTLDTPSSSSPPQG----------TSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVttatlvssSPPQVT 425
Cdd:PLN03209   382 PPTSPIPTPPSSSPASSksvdavakpaEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDL--------KPPTSP 453
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  426 SETPASSSPTQVTSETPASSSPtqvtsDTPASNSPPQGTSDTPGFSSPtqvTTATLVSSSPPQVTSDTPASSSPPQVTSD 505
Cdd:PLN03209   454 SPTAPTGVSPSVSSTSSVPAVP-----DTAPATAATDAAAPPPANMRP---LSPYAVYDDLKPPTSPSPAAPVGKVAPSS 525
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  506 TPAssSPPQVTSETPASSSPPQVTSDTSAS-ISPPQVISDT--PASSSPPQVT 555
Cdd:PLN03209   526 TNE--VVKVGNSAPPTALADEQHHAQPKPRpLSPYTMYEDLkpPTSPTPSPVL 576
PHA02682 PHA02682
ORF080 virion core protein; Provisional
348-564 2.38e-05

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 48.32  E-value: 2.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  348 SDTPAS--SSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGT-------SETPASNSPPQGTSetPGFSSPPqvttatlVS 418
Cdd:PHA02682    19 ADTSSSlfTKCPQATIPAPAAPCPPDADVDPLDKYSVKEAGryyqsrlKANSACMQRPSGQS--PLAPSPA-------CA 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  419 SSPPQVTSETPASSSPTqVTSETPASSSPTqvtsdTPASNSPPQGTSDTPGfssptqvttatlvsssppqvtsdTPASSS 498
Cdd:PHA02682    90 APAPACPACAPAAPAPA-VTCPAPAPACPP-----ATAPTCPPPAVCPAPA-----------------------RPAPAC 140
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  499 PPQVTSDTPAsssPPQVTSEtPASSSPPQVTSDtsaSISPPQVisdtPASSSPpqvTSETPASSSP 564
Cdd:PHA02682   141 PPSTRQCPPA---PPLPTPK-PAPAAKPIFLHN---QLPPPDY----PAASCP---TIETAPAASP 192
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
436-638 2.75e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 49.15  E-value: 2.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQvtSDTPASsspPQVTSDTPASSSPPQV 515
Cdd:PLN03209   304 EVIAETTAPLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPI--EEEPPQ---PKAVVPRPLSPYTAYE 378
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  516 TSETPASSSPPQVTSDTSAS-----ISPPQVISDTPASSSPPQV------TSET-----------------PASSSPTNM 567
Cdd:PLN03209   379 DLKPPTSPIPTPPSSSPASSksvdaVAKPAEPDVVPSPGSASNVpevepaQVEAkktrplspyaryedlkpPTSPSPTAP 458
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  568 TSDTPASSSPT--NMTSDTPASSSPTNMTSDTPASSSP--PWPVITEVTRPESTIPAGRSLANITSKAQEDSPLG 638
Cdd:PLN03209   459 TGVSPSVSSTSsvPAVPDTAPATAATDAAAPPPANMRPlsPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVG 533
KAR9 pfam08580
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ...
322-589 2.78e-05

Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.


Pssm-ID: 430088 [Multi-domain]  Cd Length: 684  Bit Score: 49.44  E-value: 2.78e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  322 SDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQG-TS 400
Cdd:pfam08580  427 NKTPGSSPPSS--------------VIMTPVNKGSKTPSSRRGSSFDFGSSSERVINSKLRRESKLPQIASTLKQTKrPS 492
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  401 ETPGFSSPPQVTtatlvSSSPPQ-VTSETPA-SSSPTQVTSETPASSSPTQVTSDTPASNSPPQ-GTSDTPGFSSPTQVT 477
Cdd:pfam08580  493 KIPRASPNHSGF-----LSTPSNtATSETPTpALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHnFKPLTLTTPSPTPSR 567
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  478 TATLVSSSPPQVTSDTPASSSPpqVTSDTPASSSPPQVTSETPASSSPPQvtsdtsasiSPPQVISDTPASSSP-PQVTS 556
Cdd:pfam08580  568 SSRSSSTLPPVSPLSRDKSRSP--APTCRSVSRASRRRASRKPTRIGSPN---------SRTSLLDEPPYPKLTlSKGLP 636
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1039789982  557 ETPASSSPTNMTSDTPASSSPTNMTSDTPASSS 589
Cdd:pfam08580  637 RTPRNRQSYAGTSPSRSVSVSSGLGPQTRPGTS 669
KAR9 pfam08580
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ...
414-621 2.80e-05

Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.


Pssm-ID: 430088 [Multi-domain]  Cd Length: 684  Bit Score: 49.44  E-value: 2.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  414 ATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQV-----TSDTPASNSP-----PQGTSDTPGFSSPTQVTTATLVS 483
Cdd:pfam08580  422 ATLVANKTPGSSPPSSVIMTPVNKGSKTPSSRRGSSFdfgssSERVINSKLRresklPQIASTLKQTKRPSKIPRASPNH 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  484 SSPPQVTSDTPASSSPpqvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSS 563
Cdd:pfam08580  502 SGFLSTPSNTATSETP------TPALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHNFKPLTLTTPSPTPSRSSRSSSTL 575
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  564 PTNMTSDTPASSSPT-NMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAG 621
Cdd:pfam08580  576 PPVSPLSRDKSRSPApTCRSVSRASRRRASRKPTRIGSPNSRTSLLDEPPYPKLTLSKG 634
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
508-599 2.81e-05

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 49.47  E-value: 2.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDT-PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPA 586
Cdd:PRK11907    18 LTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEART 97
                           90
                   ....*....|...
gi 1039789982  587 SSSPTnmTSDTPA 599
Cdd:PRK11907    98 VTPAA--TETSKP 108
SP4_N cd22536
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ...
345-655 2.88e-05

N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.


Pssm-ID: 411773 [Multi-domain]  Cd Length: 623  Bit Score: 49.15  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  345 QGTSDTPASSSPPQGTldtpSSSSPPQGTSDTPASSSPPQGTSETPASNSP-PQGTSEtpgFSSPPQVTTatlVSSSPPQ 423
Cdd:cd22536     90 QGVSAATSSAAPSSSN----NGSTSPTKVKAGNSNASAPGQFQVIQVQNMQnPSGSVQ---YQVIPQIQT---VEGQQIQ 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  424 VTSETPASSSPTQVTSE-TPASS--SPTQVTSDTPASNSPPQGTSDT-------PGFSSPTQVTTatlVSSSPPQVTSDT 493
Cdd:cd22536    160 ISPANATALQDLQGQIQlIPAGNnqAILTTPNRTASGNIIAQNLANQtvpvqirPGVSIPLQLQT---IPGAQAQVVTTL 236
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  494 PASSSppQVTSDTPasssppqVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMT----- 568
Cdd:cd22536    237 PINIG--GVTLALP-------VINNVAAGGGSGQLVQPSDGGVSNGNQLVSTPITTASVSTMPESPSSSTTCTTTastsl 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  569 --SDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVI-TEVTRPESTIPAGRSLANITSKAQE--------DSPL 637
Cdd:cd22536    308 tsSDTLVSSAETGQYASTAASSERTEEEPQTSAAESEAQSSSqLQSNGLQNVQDQSNSLQQVQIVGQPilqqiqiqQPQQ 387
                          330
                   ....*....|....*...
gi 1039789982  638 GVISTHPQMSFQSSTSQQ 655
Cdd:cd22536    388 QIIQAIQPQSFQLQSGQT 405
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
472-564 2.88e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 49.04  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  472 SPTQVTTATLVSSSPPQVTsDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAsisPPQVISDTPASSSP 551
Cdd:PRK14950   362 PVPAPQPAKPTAAAPSPVR-PTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPH---TPESAPKLTRAAIP 437
                           90
                   ....*....|...
gi 1039789982  552 PQVTSETPASSSP 564
Cdd:PRK14950   438 VDEKPKYTPPAPP 450
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
413-520 2.90e-05

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 48.58  E-value: 2.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  413 TATLVSSSPPQVTSETPASSSPTQV--TSETPASSSPTQVTSDTPASNsppQGTSDTPGFSSPTQVTTATLVSSSPPQVT 490
Cdd:PRK13335    55 TAGANSATTQAANTRQERTPKLEKApnTNEEKTSASKIEKISQPKQEE---QKSLNISATPAPKQEQSQTTTESTTPKTK 131
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1039789982  491 SDTPASSSPPQ----VTSDTPASSSPPQVTSE-TP 520
Cdd:PRK13335   132 VTTPPSTNTPQpmqsTKSDTPQSPTIKQAQTDmTP 166
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
534-633 3.14e-05

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 49.47  E-value: 3.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  534 ASISPPQVISDTPASSSPPQvTSETPASSSPTnmtsDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTR 613
Cdd:PRK11907    18 LTASNPKLAQAEEIVTTTPA-TSTEAEQTTPV----ESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTT 92
                           90       100
                   ....*....|....*....|
gi 1039789982  614 PEStiPAGRSLANITSKAQE 633
Cdd:PRK11907    93 SEA--RTVTPAATETSKPVE 110
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
358-566 3.53e-05

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 48.83  E-value: 3.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  358 QGTLDTPSSSSP--PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT 435
Cdd:PRK12727    52 QRALETARSDTPatAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMALRQPVSVPRQAPAAAPVR 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  436 QVTSETPAssspTQVTSDTPASNSPPQGTSDTPgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQV 515
Cdd:PRK12727   132 AASIPSPA----AQALAHAAAVRTAPRQEHALS--AVPEQLFADFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAY 205
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  516 TSETPASSSPPQVTSDTS--ASISPPQVISDTPASSSPPQVTSETPASSSPTN 566
Cdd:PRK12727   206 AQDDDEQLDDDGFDLDDAlpQILPPAALPPIVVAPAAPAALAAVAAAAPAPQN 258
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
411-549 3.79e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 48.65  E-value: 3.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  411 VTTATLVSSSPPQvtSETPASSSPTQVTsETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATlvsSSPPQVT 490
Cdd:PRK14950   355 VIEALLVPVPAPQ--PAKPTAAAPSPVR-PTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPV---PHTPESA 428
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  491 SDTPASSSPPQVtsdTPASSSPPQVTSETPASSSPPQVTSDTSASIspPQVISDTPASS 549
Cdd:PRK14950   429 PKLTRAAIPVDE---KPKYTPPAPPKEEEKALIADGDVLEQLEAIW--KQILRDVPPRS 482
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
426-559 3.93e-05

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 48.41  E-value: 3.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  426 SETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG--FSSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQV 502
Cdd:PTZ00436   208 AAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAkaAAPPAKAAAPPAKAAAPPAKAAAPPAkAAAPPAK 287
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  503 TSDTPA-SSSPPQVTSETP--ASSSPPQVTSDTSASISPPQVISDTPA-SSSPPQVTSETP 559
Cdd:PTZ00436   288 AAAPPAkAAAAPAKAAAAPakAAAAPAKAAAPPAKAAAPPAKAATPPAkAAAPPAKAAAAP 348
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
225-461 4.18e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.69  E-value: 4.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  225 GRPQVTSDTLASSSPPQGTSDTPASSSP-PQVTSATSASSSPPQGTSDTPASSSppQVTSATSASSSPPQGTSDTPASSS 303
Cdd:PRK07003   383 PGARAAAAVGASAVPAVTAVTGAAGAALaPKAAAAAAATRAEAPPAAPAPPATA--DRGDDAADGDAPVPAKANARASAD 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  304 PPqvTSATSASSSPPQGTSDTPASSSPPQVTSATSasssppqgTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPP 383
Cdd:PRK07003   461 SR--CDERDAQPPADSGSASAPASDAPPDAAFEPA--------PRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAP 530
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  384 QGTSETPASNSPP---------------------QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:PRK07003   531 EARPPTPAAAAPAaraggaaaaldvlrnagmrvsSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPP 610
                          250
                   ....*....|....*....
gi 1039789982  443 ASSSPTQVTSDTPASnSPP 461
Cdd:PRK07003   611 NGAARAEQAAESRGA-PPP 628
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
405-521 4.75e-05

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 48.70  E-value: 4.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  405 FSSPPQVTTATLVSSSPPQVTSETPASssptqvtseTPASSSPTQVTSDTPASNSppqGTSDTPGFSSPTQVTTATLVSS 484
Cdd:PRK11907     6 FSKSAVALTLALLTASNPKLAQAEEIV---------TTTPATSTEAEQTTPVESD---ATEEADNTETPVAATTAAEAPS 73
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1039789982  485 SPPQVTSDTPASSSPPQVTSDTPasSSPPQVTSETPA 521
Cdd:PRK11907    74 SSETAETSDPTSEATDTTTSEAR--TVTPAATETSKP 108
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
293-521 5.04e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.72  E-value: 5.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  293 QGTSDTPASSSPPQVTSATSASSSPpqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQG 372
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAP---AAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  373 TSDTPASSSPPQgtseTPASNSPPQgtsetpgfSSPPQVTTATLVSSSPPQvtseTPASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK12323   445 GGAPAPAPAPAA----APAAAARPA--------AAGPRPVAAAAAAAPARA----APAAAPAPADDDPPPWEELPPEFAS 508
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  453 DTPASNSPPQGTSDTPGFSSP-TQVTTATLVSSSPPQVTSDTPASSSP-----PQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK12323   509 PAPAQPDAAPAGWVAESIPDPaTADPDDAFETLAPAPAAAPAPRAAAAtepvvAPRPPRASASGLPDMFDGDWPA 583
PHA02732 PHA02732
hypothetical protein; Provisional
355-609 5.07e-05

hypothetical protein; Provisional


Pssm-ID: 165099 [Multi-domain]  Cd Length: 1467  Bit Score: 48.60  E-value: 5.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  355 SPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSP-PQGTSETPGFS-SPPQVTTATLVSSSPPQ----VTSET 428
Cdd:PHA02732  1074 SPSYIFLNSWASSYVAPGFLGSPYALPYFMNQTSALVGNTAlPKGLNVFSGYMfGAGTVASAFLYMNSTPQspvlALLLA 1153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  429 PASSSPTQVTSETPASSSPTQVTSDTP------ASNSPPQG----TSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS 498
Cdd:PHA02732  1154 PYISYKFNALSLGFSITADAAIFSLFGipapqlLSSYIPTGsvlyQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASS 1233
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  499 PPQVTSDTPASSSPPQVTSETpASSSPP--QVTSDTSASISPPQVISdtPASSSPP--QVTSETPASSSPTNMTS-DTPA 573
Cdd:PHA02732  1234 PPAATTPTPPPSSSSSSSAQS-ISTSPGqiQIVLNGSTTIHINFLFF--PALSTPKigQILAMPIVNSSGAFISLyVNSA 1310
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1039789982  574 SSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT 609
Cdd:PHA02732  1311 ISANFNVTIEYVFSNGTVIKRFTDEPGQIFPLPLIN 1346
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
448-585 5.30e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 48.23  E-value: 5.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  448 TQVTSDTPASNSPPQGTSdtPGFS---SPTQVTTATLVSSSPPQvTSDTPASSSPPQVTsdtPASSSPPQVtSETPASSS 524
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIK--PVFTqpaAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSAT---QPAGTPPTV-SVDPPAAV 435
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  525 PPQVTSDTSASISPPQVISDTPASSSppQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:PRK14971   436 PVNPPSTAPQAVRPAQFKEEKKIPVS--KVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQK 494
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
470-573 5.35e-05

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 48.70  E-value: 5.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPqvTSDTPASSSPPQVTSET---PASSSPPQVTSDTSASISPPQVISDTP 546
Cdd:PRK11907     6 FSKSAVALTLALLTASNPKLAQAEEIVTTTP--ATSTEAEQTTPVESDATeeaDNTETPVAATTAAEAPSSSETAETSDP 83
                           90       100
                   ....*....|....*....|....*..
gi 1039789982  547 ASSSPPQVTSETPASSSPTnmTSDTPA 573
Cdd:PRK11907    84 TSEATDTTTSEARTVTPAA--TETSKP 108
motB PRK12799
flagellar motor protein MotB; Reviewed
373-504 5.54e-05

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 48.17  E-value: 5.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  373 TSDTPASSSPPqgTSETPASNSPPQGTSETPGfssppQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK12799   296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPS-----PAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPA 368
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  453 DTPASNSPPQGTSDTPGFSSPTQVTTATlvsSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK12799   369 AEPVNMQPQPMSTTETQQSSTGNITSTA---NGPTTSLPAAPASNIPVSPTS 417
PLN03223 PLN03223
Polycystin cation channel protein; Provisional
1996-2132 5.75e-05

Polycystin cation channel protein; Provisional


Pssm-ID: 215637 [Multi-domain]  Cd Length: 1634  Bit Score: 48.79  E-value: 5.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1996 LRGFLLLFATVRVWDLLRHHAQLQVINKTLSKAWDEVLGFILIIVVLLSSYAMTFNLLFGWSISDYQSFFRSIVTVVGLL 2075
Cdd:PLN03223  1294 LSGINIILLLGRILKLMDFQPRLGVITRTLWLAGADLMHFFVIFGMVFVGYAFIGHVIFGNASVHFSDMTDSINSLFENL 1373
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039789982 2076 MG-----TSKHKEVIALYPILGSLLVLSSIILMGLVIINLFVSAILIAFG--KERKACEVSNQT 2132
Cdd:PLN03223  1374 LGdityfNEDLKNLTGLQFVVGMIYFYSYNIFVFMILFNFLLAIICDAFGevKANAAETVSVHT 1437
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
347-580 5.80e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 48.53  E-value: 5.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQgTLDTPSSSSPPQgtsdTPASSSPPQGTsETPASNSPPQgtSETPGFSspPQVTTATLVSSSppqvts 426
Cdd:PTZ00449   617 LLDIPKSPKRPE-SPKSPKRPPPPQ----RPSSPERPEGP-KIIKSPKPPK--SPKPPFD--PKFKEKFYDDYL------ 680
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  427 eTPASSSPTQVTSETPASSSPTQVTSDTPASNSPPqgtSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASS---SPPQVT 503
Cdd:PTZ00449   681 -DAAAKSKETKTTVVLDESFESILKETLPETPGTP---FTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIeffTPPEEE 756
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  504 S----DTPASSSPPQVTSETPASsspPQVTSDTSASISP------PQVISDTPASSSP--PQ--------VTSETPASSS 563
Cdd:PTZ00449   757 RtffhETPADTPLPDILAEEFKE---EDIHAETGEPDEAmkrpdsPSEHEDKPPGDHPslPKkrhrldglALSTTDLESD 833
                          250
                   ....*....|....*..
gi 1039789982  564 PTNMTSDtpASSSPTNM 580
Cdd:PTZ00449   834 AGRIAKD--ASGKIVKL 848
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
471-591 6.57e-05

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 47.64  E-value: 6.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  471 SSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSDTPA-SSSPPQVTSETP--ASSSPPQVTSDTSASISPPQVISDTP 546
Cdd:PTZ00436   220 AAPAKAAAAPAKAAAPPAKAAAAPAkAAAAPAKAAAPPAkAAAPPAKAAAPPakAAAPPAKAAAPPAKAAAPPAKAAAAP 299
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1039789982  547 A-SSSPPQVTSETPA-SSSPTNMTSDTPASSSPTNMTSDTPASSSPT 591
Cdd:PTZ00436   300 AkAAAAPAKAAAAPAkAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
521-635 7.26e-05

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 45.72  E-value: 7.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  521 ASSSPPQVTSDTSASISPpQVISDTPASSSPPQVTSETP-----ASSSPT--------NMTSDTPASSSPTNMTSDTPAS 587
Cdd:pfam09595   31 ASLILIGESNKEAALIIT-DIIDININKQHPEQEHHENPplneaAKEAPSesedapdiDPNNQHPSQDRSEAPPLEPAAK 109
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  588 SSPT----NMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDS 635
Cdd:pfam09595  110 TKPSehepANPPDASNRLSPPDASTAAIREARTFRKPSTGKRNNPSSAQSDQ 161
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
321-609 7.38e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.84  E-value: 7.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  321 TSDTPA-------SSSPPQVTSATSASSSPPQGTSDTPASSSPPQgtLDTPSSSSPPQgtSDTPASSSPPQgtSETPASN 393
Cdd:NF033839   280 TQDTPKepgnkkpSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQ--LEKPKPEVKPQ--PEKPKPEVKPQ--LETPKPE 353
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  394 SPPQGTSETPGFSSPPQvttaTLVSSSPPQVTSETPasssptQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFsSP 473
Cdd:NF033839   354 VKPQPEKPKPEVKPQPE----KPKPEVKPQPETPKP------EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEV-KP 422
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  474 TQVTTATLVSSSPPqvtsdTPASSSPPQvtSDTPASSSPPQvtSETPASSSPPQVtsdtsasisppqvisDTPASSSPPQ 553
Cdd:NF033839   423 QPEKPKPEVKPQPE-----KPKPEVKPQ--PEKPKPEVKPQ--PETPKPEVKPQP---------------EKPKPEVKPQ 478
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  554 VTSETPASSSPtnmTSDTPASSSPTNMTSDTPAS--SSPTNMTSDTPASSSPPWPVIT 609
Cdd:NF033839   479 PEKPKPDNSKP---QADDKKPSTPNNLSKDKQPSnqASTNEKATNKPKKSLPSTGSIS 533
SAP130_C pfam16014
Histone deacetylase complex subunit SAP130 C-terminus;
415-590 7.39e-05

Histone deacetylase complex subunit SAP130 C-terminus;


Pssm-ID: 464973 [Multi-domain]  Cd Length: 371  Bit Score: 47.62  E-value: 7.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  415 TLVSSSPPQVTSETPA-----SSSPTQVTSETPASSSPTQVT---SDTPASNSPPqgTSDTPGFSSPTQVTTATlVSSSP 486
Cdd:pfam16014    1 ALGSSPRPSILRKKPAtegakPKPDIHVAVAPPVTVAVEALPgqnSEQQTASASP--PSQHPAQAIPTILAPAA-PPSQP 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  487 PQVTSDTPASS--SPPQVTSDTPASSSPPQvtsetPASSSPPQ-VTSDTSASISPPQVISDTPASSSPPQVTSETPASSs 563
Cdd:pfam16014   78 SVVLSTLPAAMavTPPIPASMANVVAPPTQ-----PAASSTAAcAVSSVLPEIKIKQEAEPMDTSQSVPPLTPTSISPA- 151
                          170       180
                   ....*....|....*....|....*..
gi 1039789982  564 ptnMTSDTPASSSPtnmTSDTPASSSP 590
Cdd:pfam16014  152 ---LTSLANNLSVP---AGDLLPGASP 172
PLN02983 PLN02983
biotin carboxyl carrier protein of acetyl-CoA carboxylase
365-554 7.55e-05

biotin carboxyl carrier protein of acetyl-CoA carboxylase


Pssm-ID: 215533 [Multi-domain]  Cd Length: 274  Bit Score: 46.76  E-value: 7.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  365 SSSSPpqgtsdTPASSSPPQGTSETPASNSppQGTSETPGFSSPPQVTTATLVSSSPPQVTS-ETPASSSPTQVTSETPA 443
Cdd:PLN02983     3 SLSVP------CAKTAAAAANVGSRLSRSS--FRLQPKPNISFPSKGPNPKRSAVPKVKAQLnEVAVDGSSNSAKSDDPK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  444 SSSPTQVTSDTPASNSPPQGTSDTPGFSSP--TQVTT------------------------------------ATLVSSS 485
Cdd:PLN02983    75 SEVAPSEPKDEPPSNSSSKPNLPDEESISEfmTQVSSlvklvdsrdivelqlkqldcelvirkkealpqppppAPVVMMQ 154
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  486 PPQVTSDTPASSSPPQVTSDTPASSSPPqvtseTPASSSPPQVTSDTSASISPPQ--VISDTPASSSPPQV 554
Cdd:PLN02983   155 PPPPHAMPPASPPAAQPAPSAPASSPPP-----TPASPPPAKAPKSSHPPLKSPMagTFYRSPAPGEPPFV 220
PRK13042 PRK13042
superantigen-like protein SSL4; Reviewed;
481-577 7.91e-05

superantigen-like protein SSL4; Reviewed;


Pssm-ID: 183854 [Multi-domain]  Cd Length: 291  Bit Score: 46.93  E-value: 7.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  481 LVSSSPPQVTSDTPASSSPPQVTSDTPaSSSPPQVTSETPASSspPQVTSDTSASISPPQvisDTPASSSPPQVTSETPa 560
Cdd:PRK13042    15 LLTTGVITTTTQAANATTPSSTKVEAP-QSTPPSTKVEAPQSK--PNATTPPSTKVEAPQ---QTPNATTPSSTKVETP- 87
                           90
                   ....*....|....*..
gi 1039789982  561 sSSPTnmTSDTPASSSP 577
Cdd:PRK13042    88 -QSPT--TKQVPTEINP 101
PRK10856 PRK10856
cytoskeleton protein RodZ;
457-551 7.97e-05

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 47.33  E-value: 7.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  457 SNSPPQGTSDTPgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASI 536
Cdd:PRK10856   159 GQSVPLDTSTTT--DPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAP 236
                           90
                   ....*....|....*
gi 1039789982  537 SPPQVISDTPASSSP 551
Cdd:PRK10856   237 LPTDQAGVSTPAADP 251
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
353-472 8.21e-05

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 45.72  E-value: 8.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  353 SSSPPQGTLDTPSSSSPPQGTSDtPASSSPPQGTSETPASNSppqGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASS 432
Cdd:pfam09595   66 ENPPLNEAAKEAPSESEDAPDID-PNNQHPSQDRSEAPPLEP---AAKTKPSEHEPANPPDASNRLSPPDASTAAIREAR 141
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1039789982  433 SPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSS 472
Cdd:pfam09595  142 TFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSS 181
Hamartin pfam04388
Hamartin protein; This family includes the hamartin protein which is thought to function as a ...
205-585 8.35e-05

Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.


Pssm-ID: 461287 [Multi-domain]  Cd Length: 730  Bit Score: 47.74  E-value: 8.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  205 DPASSAPPKATHRMTITSLTGrpqvTSDTLASSSPPQGTSDTPASSSPPQVTsatsasssppqGTSDTPASS-----SPP 279
Cdd:pfam04388  288 YGSSTSTPSSTPRLQLSSSSG----TSPPYLSPPSIRLKTDSFPLWSPSSVC-----------GMTTPPTSPgmvptTPS 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  280 QVTSATSASSSPPQGTSD---------TPASSSPPQvtsatsasssppqgTSDTPASSSPPQVTSatsasssppqgtsdt 350
Cdd:pfam04388  353 ELSPSSSHLSSRGSSPPEaageatpetTPAKDSPYL--------------KQPPPLSDSHVHRAL--------------- 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  351 PASSSPpqgtldtpssSSPPQgTSDTPASSSPPQgTSETPASNSppqgtseTPGFSSPP-QVTTATLvsSSPPQVTSETP 429
Cdd:pfam04388  404 PASSQP----------SSPPR-KDGRSQSSFPPL-SKQAPTNPN-------SRGLLEPPgDKSSVTL--SELPDFIKDLA 462
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  430 ASSSPTQVTSETPAS-----SSPTQVTSDTPASNsppqgtsdtPGFSSPTQVTTATLVSSsppQVTSDTPASSSPPQVTS 504
Cdd:pfam04388  463 LSSEDSVEGAEEEAAisqelSEITTEKNETDCSR---------GGLDMPFSRTMESLAGS---QRSRNRIASYCSSTSQS 530
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  505 DTPASSSPPQVTSETPASSSPPQVTSDTSASISPP--QVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTS 582
Cdd:pfam04388  531 DSHGPATTPESKPSALAEDGLRRTKSCSFKQSFTPieQPIESSDDCPTDEQDGENGLETSILTPSPCKIPSRQKVSTQSG 610

                   ...
gi 1039789982  583 DTP 585
Cdd:pfam04388  611 QPL 613
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
507-660 8.47e-05

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 47.35  E-value: 8.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  507 PASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPA----SSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTS 582
Cdd:pfam05539  169 KTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATAnqrlSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQ 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  583 DTPASSSP-TNMTSD----TPASSSPPWPVITEVTRPESTIPAgrslaniTSKAQED-----SPLGVISTHPQMSFQSST 652
Cdd:pfam05539  249 HPPSTTSQdQSTTGDgqehTQRRKTPPATSNRRSPHSTATPPP-------TTKRQETgrptpRPTATTQSGSSPPHSSPP 321

                   ....*...
gi 1039789982  653 SQQALDET 660
Cdd:pfam05539  322 GVQANPTT 329
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
555-603 8.48e-05

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 45.77  E-value: 8.48e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1039789982  555 TSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:cd21441     65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVP 113
PRK10856 PRK10856
cytoskeleton protein RodZ;
451-538 9.17e-05

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 46.94  E-value: 9.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  451 TSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTS 530
Cdd:PRK10856   164 LDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAG 243

                   ....*...
gi 1039789982  531 DTSASISP 538
Cdd:PRK10856   244 VSTPAADP 251
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
482-620 9.19e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.55  E-value: 9.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  482 VSSSPPQVTSDTPASssppQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PRK07994   367 EPEVPPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSE 442
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  562 SSPTNMTsdTPASSSPTNMTSDTPASSSPTNMTSDTPA----SSSPPWPVITEVTRPESTIPA 620
Cdd:PRK07994   443 PAAASRA--RPVNSALERLASVRPAPSALEKAPAKKEAyrwkATNPVEVKKEPVATPKALKKA 503
PRK10856 PRK10856
cytoskeleton protein RodZ;
361-450 9.50e-05

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 46.94  E-value: 9.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  361 LDTPSSSSP-PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTS 439
Cdd:PRK10856   164 LDTSTTTDPaTTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAG 243
                           90
                   ....*....|.
gi 1039789982  440 ETPASSSPTQV 450
Cdd:PRK10856   244 VSTPAADPNAL 254
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
483-604 9.62e-05

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 45.33  E-value: 9.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  483 SSSPPQVTSDTpASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASS 562
Cdd:pfam09595   32 SLILIGESNKE-AALIITDIIDININKQHPEQEHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKT 110
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1039789982  563 SPT----NMTSDTPASSSPTNMTSDTPASSS----PTNMTSDTPAS----SSPP 604
Cdd:pfam09595  111 KPSehepANPPDASNRLSPPDASTAAIREARtfrkPSTGKRNNPSSaqsdQSPP 164
PRK08581 PRK08581
amidase domain-containing protein;
353-550 9.82e-05

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 47.48  E-value: 9.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  353 SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfSSPPQVTTATLVSSSPPQVTSETPASS 432
Cdd:PRK08581   129 LNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPS---SNNTKPSTSNKQPNSPKPTQPNQSNSQ 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  433 SPTQVTSETPASSSPTQVTSDTPASNSPPQgtsdtpgFSSPTQVTTATLVSSSPPQVTSdTPASSSPPQVTSDTPASSSP 512
Cdd:PRK08581   206 PASDDTANQKSSSKDNQSMSDSALDSILDQ-------YSEDAKKTQKDYASQSKKDKTE-TSNTKNPQLPTQDELKHKSK 277
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039789982  513 PQVTSETPAssspPQVTSDTSASISPPQVISDTPASSS 550
Cdd:PRK08581   278 PAQSFENDV----NQSNTRSTSLFETGPSLSNNDDSGS 311
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
239-599 1.02e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 47.76  E-value: 1.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  239 PPQGTSDTPASSSPPQvtsatsasssppqGtsdtPASSSPPQVTSATSASSSPPQGTSDtpaSSSPPQVTSATSASSSPP 318
Cdd:PTZ00449   497 APIEEEDSDKHDEPPE-------------G----PEASGLPPKAPGDKEGEEGEHEDSK---ESDEPKEGGKPGETKEGE 556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  319 QGTSDTPASSSPPqvtsatsasssppqgtSDTPASSSPPQGTLDTPSSSSPpqgtsDTPASSSPPQGTSETPASNSP--P 396
Cdd:PTZ00449   557 VGKKPGPAKEHKP----------------SKIPTLSKKPEFPKDPKHPKDP-----EEPKKPKRPRSAQRPTRPKSPklP 615
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  397 QgTSETPGFSSPPQVTTAtlvSSSPPqvtsetpassSPTQVTSETPASSSPTQVTSDTPASNSPP----------QGTSD 466
Cdd:PTZ00449   616 E-LLDIPKSPKRPESPKS---PKRPP----------PPQRPSSPERPEGPKIIKSPKPPKSPKPPfdpkfkekfyDDYLD 681
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  467 TPGFSSPTqVTTATLVSSSPPQVTSDTPASSSPPQVTSDT--PASSSPPQVTSETPASSSPPQvtSDTSASISPPQ---- 540
Cdd:PTZ00449   682 AAAKSKET-KTTVVLDESFESILKETLPETPGTPFTTPRPlpPKLPRDEEFPFEPIGDPDAEQ--PDDIEFFTPPEeert 758
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  541 VISDTPASSSPPQVTSETPASSsptNMTSDTPASSSPTNmTSDTPASSSPTNmTSDTPA 599
Cdd:PTZ00449   759 FFHETPADTPLPDILAEEFKEE---DIHAETGEPDEAMK-RPDSPSEHEDKP-PGDHPS 812
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
427-588 1.10e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 47.28  E-value: 1.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  427 ETPASSSPTQVTSETPASSSPTQVTS---DTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSppqvtsDTPASSSPPQVT 503
Cdd:PRK13108   280 EAPGALRGSEYVVDEALEREPAELAAaavASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVT------DEVAAESVVQVA 353
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  504 SDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSS--PTNMT 581
Cdd:PRK13108   354 DRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPakPDELA 433

                   ....*..
gi 1039789982  582 SDTPASS 588
Cdd:PRK13108   434 VAGPGDD 440
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
468-591 1.11e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 47.40  E-value: 1.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  468 PGFSSPTQVT--TATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTsdtsASISPPQVISDT 545
Cdd:PRK14951   366 PAAAAEAAAPaeKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAP----AAAAPAAAPAAA 441
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1039789982  546 PASSSPPQVTSETPAS--SSPTNMTSDTPASSSPTNMTSDTPASSSPT 591
Cdd:PRK14951   442 PAAVALAPAPPAQAAPetVAIPVRVAPEPAVASAAPAPAAAPAAARLT 489
PPE COG5651
PPE-repeat protein [Function unknown];
322-539 1.19e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 46.81  E-value: 1.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  322 SDTPASSSPPQVTSATSASSSPPQGTSDTPAS--------SSPPQGTLDTPSSSSPPQGTsdtpASSSPPQGTSETPASN 393
Cdd:COG5651    163 ALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNpgfanlglTGLNQVGIGGLNSGSGPIGL----NSGPGNTGFAGTGAAA 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  394 SPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSP 473
Cdd:COG5651    239 GAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGA 318
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  474 TQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP 539
Cdd:COG5651    319 AGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
460-636 1.19e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.54  E-value: 1.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  460 PPQGTSDTPGFSSPTQVTTAtlVSSSPPQVTSDTPASSSPPQvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPP 539
Cdd:PRK07003   360 PAVTGGGAPGGGVPARVAGA--VPAPGARAAAAVGASAVPAV----TAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPP 433
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  540 Q---VISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTnmTSDTPASSSPTNMTSDtPASSSPPWPVITEVTRPES 616
Cdd:PRK07003   434 AtadRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSG--SASAPASDAPPDAAFE-PAPRAAAPSAATPAAVPDA 510
                          170       180
                   ....*....|....*....|
gi 1039789982  617 TIPAGRSLANITSKAQEDSP 636
Cdd:PRK07003   511 RAPAAASREDAPAAAAPPAP 530
PRK10856 PRK10856
cytoskeleton protein RodZ;
522-603 1.25e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 46.56  E-value: 1.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  522 SSSPPQVTSDTSASISPPQVISDTPASSSP--PQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPA 599
Cdd:PRK10856   168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247

                   ....
gi 1039789982  600 SSSP 603
Cdd:PRK10856   248 AADP 251
PLN03131 PLN03131
hypothetical protein; Provisional
353-609 1.26e-04

hypothetical protein; Provisional


Pssm-ID: 178677 [Multi-domain]  Cd Length: 705  Bit Score: 47.08  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  353 SSSPPQGTLDTPSSSSPPQGTSdtpaSSSPPQGTSETPA-SNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPAS 431
Cdd:PLN03131   335 AGSGSHASLDHFKAPVAPEAAA----PMAPPIDLFQLPAtSPAPPVDLFEIPPLDPAPAINAYQPPQTSLPSSIDLFGGI 410
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  432 SSPTQVTS---ETPASSSPTqvtSDTPASNSPPQGTSDTPGFS--SPTQVTTATLVSSSPPQVTSDTPASSSPP-QVTSD 505
Cdd:PLN03131   411 TQQQSINSldeKSPELSIPK---NEGWATFDGIQPIASTPGNEnlTPFSIGPSMAGSANFDQVPSLDKGMQWPPfQNSSD 487
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  506 TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT-NMTSDTPASSSPT------ 578
Cdd:PLN03131   488 EESASGPAPWLGDLHNVEAPDNTSAQNWNAFEFDDSVAGIPLEGIKQSSEPQTAANMPPTaDQLIGCKALEDFNkdgikr 567
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1039789982  579 ---NMTSDTPASSSPTNMTSDtPASSSPPWPVIT 609
Cdd:PLN03131   568 tapHGQGELPGLDEPSDILAE-PSYTPPAHPIME 600
PRK10856 PRK10856
cytoskeleton protein RodZ;
362-476 1.30e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 46.56  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  362 DTPSSSSPPQGTSDTP--ASSSPPQGTSETPASNSPPQGTSetpgfSSPPQVTTATlvsssPPQVTSETPASSSPTQVTS 439
Cdd:PRK10856   148 DQSSAELSQNSGQSVPldTSTTTDPATTPAPAAPVDTTPTN-----SQTPAVATAP-----APAVDPQQNAVVAPSQANV 217
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1039789982  440 ETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQV 476
Cdd:PRK10856   218 DTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNAL 254
PLAT_RAB6IP1 cd01757
PLAT/LH2 domain present in RAB6 interacting protein 1 (Rab6IP1)_like family. PLAT/LH2 domains ...
1190-1242 1.39e-04

PLAT/LH2 domain present in RAB6 interacting protein 1 (Rab6IP1)_like family. PLAT/LH2 domains consists of an eight stranded beta-barrel. In RabIP1 this domain may participate in lipid-mediated modulation of Rab6IP1's function via it's generally proposed function of mediating interaction with lipids or membrane bound proteins.


Pssm-ID: 238855  Cd Length: 114  Bit Score: 43.30  E-value: 1.39e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1190 LGDLHGLRLWHDNSGDSPSWYVSQVIVSDMTTRKKWHFQCNCWL-------AVDLGNCER 1242
Cdd:cd01757     52 LGKLTTVQIGHDNSGLLAKWLVEYVMVRNEITGHTYKFPCGRWLgegvddgNGEDGSLER 111
PPE COG5651
PPE-repeat protein [Function unknown];
267-500 1.40e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 46.42  E-value: 1.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  267 QGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsATSASSSPPQG 346
Cdd:COG5651    155 AAASAAAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGP----IGLNSGPGNTG 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTS 426
Cdd:COG5651    231 FAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGL 310
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039789982  427 ETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPP 500
Cdd:COG5651    311 GAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
208-572 1.44e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 47.30  E-value: 1.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  208 SSAPPKATHRMTITSLTGRPQVTSDtlaSSSPPQGTSDTPassSPPQVTSATSASSSPpqgTSDTPASSSPPQVtsatsa 287
Cdd:TIGR00927   76 SSDPPKSSSEMEGEMLAPQATVGRD---EATPSIAMENTP---SPPRRTAKITPTTPK---NNYSPTAAGTERV------ 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  288 sssppqgTSDTPASssPPQVTSATSASSSPPQGTSDTPA------SSSPPQVTSatsasssppQGTSDTPASSSPPQGTL 361
Cdd:TIGR00927  141 -------KEDTPAT--PSRALNHYISTSGRQRVKSYTPKprgevkSSSPTQTRE---------KVRKYTPSPLGRMVNSY 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  362 DTPSSSSPPQGTSDTPASSsppQGTSETPASNS--PPQGTSETPGFSSPpqVTTATLVSSSPPQVTS--ETPASSSPTQV 437
Cdd:TIGR00927  203 APSTFMTMPRSHGITPRTT---VKDSEITATYKmlETNPSKRTAGKTTP--TPLKGMTDNTPTFLTRevETDLLTSPRSV 277
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  438 TsETPASSSPTQVTSDTPAS-------NSP--PQGT--SDTPGfSSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSD 505
Cdd:TIGR00927  278 V-EKNTLTTPRRVESNSSTNhwglvgkNNLttPQGTvlEHTPA-TSEGQVTISIMTGSSPAETKASTAAwKIRNPLSRTS 355
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  506 TPA-------------SSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPaSSSPTNMTSDTP 572
Cdd:TIGR00927  356 APAvriasatfrglekNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPSLTTALFPEAP-SPSPSALPPGQP 434
PLAT_LOX cd01753
PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they ...
1132-1233 1.45e-04

PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they catalyze enzymatic lipid peroxidation in complex biological structures via direct dioxygenation of phospholipids and cholesterol esters of biomembranes and plasma lipoproteins. Both types of enzymes are cytosolic but need this domain to access their sequestered membrane or micelle bound substrates.


Pssm-ID: 238851  Cd Length: 113  Bit Score: 43.07  E-value: 1.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982 1132 YLIQVYTGYRRRAATTAKVVITLYGSEGHSEPhHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWYV 1211
Cdd:cd01753      3 YKVTVATGSSLFAGTDDYIYLTLVGTAGESEK-QLLDRPGYDFERGAVDEYKVKVPEDLGELLLVRLRKRKYLLFDAWFC 81
                           90       100
                   ....*....|....*....|..
gi 1039789982 1212 SQVIVSDmTTRKKWHFQCNCWL 1233
Cdd:cd01753     82 NYITVTG-PGGDEYHFPCYRWI 102
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
430-603 1.48e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 47.30  E-value: 1.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  430 ASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgfssptQVTTAtlVSSSPPQVT-SDTPasSSPPQVTSDTPA 508
Cdd:TIGR00927   55 SSQQPIKLASRDLSNDEMMMVSSDPPKSSSEMEGEMLAP------QATVG--RDEATPSIAmENTP--SPPRRTAKITPT 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  509 SSSppqvTSETPASSSPPQVTSDTSAsiSPPQVISDTPASSSPPQVTSETPA------SSSPTNMTSDTPA-SSSPTNMT 581
Cdd:TIGR00927  125 TPK----NNYSPTAAGTERVKEDTPA--TPSRALNHYISTSGRQRVKSYTPKprgevkSSSPTQTREKVRKyTPSPLGRM 198
                          170       180
                   ....*....|....*....|..
gi 1039789982  582 SDTPASSspTNMTSDTPASSSP 603
Cdd:TIGR00927  199 VNSYAPS--TFMTMPRSHGITP 218
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
427-578 1.49e-04

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 46.48  E-value: 1.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  427 ETPASSSPTQVTSETPASSSPTQVTSdtpASNSPPQGTSDTPGFSS--PTQVTTATLVSSSPPQVTSDTPA-SSSPPQVT 503
Cdd:PTZ00436   191 EDAAAAAAAKQKAAAKKAAAPSGKKS---AKAAAPAKAAAAPAKAAapPAKAAAAPAKAAAAPAKAAAPPAkAAAPPAKA 267
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  504 SDTPA-SSSPPQVTSETP--ASSSPPQVTSDTSASISPPQVISDTPA-SSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PTZ00436   268 AAPPAkAAAPPAKAAAPPakAAAPPAKAAAAPAKAAAAPAKAAAAPAkAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
PHA03255 PHA03255
BDLF3; Provisional
295-461 1.52e-04

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 45.67  E-value: 1.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  295 TSDTPASSSppqvtsatsassspPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSppqGTS 374
Cdd:PHA03255    25 TSSGSSTAS--------------AGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST---GTT 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  375 DTPASSSpPQGTSETPASNSPPQGTSET-PGFSSPPQVTtatlvSSSPPQVTSETPASSSPTQVTSETPASSS-----PT 448
Cdd:PHA03255    88 VTPVPTT-SNASTINVTTKVTAQNITATeAGTGTSTGVT-----SNVTTRSSSTTSATTRITNATTLAPTLSSkgtsnAT 161
                          170
                   ....*....|...
gi 1039789982  449 QVTSDTPasnSPP 461
Cdd:PHA03255   162 KTTAELP---TVP 171
PRK12495 PRK12495
hypothetical protein; Provisional
447-565 1.56e-04

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 45.24  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  447 PT---QVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQV-TSDTPASSSPPQVTSETPAS 522
Cdd:PRK12495    62 PTcqqPVTEDGAAGDDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTsATDEAATDPPATAAARDGPT 141
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1039789982  523 SSPPQVTSDTSASISPPQvisDTPASSSPPQvTSETPASSSPT 565
Cdd:PRK12495   142 PDPTAQPATPDERRSPRQ---RPPVSGEPPT-PSTPDAHVAGT 180
PRK11901 PRK11901
hypothetical protein; Reviewed
345-530 1.58e-04

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 46.21  E-value: 1.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  345 QGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQgtsetPASNSPPQGTS--ETPGFssppqvttatlVSSSPP 422
Cdd:PRK11901    91 NQSSPSAANNTSDGHDASGVKNTAPPQDISAPPISPTPTQ-----AAPPQTPNGQQriELPGN-----------ISDALS 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  423 QVTSETPASSSPTQvtseTPASSSPTqvtsdTPASNSPPQGTSdtPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:PRK11901   155 QQQGQVNAASQNAQ----GNTSTLPT-----APATVAPSKGAK--VPATAETHPTPPQKPATKKPAVNHHKTATVAVPPA 223
                          170       180
                   ....*....|....*....|....*....
gi 1039789982  503 TSDTPASSSPPQ-VTSETPASSSPPQVTS 530
Cdd:PRK11901   224 TSGKPKSGAASArALSSAPASHYTLQLSS 252
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
542-603 1.82e-04

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 44.61  E-value: 1.82e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  542 ISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:cd21441     65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
PRK10856 PRK10856
cytoskeleton protein RodZ;
483-564 1.82e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 46.17  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  483 SSSPPQVTSDTPASSSPPQVTSDTPASSSP--PQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPA 560
Cdd:PRK10856   168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247

                   ....
gi 1039789982  561 SSSP 564
Cdd:PRK10856   248 AADP 251
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
375-554 2.11e-04

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 46.10  E-value: 2.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  375 DTPASSSPPQGTSETPASnsPPQGTSETPGfSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTqvtsdT 454
Cdd:PTZ00436   192 DAAAAAAAKQKAAAKKAA--APSGKKSAKA-AAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAA-----P 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  455 PASNSPPQGTSDTPgfssPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSDTPA-SSSPPQVTSETPASSSppqvtsdt 532
Cdd:PTZ00436   264 PAKAAAPPAKAAAP----PAKAAAPPAKAAAPPAKAAAAPAkAAAAPAKAAAAPAkAAAPPAKAAAPPAKAA-------- 331
                          170       180
                   ....*....|....*....|..
gi 1039789982  533 sasiSPPQVISDTPASSSPPQV 554
Cdd:PTZ00436   332 ----TPPAKAAAPPAKAAAAPV 349
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
429-561 2.15e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 46.25  E-value: 2.15e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  429 PASSSPTQVTSEtpaSSSPTQVTSDTPASNSPPQgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPA 508
Cdd:PRK14951   366 PAAAAEAAAPAE---KKTPARPEAAAPAAAPVAQ--------AAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAP 434
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  509 SSSPPQVTSETPASSSPPQVTSDTSASI-----SPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PRK14951   435 AAAPAAAPAAVALAPAPPAQAAPETVAIpvrvaPEPAVASAAPAPAAAPAAARLTPTE 492
PHA03255 PHA03255
BDLF3; Provisional
321-497 2.24e-04

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 44.89  E-value: 2.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  321 TSDTPASSSPPQVTSATSASSSPPQGtSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSppqGTSETPASnsppqgts 400
Cdd:PHA03255    25 TSSGSSTASAGNVTGTTAVTTPSPSA-SGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST---GTTVTPVP-------- 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  401 eTPGFSSPPQVTTatlvsssppQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTAT 480
Cdd:PHA03255    93 -TTSNASTINVTT---------KVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATK 162
                          170
                   ....*....|....*..
gi 1039789982  481 LVSSSPPQVTSDTPASS 497
Cdd:PHA03255   163 TTAELPTVPDERQPSLS 179
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
147-433 2.38e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.30  E-value: 2.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  147 PPQGASIWRNEF-GPGPLLPmkRRGAETERHMIPGNGPPLAMCHQPAPPElfETlcfPIDPASSAPPKATHRMTiTSLTG 225
Cdd:pfam03154  294 PPQPFPLTPQSSqSQVPPGP--SPAAPGQSQQRIHTPPSQSQLQSQQPPR--EQ---PLPPAPLSMPHIKPPPT-TPIPQ 365
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  226 RPQVTSDT----LASSSPPQGTSDTPAsssPPQVTSATSAsssppqgTSDTPASSSPPQVTSATsasssppQGTSDTPAS 301
Cdd:pfam03154  366 LPNPQSHKhpphLSGPSPFQMNSNLPP---PPALKPLSSL-------STHHPPSAHPPPLQLMP-------QSQQLPPPP 428
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  302 SSPPQVTsatsasssppQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSS 381
Cdd:pfam03154  429 AQPPVLT----------QSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSA 498
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  382 PPQGTSETPASNS---PP-QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:pfam03154  499 SVSSSGPVPAAVScplPPvQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
364-499 2.39e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 46.21  E-value: 2.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  364 PSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPqvTSETPASSSPTQVTSETPa 443
Cdd:PRK14959   367 PVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPA--PSAAPSPRVPWDDAPPAP- 443
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  444 sssptqvtsdtPASNSPPQGTSDTPGFSSPT--QVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:PRK14959   444 -----------PRSGIPPRPAPRMPEASPVPgaPDSVASASDAPPTLGDPSDTAEHTP 490
AF-4 pfam05110
AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and ...
178-606 2.41e-04

AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homolog Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.


Pssm-ID: 461550 [Multi-domain]  Cd Length: 514  Bit Score: 46.27  E-value: 2.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  178 IPGNGPPLAMCHQPAPPelfetlCFPIDPASSAPPKATHrmtitsltgrpqvtsdtlaSSSPPQGTsdtPASSSPPqvts 257
Cdd:pfam05110   75 IPKNSVPQTPQEKPDQP------FFPDKTSGLPPSFHTS-------------------SHSQPMGP---PSSSSPS---- 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  258 atsasSSPPQGTSDTPASSSPPQvtsatsasssPPQGTSDTPASSSPPQvtsatSASSSPPQGTSDTPASSSPPQVTSAT 337
Cdd:pfam05110  123 -----VSSSQSQKKSQARTEPAH----------GGHSSSGSQSSQRSQG-----QSRSKGGQESHSSSHHKRQERREDLF 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  338 SASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPpqGTSETPASNSPPQGTSETPGFSSPpqVTTATLV 417
Cdd:pfam05110  183 SCASLSHSLEELSPLLSSLSSPVKPLSPSHSRQHTGSKAQNSSDH--HGKEYSHSKSPRDSEAGSHGPESP--STSLLAS 258
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  418 SSSPPQVTSETPASSSPTQVTSETPASSSP----TQVTSDTPASNSPPQG----------------TSDTPGFSSPTQVT 477
Cdd:pfam05110  259 SSQLSSQTFPPSLPSKTSAMQQKPTAYVRPmdgqDQAPSESPELKPSPEDyhgqsygklsdlkanaKAKLSKLKIPSQPL 338
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  478 TATLVS--------------SSPPQVTS-DTPASSSP---PQVTSDTPASSSPPQ------VTSETPASSSPPQVT---- 529
Cdd:pfam05110  339 EQSLSNdvhcveeilkemthSWPPPLTAiHTPSTAEPskfPFPTKESQHVTSGYQnqkqydAPSKTLPTSQQGTSMledd 418
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  530 ---------SDTSASISPPQviSDTPASSSPPQVTSETPASSSPTNMTSdtpASSSPTNMTSDTPASSSPTNMTSDTPAS 600
Cdd:pfam05110  419 lklsssedsDDDQAPEKPPP--SSAPPSAPQSQPNSVASAHSSSGESGS---SSDSESSSESDSESESSSSDSEANEPPR 493

                   ....*.
gi 1039789982  601 SSPPWP 606
Cdd:pfam05110  494 SATPEP 499
motB PRK12799
flagellar motor protein MotB; Reviewed
425-556 2.44e-04

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 45.86  E-value: 2.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  425 TSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVts 504
Cdd:PRK12799   296 HGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQS----ATTTQASAVALSSAGVLPSDVTLPGTVALPAA-- 369
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  505 dTPASSSPPQVTSETPASSSPPQVTSDTSasiSPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799   370 -EPVNMQPQPMSTTETQQSSTGNITSTAN---GPTTSLPAAPASNIPVSPTS 417
PHA03193 PHA03193
tegument protein VP11/12; Provisional
375-511 2.76e-04

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 45.86  E-value: 2.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  375 DTPASSS---PPQG--TSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPAS------SSPTQVTSETPA 443
Cdd:PHA03193   440 DSPFQRKramPEDGgeIHEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTSNEMkgdaecPAAQDAAAILPA 519
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  444 SSsptQVTSDTPASNSPPQGTSDTpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSS 511
Cdd:PHA03193   520 SF---QIENGGAADGSGLAIPAAM---CDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSK 581
PHA03291 PHA03291
envelope glycoprotein I; Provisional
350-435 2.81e-04

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 45.72  E-value: 2.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  350 TPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASnspPQGTSETPGFSSPPqvttatlvsssPPQVTSETP 429
Cdd:PHA03291   204 VPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQ---AGTTPEAEGTPAPP-----------TPGGGEAPP 269

                   ....*.
gi 1039789982  430 ASSSPT 435
Cdd:PHA03291   270 ANATPA 275
PRK10856 PRK10856
cytoskeleton protein RodZ;
431-525 2.84e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 45.40  E-value: 2.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  431 SSSPTQVTSETPASSSPTQVTSDTPASNSPPQGtsdtpgfSSPTQVTTATlvssSPPQVTSDTPASSSPPQVTSDTPASS 510
Cdd:PRK10856   168 TTTDPATTPAPAAPVDTTPTNSQTPAVATAPAP-------AVDPQQNAVV----APSQANVDTAATPAPAAPATPDGAAP 236
                           90
                   ....*....|....*
gi 1039789982  511 SPPQVTSETPASSSP 525
Cdd:PRK10856   237 LPTDQAGVSTPAADP 251
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
487-594 2.88e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.92  E-value: 2.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  487 PQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVisdTPASSSPPQVtSETPASSSPTN 566
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSA---TQPAGTPPTV-SVDPPAAVPVN 438
                           90       100
                   ....*....|....*....|....*...
gi 1039789982  567 MTSDTPASSSPTNMTSDTPASSSPTNMT 594
Cdd:PRK14971   439 PPSTAPQAVRPAQFKEEKKIPVSKVSSL 466
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
435-572 3.03e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.92  E-value: 3.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  435 TQVTSETPASSSPTQVTSDT---PASNSPPQGTSDtpgfssPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSS 511
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKPVftqPAAAPQPSAAAA------ASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVP 436
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039789982  512 PPqvtsetPASSSPPQVTSDTSAS---ISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PRK14971   437 VN------PPSTAPQAVRPAQFKEekkIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQK 494
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
377-513 3.09e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.86  E-value: 3.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  377 PASSSPPQGTSETPASNSPPQGTsetPGFSSPPQVTTATLVSSSPPQVTSET---PASSSPTQVTSETPASSSPTQVTSD 453
Cdd:PRK14951   366 PAAAAEAAAPAEKKTPARPEAAA---PAAAPVAQAAAAPAPAAAPAAAASAPaapPAAAPPAPVAAPAAAAPAAAPAAAP 442
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  454 TPASNSPPQGTSDTPGFSSPtqvttatlvsssPPQVTSDTPASSSPPQVTSDTPASSSPP 513
Cdd:PRK14951   443 AAVALAPAPPAQAAPETVAI------------PVRVAPEPAVASAAPAPAAAPAAARLTP 490
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
366-456 3.22e-04

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 46.00  E-value: 3.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  366 SSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSET-PGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPAS 444
Cdd:PRK11907    19 TASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEARTV 98
                           90
                   ....*....|..
gi 1039789982  445 SSPTqvTSDTPA 456
Cdd:PRK11907    99 TPAA--TETSKP 108
motB PRK12799
flagellar motor protein MotB; Reviewed
399-530 3.56e-04

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 45.48  E-value: 3.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  399 TSETPGFSSPPqvTTATLVSSSPPQVTSETPASS---SPTQVTSETPASSSPTQVTSDTPASnsppqgtSDTPGFSSPTQ 475
Cdd:PRK12799   296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPSPAvipSSVTTQSATTTQASAVALSSAGVLP-------SDVTLPGTVAL 366
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  476 VTTATLVSSSPPQVTSDTPASSSppqvTSDTPASSSPPQVTSETPASSSPPQVTS 530
Cdd:PRK12799   367 PAAEPVNMQPQPMSTTETQQSST----GNITSTANGPTTSLPAAPASNIPVSPTS 417
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
455-573 3.61e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 45.63  E-value: 3.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  455 PASNSPPQGTSDTPGfsspTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSdtSA 534
Cdd:PRK07994   366 PEPEVPPQSAAPAAS----AQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK--AK 439
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1039789982  535 SISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PRK07994   440 KSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEA 478
PHA03132 PHA03132
thymidine kinase; Provisional
354-473 4.09e-04

thymidine kinase; Provisional


Pssm-ID: 222997 [Multi-domain]  Cd Length: 580  Bit Score: 45.52  E-value: 4.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  354 SSPPQGTLDTPSSSSPPQGTSDTPasSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:PHA03132    65 GVATSTIYTVPRPPRGPEQTLDKP--DSLPASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFLYQVNFPVILAPIGES 142
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1039789982  434 PTqvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSP 473
Cdd:PHA03132   143 NS--SSEELSEEEEHSRPPPSESLKVKNGGKVYPKGFSKH 180
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
442-578 4.31e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.48  E-value: 4.31e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  442 PASSSPTQVTSDTPASNSPPQGTsdtPGFSSPTQVTTATLVSSSPPQVTSdtpASSSPPqvtsdTPASSSPPQVTSETPA 521
Cdd:PRK14951   366 PAAAAEAAAPAEKKTPARPEAAA---PAAAPVAQAAAAPAPAAAPAAAAS---APAAPP-----AAAPPAPVAAPAAAAP 434
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  522 SSSPPQVTSdtSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PRK14951   435 AAAPAAAPA--AVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLT 489
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
204-410 4.44e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.36  E-value: 4.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  204 IDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvts 283
Cdd:PRK07764   588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD--- 664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  284 atSASSSPPQGTSDTPASSSPPQVtsatsasssppQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDT 363
Cdd:PRK07764   665 --GGDGWPAKAGGAAPAAPPPAPA-----------PAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039789982  364 PSSSS---PPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQ 410
Cdd:PRK07764   732 SPAADdpvPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
PHA03291 PHA03291
envelope glycoprotein I; Provisional
460-577 4.60e-04

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 44.95  E-value: 4.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  460 PPQGT-SDTPGFSSPTQvttatlvSSSPPQVTSDTPaSSSPPQVTsdTPASSSPPQVTSETPASSSPPQVTSDTSASISP 538
Cdd:PHA03291   167 PAEGTlAAPPLGEGSAD-------GSCDPALPLSAP-RLGPADVF--VPATPRPTPRTTASPETTPTPSTTTSPPSTTIP 236
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1039789982  539 PQVISDTPASSSPPQVTSETPASSSP-TNMTSDTPASSSP 577
Cdd:PHA03291   237 APSTTIAAPQAGTTPEAEGTPAPPTPgGGEAPPANATPAP 276
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
390-530 4.78e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 45.24  E-value: 4.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  390 PASNSPPQGTSetpgfSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:PRK07994   366 PEPEVPPQSAA-----PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKK 440
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  470 fSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASS--SPPQVTSETPASSSPPQVTS 530
Cdd:PRK07994   441 -SEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwkATNPVEVKKEPVATPKALKK 502
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
378-468 4.96e-04

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 45.23  E-value: 4.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  378 ASSSPPQGTSETPASNSPPQGTSETPgfSSPPQVTTAT---LVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT 454
Cdd:PRK11907    18 LTASNPKLAQAEEIVTTTPATSTEAE--QTTPVESDATeeaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEA 95
                           90
                   ....*....|....
gi 1039789982  455 PAsnSPPQGTSDTP 468
Cdd:PRK11907    96 RT--VTPAATETSK 107
PRK13042 PRK13042
superantigen-like protein SSL4; Reviewed;
428-518 4.99e-04

superantigen-like protein SSL4; Reviewed;


Pssm-ID: 183854 [Multi-domain]  Cd Length: 291  Bit Score: 44.62  E-value: 4.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  428 TPASSSPTQVTSETPASSSPTQVTSDTPASNSPpqgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:PRK13042    18 TGVITTTTQAANATTPSSTKVEAPQSTPPSTKV----------EAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETP 87
                           90
                   ....*....|.
gi 1039789982  508 ASSSPPQVTSE 518
Cdd:PRK13042    88 QSPTTKQVPTE 98
CytochromB561_N pfam09786
Cytochrome B561, N terminal; Members of this family are found in the N terminal region of ...
350-558 5.01e-04

Cytochrome B561, N terminal; Members of this family are found in the N terminal region of cytochrome B561, as well as in various other putative uncharacterized proteins.


Pssm-ID: 462899  Cd Length: 579  Bit Score: 45.20  E-value: 5.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  350 TPASSSPPQGTLDTPSssspPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFS--SPPQVTTATLVSSSPPQVTSE 427
Cdd:pfam09786  129 PPKSKSSPQSPSPVLV----PLHQSVSPSSSESRKGGDKSPAGSGKKLRSFSTSSKSpaSPSVYLRGSPVPLNSSPLPSD 204
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  428 TPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSpPQVTSDTP 507
Cdd:pfam09786  205 RNYENSVQSSPEIDSAVSTPWSRKRATIGKEIRTEKMLERFLAEVDEKITESAFGKASPSNVSGSANRSGS-TRSTPLRS 283
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  508 ASSSPPQVTSETPASSSPpqvtSDTSASISPPQVISDTPASSSPPQVTSET 558
Cdd:pfam09786  284 VRMSPGSQKFTTPPKKGE----GDLPSPMSMEENIEAFENLGIYPQIEQWR 330
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
357-501 5.50e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.15  E-value: 5.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  357 PQGTLDTPSSSSPPQGTSDT---PASSSPPQgtseTPASNSPPQGTSETPgfSSPPQVTTATLVSSSPPQVtSETPASSS 433
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKPVftqPAAAPQPS----AAAAASPSPSQSSAA--AQPSAPQSATQPAGTPPTV-SVDPPAAV 435
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  434 PTQVTSETPASSSPTQVTSDTPasNSPPQGTSDTPGFSSPTQVTTAtlvsssppQVTSDTPASSSPPQ 501
Cdd:PRK14971   436 PVNPPSTAPQAVRPAQFKEEKK--IPVSKVSSLGPSTLRPIQEKAE--------QATGNIKEAPTGTQ 493
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
396-540 5.60e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.15  E-value: 5.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  396 PQGTSETPGFSSPPQVTT-ATLVSSSPPQVTSETPASSSPTQvTSETPASSSPTQVtsdTPASNSPPQGTSDTPgfsSPT 474
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKpVFTQPAAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSA---TQPAGTPPTVSVDPP---AAV 435
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  475 QVTTAtlvSSSPPQVtsdTPASSSPPQVTSDTPASSSPPQVTSetPASSSPPQVTSDTSASISPPQ 540
Cdd:PRK14971   436 PVNPP---STAPQAV---RPAQFKEEKKIPVSKVSSLGPSTLR--PIQEKAEQATGNIKEAPTGTQ 493
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
346-460 6.42e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 45.06  E-value: 6.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  346 GTSDTPASSSPPQGTLDTPSSSSPpQGTSdtPASS-SPPQGTSETPASNSPPqgTSETPGFSSPPQVTTATLVSSSPPQV 424
Cdd:PRK14959   382 SGSAAEGPASGGAATIPTPGTQGP-QGTA--PAAGmTPSSAAPATPAPSAAP--SPRVPWDDAPPAPPRSGIPPRPAPRM 456
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1039789982  425 TSETPASSSPTQVTSEtpASSSPTQVTSDTPASNSP 460
Cdd:PRK14959   457 PEASPVPGAPDSVASA--SDAPPTLGDPSDTAEHTP 490
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
460-647 6.49e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.87  E-value: 6.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  460 PPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASI--- 536
Cdd:PRK12323   365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASArgp 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  537 ----SPPQVISDTPASSSPPQVTS-ETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWP--VIT 609
Cdd:PRK12323   445 ggapAPAPAPAAAPAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAgwVAE 524
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039789982  610 EVTRPESTIPAGRSLANITSKAQEDSPLGVISTHPQMS 647
Cdd:PRK12323   525 SIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA 562
PHA03255 PHA03255
BDLF3; Provisional
269-454 6.84e-04

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 43.74  E-value: 6.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  269 TSDTPASSSppqvtsatsassspPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSppqVTSATSASSSPPQGTS 348
Cdd:PHA03255    25 TSSGSSTAS--------------AGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTT---AILSTNTTTVTSTGTT 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  349 DTPASSSPPQGTLDTPSSSsppqgTSDTPASSSPPQGTSETPASNSPPQGTSETpgfSSPPQVTTATLVSSSPPQVTset 428
Cdd:PHA03255    88 VTPVPTTSNASTINVTTKV-----TAQNITATEAGTGTSTGVTSNVTTRSSSTT---SATTRITNATTLAPTLSSKG--- 156
                          170       180
                   ....*....|....*....|....*.
gi 1039789982  429 paSSSPTQVTSETPASSSPTQVTSDT 454
Cdd:PHA03255   157 --TSNATKTTAELPTVPDERQPSLSY 180
PPE COG5651
PPE-repeat protein [Function unknown];
295-526 7.04e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 44.50  E-value: 7.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  295 TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSppqvtSATSASSSPPQGTSDTPASSSPPQGTldtpSSSSPPQGTS 374
Cdd:COG5651    162 VALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPG-----FANLGLTGLNQVGIGGLNSGSGPIGL----NSGPGNTGFA 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  375 DTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT 454
Cdd:COG5651    233 GTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGA 312
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  455 PASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSdTPASSSPPQVTSDTPASSSPPQVTSETPASSSPP 526
Cdd:COG5651    313 GGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAA-AAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
354-502 7.06e-04

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 44.17  E-value: 7.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  354 SSPPQGTlDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPgfssPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:PTZ00436   208 AAAPSGK-KSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAP----PAKAAAPPAKAAAPPAKAAAPPAKAA 282
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  434 ptqvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSS--PTQVTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:PTZ00436   283 ----APPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAapPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
383-514 7.60e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.77  E-value: 7.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  383 PQGTSETPASNSPPQGTSetPGFS---SPPQVTTATLVSSSPPQvTSETPASSSPTQVtseTPASSSPTQVTSDTP-ASN 458
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIK--PVFTqpaAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSA---TQPAGTPPTVSVDPPaAVP 436
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  459 SPPQGT---SDTPGFSSPTQVTTATLVSSSPPQVTSdtPASSSPPQVTSDTPASSSPPQ 514
Cdd:PRK14971   437 VNPPSTapqAVRPAQFKEEKKIPVSKVSSLGPSTLR--PIQEKAEQATGNIKEAPTGTQ 493
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
206-449 7.97e-04

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 44.29  E-value: 7.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  206 PASSAPPKATHRMTitsltgRPQvtsdtlASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSP------- 278
Cdd:pfam03546  246 PAAATPAQAKPALK------TPQ------TKASPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACASSPAvargaqr 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  279 PQVTSATSASSSPPQGTSDTPASSSPPQV-------TSATSASSSPPQGTSDTPASSSPP---QVTSATSASSSPPQGTS 348
Cdd:pfam03546  314 PEEDSSSSEESESEEETAPAAAVGQAKSVgkglqgkAASAPTKGPSGQGTAPVPPGKTGPavaQVKAEAQEDSESSEEES 393
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  349 D------TPASSSPPQGTLDTPSSSSPPQGTSDTPASSSP-------PQGTSETPASNSPPQGTSETPGFSSPPQ----- 410
Cdd:pfam03546  394 DseeaaaTPAQVKASGKTPQAKANPAPTKASSAKGAASAPgkvvaaaAQAKQGSPAKVKPPARTPQNSAISVRGQasvpa 473
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  411 ----VTTATLVSSSPPQVTSETPASSS----------PTQV-----TSETPASSSPTQ 449
Cdd:pfam03546  474 vgkaVATAAQAQKGPVGGPQEEDSESSeeesdseeeaPAQAkpsgkTPQVRAASAPAK 531
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
500-602 8.14e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.38  E-value: 8.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  500 PQVTSDTPASSSPPQVTSET---PASSSPPQVTSDTSASispPQVISDTPASSSPPQVTseTPASSSPTNMTSdtPASSS 576
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKPVftqPAAAPQPSAAAAASPS---PSQSSAAAQPSAPQSAT--QPAGTPPTVSVD--PPAAV 435
                           90       100
                   ....*....|....*....|....*.
gi 1039789982  577 PTNMTSDTPASSSPTNMTSDTPASSS 602
Cdd:PRK14971   436 PVNPPSTAPQAVRPAQFKEEKKIPVS 461
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
403-526 8.44e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 44.32  E-value: 8.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  403 PGFS--SPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTAT 480
Cdd:PRK14951   366 PAAAaeAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV 445
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1039789982  481 LVSSSPPQVTSDTPAsSSPPQVTSDTPASSSPPQVTSETPASSSPP 526
Cdd:PRK14951   446 ALAPAPPAQAAPETV-AIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
443-552 8.74e-04

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 44.46  E-value: 8.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  443 ASSSPTQVTSDTPASNSPPqgtsdtpgfSSPTQVTTATLVSSSppqVTSDTPASSSPPQVTSDTPASSSPpqvtsETPAS 522
Cdd:PRK11907    18 LTASNPKLAQAEEIVTTTP---------ATSTEAEQTTPVESD---ATEEADNTETPVAATTAAEAPSSS-----ETAET 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 1039789982  523 SSPPQVTSDTSASISPPQVISDTpaSSSPP 552
Cdd:PRK11907    81 SDPTSEATDTTTSEARTVTPAAT--ETSKP 108
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
487-620 9.51e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.47  E-value: 9.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  487 PQVTSDTPASssPPQVTSDTPASssppQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTN 566
Cdd:PRK07994   361 PAAPLPEPEV--PPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQG 434
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  567 mtSDTPASSSPTNMTSDTPASSSPTNMTSDTP-ASSSPPWPVITEVTRPESTIPA 620
Cdd:PRK07994   435 --ATKAKKSEPAAASRARPVNSALERLASVRPaPSALEKAPAKKEAYRWKATNPV 487
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
485-609 9.59e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 44.32  E-value: 9.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  485 SPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAsiSPPQVISDTPASSSPPQVTSETPASSSP 564
Cdd:PRK14951   372 AAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAA--PPAPVAAPAAAAPAAAPAAAPAAVALAP 449
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1039789982  565 TNMTSDTPASSSPTNMTSDTPASSSPtnmtSDTPASSSPPWPVIT 609
Cdd:PRK14951   450 APPAQAAPETVAIPVRVAPEPAVASA----APAPAAAPAAARLTP 490
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
434-560 9.68e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.47  E-value: 9.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  434 PTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS----DTPAS 509
Cdd:PRK07994   361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRaqgaTKAKK 440
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  510 SSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQ-------VTSETPA 560
Cdd:PRK07994   441 SEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATnpvevkkEPVATPK 498
PHA03291 PHA03291
envelope glycoprotein I; Provisional
375-512 1.04e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 43.79  E-value: 1.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  375 DTPASSSPPQGTSETPASNSPpqgtSETPGFSSPPQvttatlvSSSPPQVTSETPaSSSPTQVTseTPASSSPTQVTSDT 454
Cdd:PHA03291   152 GATNASLFPLGLAAFPAEGTL----AAPPLGEGSAD-------GSCDPALPLSAP-RLGPADVF--VPATPRPTPRTTAS 217
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  455 PASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPAsSSPPQVTSDTPASSSP 512
Cdd:PHA03291   218 PETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPA-PPTPGGGEAPPANATP 274
PHA03193 PHA03193
tegument protein VP11/12; Provisional
347-476 1.10e-03

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 43.94  E-value: 1.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSsppQGTLDTPSSSsppqGTSDTPASSSPPQGTSETPASNSPPQGTsETPGFSSPPQVTTATLVSSSppQVTS 426
Cdd:PHA03193   456 IHEALANN---GQAIFPECFS----GDLPPIAQALLSADELPNDTTASTSNEM-KGDAECPAAQDAAAILPASF--QIEN 525
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039789982  427 ETPASSSPTQVTSetpASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQV 476
Cdd:PHA03193   526 GGAADGSGLAIPA---AMCDATAVESPSTVAETPPERLLAAESGPRCKAT 572
PHA03193 PHA03193
tegument protein VP11/12; Provisional
433-583 1.15e-03

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 43.94  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  433 SPTQVTSETPASSSPTqvtSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSD-----TPASSSPPQ-VTSDT 506
Cdd:PHA03193   441 SPFQRKRAMPEDGGEI---HEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTsnemkGDAECPAAQdAAAIL 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  507 PASSsppQVTSETPASSSPPQVTSDtsaSISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT------NM 580
Cdd:PHA03193   518 PASF---QIENGGAADGSGLAIPAA---MCDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSKVEeilrrlRM 591

                   ...
gi 1039789982  581 TSD 583
Cdd:PHA03193   592 ASD 594
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
345-444 1.15e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 43.92  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  345 QGTSDTPASSSP---------PQGTLDTPSSSSPPQ---GTSDTPASSSPPQGTSETPASNSPPQGTSE--TPGFSSP-- 408
Cdd:PLN02217   545 QGDAWIPGKGVPyipglfagnPGSTNSTPTGSAASSnttFSSDSPSTVVAPSTSPPAGHLGSPPATPSKivSPSTSPPas 624
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1039789982  409 ----PQVTTATLVSSSPPQVTSETPASSSPTQVTSETPAS 444
Cdd:PLN02217   625 hlgsPSTTPSSPESSIKVASTETASPESSIKVASTESSVS 664
dnaA PRK14086
chromosomal replication initiator protein DnaA;
363-565 1.17e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.05  E-value: 1.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  363 TPSSSSPPQGTSDTPASSSP--PQGTSETPASNSPPQGTSETPGFSSPPQVTTAtlVSSSPPQVTSETPASSSPTqvtse 440
Cdd:PRK14086    89 DPSAGEPAPPPPHARRTSEPelPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTA--RPAYPAYQQRPEPGAWPRA----- 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  441 tPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPpqvtsetP 520
Cdd:PRK14086   162 -ADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPP-------P 233
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1039789982  521 ASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PRK14086   234 GAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEPT 278
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
425-486 1.20e-03

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 42.30  E-value: 1.20e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  425 TSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSP 486
Cdd:cd21441     65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
PRK10856 PRK10856
cytoskeleton protein RodZ;
419-512 1.20e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 43.48  E-value: 1.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  419 SSPPQVTSETPASSSPTQVtseTPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS 498
Cdd:PRK10856   161 SVPLDTSTTTDPATTPAPA---APVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPL 237
                           90
                   ....*....|....
gi 1039789982  499 PPQVTSDTPASSSP 512
Cdd:PRK10856   238 PTDQAGVSTPAADP 251
NupH_GANP pfam16768
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the ...
371-589 1.22e-03

Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the nucleoporin-homology domain at the N-terminus of human GANP or germinal-centre associated nuclear proteins. GANP is part of the TREX-2 complex that links transcription with nuclear messenger RNA export, and it associates with the mRNP particle through the interaction of the NupH_GANP with NXF1, the export factor. This attachment mediates efficient delivery of mRNPs to nuclear pore complexes.


Pssm-ID: 435572 [Multi-domain]  Cd Length: 292  Bit Score: 43.36  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  371 QGTSDTPASSSPPQGTSETPAS---NSPP------QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSptqVTSET 441
Cdd:pfam16768    9 QQPSAFSTSSSPSTGTFQAKPPfrfGQPSlfgqnnTLSGKNSGFSQVSSFPTTSGVSHSSSGQTLGFTQTSG---VGLFS 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  442 PASSSPTQVTSDTPASNSPPQgtsdTPGFS--SPTQV----------TTATLVSSS----------PPQVTSDTP---AS 496
Cdd:pfam16768   86 GLEHTPSFVATSGPSSSSVPS----NPGFSfkSPTNLgafpststfgPESGEVASSgfgktefsfkPPENAVFRPifgAE 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  497 SSP----PQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASispPQVISDTPASSSPPQvTSETPASSSPT---NMTS 569
Cdd:pfam16768  162 SEPektqSQITSGFFTFSHPVSSGPGGLAPFSFSQVTSSSATS---SNFTFSKPVSSNNSS-SAFAPALSSQNveeEKRG 237
                          250       260
                   ....*....|....*....|
gi 1039789982  570 DTPASSSPTNMTSDTPASSS 589
Cdd:pfam16768  238 PKSFFGSSNSSFTSFPNSSS 257
PHA03269 PHA03269
envelope glycoprotein C; Provisional
471-600 1.32e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 43.95  E-value: 1.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  471 SSPTQVTTATLVSS-----SPPQVTSDTPASSSP-PQVTSDTPASSSPPQVTSETPASSSP--PQVTSDTSASISPPQVI 542
Cdd:PHA03269    21 NLNTNIPIPELHTSaatqkPDPAPAPHQAASRAPdPAVAPTSAASRKPDLAQAPTPAASEKfdPAPAPHQAASRAPDPAV 100
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  543 SDTPASSSPPQvtsetpASSSPTNMTSDTPASS-SPTNMTSDTPassSPTNMTSDTPAS 600
Cdd:PHA03269   101 APQLAAAPKPD------AAEAFTSAAQAHEAPAdAGTSAASKKP---DPAAHTQHSPPP 150
PHA03193 PHA03193
tegument protein VP11/12; Provisional
409-570 1.41e-03

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 43.55  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  409 PQVTTATLVSSSPPqvTSETPASSSPTQVTSETPASSSPtqvtsdtPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQ 488
Cdd:PHA03193   442 PFQRKRAMPEDGGE--IHEALANNGQAIFPECFSGDLPP-------IAQALLSADELPNDTTASTSNEMKGDAECPAAQD 512
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  489 VTSDTPASSsppQVTSDTPASSSPPQVTSetpASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT--- 565
Cdd:PHA03193   513 AAAILPASF---QIENGGAADGSGLAIPA---AMCDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSKVEeil 586

                   ....*...
gi 1039789982  566 ---NMTSD 570
Cdd:PHA03193   587 rrlRMASD 594
PRK10856 PRK10856
cytoskeleton protein RodZ;
418-499 1.49e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 43.09  E-value: 1.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  418 SSSPPQVTSETPA--SSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:PRK10856   168 TTTDPATTPAPAApvDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247

                   ....
gi 1039789982  496 SSSP 499
Cdd:PRK10856   248 AADP 251
PHA03269 PHA03269
envelope glycoprotein C; Provisional
476-627 1.50e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 43.56  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  476 VTTATLVSSSppqvTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQV-ISDTPASSSP--P 552
Cdd:PHA03269     9 IITIACINLI----IANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLaQAPTPAASEKfdP 84
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039789982  553 QVTSETPASSSPTNMTSDTPAsSSPTNMTSDTPASSS---PTNMTSDTPASSSPPWPVITEVTRPESTIpAGRSLANI 627
Cdd:PHA03269    85 APAPHQAASRAPDPAVAPQLA-AAPKPDAAEAFTSAAqahEAPADAGTSAASKKPDPAAHTQHSPPPFA-YTRSMEHI 160
PLN02983 PLN02983
biotin carboxyl carrier protein of acetyl-CoA carboxylase
430-604 1.65e-03

biotin carboxyl carrier protein of acetyl-CoA carboxylase


Pssm-ID: 215533 [Multi-domain]  Cd Length: 274  Bit Score: 42.52  E-value: 1.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  430 ASSSPTQVTSETPASSSPTQvTSDTPASNSPPQgtsdtPGFSSPTQ----------VTTATL----VSSSPPQVTSDTPA 495
Cdd:PLN02983     1 MASLSVPCAKTAAAAANVGS-RLSRSSFRLQPK-----PNISFPSKgpnpkrsavpKVKAQLnevaVDGSSNSAKSDDPK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  496 SSSPPQVTSDTPA--SSSPPQVTSETPASSSPPQVTS-----DtSASISPPQ--------VISDTPASSSPPqvtseTPA 560
Cdd:PLN02983    75 SEVAPSEPKDEPPsnSSSKPNLPDEESISEFMTQVSSlvklvD-SRDIVELQlkqldcelVIRKKEALPQPP-----PPA 148
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  561 S---SSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPA----SSSPP 604
Cdd:PLN02983   149 PvvmMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAkapkSSHPP 199
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
414-552 1.71e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 43.32  E-value: 1.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  414 ATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTsdtPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSdt 493
Cdd:PRK07994   363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVP---PPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK-- 437
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  494 PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAS-ISPPQVISDTPASSSPP 552
Cdd:PRK07994   438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYrWKATNPVEVKKEPVATP 497
PRK10856 PRK10856
cytoskeleton protein RodZ;
391-485 1.73e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 43.09  E-value: 1.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  391 ASNSPPQGTSE--TPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT--QVTSETPASSSPTQVTSDTPASNSPPQGTSD 466
Cdd:PRK10856   152 AELSQNSGQSVplDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPapAVDPQQNAVVAPSQANVDTAATPAPAAPATP 231
                           90       100
                   ....*....|....*....|
gi 1039789982  467 TPGFSSPT-QVTTATLVSSS 485
Cdd:PRK10856   232 DGAAPLPTdQAGVSTPAADP 251
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
494-650 1.80e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.55  E-value: 1.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  494 PASSSPPQVTSdtpASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDT---PASSSPPQVTSETPASSSPTNMTSD 570
Cdd:PRK14951   366 PAAAAEAAAPA---EKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPaapPAAAPPAPVAAPAAAAPAAAPAAAP 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  571 TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT--EVTRPESTIPAGRSLANITSKAQEDSPLGVISthpQMSF 648
Cdd:PRK14951   443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAapAAARLTPTEEGDVWHATVQQLAAAEAITALAR---ELAL 519

                   ..
gi 1039789982  649 QS 650
Cdd:PRK14951   520 QS 521
PRK11901 PRK11901
hypothetical protein; Reviewed
484-655 1.85e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 42.75  E-value: 1.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  484 SSPPQVTSDTPASSSPPQVTS----DTPASSSPPQVTSETPASSSPPQVTSD--TSASISPPQVISDTPASSSPPQVTSE 557
Cdd:PRK11901    55 GSALKSPTEHESQQSSNNAGAekniDLSGSSSLSSGNQSSPSAANNTSDGHDasGVKNTAPPQDISAPPISPTPTQAAPP 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  558 TPASSS-----PTNMTS----------------DTPASSSPTnmtsdTPASSSPTNMTSDTPASSSPPWPVITE-----V 611
Cdd:PRK11901   135 QTPNGQqrielPGNISDalsqqqgqvnaasqnaQGNTSTLPT-----APATVAPSKGAKVPATAETHPTPPQKPatkkpA 209
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1039789982  612 TRPESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQ---SSTSQQ 655
Cdd:PRK11901   210 VNHHKTATVAVPPATSGKPKSGAASARALSSAPASHYTlqlSSASRS 256
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
322-553 1.87e-03

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 42.86  E-value: 1.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  322 SDTPASSSP--PQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS---SSPPQGTSDTPASSSPPQGTSETPASNSPP 396
Cdd:pfam12287   32 SAQPPSQSPdlSQMVCPPASPEQRLSQQSDVLQQPEQTQVSPVSPSSnacASSGSEYQFHTSEPPQPEAIDPIQSSMSLP 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  397 qgtsetpgfSSPPQvTTATLVSSSPPQVTSETPASSSPTQVtsetpaSSSPTQVTSDTPASNSP-PQGTSDTPGFSSPTQ 475
Cdd:pfam12287  112 ---------SELAP-PSPPLSPASQPQVFQSKPASSSGINV------NAAPFQSMQTVFNVNAPvPPRNEQELKESSQYS 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  476 VTTATLVSSSPPQvtsdtpassSPPQvtSDTPASssppQVTSETPASSSPPQVTSDTSASIS--PPQvisdTPASSSPPQ 553
Cdd:pfam12287  176 SGYNQSFSSQSTQ---------TVPQ--CQLPSE----QLEQTVVGAYHPDGTIQVSNGHLAfyPAQ----TNGFPRPPQ 236
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
325-532 1.98e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 43.41  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  325 PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPG 404
Cdd:PRK14948   364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSLNLEE 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  405 -----FSSPPQVTT-------ATLVSSSPPQVT------------SETP--------ASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK14948   444 lwqqiLAKLELPSTrmllsqqAELVSLDSNRAViavspnwlgmvqSRKPlleqafakVLGRSIKLNLESQSGSASNTAKT 523
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  453 DTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPqvtsdtPASSSPPQVTSETPASSSPPQVTSDT 532
Cdd:PRK14948   524 PPPPQKSPPPPAP-TPPLPQPTATAPPPTPPPPPPTATQASSNAPAQI------PADSSPPPPIPEEPTPSPTKDSSPEE 596
PHA03291 PHA03291
envelope glycoprotein I; Provisional
489-623 1.99e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 43.02  E-value: 1.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  489 VTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSS-PPQVTSdTSASISPPQVIsdTPASSSPPQVTSETPASSSPTNM 567
Cdd:PHA03291   150 VEGATNASLFPLGLAAFPAEGTLAAPPLGEGSADGScDPALPL-SAPRLGPADVF--VPATPRPTPRTTASPETTPTPST 226
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039789982  568 TSDTPASSSPTNMTSDTPASSSPTNMTSDTPAsssPPWPVITEvTRPESTIPAGRS 623
Cdd:PHA03291   227 TTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPA---PPTPGGGE-APPANATPAPEA 278
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
529-590 2.09e-03

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 41.53  E-value: 2.09e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  529 TSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 590
Cdd:cd21441     65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
CytochromB561_N pfam09786
Cytochrome B561, N terminal; Members of this family are found in the N terminal region of ...
417-577 2.10e-03

Cytochrome B561, N terminal; Members of this family are found in the N terminal region of cytochrome B561, as well as in various other putative uncharacterized proteins.


Pssm-ID: 462899  Cd Length: 579  Bit Score: 43.28  E-value: 2.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  417 VSSSPPQVTSETPASSSPTQVTS---ETPASSSPTQVTSDTPASNSPPQGTSDTPGfssptqvttatlvssspPQVTSDT 493
Cdd:pfam09786   89 VQSKSPSKGTKTPSRLTNQQLGLlglKPNDSSFVTTHRKKPPKSKSSPQSPSPVLV-----------------PLHQSVS 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  494 PASSSPPQVTSDTPASSSPPQVTSETPASSSppqvtsdtsasISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:pfam09786  152 PSSSESRKGGDKSPAGSGKKLRSFSTSSKSP-----------ASPSVYLRGSPVPLNSSPLPSDRNYENSVQSSPEIDSA 220

                   ....
gi 1039789982  574 SSSP 577
Cdd:pfam09786  221 VSTP 224
PHA03377 PHA03377
EBNA-3C; Provisional
270-769 2.37e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 43.12  E-value: 2.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  270 SDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVtsatsassSPPQGTSDTPASSSPPQ----VTSATSASSSPPQ 345
Cdd:PHA03377   447 QSTPERPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQS--------PPTVAIKPAPPPSRRRRgacvVYDDDIIEVIDVE 518
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  346 GTSDTPASSSP------PQGTLD-----TPSSSSPPQGTSDT-PASSSPPqgTSETPASNSPPQGTSET-PGFSSPPQVT 412
Cdd:PHA03377   519 TTEEEESVTQPakphrkVQDGFQrsgrrQKRATPPKVSPSDRgPPKASPP--VMAPPSTGPRVMATPSTgPRDMAPPSTG 596
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  413 TATL--VSSSPPQVTSE--TPASSSP-----------------TQVTSETPASSSPTQVTSDTPASNSPP---QGTSDTP 468
Cdd:PHA03377   597 PRQQakCKDGPPASGPHekQPPSSAPrdmapsvvrmflrerllEQSTGPKPKSFWEMRAGRDGSGIQQEPssrRQPATQS 676
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  469 GFSSPTQVTTATLVSSSP---PQVTSDTPASS-SPPQVTS----------DTPASSSPPQVTSETPASSSPPQVTSDTSA 534
Cdd:PHA03377   677 TPPRPSWLPSVFVLPSVDagrAQPSEESHLSSmSPTQPISheeqpryedpDDPLDLSLHPDQAPPPSHQAPYSGHEEPQA 756
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  535 SISPPQVISDTPASSSPPQVTSETPA-----SSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPvit 609
Cdd:PHA03377   757 QQAPYPGYWEPRPPQAPYLGYQEPQAqgvqvSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHL--- 833
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  610 evtrPESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALDETAGervPTIPDFQAHsefqkacAILQRLRD 689
Cdd:PHA03377   834 ----PPQWDGSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPYSQTLVSSSA---PSWSSPQPR-------APIRPIPT 899
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  690 FLPTSPTSAQKNNSWSSQTPAVSCPFQPLGrlttTEKSSHQMAQQDME-QHPM-----DGAHNAFGISAGGSEIQSDIQL 763
Cdd:PHA03377   900 RFPPPPMPLQDSMAVGCDSSGTACPSMPFA----SDYSQGAFTPLDINaQTPKrprveESSHGPARCSQATTEAQEILSD 975

                   ....*.
gi 1039789982  764 RSEFEV 769
Cdd:PHA03377   976 NSEISV 981
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
438-499 2.38e-03

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 41.53  E-value: 2.38e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  438 TSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:cd21441     65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
482-596 2.46e-03

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 41.53  E-value: 2.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  482 VSSSPPQ--VTSDTPASSSPPQVTSDTPASSSPpqvtseTPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETP 559
Cdd:cd21441     35 VKAEPPEdsLSTDHFQTQTEPVDLSINKARTSP------TAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSS 108
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1039789982  560 ASSSPTNMTSDTPASSSPT-------NMTSDTPaSSSPTNMTSD 596
Cdd:cd21441    109 ASSVPTVLTPGPLVASASGvggqqflHIIHPVP-PSSPMNLQSN 151
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
426-535 2.57e-03

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 39.81  E-value: 2.57e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982   426 SETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSD 505
Cdd:smart01104   12 SKTPAWGSRTPGTAAGGAPTARGGSGSRTPAWGGAGSRTP-AWGGAGPTGSRTPAWGGASAWGNKSSEGSASSWAAGPGG 90
                            90       100       110
                    ....*....|....*....|....*....|..
gi 1039789982   506 TPASSSP--PQVTSeTPASSSPPQVTSDTSAS 535
Cdd:smart01104   91 AYGAPTPgyGGTPS-AYGPATPGGGAMAGSAS 121
PAP1 pfam08601
Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene ...
272-470 2.81e-03

Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene transcription in response to H2O2. This region is cysteine rich. Alkylation of cysteine residues following treatment with a cysteine alkylating agent can mask the accessibility of the nuclear exporter Crm1, triggering nuclear accumulation and Pap1 dependent transcriptional expression.


Pssm-ID: 369990  Cd Length: 363  Bit Score: 42.54  E-value: 2.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  272 TPASSSPPqvtsatsasssPPQGTSDTPASSSPPQVTSATSASSsppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTP 351
Cdd:pfam08601   53 PNASTSTP-----------DSQPPPSASSSTTPNQGSNGLNAFT----GEDNNNYSNSAANPGATRGSTASSARSQSSPY 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  352 ASSSPPQGTLDTPSSSSPPQGTsdtPASSSppqGTSETPASNSPPQGTSetpgfssppqvTTATLVSSSPPQVTSETPAS 431
Cdd:pfam08601  118 SFGSGTSTSSDSPSSSSSSHQG---QLSSC---GTSPEPSTQSPGGQKS-----------VETMIGEEQCAHGTIDGEKS 180
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1039789982  432 S-SPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGF 470
Cdd:pfam08601  181 FcAKLGMACGNINNPIPAAMSKSNSLSNTPGHASNDSNGL 220
PHA03249 PHA03249
DNA packaging tegument protein UL25; Provisional
429-561 2.88e-03

DNA packaging tegument protein UL25; Provisional


Pssm-ID: 223023  Cd Length: 653  Bit Score: 42.69  E-value: 2.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  429 PASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTsDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS--PPQVTSDT 506
Cdd:PHA03249    33 PRPRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSET-DSGSEAEAERGRRAGMGGRNKATKPSRRNKTTQcrPTSLALAT 111
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  507 PASSSPPQVTSETPASSSPPQVTSDTS-------ASISPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PHA03249   112 AATMPATPSSGKSPKVSSPPSIPSLSEedegaerNSGGDDSSHTDNESTQSQPEADDEPDLA 173
Mating_C pfam12737
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine ...
167-534 2.90e-03

C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine the mating type of an individual, and only individuals with differing mating types can mate. Basidiomycete fungi have evolved a unique mating system, termed tetrapolar or bifactorial incompatibility, in which mating type is determined by two unlinked loci; compatibility at both loci is required for mating to occur. The multi-allelic tetrapolar mating system is considered to be a novel innovation that could have only evolved once, and is thus unique to the mushroom fungi. This domain is C-terminal to the homeodomain transcription factor region.


Pssm-ID: 372279 [Multi-domain]  Cd Length: 412  Bit Score: 42.28  E-value: 2.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  167 KRRGAETERHMIPGNGPPLAMChqpAPPELFETLCFPIDPASSAP-----PKATHRMTITSLT---------GRPQvtSD 232
Cdd:pfam12737   50 KKRKRQAERSMRDALAYPSPER---SPASSPERNLSPQVDVCQLTirqnnLNLKRRSSSSSDVdssnaerchKRPR--LD 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  233 TLASSSPPQGTSDTPASSSPPQVTSATSAsssppqgTSDTPASSSPPQVTSATSASSSPPQGTSD-TPASSSPPqvtsaT 311
Cdd:pfam12737  125 SPSSSSSPEKCLPSPAPSEQEALSEISAA-------CGPTPSTLTPLNVAPSLTPSKKRKRCLSDgFDGPKRPP-----N 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  312 SASSSPPQGTSDT-PASSSPPQVtsatsasssppqgtSDTPASSSPPQGTLdtpSSSSPPQGTSDTPASSSPpqgtSETP 390
Cdd:pfam12737  193 KRVQPRPQTVSDPfPTSTSIPEW--------------DEWLQNHMSPSLTL---HGDIPPPVSVEAPDSNTP----LDIE 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  391 ASNSPPQgtsetpgfsspPQVTTATLVSSSPPQVTSETPASSSPtqVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGF 470
Cdd:pfam12737  252 IFNFPYH-----------PDLTPSPAPSLSDSVIEVATPTTESD--YMCNGTLRQTFSWFEFDFPELIQPTNTPASNNEL 318
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  471 SSPTQVTTATLVSSSPPQV------TSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSA 534
Cdd:pfam12737  319 SLPFDPSTDIVVSRTILPLldwrsqSFLSQTFASPPHSILRSNSSSPDVSAFALDLTPAFTPITYSLSES 388
NupH_GANP pfam16768
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the ...
354-524 3.02e-03

Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the nucleoporin-homology domain at the N-terminus of human GANP or germinal-centre associated nuclear proteins. GANP is part of the TREX-2 complex that links transcription with nuclear messenger RNA export, and it associates with the mRNP particle through the interaction of the NupH_GANP with NXF1, the export factor. This attachment mediates efficient delivery of mRNPs to nuclear pore complexes.


Pssm-ID: 435572 [Multi-domain]  Cd Length: 292  Bit Score: 41.82  E-value: 3.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  354 SSPPQGTLDTPSSSSPPQGTSDTPA---SSSPPQGTSETPASNSPPQGTSETPGFSSPPqvTTATLVSSSPPQVTSETPA 430
Cdd:pfam16768   56 SSFPTTSGVSHSSSGQTLGFTQTSGvglFSGLEHTPSFVATSGPSSSSVPSNPGFSFKS--PTNLGAFPSTSTFGPESGE 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  431 SSSPTQVTSETPASSSPTQVTSDTPASNSPP-----QGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSppqVTSD 505
Cdd:pfam16768  134 VASSGFGKTEFSFKPPENAVFRPIFGAESEPektqsQITSGFFTFSHPVSSGPGGLAPFSFSQVTSSSATSSN---FTFS 210
                          170
                   ....*....|....*....
gi 1039789982  506 TPASSSPPQvTSETPASSS 524
Cdd:pfam16768  211 KPVSSNNSS-SAFAPALSS 228
COG5099 COG5099
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ...
155-601 3.23e-03

RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];


Pssm-ID: 227430 [Multi-domain]  Cd Length: 777  Bit Score: 42.81  E-value: 3.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  155 RNEFGPGPLLPMKRRGAETERHMIPGNGPPLAMCHQPAPPELFetlcFPIDPASSAPPKATHRMTITSLTGRpQVTSDTL 234
Cdd:COG5099      1 PNSDTMNNLLPSIKSQLHHSKKSPPSSTTSQELMNGNSTPNSF----SPIPSKASSSATFTLNLPINNSVNH-KITSSSS 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  235 ASSSPPqgtsdTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSppqvtsatsas 314
Cdd:COG5099     76 SRRKPS-----GSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSST----------- 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  315 sspPQGTSDTPASSSPPQVT--SATSASSSPPQGTSDTPASSSPPQGtldtPSSSSPPQGTSDTpaSSSPPQGTSETPAS 392
Cdd:COG5099    140 ---ASSMFNSNKLPLPNPNHsnSATTNQSGSSFINTPASSSSQPLTN----LVVSSIKRFPYLT--SLSPFFNYLIDPSS 210
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  393 NSPpqgTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSpTQVTSETPASSSPTQVTSDTPASNSP---PQGTSDTPG 469
Cdd:COG5099    211 DSA---TASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQS-VENNIILNSSSSINELTSIYGSVPSIrnlRGLNSALVS 286
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDtpaSSSPPQVTSETPASSSPPQVTSdtsaSISPPQVISdtPASS 549
Cdd:COG5099    287 FLNVSSSSLAFSALNGKEVSPTGSPSTRSFARVLPK---SSPNNLLTEILTTGVNPPQSLP----SLLNPVFLS--TSTG 357
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039789982  550 SPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASS 601
Cdd:COG5099    358 FSLTNLSGYLNPNKNLKKNTLSSLSNLGYSSNVPSPSSSESTRNILGNISPN 409
PRK10856 PRK10856
cytoskeleton protein RodZ;
346-437 3.42e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 41.94  E-value: 3.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  346 GTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASnsppQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:PRK10856   167 STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPS----QANVDTAATPAPAAPATPDGAAPLPTDQA 242
                           90
                   ....*....|..
gi 1039789982  426 SETPASSSPTQV 437
Cdd:PRK10856   243 GVSTPAADPNAL 254
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
347-501 3.58e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.55  E-value: 3.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  347 TSDTPASSSPPQG----TLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTAtlVSSSPP 422
Cdd:PRK07994   367 EPEVPPQSAAPAAsaqaTAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKA--KKSEPA 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASN---SPPQGTSDTPGfSSPTQVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:PRK07994   445 AASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwkaTNPVEVKKEPV-ATPKALKKALEHEKTPELAAKLAAEAIER 523

                   ..
gi 1039789982  500 PQ 501
Cdd:PRK07994   524 DP 525
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
345-510 3.93e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.16  E-value: 3.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  345 QGTSDTPAssSPPQGTLDTPSSssppQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQV 424
Cdd:PRK07994   362 AAPLPEPE--VPPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGA 435
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  425 TSetPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFS-SPTQVTTATLVSSSPPQVTSDTPASSSPPQVT 503
Cdd:PRK07994   436 TK--AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALEHEKTPELA 513

                   ....*..
gi 1039789982  504 SDTPASS 510
Cdd:PRK07994   514 AKLAAEA 520
SAP130_C pfam16014
Histone deacetylase complex subunit SAP130 C-terminus;
480-666 4.06e-03

Histone deacetylase complex subunit SAP130 C-terminus;


Pssm-ID: 464973 [Multi-domain]  Cd Length: 371  Bit Score: 41.84  E-value: 4.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  480 TLVSSSPPQVTSDTPAS-----------SSPPQVTSDT---PASSSPPQVTSETPASSSPPQVTSDTSASISPPqvisdt 545
Cdd:pfam16014    1 ALGSSPRPSILRKKPATegakpkpdihvAVAPPVTVAVealPGQNSEQQTASASPPSQHPAQAIPTILAPAAPP------ 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  546 pasSSPPQVTSETPASSSptnMTSDTPASSSPTNMTSDTPASSSPTNMtsdtPASSSPPWPVITEVTRPESTipagrsla 625
Cdd:pfam16014   75 ---SQPSVVLSTLPAAMA---VTPPIPASMANVVAPPTQPAASSTAAC----AVSSVLPEIKIKQEAEPMDT-------- 136
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1039789982  626 nitskAQEDSPLGVISTHPQMSFQSSTsqqaLDETAGERVP 666
Cdd:pfam16014  137 -----SQSVPPLTPTSISPALTSLANN----LSVPAGDLLP 168
ARG80 COG5068
Regulator of arginine metabolism and related MADS box-containing transcription factors ...
443-705 4.08e-03

Regulator of arginine metabolism and related MADS box-containing transcription factors [Transcription];


Pssm-ID: 227400 [Multi-domain]  Cd Length: 412  Bit Score: 41.93  E-value: 4.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  443 ASSSPTQVTSdTPASNSPPQGTSDTPGF-SSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQvtsetpa 521
Cdd:COG5068    116 SVLTGTEVLL-LVISENGLVHTFTTPKLeSVVKSLEGKSLIQSPCSNAPSDSSEEPSSSASFSVDPNDNNPMG------- 187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  522 ssSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTN------MTSDTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:COG5068    188 --SFQHNGSPQTNFIPLQNPQTQQYQQHSSRKDHPTVPHSNTNNGrppakfMIPELHSSHSTLDLPSDFISDSGFPNQSS 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  596 dtpASSSPPWPVITEVTRPES-----TIPAGRSLANITSKAQE-----DSPLGVISTHP-------------QMSFQSST 652
Cdd:COG5068    266 ---TSIFPLDSAIIQITPPHLpnnppQENRHELYSNDSSMVSEtpppkNLPNGSPNQSPlnnlsrgnpaspnSIIRENNQ 342
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  653 SQQalDETAGERVPTIPDFQA---------HSEFQKACAILQRLRDFLPTSPT-SAQKNNSWS 705
Cdd:COG5068    343 VED--ESFNGRQGSAIWNALIsttqpnsglHTEASTAPSSTIPADPLKNAAQTnSGTRNNNFS 403
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
460-525 4.13e-03

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 40.76  E-value: 4.13e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  460 PPQGTSDTPGFSSPTQV---------TTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSP 525
Cdd:cd21441     39 PPEDSLSTDHFQTQTEPvdlsinkarTSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVP 113
KLF10_11_N cd21974
N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily ...
363-528 4.55e-03

N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily is composed of Kruppel-like factor or Krueppel-like factor (KLF) 10, KLF11, and similar proteins. KLF10 was first identified in human osteoblasts and plays a role in mediating estrogen (E2) signaling in bone and skeletal homeostasis and a regulatory role in tumor formation and metastasis. KLF11 is involved in cell growth, apoptosis, cellular inflammation and differentiation, endometriosis, and cholesterol, prostaglandin, neurotransmitter, fat, and sugar metabolism. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved a-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. KLF10/11 belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF10, KLF11, and similar proteins.


Pssm-ID: 409243 [Multi-domain]  Cd Length: 229  Bit Score: 41.07  E-value: 4.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  363 TPSSSSPpqgtSDTPASSSPPQGTSETPASNSPPQgtsetpgfsSPPQV-TTATLVSSSPPQVTSETPASSSPTQVTSET 441
Cdd:cd21974     32 TPSSDSS----DEDDAPESPKDFHSLSSLCMTPPY---------SPPFFeASHSPSVASLHPPSAASSQPPPEPESSEPP 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  442 PASSSPTQVTSdtpasnsppqgtsdtpgfssptqVT--TATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSET 519
Cdd:cd21974     99 AASPQRAQATS-----------------------VIrhTADPVPVSPPPVLCQMLPVSSSSGVIVAFLKAPQQPSPQPQK 155

                   ....*....
gi 1039789982  520 PASSSPPQV 528
Cdd:cd21974    156 PALPQPQVV 164
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
162-451 4.57e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 42.22  E-value: 4.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  162 PLLPMKRRGAETERHMIPGNGPPLAMCHQPAPPELFETLCfPIDPASSAPPKATHRMTitsltgRPQVTSDTLASSSPPQ 241
Cdd:PLN03209   312 PLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEA-PSPPIEEEPPQPKAVVP------RPLSPYTAYEDLKPPT 384
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  242 GTSDTPASSSPPQVTSATSASSSPPQGTSdtPASSSPPQVTSATSASSSPPQGTSDTPASS----SPPQvtsatsasssp 317
Cdd:PLN03209   385 SPIPTPPSSSPASSKSVDAVAKPAEPDVV--PSPGSASNVPEVEPAQVEAKKTRPLSPYARyedlKPPT----------- 451
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  318 pqGTSDTPASSSPPQVTSATSASssppqGTSDTPASSSPPQGTLDTPSSSSPpqgTSDTPASSSPPQGTSETPASNSPPQ 397
Cdd:PLN03209   452 --SPSPTAPTGVSPSVSSTSSVP-----AVPDTAPATAATDAAAPPPANMRP---LSPYAVYDDLKPPTSPSPAAPVGKV 521
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  398 GTSETPgfSSPPQVTTATLVSSSPPQVTSE---TPASSSPTQVTSETPASSSPTQVT 451
Cdd:PLN03209   522 APSSTN--EVVKVGNSAPPTALADEQHHAQpkpRPLSPYTMYEDLKPPTSPTPSPVL 576
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
351-459 4.59e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 41.86  E-value: 4.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  351 PA-SSSPPQGTLDTPS-SSSPPQGTSDTPA-SSSPPQGTSETPA-SNSPPQGTSETP--GFSSPPQVTTATLVSSSPPQV 424
Cdd:PTZ00436   229 PAkAAAPPAKAAAAPAkAAAAPAKAAAPPAkAAAPPAKAAAPPAkAAAPPAKAAAPPakAAAPPAKAAAAPAKAAAAPAK 308
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1039789982  425 TSETPA-SSSPTQVTSETPASSSPTQVTSDTPASNS 459
Cdd:PTZ00436   309 AAAAPAkAAAPPAKAAAPPAKAATPPAKAAAPPAKA 344
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
321-419 4.89e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 42.00  E-value: 4.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  321 TSDTPASSSPPqvtsatsassSPPQGTSDTPA-----SSSPPQGTLDTPSSSSPPQGTSDTpassSPPQGT----SETPA 391
Cdd:PLN02217   569 TNSTPTGSAAS----------SNTTFSSDSPStvvapSTSPPAGHLGSPPATPSKIVSPST----SPPASHlgspSTTPS 634
                           90       100
                   ....*....|....*....|....*...
gi 1039789982  392 SNSPPQGTSETPGFSSPPQVTTATLVSS 419
Cdd:PLN02217   635 SPESSIKVASTETASPESSIKVASTESS 662
Mating_C pfam12737
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine ...
441-717 5.07e-03

C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine the mating type of an individual, and only individuals with differing mating types can mate. Basidiomycete fungi have evolved a unique mating system, termed tetrapolar or bifactorial incompatibility, in which mating type is determined by two unlinked loci; compatibility at both loci is required for mating to occur. The multi-allelic tetrapolar mating system is considered to be a novel innovation that could have only evolved once, and is thus unique to the mushroom fungi. This domain is C-terminal to the homeodomain transcription factor region.


Pssm-ID: 372279 [Multi-domain]  Cd Length: 412  Bit Score: 41.51  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  441 TPASSSPTQVTSDTPASNSPpqGTSDTPGFSSptqvttatlvSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETP 520
Cdd:pfam12737   72 SPASSPERNLSPQVDVCQLT--IRQNNLNLKR----------RSSSSSDVDSSNAERCHKRPRLDSPSSSSSPEKCLPSP 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  521 ASSSPPQVtSDTSASISPpqvisdTPASSSPPQVTSETPASSSPTNMTSD-TPASSSPTN--------MTSDT-PASSSP 590
Cdd:pfam12737  140 APSEQEAL-SEISAACGP------TPSTLTPLNVAPSLTPSKKRKRCLSDgFDGPKRPPNkrvqprpqTVSDPfPTSTSI 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  591 TN--------MTSDTPASSSPPWPVitEVTRPESTIPAGRSLANITSKAQED-SPLGVISTHPQMSFQSSTSQQALDETA 661
Cdd:pfam12737  213 PEwdewlqnhMSPSLTLHGDIPPPV--SVEAPDSNTPLDIEIFNFPYHPDLTpSPAPSLSDSVIEVATPTTESDYMCNGT 290
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  662 GERVPTIPDFQAHSEFQKACAILQRLRDFLPTSP-TSAQKNNSWSSQTPAVSCPFQP 717
Cdd:pfam12737  291 LRQTFSWFEFDFPELIQPTNTPASNNELSLPFDPsTDIVVSRTILPLLDWRSQSFLS 347
TYA pfam01021
Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a ...
423-543 5.30e-03

Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a fusion protein of TYA and TYB. The TYA protein is analogous to the gag protein of retroviruses. TYA a is cleaved to form 46kd protein which can form mature virion like particles. This entry corresponds to the capsid protein from Ty1 and Ty2 transposons.


Pssm-ID: 425992  Cd Length: 384  Bit Score: 41.48  E-value: 5.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  423 QVTSETPASSSPTQVtSETPASSSPTqvtsdtPASNSPPQ-GTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSP-- 499
Cdd:pfam01021   33 KANSQQTTTPGSSAV-PENHHHASPQ------PASVPPPQnGPYSQQCMMTPNQANPSGWPFYGHPSMMPYTPYQMSPmy 105
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  500 ---------PQVTSD--TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVIS 543
Cdd:pfam01021  106 fppgpqsqfPQYPSSvgTPLSTPSPESGNTFTDSSSAKSDMTSTNKYVRPPPILT 160
PRK10672 PRK10672
endolytic peptidoglycan transglycosylase RlpA;
359-448 5.31e-03

endolytic peptidoglycan transglycosylase RlpA;


Pssm-ID: 236733 [Multi-domain]  Cd Length: 361  Bit Score: 41.59  E-value: 5.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  359 GTLDTPSSSSPPQGTSDT-PASSSPPQGTSETPAsnspPQGTSetpGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQV 437
Cdd:PRK10672   202 GGMGTPSVQPAPAPQGDVlPVSNSTLKSEDPTGA----PVTSS---GFLGAPTTLAPGVLEGSEPTPTAPSSAPATAPAA 274
                           90
                   ....*....|.
gi 1039789982  438 TSETPASSSPT 448
Cdd:PRK10672   275 AAPQAAATSSS 285
PHA03193 PHA03193
tegument protein VP11/12; Provisional
492-604 5.56e-03

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 41.63  E-value: 5.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  492 DTPASSSPPQV-----TSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISD-----TPASSSPPQ-VTSETPA 560
Cdd:PHA03193   440 DSPFQRKRAMPedggeIHEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTsnemkGDAECPAAQdAAAILPA 519
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1039789982  561 SSSPTNmtsDTPASSSPTNMTSDtpaSSSPTNMTSDTPASSSPP 604
Cdd:PHA03193   520 SFQIEN---GGAADGSGLAIPAA---MCDATAVESPSTVAETPP 557
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
149-447 5.67e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 5.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  149 QGASIWRNEFGPGP---------LLPM-KRRGAETERHMIPGNGPPLAMCHQPAPPELFetlcfPIDPASSAPPKATHRM 218
Cdd:PRK07003   329 QIATVGRGELGLAPdeyagftmtLLRMlAFEPAVTGGGAPGGGVPARVAGAVPAPGARA-----AAAVGASAVPAVTAVT 403
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  219 TITSLTGRPQVtsdtlASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGT-SDTPASSSPPQVTSATSASSSPPQGTSD 297
Cdd:PRK07003   404 GAAGAALAPKA-----AAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVpAKANARASADSRCDERDAQPPADSGSAS 478
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  298 TPASSSPPQVTSATSasssppqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTP 377
Cdd:PRK07003   479 APASDAPPDAAFEPA--------PRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAA 550
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  378 AS-----------SSPPQGTSETPASNSPPQGTSETPgfsSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSS 446
Cdd:PRK07003   551 AAldvlrnagmrvSSDRGARAAAAAKPAAAPAAAPKP---AAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPP 627

                   .
gi 1039789982  447 P 447
Cdd:PRK07003   628 P 628
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
455-552 5.76e-03

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 41.03  E-value: 5.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  455 PASNSPPQGTSDTPGFSSPTQVTTATLVSSSPpqvTSDTPASSSPPQVTSDTPASSSPPQvtsetpasSSPPQVTSDTSA 534
Cdd:PHA03201     4 ARSRSPSPPRRPSPPRPTPPRSPDASPEETPP---SPPGPGAEPPPGRAAGPAAPRRRPR--------GCPAGVTFSSSA 72
                           90
                   ....*....|....*...
gi 1039789982  535 SISPPQVISDTPASSSPP 552
Cdd:PHA03201    73 PPRPPLGLDDAPAATPPP 90
PRK10905 PRK10905
cell division protein DamX; Validated
247-497 5.79e-03

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 41.08  E-value: 5.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  247 PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQ---VTSATSASSSPPQGtsD 323
Cdd:PRK10905    23 PSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAEQTAGNTQQDVSLPPISSTPTQgqtPVATDGQQRVEVQG--D 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  324 TPASSSPPQvtsatsassspPQGTSDTPASSSppqgTLDT-PSSSSPPQG---TSDTPASSSPPQGTSETPASNsppQGT 399
Cdd:PRK10905   101 LNNALTQPQ-----------NQQQLNNVAVNS----TLPTePATVAPVRNgnaSRQTAKTQTAERPATTRPARK---QAV 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  400 SEtpgfSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTA 479
Cdd:PRK10905   163 IE----PKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVG 238
                          250
                   ....*....|....*...
gi 1039789982  480 TLVSSSPPQVTSDTPASS 497
Cdd:PRK10905   239 SLKSAPSSHYTLQLSSSS 256
PRK11901 PRK11901
hypothetical protein; Reviewed
456-635 5.99e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 41.21  E-value: 5.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  456 ASNSPPQGTSDTpgfsSPTQVTTATLV----SSSPPQVTSDTPASSSPPQVTSD--TPASSSPPQVTSETPASSSP---- 525
Cdd:PRK11901    57 ALKSPTEHESQQ----SSNNAGAEKNIdlsgSSSLSSGNQSSPSAANNTSDGHDasGVKNTAPPQDISAPPISPTPtqaa 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  526 PQVTSDTSASISPPQVISD-----------------TPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASS 588
Cdd:PRK11901   133 PPQTPNGQQRIELPGNISDalsqqqgqvnaasqnaqGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNH 212
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1039789982  589 SPTNMTSDTPASSSPPWPVITEVTRPESTiPAGRSLANITSKAQEDS 635
Cdd:PRK11901   213 HKTATVAVPPATSGKPKSGAASARALSSA-PASHYTLQLSSASRSDT 258
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
225-395 6.18e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 41.31  E-value: 6.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  225 GRP----QVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsaTSASSSPPQGTSDTPA 300
Cdd:pfam13254  195 GRPnsfkEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPA-----PTSASEPPPKTKELPK 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  301 SSSPPQVTSATSASSSPPQgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQgtsDTPASS 380
Cdd:pfam13254  270 DSEEPAAPSKSAEASTEKK-EPDTESSPETSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPPK---DFRANL 345
                          170
                   ....*....|....*
gi 1039789982  381 SPPQGTSETPASNSP 395
Cdd:pfam13254  346 RSREVPKDKSKKDEP 360
PHA03132 PHA03132
thymidine kinase; Provisional
357-606 6.30e-03

thymidine kinase; Provisional


Pssm-ID: 222997 [Multi-domain]  Cd Length: 580  Bit Score: 41.67  E-value: 6.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  357 PQGTLDTPSSSSPPQGTSDTPASSSPpqgTSETPASNSPPqgtSETPGFSSPPQVTTATLvsSSPPQVTSETPASSSPTQ 436
Cdd:PHA03132    21 PEGSRDENFDAERDDFLTPLGSTSEA---TSEDDDDLYPP---RETGSGGGVATSTIYTV--PRPPRGPEQTLDKPDSLP 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  437 VTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPtqvttatlvsssPPQVTSDTPASSSPPqvtSDTPASSSPPqvt 516
Cdd:PHA03132    93 ASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFLYQ------------VNFPVILAPIGESNS---SSEELSEEEE--- 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  517 SETPASSSPPQVTSDTSasisppqvISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSdtpASSSPTN-MTS 595
Cdd:PHA03132   155 HSRPPPSESLKVKNGGK--------VYPKGFSKHKTHKRSEFSGLTKKAARKRKGSFVFKPSQLKE---LSGSLKNlLHL 223
                          250
                   ....*....|.
gi 1039789982  596 DTPASSSPPWP 606
Cdd:PHA03132   224 DDSAETDPATR 234
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
347-425 6.30e-03

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 41.27  E-value: 6.30e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039789982  347 TSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSspPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:PRK13335    89 ASKIEKISQPKQEEQKSLNISATPAPKQEQSQTT--TESTTPKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMT 165
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
345-549 6.70e-03

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 41.37  E-value: 6.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  345 QGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSP--- 421
Cdd:COG3266    171 QGTLQALGAVAALLGLRKAEEALALRAGSAAADALALLLLLLASALGEAVAAAAELAALALLAAGAAEVLTARLVLLlli 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  422 PQVTSETPASSSptQVTSETPASSSPTQVTSDTPASNSPPQGTSdtpgfssPTQVTTATLVSSSPPQVTSDTPASSSPPQ 501
Cdd:COG3266    251 IGSALKAPSQAS--SASAPATTSLGEQQEVSLPPAVAAQPAAAA-------AAQPSAVALPAAPAAAAAAAAPAEAAAPQ 321
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  502 VTSDTPASSSPPQVTS---ETPASSSPPQVTSDTSASISPPQVISDTPASS 549
Cdd:COG3266    322 PTAAKPVVTETAAPAApapEAAAAAAAPAAPAVAKKLAADEQWLASQPASH 372
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
397-602 6.70e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.91  E-value: 6.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  397 QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASS---SPTQVTSET-PASSSPTQVTSDTPA-SNSPPQGTSDTPGFS 471
Cdd:NF033849   247 VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTrgwSHTQSTSESeSTGQSSSVGTSESQShGTTEGTSTTDSSSHS 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  472 SPTQVTTATLVSSSpPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSdTSASISppqviSDTPASSSP 551
Cdd:NF033849   327 QSSSYNVSSGTGVS-SSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRS-SSSGVS-----GGFSGGIAG 399
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039789982  552 PQVTSETPASSSPTN---MTSDTPASSSPTNMTS-DTPASSSPTNMTSDTPASSS 602
Cdd:NF033849   400 GGVTSEGLGASQGGSegwGSGDSVQSVSQSYGSSsSTGTSSGHSDSSSHSTSSGQ 454
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
147-456 7.29e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.60  E-value: 7.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  147 PPQGASiwrnEFGPGPLLPMKRRGAETER-----------HMIPGNGPPLAMCHQPAP---------PELFETLCFPIDP 206
Cdd:PTZ00449   510 PPEGPE----ASGLPPKAPGDKEGEEGEHedskesdepkeGGKPGETKEGEVGKKPGPakehkpskiPTLSKKPEFPKDP 585
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  207 ASSAPPKATHRmtitslTGRPQVTSDTLASSSP--PQgTSDTPASSSPPQvTSATSASSSPPQgtsdTPASSSPPQVTSA 284
Cdd:PTZ00449   586 KHPKDPEEPKK------PKRPRSAQRPTRPKSPklPE-LLDIPKSPKRPE-SPKSPKRPPPPQ----RPSSPERPEGPKI 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  285 TSasssppqgTSDTPASSSPP-------QVTSATSASSSPPQGTSDTPASSSPPQvtSATSASSSPPQGTSDTPASSSPP 357
Cdd:PTZ00449   654 IK--------SPKPPKSPKPPfdpkfkeKFYDDYLDAAAKSKETKTTVVLDESFE--SILKETLPETPGTPFTTPRPLPP 723
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  358 QGTLDtPSSSSPPQGTSDTPaSSSPPQgtsetpaSNSPPQGTS----ETPGFSSPPQVTTATLvssSPPQVTSETPASSS 433
Cdd:PTZ00449   724 KLPRD-EEFPFEPIGDPDAE-QPDDIE-------FFTPPEEERtffhETPADTPLPDILAEEF---KEEDIHAETGEPDE 791
                          330       340
                   ....*....|....*....|...
gi 1039789982  434 PTQvTSETPASSSPTQvTSDTPA 456
Cdd:PTZ00449   792 AMK-RPDSPSEHEDKP-PGDHPS 812
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
370-521 7.50e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.39  E-value: 7.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  370 PQGTSDTPASssPPQGTSETPASnsppQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSetpASSSPTQ 449
Cdd:PRK07994   361 PAAPLPEPEV--PPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLA---ARQQLQR 431
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  450 VTSDTPASNSPPQGTSDTPgfssPTQVTTATLVSSSP-PQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK07994   432 AQGATKAKKSEPAAASRAR----PVNSALERLASVRPaPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKAL 500
FimV COG3170
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];
360-516 7.86e-03

Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];


Pssm-ID: 442403 [Multi-domain]  Cd Length: 508  Bit Score: 41.32  E-value: 7.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  360 TLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTS 439
Cdd:COG3170    202 VLRVPAAEEVAALSPAEARQEVQAQSADWAAYRARLAAAVEPAPAAAAPAAPPAAAAAAGPVPAAAEDTLSPEVTAAAAA 281
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  440 ETPASS--SPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATL--VSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQV 515
Cdd:COG3170    282 EEADALpeAAAELAERLAALEAQLAELQRLLALKNPAPAAAVSApaAAAAAATVEAAAPAAAAQPAAAAPAPALDNPLLL 361

                   .
gi 1039789982  516 T 516
Cdd:COG3170    362 A 362
COG5025 COG5025
Transcription factor of the Forkhead/HNF3 family [Transcription];
365-666 7.94e-03

Transcription factor of the Forkhead/HNF3 family [Transcription];


Pssm-ID: 227358 [Multi-domain]  Cd Length: 610  Bit Score: 41.33  E-value: 7.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  365 SSSSPPQ--GTSDTPASSSPPQGTSETPASNSPPQGTSET------PGFSSPPQVTTATLVSSSPPQVTSET---PASSS 433
Cdd:COG5025    292 SSVNPSRlaNNKDEGRKGSKSSPVPKDAAPPSTLSDLSADvnrtskPAFSYANSITQAILSSPSGKMTLSEIyswISSNL 371
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  434 PTQVTSETPASSSPTQVTSDTPASNSPP--QGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSS 511
Cdd:COG5025    372 PYYRHKPTAWQNSIRHNLSLNKSFEKVPrsASQPGKGCFWKIDYSYIYEKESKRNPRSPKKSPSAHSVHQKLSLHVNDLY 451
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  512 PPQVTSEtpASSSPPQVTSdtsasisPPQVISDTPASSS----PPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTpaS 587
Cdd:COG5025    452 QSPATSD--IASSSSQVNS-------QPEFISTQIHSSKgvsnVDLTEQDSQKEASKGNFLDDSGSLSPNTNEINSF--S 520
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  588 SSPTNmtSDTPASSSPPWPVITEVTRPESTIPagRSLANITSKAQEDSPLGVI-STHPQMSFQSSTSQQALDETAGERVP 666
Cdd:COG5025    521 LNTTD--SQQKQSPSHNAPTNNSLNEMASKNS--NSQTQASNSNENVAAVKAIlDASAQMEKPYDLSQAATPTKATESAS 596
PHA03269 PHA03269
envelope glycoprotein C; Provisional
502-629 7.95e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.25  E-value: 7.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  502 VTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQV-TSETPASSSPTNmtsdtPASSSPTNM 580
Cdd:PHA03269    18 IIANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLaQAPTPAASEKFD-----PAPAPHQAA 92
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1039789982  581 TSDTPASSSPTNMTSDTPASSSPPwpvITEVTRPESTIPAGRSLANITS 629
Cdd:PHA03269    93 SRAPDPAVAPQLAAAPKPDAAEAF---TSAAQAHEAPADAGTSAASKKP 138
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
399-451 8.30e-03

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 39.99  E-value: 8.30e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039789982  399 TSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVT 451
Cdd:cd21441     65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLT 117
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
390-513 8.30e-03

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 40.65  E-value: 8.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  390 PASNSPPQGTSETPGFSSPPQVTTATlvssspPQVTSETPASssptqvtsetPASSSPTQVTSDTPASNSPPQGtsdtpg 469
Cdd:PHA03201     4 ARSRSPSPPRRPSPPRPTPPRSPDAS------PEETPPSPPG----------PGAEPPPGRAAGPAAPRRRPRG------ 61
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1039789982  470 fssptqvttatlvssSPPQVTSDTPASSSPPQVTSDTPASSSPP 513
Cdd:PHA03201    62 ---------------CPAGVTFSSSAPPRPPLGLDDAPAATPPP 90
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
507-606 8.75e-03

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 40.65  E-value: 8.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  507 PASSSPPQVTSETPASSSPPQvTSDTSASISPPqvisdtpassSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPA 586
Cdd:PHA03201     4 ARSRSPSPPRRPSPPRPTPPR-SPDASPEETPP----------SPPGPGAEPPPGRAAGPAAPRRRPRGCPAGVTFSSSA 72
                           90       100
                   ....*....|....*....|..
gi 1039789982  587 SSSPTNMTSDTPASSSPP--WP 606
Cdd:PHA03201    73 PPRPPLGLDDAPAATPPPldWT 94
PHA02682 PHA02682
ORF080 virion core protein; Provisional
324-445 9.01e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 40.61  E-value: 9.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  324 TPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSdtPASSSPPQGTSETPAsnsPPQGTSEtP 403
Cdd:PHA02682    85 SPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPAR--PAPACPPSTRQCPPA---PPLPTPK-P 158
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1039789982  404 GFSSPPQVTTATLvssSPPQVtsetPASSSPTQVTSetPASS 445
Cdd:PHA02682   159 APAAKPIFLHNQL---PPPDY----PAASCPTIETA--PAAS 191
SAP130_C pfam16014
Histone deacetylase complex subunit SAP130 C-terminus;
364-513 9.96e-03

Histone deacetylase complex subunit SAP130 C-terminus;


Pssm-ID: 464973 [Multi-domain]  Cd Length: 371  Bit Score: 40.69  E-value: 9.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  364 PSSSSPPQGTSDTPASSSPPQGTSETPASNSPPqgtsetpgfsSPPQVTTATLVSSSPpqVTSETPASSSPTQVTSETPA 443
Cdd:pfam16014   42 PGQNSEQQTASASPPSQHPAQAIPTILAPAAPP----------SQPSVVLSTLPAAMA--VTPPIPASMANVVAPPTQPA 109
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039789982  444 SSSptqvTSDTPASNSPPQgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS-DTPASSSPP 513
Cdd:pfam16014  110 ASS----TAACAVSSVLPE--------IKIKQEAEPMDTSQSVPPLTPTSISPALTSLANNlSVPAGDLLP 168
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
377-609 9.97e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 41.17  E-value: 9.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  377 PASSSPPQGTSETPASNSPPQGTSETPGFSSPPqVTTATLVSSSPPQVtSETPASSSPTQVTSETPASSSPTQVTSDTPA 456
Cdd:pfam09770  107 PAARAAQSSAQPPASSLPQYQYASQQSQQPSKP-VRTGYEKYKEPEPI-PDLQVDASLWGVAPKKAAAPAPAPQPAAQPA 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  457 SNSPP------------QGTSDTPGfsSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSS 524
Cdd:pfam09770  185 SLPAPsrkmmsleeveaAMRAQAKK--PAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHP 262
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  525 PPQVTSDTSASISPPQVISDTPASSSPPQVtseTPASSSPT------NMTSDTPAsSSPTNMTSdtPASSSPTNMTSDTP 598
Cdd:pfam09770  263 VTILQRPQSPQPDPAQPSIQPQAQQFHQQP---PPVPVQPTqilqnpNRLSAARV-GYPQNPQP--GVQPAPAHQAHRQQ 336
                          250
                   ....*....|.
gi 1039789982  599 ASSSPPWPVIT 609
Cdd:pfam09770  337 GSFGRQAPIIT 347
PPE COG5651
PPE-repeat protein [Function unknown];
419-631 9.97e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 40.65  E-value: 9.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  419 SSPPQVTSEtPASSSPTQVTSETPASSSP----TQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTP 494
Cdd:COG5651    168 TQPPPTITN-PGGLLGAQNAGSGNTSSNPgfanLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAA 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982  495 ASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPAS 574
Cdd:COG5651    247 AAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGA 326
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039789982  575 SSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKA 631
Cdd:COG5651    327 ALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
348-465 9.99e-03

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 38.27  E-value: 9.99e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039789982   348 SDTPA-SSSPPQgtldTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSeTPGFSSPPQVTTATLVSSSPPQVTS 426
Cdd:smart01104    2 GRTPAwGASGSK----TPAWGSRTPGTAAGGAPTARGGSGSRTPAWGGAGSRTP-AWGGAGPTGSRTPAWGGASAWGNKS 76
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*
gi 1039789982   427 ETPASSSPTQVTSETPASSSP------TQVTSDTPASNSPPQGTS 465
Cdd:smart01104   77 SEGSASSWAAGPGGAYGAPTPgyggtpSAYGPATPGGGAMAGSAS 121
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH