NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2229487482|ref|NP_001392323|]
View 

synaptojanin-1 isoform e [Mus musculus]

Protein Classification

INPP5c_Synj1 and RRM_SYNJ1 domain-containing protein( domain architecture ID 13429261)

protein containing domains COG5329, INPP5c_Synj1, RRM_SYNJ1, and Atrophin-1

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
INPP5c_Synj1 cd09098
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This ...
525-860 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This subfamily contains the INPP5c domains of human synaptojanin 1 (Synj1) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj1 occurs as two main isoforms: a brain enriched 145 KDa protein (Synj1-145) and a ubiquitously expressed 170KDa protein (Synj1-170). Synj1-145 participates in clathrin-mediated endocytosis. The primary substrate of the Synj1-145 INPP5c domain is PI(4,5)P2, which it converts to PI4P. Synj1-145 may work with membrane curvature sensors/generators (such as endophilin) to remove PI(4,5)P2 from curved membranes. The recruitment of the INPP5c domain of Synj1-145 to endophilin-induced membranes leads to a fragmentation and condensation of these structures. The PI(4,5)P2 to PI4P conversion may cooperate with dynamin to produce membrane fission. In addition to this INPP5c domain, these proteins contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro.


:

Pssm-ID: 197332  Cd Length: 336  Bit Score: 766.12  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 604
Cdd:cd09098      1 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKKAGIPEFQDVRSKPVDIFAIGFEEMVELNAGNIVSASTTNQKLWA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  605 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 684
Cdd:cd09098     81 AELQKTISRDQKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  685 GQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRG 764
Cdd:cd09098    161 GQSQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDIPNEEVKELIRQQNWDSLIAGDQLINQKNAGQVFRG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  765 FLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRA 844
Cdd:cd09098    241 FLEGKLDFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFPDNSKEQYTWSPGTLLHYGRA 320
                          330
                   ....*....|....*.
gi 2229487482  845 ELKTSDHRPVVALIDI 860
Cdd:cd09098    321 ELKTSDHRPVVALIDI 336
COG5329 super family cl34984
Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];
60-480 2.02e-93

Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];


The actual alignment was detected with superfamily member COG5329:

Pssm-ID: 227637 [Multi-domain]  Cd Length: 570  Bit Score: 314.33  E-value: 2.02e-93
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   60 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASDEDRIS---------EVRKVLNSGNFYF 130
Cdd:COG5329     61 YGVIGLIKLKGD----IYLIVITGASLVGVIPGHSIYKILDVDFISLNNNKWDDELEEdeanydklsELKKLLSNGTFYF 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  131 A--WSASGvSLDLSLNAHRSMQEHTTDNRFFWNQSL------HLHLKHYGVNCDD-WLLRLMCGGVEIRTIYAAHKQAKA 201
Cdd:COG5329    137 SydFDITN-SLQKNLSEGLEASVDRADLIFMWNSFLleefinHRSKLSSLEKQFDnFLTTVIRGFAETVDIKVGGNTISL 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  202 CLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDCVSSFIQIRGSVPLFWEQPGLQVGShRVRMSRGFEANAPA 281
Cdd:COG5329    216 TLISRRSSERAGTRYLSRGIDDDGNVSNFVETEQIVTDSQYIFSFTQVRGSIPLFWEQSNLLYGP-KIKVTRSSEAAQSA 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  282 FDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKASEHAsDIHMVSFDYHQMVKGGKAEKLHSILKPQVQKFLDY 361
Cdd:COG5329    295 FDKHFDKLREKYGDVYVVNLLKTKGYEAPLLELYEKHLDLSKKP-KIHYTEFDFHKETSQDGFDDVKKLLYLIEQDLLEF 373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  362 GFFYFDGSEVQRC--QSGTVRTNCLDCLDRTNSVQAFLGLEMLAKQLEALGLAEKpqlVTRFQEVFRSMWSVNGDSISKI 439
Cdd:COG5329    374 GYFAYDINEGKSIseQDGVFRTNCLDCLDRTNVIQSLISRVLLEQFRSEGVISDG---YSPFLQIHRELWADNGDAISRL 450
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2229487482  440 YAGTGALEGKA----------KLKDGARSVTRTIQNNFFDSSKQEAIDVLL 480
Cdd:COG5329    451 YTGTGALKSSFtrrgrrsfagALNDFIKSFSRYYINNFTDGQRQDAIDLLL 501
RRM_SF super family cl17169
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
859-1000 1.69e-62

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


The actual alignment was detected with superfamily member pfam08952:

Pssm-ID: 473069  Cd Length: 146  Bit Score: 209.67  E-value: 1.69e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  859 DIDIFEVEAEERQKIYKEVIAVQGPPDGTVLVSIKS-SAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEG 937
Cdd:pfam08952    1 DVEIQEVDPEARRRVFKEVIRDQGPPDGTIVVSLCSgDLDEKNIFDENLMDELIQELTSFGEVTLVRFVEDTMWVTFRDG 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2229487482  938 SSALNALSLNGKELLNRTITITLKSPDWIKHLEEEMSLEKIS-VTLPSSASSTLLGEDAEVAAD 1000
Cdd:pfam08952   81 HSALNALSKDGMKVCGRALKIRLKSKDWIKGLEEEIILCTDNtIPVSPCANSTLLAEDFDFGSP 144
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1002-1409 9.22e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 66.71  E-value: 9.22e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1002 DMEGDVDDySAEVEELLPQHLQPSSSSGLGTSPSSSPRTSPCQSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQ 1081
Cdd:pfam03154  153 DNESDSDS-SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLI 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1082 PAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQrppppsgrnQPSPQAGLAGPGP-------AGYGAARPTIPARAGVISA 1154
Cdd:pfam03154  232 QQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSP---------QPLPQPSLHGQMPpmphslqTGPSHMQHPVPPQPFPLTP 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1155 PQSQARVCAGrPTPDSQSKPSETLKGPAVLPEPLKPQAafPQQPSLPtPAqklqdplvPIAAPTM--PPSGPQPNLETPP 1232
Cdd:pfam03154  303 QSSQSQVPPG-PSPAAPGQSQQRIHTPPSQSQLQSQQP--PREQPLP-PA--------PLSMPHIkpPPTTPIPQLPNPQ 370
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1233 QPPPRSRSSQSLPsdsspqLQVKINgisgVKQEPTLKsdPFEDLSlsvlavSKAQPSVQISPvltpdpkmlIQLPSASQS 1312
Cdd:pfam03154  371 SHKHPPHLSGPSP------FQMNSN----LPPPPALK--PLSSLS------THHPPSAHPPP---------LQLMPQSQQ 423
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1313 QVNPlssvscmPTRPPGPEESKSQESMGSSANPFPSL---PCRNPFTDRTAAPGNPfrvqsqeseaTSWLSKEEPVPNSp 1389
Cdd:pfam03154  424 LPPP-------PAQPPVLTQSQSLPPPAASHPPTSGLhqvPSQSPFPQHPFVPGGP----------PPITPPSGPPTST- 485
                          410       420
                   ....*....|....*....|
gi 2229487482 1390 fPPLMPLSHDTSKASSSLGG 1409
Cdd:pfam03154  486 -SSAMPGIQPPSSASVSSSG 504
 
Name Accession Description Interval E-value
INPP5c_Synj1 cd09098
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This ...
525-860 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This subfamily contains the INPP5c domains of human synaptojanin 1 (Synj1) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj1 occurs as two main isoforms: a brain enriched 145 KDa protein (Synj1-145) and a ubiquitously expressed 170KDa protein (Synj1-170). Synj1-145 participates in clathrin-mediated endocytosis. The primary substrate of the Synj1-145 INPP5c domain is PI(4,5)P2, which it converts to PI4P. Synj1-145 may work with membrane curvature sensors/generators (such as endophilin) to remove PI(4,5)P2 from curved membranes. The recruitment of the INPP5c domain of Synj1-145 to endophilin-induced membranes leads to a fragmentation and condensation of these structures. The PI(4,5)P2 to PI4P conversion may cooperate with dynamin to produce membrane fission. In addition to this INPP5c domain, these proteins contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro.


Pssm-ID: 197332  Cd Length: 336  Bit Score: 766.12  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 604
Cdd:cd09098      1 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKKAGIPEFQDVRSKPVDIFAIGFEEMVELNAGNIVSASTTNQKLWA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  605 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 684
Cdd:cd09098     81 AELQKTISRDQKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  685 GQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRG 764
Cdd:cd09098    161 GQSQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDIPNEEVKELIRQQNWDSLIAGDQLINQKNAGQVFRG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  765 FLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRA 844
Cdd:cd09098    241 FLEGKLDFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFPDNSKEQYTWSPGTLLHYGRA 320
                          330
                   ....*....|....*.
gi 2229487482  845 ELKTSDHRPVVALIDI 860
Cdd:cd09098    321 ELKTSDHRPVVALIDI 336
IPPc smart00128
Inositol polyphosphate phosphatase, catalytic domain homologues; Mg(2+)-dependent/Li(+) ...
523-863 1.02e-127

Inositol polyphosphate phosphatase, catalytic domain homologues; Mg(2+)-dependent/Li(+)-sensitive enzymes.


Pssm-ID: 214525 [Multi-domain]  Cd Length: 306  Bit Score: 399.04  E-value: 1.02e-127
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   523 KKIRVCVGTWNVNGGKqfrsiaFKNQTLTDWLLdapklagiQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKL 602
Cdd:smart00128    1 RDIKVLIGTWNVGGLE------SPKVDVTSWLF--------QKIEVKQSEKPDIYVIGLQEVVGLAPGVILETIAGKERL 66
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   603 WAVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 682
Cdd:smart00128   67 WSDLLESSLNGDGQYNVLAKVYLVGILVLVFVKANHLVYIKDVETFTVKTGMGGLWGNKGAVAVRFKLSDTSFCFVNSHL 146
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   683 AAGQSQVKERNEDFVEIARKLSFPMGRML--FSHDYVFWCGDFNYRIDLP-NEEVKELIRQQNWDSLIAGDQLINQKNAG 759
Cdd:smart00128  147 AAGASNVEQRNQDYKTILRALSFPERALLsqFDHDVVFWFGDLNFRLDSPsYEEVRRKISKKEFDDLLEKDQLNRQREAG 226
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   760 QIFRGFLEGKVTFAPTYKYDLF-SEDYDTSEKCRTPAWTDRVLWRrrkwpfdRSAEDLDLLNAsfqdeskilytwtpgtl 838
Cdd:smart00128  227 KVFKGFQEGPITFPPTYKYDSVgTETYDTSEKKRVPAWCDRILYR-------SNGPELIQLSE----------------- 282
                           330       340
                    ....*....|....*....|....*
gi 2229487482   839 lHYGRAELKTSDHRPVVALIDIDIF 863
Cdd:smart00128  283 -YHSGMEITTSDHKPVFATFRLKVT 306
COG5329 COG5329
Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];
60-480 2.02e-93

Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];


Pssm-ID: 227637 [Multi-domain]  Cd Length: 570  Bit Score: 314.33  E-value: 2.02e-93
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   60 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASDEDRIS---------EVRKVLNSGNFYF 130
Cdd:COG5329     61 YGVIGLIKLKGD----IYLIVITGASLVGVIPGHSIYKILDVDFISLNNNKWDDELEEdeanydklsELKKLLSNGTFYF 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  131 A--WSASGvSLDLSLNAHRSMQEHTTDNRFFWNQSL------HLHLKHYGVNCDD-WLLRLMCGGVEIRTIYAAHKQAKA 201
Cdd:COG5329    137 SydFDITN-SLQKNLSEGLEASVDRADLIFMWNSFLleefinHRSKLSSLEKQFDnFLTTVIRGFAETVDIKVGGNTISL 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  202 CLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDCVSSFIQIRGSVPLFWEQPGLQVGShRVRMSRGFEANAPA 281
Cdd:COG5329    216 TLISRRSSERAGTRYLSRGIDDDGNVSNFVETEQIVTDSQYIFSFTQVRGSIPLFWEQSNLLYGP-KIKVTRSSEAAQSA 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  282 FDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKASEHAsDIHMVSFDYHQMVKGGKAEKLHSILKPQVQKFLDY 361
Cdd:COG5329    295 FDKHFDKLREKYGDVYVVNLLKTKGYEAPLLELYEKHLDLSKKP-KIHYTEFDFHKETSQDGFDDVKKLLYLIEQDLLEF 373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  362 GFFYFDGSEVQRC--QSGTVRTNCLDCLDRTNSVQAFLGLEMLAKQLEALGLAEKpqlVTRFQEVFRSMWSVNGDSISKI 439
Cdd:COG5329    374 GYFAYDINEGKSIseQDGVFRTNCLDCLDRTNVIQSLISRVLLEQFRSEGVISDG---YSPFLQIHRELWADNGDAISRL 450
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2229487482  440 YAGTGALEGKA----------KLKDGARSVTRTIQNNFFDSSKQEAIDVLL 480
Cdd:COG5329    451 YTGTGALKSSFtrrgrrsfagALNDFIKSFSRYYINNFTDGQRQDAIDLLL 501
Syja_N pfam02383
SacI homology domain; This Pfam family represents a protein domain which shows homology to the ...
60-340 3.07e-86

SacI homology domain; This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin.


Pssm-ID: 460545  Cd Length: 295  Bit Score: 283.69  E-value: 3.07e-86
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   60 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASD----------EDRI-SEVRKVLNSGNF 128
Cdd:pfam02383    1 YGILGLIRLLSG----YYLIVITKREQVGQIGGHPIYKITDVEFIPLNSSLSDtqlakkehpdEERLlKLLKLFLSSGSF 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  129 YFAWSasgvsLDLSlnahRSMQEHTT----------DNRFFWNQSLHLHLKHYGVNCDDWLLRLMCGGVEIRTIYAAHKQ 198
Cdd:pfam02383   77 YFSYD-----YDLT----NSLQRNLTrsrspsfdslDDRFFWNRHLLKPLIDFQLDLDRWILPLIQGFVEQGKLSVFGRS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  199 AKACLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDC-----VSSFIQIRGSVPLFWEQPGLQVGSHRVRMSR 273
Cdd:pfam02383  148 VTLTLISRRSRKRAGTRYLRRGIDDDGNVANFVETEQIVSLNTSnsegkIFSFVQIRGSIPLFWSQDPNLKYKPKIQITR 227
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2229487482  274 gFEANAPAFDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKAS--EHASDIHMVSFDYHQMVK 340
Cdd:pfam02383  228 -PEATQPAFKKHFDDLIERYGPVHIVNLVEKKGRESKLSEAYEEAVKYLnqFLPDKLRYTAFDFHHECK 295
COG5411 COG5411
Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms];
518-886 1.41e-68

Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms];


Pssm-ID: 227698 [Multi-domain]  Cd Length: 460  Bit Score: 238.91  E-value: 1.41e-68
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  518 KYSKPKKIRVCVGTWNVNGgkqfrsiafKNQT--LTDWLLdaPklagiqefQDKRSKPTDIFAIGFEEMVELNAGNIVNA 595
Cdd:COG5411     23 KYVIEKDVSIFVSTFNPPG---------KPPKasTKRWLF--P--------EIEATELADLYVVGLQEVVELTPGSILSA 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  596 STtNQKL--W---AVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLF 670
Cdd:COG5411     84 DP-YDRLriWeskVLDCLNGAQSDEKYSLLRSPQLGGILLRVFSLATNLPVVKPVSGTVKKTGFGGSSSNKGAVAIRFNY 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  671 HTTSLCFVCSHFAAGQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNW--DSLIA 748
Cdd:COG5411    163 ERTSFCFVNSHLAAGVNNIEERIFDYRSIASNICFSRGLRIYDHDTIFWLGDLNYRVTSTNEEVRPEIASDDGrlDKLFE 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  749 GDQLINQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRrkwpfdrsaedldllnasfqdesk 828
Cdd:COG5411    243 YDQLLWEMEVGNVFPGFKEPVITFPPTYKFDYGTDEYDTSDKGRIPSWTDRILYKS------------------------ 298
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2229487482  829 ilYTWTPGTllhYGRAE-LKTSDHRPVVALIDIDIFEVEAEERQKIYKEVIA--VQGPPDG 886
Cdd:COG5411    299 --EQLTPHS---YSSIPhLMISDHRPVYATFRAKIKVVDPSKKEGLIEKLYAeyKTELGEA 354
DUF1866 pfam08952
Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known ...
859-1000 1.69e-62

Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known function.


Pssm-ID: 286093  Cd Length: 146  Bit Score: 209.67  E-value: 1.69e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  859 DIDIFEVEAEERQKIYKEVIAVQGPPDGTVLVSIKS-SAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEG 937
Cdd:pfam08952    1 DVEIQEVDPEARRRVFKEVIRDQGPPDGTIVVSLCSgDLDEKNIFDENLMDELIQELTSFGEVTLVRFVEDTMWVTFRDG 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2229487482  938 SSALNALSLNGKELLNRTITITLKSPDWIKHLEEEMSLEKIS-VTLPSSASSTLLGEDAEVAAD 1000
Cdd:pfam08952   81 HSALNALSKDGMKVCGRALKIRLKSKDWIKGLEEEIILCTDNtIPVSPCANSTLLAEDFDFGSP 144
PLN03191 PLN03191
Type I inositol-1,4,5-trisphosphate 5-phosphatase 2; Provisional
616-875 7.90e-50

Type I inositol-1,4,5-trisphosphate 5-phosphatase 2; Provisional


Pssm-ID: 215624 [Multi-domain]  Cd Length: 621  Bit Score: 188.19  E-value: 7.90e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  616 KYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKE--RN 693
Cdd:PLN03191   363 KYVRIVSKQMVGIYVSVWVRKRLRRHINNLKVSPVGVGLMGYMGNKGSVSISMSLFQSRLCFVCSHLTSGHKDGAEqrRN 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  694 EDFVEIARKLSFP------MGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRGFLE 767
Cdd:PLN03191   443 ADVYEIIRRTRFSsvldtdQPQTIPSHDQIFWFGDLNYRLNMLDTEVRKLVAQKRWDELINSDQLIKELRSGHVFDGWKE 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  768 GKVTFAPTYKYDLFSEDY-----DTSEKCRTPAWTDRVLWrrrkwpfdrsaedldlLNASFQDESkilytwtpgtllhYG 842
Cdd:PLN03191   523 GPIKFPPTYKYEINSDRYvgenpKEGEKKRSPAWCDRILW----------------LGKGIKQLC-------------YK 573
                          250       260       270
                   ....*....|....*....|....*....|...
gi 2229487482  843 RAELKTSDHRPVVALIDIdifEVEAEERQKIYK 875
Cdd:PLN03191   574 RSEIRLSDHRPVSSMFLV---EVEVFDHRKLQR 603
RRM_SYNJ1 cd12719
RNA recognition motif (RRM) found in synaptojanin-1 and similar proteins; This subgroup ...
885-961 1.03e-38

RNA recognition motif (RRM) found in synaptojanin-1 and similar proteins; This subgroup corresponds to the RRM of synaptojanin-1, also termed synaptojanin, or synaptic inositol-1,4,5-trisphosphate 5-phosphatase 1, originally identified as one of the major Grb2-binding proteins that may participate in synaptic vesicle endocytosis. It also acts as a Src homology 3 (SH3) domain-binding brain-specific inositol 5-phosphatase with a putative role in clathrin-mediated endocytosis. Synaptojanin-1 contains an N-terminal domain homologous to the cytoplasmic portion of the yeast protein Sac1p, a central inositol 5-phosphatase domain followed by a putative RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal proline-rich region mediating the binding of synaptojanin-1 to various SH3 domain-containing proteins including amphiphysin, SH3p4, SH3p8, SH3p13, and Grb2. Synaptojanin-1 has two tissue-specific alternative splicing isoforms, synaptojanin-145 expressed in brain and synaptojanin-170 expressed in peripheral tissues. Synaptojanin-145 is very abundant in nerve terminals and may play an essential role in the clathrin-mediated endocytosis of synaptic vesicles. In contrast to synaptojanin-145, synaptojanin-170 contains three unique asparagine-proline-phenylalanine (NPF) motifs in the C-terminal region and may functions as a potential binding partner for Eps15, a clathrin coat-associated protein acting as a major substrate for the tyrosine kinase activity of the epidermal growth factor receptor.


Pssm-ID: 410118  Cd Length: 77  Bit Score: 138.69  E-value: 1.03e-38
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2229487482  885 DGTVLVSIKSSAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEGSSALNALSLNGKELLNRTITITLK 961
Cdd:cd12719      1 DGTVVVSVLSSSPEPNYFDDNLIDALLQQFSSFGEVILIRFVEDKMWVTFLEGSSALAALSLNGTEVLGRTIIISLK 77
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1002-1409 9.22e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 66.71  E-value: 9.22e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1002 DMEGDVDDySAEVEELLPQHLQPSSSSGLGTSPSSSPRTSPCQSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQ 1081
Cdd:pfam03154  153 DNESDSDS-SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLI 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1082 PAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQrppppsgrnQPSPQAGLAGPGP-------AGYGAARPTIPARAGVISA 1154
Cdd:pfam03154  232 QQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSP---------QPLPQPSLHGQMPpmphslqTGPSHMQHPVPPQPFPLTP 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1155 PQSQARVCAGrPTPDSQSKPSETLKGPAVLPEPLKPQAafPQQPSLPtPAqklqdplvPIAAPTM--PPSGPQPNLETPP 1232
Cdd:pfam03154  303 QSSQSQVPPG-PSPAAPGQSQQRIHTPPSQSQLQSQQP--PREQPLP-PA--------PLSMPHIkpPPTTPIPQLPNPQ 370
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1233 QPPPRSRSSQSLPsdsspqLQVKINgisgVKQEPTLKsdPFEDLSlsvlavSKAQPSVQISPvltpdpkmlIQLPSASQS 1312
Cdd:pfam03154  371 SHKHPPHLSGPSP------FQMNSN----LPPPPALK--PLSSLS------THHPPSAHPPP---------LQLMPQSQQ 423
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1313 QVNPlssvscmPTRPPGPEESKSQESMGSSANPFPSL---PCRNPFTDRTAAPGNPfrvqsqeseaTSWLSKEEPVPNSp 1389
Cdd:pfam03154  424 LPPP-------PAQPPVLTQSQSLPPPAASHPPTSGLhqvPSQSPFPQHPFVPGGP----------PPITPPSGPPTST- 485
                          410       420
                   ....*....|....*....|
gi 2229487482 1390 fPPLMPLSHDTSKASSSLGG 1409
Cdd:pfam03154  486 -SSAMPGIQPPSSASVSSSG 504
PHA03247 PHA03247
large tegument protein UL36; Provisional
1052-1412 8.07e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 8.07e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1052 SAPSLPIRPsRAPSRTPGPPSSQGSPVDTQPAAqkdssqtlepkrpppprpvapparpappqrPPPPSGRNQPSPQAGLA 1131
Cdd:PHA03247  2590 DAPPQSARP-RAPVDDRGDPRGPAPPSPLPPDT------------------------------HAPDPPPPSPSPAANEP 2638
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1132 GPG--------------PAGYGAARP---TIPARAGVISAPQSQARVCAGRPT---------PDSQSKPSETLKGPAV-- 1183
Cdd:PHA03247  2639 DPHppptvppperprddPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPTvgsltsladPPPPPPTPEPAPHALVsa 2718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1184 LPEPLKPQA---AFPQQPSLPTPAQKLQDPLVPI-----------------AAPTMPPSGPQPNLeTPPQPPPRSRSSQS 1243
Cdd:PHA03247  2719 TPLPPGPAAarqASPALPAAPAPPAVPAGPATPGgparparppttagppapAPPAAPAAGPPRRL-TRPAVASLSESRES 2797
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1244 LPSDSSPQLQVKINGISGVKQEPTLKSDPFEDLSLSVLAVSKAQPSVQISPVLTPDPKMliqLPSASQSQVNPLSSVSCM 1323
Cdd:PHA03247  2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV---APGGDVRRRPPSRSPAAK 2874
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1324 PTRPPGPEESK-SQESMGSSANPFPsLPCRNPFTDRTAAPGNPFRVQSQESEATSWLSKEEPvPNSPFPPLMPLShDTSK 1402
Cdd:PHA03247  2875 PAAPARPPVRRlARPAVSRSTESFA-LPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP-PPRPQPPLAPTT-DPAG 2951
                          410
                   ....*....|
gi 2229487482 1403 ASSSLGGFED 1412
Cdd:PHA03247  2952 AGEPSGAVPQ 2961
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1044-1226 1.53e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.30  E-value: 1.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1044 QSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGRNQ 1123
Cdd:NF033839   285 KEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1124 PSPQagLAGPgpagygaaRPTIPARAGvisAPQSQARVCAGRPTPDSQSKPsETLKgPAVLPEPLKPQAAFPQQPSLPTP 1203
Cdd:NF033839   365 VKPQ--PEKP--------KPEVKPQPE---TPKPEVKPQPEKPKPEVKPQP-EKPK-PEVKPQPEKPKPEVKPQPEKPKP 429
                          170       180
                   ....*....|....*....|...
gi 2229487482 1204 AQKLQdPLVPIAAPTMPPSGPQP 1226
Cdd:NF033839   430 EVKPQ-PEKPKPEVKPQPEKPKP 451
RRM smart00360
RNA recognition motif;
903-958 1.70e-03

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 38.34  E-value: 1.70e-03
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2229487482   903 DDALIDELLRQFAHFGEVILIRFVEDKMW--------VTFLEGSSALNALS-LNGKELLNRTITI 958
Cdd:smart00360    9 PDTTEEELRELFSKFGKVESVRLVRDKETgkskgfafVEFESEEDAEKALEaLNGKELDGRPLKV 73
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
1124-1399 7.00e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 40.82  E-value: 7.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1124 PSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQArvcAGRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPTP 1203
Cdd:COG5180    163 AKDPDGDSASTLPPPAEKLDKVLTEPRDALKDSPEK---LDRPKVEVKDEAQEEPPDLTGGADHPRPEAASSPKVDPPST 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1204 AQKLQDPLVPIAAPTMPPSGPQPNLEtppqppprSRSSQSLPSDSSPQLQVKINGISGVKQEPTLKSDPFEDLSlSVLAV 1283
Cdd:COG5180    240 SEARSRPATVDAQPEMRPPADAKERR--------RAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPIDVK-GVASA 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1284 SKAQPSVQISPVL----TPDPKMLIQLPSA---SQSQVNPLSSVSCMPT-RPPGPEESKSQESMGSSANPfpslpcRNPF 1355
Cdd:COG5180    311 PPATRPVRPPGGArdpgTPRPGQPTERPAGvpeAASDAGQPPSAYPPAEeAVPGKPLEQGAPRPGSSGGD------GAPF 384
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 2229487482 1356 TDRTAAPGNpfrvQSQESEAtswlskeePVPNSPFPPLMPLSHD 1399
Cdd:COG5180    385 QPPNGAPQP----GLGRRGA--------PGPPMGAGDLVQAALD 416
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1143-1228 9.88e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 38.62  E-value: 9.88e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  1143 PTIPARAGVIsaPQSQARVCAGRP--TPDSQSKPSetLKGPAvlPEPLKPQAAFPQQPSLPTPAQKLQDPLVPI----AA 1216
Cdd:smart00818   59 PVLPAQQPVV--PQQPLMPVPGQHsmTPTQHHQPN--LPQPA--QQPFQPQPLQPPQPQQPMQPQPPVHPIPPLppqpPL 132
                            90
                    ....*....|..
gi 2229487482  1217 PTMPPSGPQPNL 1228
Cdd:smart00818  133 PPMFPMQPLPPL 144
 
Name Accession Description Interval E-value
INPP5c_Synj1 cd09098
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This ...
525-860 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This subfamily contains the INPP5c domains of human synaptojanin 1 (Synj1) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj1 occurs as two main isoforms: a brain enriched 145 KDa protein (Synj1-145) and a ubiquitously expressed 170KDa protein (Synj1-170). Synj1-145 participates in clathrin-mediated endocytosis. The primary substrate of the Synj1-145 INPP5c domain is PI(4,5)P2, which it converts to PI4P. Synj1-145 may work with membrane curvature sensors/generators (such as endophilin) to remove PI(4,5)P2 from curved membranes. The recruitment of the INPP5c domain of Synj1-145 to endophilin-induced membranes leads to a fragmentation and condensation of these structures. The PI(4,5)P2 to PI4P conversion may cooperate with dynamin to produce membrane fission. In addition to this INPP5c domain, these proteins contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro.


Pssm-ID: 197332  Cd Length: 336  Bit Score: 766.12  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 604
Cdd:cd09098      1 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKKAGIPEFQDVRSKPVDIFAIGFEEMVELNAGNIVSASTTNQKLWA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  605 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 684
Cdd:cd09098     81 AELQKTISRDQKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  685 GQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRG 764
Cdd:cd09098    161 GQSQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDIPNEEVKELIRQQNWDSLIAGDQLINQKNAGQVFRG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  765 FLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRA 844
Cdd:cd09098    241 FLEGKLDFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFPDNSKEQYTWSPGTLLHYGRA 320
                          330
                   ....*....|....*.
gi 2229487482  845 ELKTSDHRPVVALIDI 860
Cdd:cd09098    321 ELKTSDHRPVVALIDI 336
INPP5c_Synj cd09089
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanins; This ...
525-860 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanins; This subfamily contains the INPP5c domains of two human synaptojanins, synaptojanin 1 (Synj1) and synaptojanin 2 (Synj2), and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs). They belong to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj1 occurs as two main isoforms: a brain enriched 145 KDa protein (Synj1-145) and a ubiquitously expressed 170KDa protein (Synj1-170). Synj1-145 participates in clathrin-mediated endocytosis. The primary substrate of the Synj1-145 INPP5c domain is PI(4,5)P2, which it converts to PI4P. Synj1-145 may work with membrane curvature sensors/generators (such as endophilin) to remove PI(4,5)P2 from curved membranes. The recruitment of the INPP5c domain of Synj1-145 to endophilin-induced membranes leads to a fragmentation and condensation of these structures. The PI(4,5)P2 to PI4P conversion may cooperate with dynamin to produce membrane fission. In addition to this INPP5c domain, Synjs contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro. Synj2 can hydrolyze phosphatidylinositol diphosphate (PIP2) to phosphatidylinositol phosphate (PIP). Synj2 occurs as multiple alternative splice variants in various tissues. These variants share the INPP5c domain and the Sac1 domain. Synj2A is recruited to the mitochondria via its interaction with OMP25 (a mitochondrial outer membrane protein). Synj2B is found at nerve terminals in the brain and at the spermatid manchette in testis. Synj2B undergoes further alternative splicing to give 2B1 and 2B2. In clathrin-mediated endocytosis, Synj2 participates in the formation of clathrin-coated pits, and perhaps also in vesicle decoating. Rac1 GTPase regulates the intracellular localization of Synj2 forms, but not Synj1. Synj2 may contribute to the role of Rac1 in cell migration and invasion, and is a potential target for therapeutic intervention in malignant tumors.


Pssm-ID: 197323 [Multi-domain]  Cd Length: 328  Bit Score: 685.66  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQ-EFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLW 603
Cdd:cd09089      1 LRVFVGTWNVNGGKHFRSIAFKHQSMTDWLLDNPKLAGQCsNDSEEDEKPVDIFAIGFEEMVDLNASNIVSASTTNQKEW 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  604 AVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFA 683
Cdd:cd09089     81 GEELQKTISRDHKYVLLTSEQLVGVCLFVFVRPQHAPFIRDVAVDTVKTGLGGAAGNKGAVAIRFLLHSTSLCFVCSHFA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  684 AGQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFR 763
Cdd:cd09089    161 AGQSQVKERNEDFAEIARKLSFPMGRTLDSHDYVFWCGDFNYRIDLPNDEVKELVRNGDWLKLLEFDQLTKQKAAGNVFK 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  764 GFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDllnasfqdeSKILYTWTPGTLLHYGR 843
Cdd:cd09089    241 GFLEGEINFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPSDKTEESLV---------ETNDPTWNPGTLLYYGR 311
                          330
                   ....*....|....*..
gi 2229487482  844 AELKTSDHRPVVALIDI 860
Cdd:cd09089    312 AELKTSDHRPVVAIIDI 328
INPP5c_Synj2 cd09099
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 2; This ...
525-860 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 2; This subfamily contains the INPP5c domains of human synaptojanin 2 (Synj2) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj2 can hydrolyze phosphatidylinositol diphosphate (PIP2) to phosphatidylinositol phosphate (PIP). In addition to this INPP5c domain, these proteins contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro. Synj2 occurs as multiple alternative splice variants in various tissues. These variants share the INPP5c domain and the Sac1 domain. Synj2A is recruited to the mitochondria via its interaction with OMP25, a mitochondrial outer membrane protein. Synj2B is found at nerve terminals in the brain and at the spermatid manchette in testis. Synj2B undergoes further alternative splicing to give 2B1 and 2B2. In clathrin-mediated endocytosis, Synj2 participates in the formation of clathrin-coated pits, and perhaps also in vesicle decoating. Rac1 GTPase regulates the intracellular localization of Synj2 forms, but not Synj1. Synj2 may contribute to the role of Rac1 in cell migration and invasion, and is a potential target for therapeutic intervention in malignant tumors.


Pssm-ID: 197333  Cd Length: 336  Bit Score: 566.96  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 604
Cdd:cd09099      1 TRVAMGTWNVNGGKQFRSNILGTSELTDWLLDSPKLSGTPDFQDDESNPPDIFAVGFEEMVELSAGNIVNASTTNRKMWG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  605 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 684
Cdd:cd09099     81 EQLQKAISRSHRYILLTSAQLVGVCLFIFVRPYHVPFIRDVAIDTVKTGMGGKAGNKGAVAIRFQFYSTSFCFICSHLTA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  685 GQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRG 764
Cdd:cd09099    161 GQNQVKERNEDYKEITQKLSFPMGRNVFSHDYVFWCGDFNYRIDLTYEEVFYFIKRQDWKKLLEFDQLQLQKSSGKIFKD 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  765 FLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRA 844
Cdd:cd09099    241 FHEGTINFGPTYKYDVGSEAYDTSDKCRTPAWTDRVLWWRKKWPFEKTAGEINLLDSDLDFDTKIRHTWTPGALMYYGRA 320
                          330
                   ....*....|....*.
gi 2229487482  845 ELKTSDHRPVVALIDI 860
Cdd:cd09099    321 ELQASDHRPVLAIVEV 336
IPPc smart00128
Inositol polyphosphate phosphatase, catalytic domain homologues; Mg(2+)-dependent/Li(+) ...
523-863 1.02e-127

Inositol polyphosphate phosphatase, catalytic domain homologues; Mg(2+)-dependent/Li(+)-sensitive enzymes.


Pssm-ID: 214525 [Multi-domain]  Cd Length: 306  Bit Score: 399.04  E-value: 1.02e-127
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   523 KKIRVCVGTWNVNGGKqfrsiaFKNQTLTDWLLdapklagiQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKL 602
Cdd:smart00128    1 RDIKVLIGTWNVGGLE------SPKVDVTSWLF--------QKIEVKQSEKPDIYVIGLQEVVGLAPGVILETIAGKERL 66
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   603 WAVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 682
Cdd:smart00128   67 WSDLLESSLNGDGQYNVLAKVYLVGILVLVFVKANHLVYIKDVETFTVKTGMGGLWGNKGAVAVRFKLSDTSFCFVNSHL 146
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   683 AAGQSQVKERNEDFVEIARKLSFPMGRML--FSHDYVFWCGDFNYRIDLP-NEEVKELIRQQNWDSLIAGDQLINQKNAG 759
Cdd:smart00128  147 AAGASNVEQRNQDYKTILRALSFPERALLsqFDHDVVFWFGDLNFRLDSPsYEEVRRKISKKEFDDLLEKDQLNRQREAG 226
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   760 QIFRGFLEGKVTFAPTYKYDLF-SEDYDTSEKCRTPAWTDRVLWRrrkwpfdRSAEDLDLLNAsfqdeskilytwtpgtl 838
Cdd:smart00128  227 KVFKGFQEGPITFPPTYKYDSVgTETYDTSEKKRVPAWCDRILYR-------SNGPELIQLSE----------------- 282
                           330       340
                    ....*....|....*....|....*
gi 2229487482   839 lHYGRAELKTSDHRPVVALIDIDIF 863
Cdd:smart00128  283 -YHSGMEITTSDHKPVFATFRLKVT 306
INPP5c cd09074
Catalytic domain of inositol polyphosphate 5-phosphatases; Inositol polyphosphate ...
525-860 7.08e-107

Catalytic domain of inositol polyphosphate 5-phosphatases; Inositol polyphosphate 5-phosphatases (5-phosphatases) are signal-modifying enzymes, which hydrolyze the 5-phosphate from the inositol ring of specific 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), such as PI(4,5)P2, PI(3,4,5)P3, PI(3,5)P2, I(1,4,5)P3, and I(1,3,4,5)P4. These enzymes are Mg2+-dependent, and belong to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. In addition to this INPP5c domain, 5-phosphatases often contain additional domains and motifs, such as the SH2 domain, the Sac-1 domain, the proline-rich domain (PRD), CAAX, RhoGAP (RhoGTPase-activating protein), and SKICH [SKIP (skeletal muscle- and kidney-enriched inositol phosphatase) carboxyl homology] domains, that are important for protein-protein interactions and/or for the subcellular localization of these enzymes. 5-phosphatases incorporate into large signaling complexes, and regulate diverse cellular processes including postsynaptic vesicular trafficking, insulin signaling, cell growth and survival, and endocytosis. Loss or gain of function of 5-phosphatases is implicated in certain human diseases. This family also contains a functionally unrelated nitric oxide transport protein, Cimex lectularius (bedbug) nitrophorin, which catalyzes a heme-assisted S-nitrosation of a proximal thiolate; the heme however binds at a site distinct from the active site of the 5-phosphatases.


Pssm-ID: 197308 [Multi-domain]  Cd Length: 299  Bit Score: 341.62  E-value: 7.08e-107
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKqfrsiaFKNQTLTDWLLDAPklagiqefqdkrSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 604
Cdd:cd09074      1 VKIFVVTWNVGGGI------SPPENLENWLSPKG------------TEAPDIYAVGVQEVDMSVQGFVGNDDSAKAREWV 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  605 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAV--DTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 682
Cdd:cd09074     63 DNIQEALNEKENYVLLGSAQLVGIFLFVFVKKEHLPQIKDLEVegVTVGTGGGGKLGNKGGVAIRFQINDTSFCFVNSHL 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  683 AAGQSQVKERNEDFVEIARKLSFPMG----RMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNA 758
Cdd:cd09074    143 AAGQEEVERRNQDYRDILSKLKFYRGdpaiDSIFDHDVVFWFGDLNYRIDSTDDEVRKLISQGDLDDLLEKDQLKKQKEK 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  759 GQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRrkwpfdrsaedldllnasfqdeskilYTWTPGTL 838
Cdd:cd09074    223 GKVFDGFQELPITFPPTYKFDPGTDEYDTSDKKRIPAWCDRILYKS--------------------------KAGSEIQP 276
                          330       340
                   ....*....|....*....|...
gi 2229487482  839 LHYGRAEL-KTSDHRPVVALIDI 860
Cdd:cd09074    277 LSYTSVPLyKTSDHKPVRATFRV 299
INPP5c_ScInp51p-like cd09090
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Saccharomyces cerevisiae ...
525-856 1.69e-104

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Saccharomyces cerevisiae Inp51p, Inp52p, and Inp53p, and related proteins; This subfamily contains the INPP5c domain of three Saccharomyces cerevisiae synaptojanin-like inositol polyphosphate 5-phosphatases (INP51, INP52, and INP53), Schizosaccharomyces pombe synaptojanin (SPsynaptojanin), and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. In addition to this INPP5c domain, these proteins have an N-terminal catalytic Sac1-like domain (found in other proteins including the phophoinositide phosphatase Sac1p), and a C-terminal proline-rich domain (PRD). The Sac1 domain allows Inp52p and Inp53p to recognize and dephosphorylate a wider range of substrates including PI3P, PI4P, and PI(3,5)P2. The Sac1 domain of Inp51p is non-functional. Disruption of any two of INP51, INP52, and INP53, in S. cerevisiae leads to abnormal vacuolar and plasma membrane morphology. During hyperosmotic stress, Inp52p and Inp53p localize at actin patches, where they may facilitate the hydrolysis of PI(4,5)P2, and consequently promote actin rearrangement to regulate cell growth. SPsynaptojanin is also active against a range of soluble and lipid inositol phosphates, including I(1,4,5)P3, I(1,3,4,5)P4, I(1,4,5,6)P4, PI(4,5)P2, and PIP3. Transformation of S. cerevisiae with a plasmid expressing the SPsynaptojanin 5-phosphatase domain rescues inp51/inp52/inp53 triple-mutant strains.


Pssm-ID: 197324  Cd Length: 291  Bit Score: 334.69  E-value: 1.69e-104
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGgkqfrsiAFKNQTLTDWLldapklagiqeFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 604
Cdd:cd09090      1 INIFVGTFNVNG-------KSYKDDLSSWL-----------FPEENDELPDIVVIGLQEVVELTAGQILNSDPSKSSFWE 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  605 VELQKTISR--DNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 682
Cdd:cd09090     63 KKIKTTLNGrgGEKYVLLRSEQLVGTALLFFVKESQLPKVKNVEGSTKKTGLGGMSGNKGAVAIRFDYGDTSFCFVTSHL 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  683 AAGQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIF 762
Cdd:cd09090    143 AAGLTNYEERNNDYKTIARGLRFSRGRTIKDHDHVIWLGDFNYRISLTNEDVRRFILNGKLDKLLEYDQLNQQMNAGEVF 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  763 RGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRrrkwpfdrsAEDLDLLNasfqdeskilytwtpgtllhYG 842
Cdd:cd09090    223 PGFSEGPITFPPTYKYDKGTDNYDTSEKQRIPAWTDRILYR---------GENLRQLS--------------------YN 273
                          330
                   ....*....|....
gi 2229487482  843 RAELKTSDHRPVVA 856
Cdd:cd09090    274 SAPLRFSDHRPVYA 287
INPP5c_INPP5B cd09093
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Type II inositol ...
525-860 1.60e-99

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Type II inositol polyphosphate 5-phosphatase I, Oculocerebrorenal syndrome of Lowe 1, and related proteins; This subfamily contains the INPP5c domain of type II inositol polyphosphate 5-phosphatase I (INPP5B), Oculocerebrorenal syndrome of Lowe 1 (OCRL-1), and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. INPP5B and OCRL1 preferentially hydrolyze the 5-phosphate of phosphatidylinositol (4,5)- bisphosphate [PI(4,5)P2] and phosphatidylinositol (3,4,5)- trisphosphate [PI(3,4,5)P3]. INPP5B can also hydrolyze soluble inositol (1,4,5)-trisphosphate [I(1,4,5)P3] and inositol (1,3,4,5)-tetrakisphosphate [I(1,3,4,5)P4]. INPP5B participates in the endocytic pathway and in the early secretory pathway. In the latter, it may function in retrograde ERGIC (ER-to-Golgi intermediate compartment)-to-ER transport; it binds specific RAB proteins within the secretory pathway. In the endocytic pathway, it binds RAB5 and during endocytosis, may function in a RAB5-controlled cascade for converting PI(3,4,5)P3 to phosphatidylinositol 3-phosphate (PI3P). This cascade may link growth factor signaling and membrane dynamics. Mutation in OCRL1 is implicated in Lowe syndrome, an X-linked recessive multisystem disorder, which includes defects in eye, brain, and kidney function, and in Type 2 Dent's disease, a disorder with only the renal symptoms. OCRL-1 may have a role in membrane trafficking within the endocytic pathway and at the trans-Golgi network, and may participate in actin dynamics or signaling from endomembranes. OCRL1 and INPP5B have overlapping functions: deletion of both 5-phosphatases in mice is embryonic lethal, deletion of OCRL1 alone has no phenotype, and deletion of Inpp5b alone has only a mild phenotype (male sterility). Several of the proteins that interact with OCRL1 also bind INPP5B, for examples, inositol polyphosphate phosphatase interacting protein of 27kDa (IPIP27)A and B (also known as Ses1 and 2), and endocytic signaling adaptor APPL1. OCRL1, but not INPP5B, binds clathrin heavy chain, the plasma membrane AP2 adaptor subunit alpha-adaptin. In addition to this INPP5c domain, most proteins in this subfamily have a C-terminal RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain.


Pssm-ID: 197327  Cd Length: 292  Bit Score: 320.80  E-value: 1.60e-99
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKqfrsiafKNQTLTDWLldapklagiqefqDKRSKPTDIFAIGFEEmVELNAGNIVNASTTNQKLWA 604
Cdd:cd09093      1 FRIFVGTWNVNGQS-------PDESLRPWL-------------SCDEEPPDIYAIGFQE-LDLSAEAFLFNDSSREQEWV 59
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  605 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 684
Cdd:cd09093     60 KAVERGLHPDAKYKKVKLIRLVGMMLLVFVKKEHRQHIKEVAAETVGTGIMGKMGNKGGVAVRFQFHNTTFCFVNSHLAA 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  685 GQSQVKERNEDFVEIARKLSFPMG----RMLFSHDYVFWCGDFNYRI-DLPNEEVKELIRQQNWDSLIAGDQLINQKNAG 759
Cdd:cd09093    140 HMEEVERRNQDYKDICARMKFEDPdgppLSISDHDVVFWLGDLNYRIqELPTEEVKELIEKNDLEELLKYDQLNIQRRAG 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  760 QIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRrrkwpfdrsaedldllnasfqdESKIlytwtpgTLL 839
Cdd:cd09093    220 KVFEGFTEGEINFIPTYKYDPGTDNWDSSEKCRAPAWCDRILWR----------------------GTNI-------VQL 270
                          330       340
                   ....*....|....*....|..
gi 2229487482  840 HYGR-AELKTSDHRPVVALIDI 860
Cdd:cd09093    271 SYRShMELKTSDHKPVSALFDI 292
COG5329 COG5329
Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];
60-480 2.02e-93

Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];


Pssm-ID: 227637 [Multi-domain]  Cd Length: 570  Bit Score: 314.33  E-value: 2.02e-93
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   60 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASDEDRIS---------EVRKVLNSGNFYF 130
Cdd:COG5329     61 YGVIGLIKLKGD----IYLIVITGASLVGVIPGHSIYKILDVDFISLNNNKWDDELEEdeanydklsELKKLLSNGTFYF 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  131 A--WSASGvSLDLSLNAHRSMQEHTTDNRFFWNQSL------HLHLKHYGVNCDD-WLLRLMCGGVEIRTIYAAHKQAKA 201
Cdd:COG5329    137 SydFDITN-SLQKNLSEGLEASVDRADLIFMWNSFLleefinHRSKLSSLEKQFDnFLTTVIRGFAETVDIKVGGNTISL 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  202 CLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDCVSSFIQIRGSVPLFWEQPGLQVGShRVRMSRGFEANAPA 281
Cdd:COG5329    216 TLISRRSSERAGTRYLSRGIDDDGNVSNFVETEQIVTDSQYIFSFTQVRGSIPLFWEQSNLLYGP-KIKVTRSSEAAQSA 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  282 FDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKASEHAsDIHMVSFDYHQMVKGGKAEKLHSILKPQVQKFLDY 361
Cdd:COG5329    295 FDKHFDKLREKYGDVYVVNLLKTKGYEAPLLELYEKHLDLSKKP-KIHYTEFDFHKETSQDGFDDVKKLLYLIEQDLLEF 373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  362 GFFYFDGSEVQRC--QSGTVRTNCLDCLDRTNSVQAFLGLEMLAKQLEALGLAEKpqlVTRFQEVFRSMWSVNGDSISKI 439
Cdd:COG5329    374 GYFAYDINEGKSIseQDGVFRTNCLDCLDRTNVIQSLISRVLLEQFRSEGVISDG---YSPFLQIHRELWADNGDAISRL 450
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2229487482  440 YAGTGALEGKA----------KLKDGARSVTRTIQNNFFDSSKQEAIDVLL 480
Cdd:COG5329    451 YTGTGALKSSFtrrgrrsfagALNDFIKSFSRYYINNFTDGQRQDAIDLLL 501
Syja_N pfam02383
SacI homology domain; This Pfam family represents a protein domain which shows homology to the ...
60-340 3.07e-86

SacI homology domain; This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin.


Pssm-ID: 460545  Cd Length: 295  Bit Score: 283.69  E-value: 3.07e-86
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482   60 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASD----------EDRI-SEVRKVLNSGNF 128
Cdd:pfam02383    1 YGILGLIRLLSG----YYLIVITKREQVGQIGGHPIYKITDVEFIPLNSSLSDtqlakkehpdEERLlKLLKLFLSSGSF 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  129 YFAWSasgvsLDLSlnahRSMQEHTT----------DNRFFWNQSLHLHLKHYGVNCDDWLLRLMCGGVEIRTIYAAHKQ 198
Cdd:pfam02383   77 YFSYD-----YDLT----NSLQRNLTrsrspsfdslDDRFFWNRHLLKPLIDFQLDLDRWILPLIQGFVEQGKLSVFGRS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  199 AKACLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDC-----VSSFIQIRGSVPLFWEQPGLQVGSHRVRMSR 273
Cdd:pfam02383  148 VTLTLISRRSRKRAGTRYLRRGIDDDGNVANFVETEQIVSLNTSnsegkIFSFVQIRGSIPLFWSQDPNLKYKPKIQITR 227
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2229487482  274 gFEANAPAFDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKAS--EHASDIHMVSFDYHQMVK 340
Cdd:pfam02383  228 -PEATQPAFKKHFDDLIERYGPVHIVNLVEKKGRESKLSEAYEEAVKYLnqFLPDKLRYTAFDFHHECK 295
INPP5c_INPP5J-like cd09094
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of inositol polyphosphate ...
526-860 9.39e-72

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of inositol polyphosphate 5-phosphatase J and related proteins; INPP5c domain of Inositol polyphosphate-5-phosphatase J (INPP5J), also known as PIB5PA or PIPP, and related proteins. This subfamily belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. INPP5J hydrolyzes PI(4,5)P2, I(1,4,5)P3, and I(1,3,4,5)P4 at ruffling membranes. These proteins contain a C-terminal, SKIP carboxyl homology domain (SKICH), which may direct plasma membrane ruffle localization.


Pssm-ID: 197328  Cd Length: 300  Bit Score: 242.28  E-value: 9.39e-72
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  526 RVCVGTWNVnggkqfrSIAFKNQTLTdwlldapKLAGIQEFQDKrskpTDIFAIGFEEmvelnagniVNASTTNQKL--- 602
Cdd:cd09094      2 RVYVVTWNV-------ATAPPPIDVR-------SLLGLQSPEVA----PDIYIIGLQE---------VNSKPVQFVSdli 54
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  603 ----WAvELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFV 678
Cdd:cd09094     55 fddpWS-DLFMDILSPKGYVKVSSIRLQGLLLLVFVKIQHLPFIRDVQTNYTRTGLGGYWGNKGAVTVRFSLYGHMICFL 133
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  679 CSHFAAGQSQVKERNEDFVEIARKLSFPMGRM--LFSHDYVFWCGDFNYRI-DLPNEEVKELIRQQNWDSLIAGDQLINQ 755
Cdd:cd09094    134 NCHLPAHMEKWEQRIDDFETILSTQVFNECNTpsILDHDYVFWFGDLNFRIeDVSIEFVRELVNSKKYHLLLEKDQLNMA 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  756 KNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRrrkwpfdrsaedLDLLNASFQDESKIlytwtp 835
Cdd:cd09094    214 KRKEEAFQGFQEGPLNFAPTYKFDLGTDEYDTSGKKRKPAWTDRILWK------------VNPDASTEEKFLSI------ 275
                          330       340
                   ....*....|....*....|....*.
gi 2229487482  836 gTLLHY-GRAELKTSDHRPVVALIDI 860
Cdd:cd09094    276 -TQTSYkSHMEYGISDHKPVTAQFRL 300
COG5411 COG5411
Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms];
518-886 1.41e-68

Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms];


Pssm-ID: 227698 [Multi-domain]  Cd Length: 460  Bit Score: 238.91  E-value: 1.41e-68
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  518 KYSKPKKIRVCVGTWNVNGgkqfrsiafKNQT--LTDWLLdaPklagiqefQDKRSKPTDIFAIGFEEMVELNAGNIVNA 595
Cdd:COG5411     23 KYVIEKDVSIFVSTFNPPG---------KPPKasTKRWLF--P--------EIEATELADLYVVGLQEVVELTPGSILSA 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  596 STtNQKL--W---AVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLF 670
Cdd:COG5411     84 DP-YDRLriWeskVLDCLNGAQSDEKYSLLRSPQLGGILLRVFSLATNLPVVKPVSGTVKKTGFGGSSSNKGAVAIRFNY 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  671 HTTSLCFVCSHFAAGQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNW--DSLIA 748
Cdd:COG5411    163 ERTSFCFVNSHLAAGVNNIEERIFDYRSIASNICFSRGLRIYDHDTIFWLGDLNYRVTSTNEEVRPEIASDDGrlDKLFE 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  749 GDQLINQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRrkwpfdrsaedldllnasfqdesk 828
Cdd:COG5411    243 YDQLLWEMEVGNVFPGFKEPVITFPPTYKFDYGTDEYDTSDKGRIPSWTDRILYKS------------------------ 298
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2229487482  829 ilYTWTPGTllhYGRAE-LKTSDHRPVVALIDIDIFEVEAEERQKIYKEVIA--VQGPPDG 886
Cdd:COG5411    299 --EQLTPHS---YSSIPhLMISDHRPVYATFRAKIKVVDPSKKEGLIEKLYAeyKTELGEA 354
DUF1866 pfam08952
Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known ...
859-1000 1.69e-62

Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known function.


Pssm-ID: 286093  Cd Length: 146  Bit Score: 209.67  E-value: 1.69e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  859 DIDIFEVEAEERQKIYKEVIAVQGPPDGTVLVSIKS-SAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEG 937
Cdd:pfam08952    1 DVEIQEVDPEARRRVFKEVIRDQGPPDGTIVVSLCSgDLDEKNIFDENLMDELIQELTSFGEVTLVRFVEDTMWVTFRDG 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2229487482  938 SSALNALSLNGKELLNRTITITLKSPDWIKHLEEEMSLEKIS-VTLPSSASSTLLGEDAEVAAD 1000
Cdd:pfam08952   81 HSALNALSKDGMKVCGRALKIRLKSKDWIKGLEEEIILCTDNtIPVSPCANSTLLAEDFDFGSP 144
INPP5c_INPP5E-like cd09095
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Inositol ...
523-860 2.51e-56

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Inositol polyphosphate-5-phosphatase E and related proteins; INPP5c domain of Inositol polyphosphate-5-phosphatase E (also called type IV or 72 kDa 5-phosphatase), rat pharbin, and related proteins. This subfamily belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. INPP5E hydrolyzes the 5-phosphate from PI(3,5)P2, PI(4,5)P2 and PI(3,4,5)P3, forming PI3P, PI4P, and PI(3,4)P2, respectively. It is a very potent PI(3,4,5)P3 5-phosphatase. Its intracellular localization is chiefly cytosolic, with pronounced perinuclear/Golgi localization. INPP5E also has an N-terminal proline rich domain (PRD) and a C-terminal CAAX motif. This protein is expressed in a variety of tissues, including the breast, brain, testis, and haemopoietic cells. It is differentially expressed in several cancers, for example, it is up-regulated in cervical cancer and down-regulated in stomach cancer. It is a candidate target for therapeutics of obesity and related disorders, as it is expressed in the hypothalamus, and following insulin stimulation, it undergoes tyrosine phosphorylation, associates with insulin receptor substrate-1, -2, and PI3-kinase, and become active as a 5-phosphatase. INPP5E may play a role, along with other 5-phosphatases SHIP2 and SKIP, in regulating glucose homoeostasis and energy metabolism. Mice deficient in INPPE5 develop a multi-organ disorder associated with structural defects of the primary cilium.


Pssm-ID: 197329  Cd Length: 298  Bit Score: 198.03  E-value: 2.51e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  523 KKIRVCVGTWNVNGGKQFrsiafkNQTLTDWLLdapklAGIQEFQdkrskpTDIFAIGFEEmvelnagnivnaSTTNQKL 602
Cdd:cd09095      3 RNVGIFVATWNMQGQKEL------PENLDDFLL-----PTSADFA------QDIYVIGVQE------------GCSDRRE 53
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  603 WAVELQKTISrdNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 682
Cdd:cd09095     54 WEIRLQETLG--PSHVLLHSASHGVLHLAVFIRRDLIWFCSEVESATVTTRIVSQIKTKGALAISFTFFGTSFLFITSHF 131
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  683 AAGQSQVKERNEDFVEIARKLSFPmgRMLFSHDY-------------VFWCGDFNYRIDLPNEEVKELIRQ---QNWDSL 746
Cdd:cd09095    132 TSGDGKVKERVLDYNKIIQALNLP--RNVPTNPYksesgdvttrfdeVFWFGDFNFRLSGPRHLVDALINQgqeVDVSAL 209
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  747 IAGDQLINQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRkwpfdrsaedldllnasfqde 826
Cdd:cd09095    210 LQHDQLTREMSKGSIFKGFQEAPIHFPPTYKFDIGSDVYDTSSKQRVPSYTDRILYRSR--------------------- 268
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 2229487482  827 skilytwTPGTL--LHYGRAE-LKTSDHRPVVALIDI 860
Cdd:cd09095    269 -------QKGDVccLKYNSCPsIKTSDHRPVFALFRV 298
PLN03191 PLN03191
Type I inositol-1,4,5-trisphosphate 5-phosphatase 2; Provisional
616-875 7.90e-50

Type I inositol-1,4,5-trisphosphate 5-phosphatase 2; Provisional


Pssm-ID: 215624 [Multi-domain]  Cd Length: 621  Bit Score: 188.19  E-value: 7.90e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  616 KYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKE--RN 693
Cdd:PLN03191   363 KYVRIVSKQMVGIYVSVWVRKRLRRHINNLKVSPVGVGLMGYMGNKGSVSISMSLFQSRLCFVCSHLTSGHKDGAEqrRN 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  694 EDFVEIARKLSFP------MGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRGFLE 767
Cdd:PLN03191   443 ADVYEIIRRTRFSsvldtdQPQTIPSHDQIFWFGDLNYRLNMLDTEVRKLVAQKRWDELINSDQLIKELRSGHVFDGWKE 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  768 GKVTFAPTYKYDLFSEDY-----DTSEKCRTPAWTDRVLWrrrkwpfdrsaedldlLNASFQDESkilytwtpgtllhYG 842
Cdd:PLN03191   523 GPIKFPPTYKYEINSDRYvgenpKEGEKKRSPAWCDRILW----------------LGKGIKQLC-------------YK 573
                          250       260       270
                   ....*....|....*....|....*....|...
gi 2229487482  843 RAELKTSDHRPVVALIDIdifEVEAEERQKIYK 875
Cdd:PLN03191   574 RSEIRLSDHRPVSSMFLV---EVEVFDHRKLQR 603
INPP5c_SHIP1-INPP5D cd09100
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing ...
525-803 5.85e-41

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing inositol polyphosphate 5-phosphatase-1 and related proteins; This subfamily contains the INPP5c domain of SHIP1 (SH2 domain containing inositol polyphosphate 5-phosphatase-1, also known as SHIP/INPP5D) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. SHIP1's enzymic activity is restricted to phosphatidylinositol 3,4,5-trisphosphate [PI (3,4,5)P3] and inositol-1,3,4,5- polyphosphate [I(1,3,4,5)P4]. It converts these two phosphoinositides to phosphatidylinositol 3,4-bisphosphate [PI (3,4)P2] and inositol-1,3,4-polyphosphate [I(1,3,4)P3], respectively. SHIP1 is a negative regulator of cell growth and plays a major part in mediating the inhibitory signaling in B cells; it is predominantly expressed in hematopoietic cells. In addition to this INPP5c domain, SHIP1 has an N-terminal SH2 domain, two NPXY motifs, and a C-terminal proline-rich region (PRD). SHIP1's phosphorylated NPXY motifs interact with proteins with phosphotyrosine binding (PTB) domains, and facilitate the translocation of SHIP1 to the plasma membrane to hydrolyze PI(3,4,5)P3. SHIP1 generally acts to oppose the activity of phosphatidylinositol 3-kinase (PI3K). It acts as a negative signaling molecule, reducing the levels of PI(3,4,5)P3, thereby removing the latter as a membrane-targeting signal for PH domain-containing effector molecules. SHIP1 may also, in certain contexts, amplify PI3K signals. SHIP1 and SHIP2 have little overlap in their in vivo functions.


Pssm-ID: 197334  Cd Length: 307  Bit Score: 153.99  E-value: 5.85e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKQFRSIafknqtlTDWLLDApklaGIQEFQDKRSK--PTDIFAIGFEEmvelnagnivnaSTTNQKL 602
Cdd:cd09100      1 ITIFIGTWNMGNAPPPKKI-------TSWFQCK----GQGKTRDDTADyiPHDIYVIGTQE------------DPLGEKE 57
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  603 WAVELQKTISR--DNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCS 680
Cdd:cd09100     58 WLDTLKHSLREitSISFKVIAIQTLWNIRIVVLAKPEHENRISHICTDSVKTGIANTLGNKGAVGVSFMFNGTSFGFVNS 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  681 HFAAGQSQVKERNEDFVEIARKLSF---PMGRMLFSH--DYVFWCGDFNYRIDLPNEEVKEL---IRQQNWDSLIAGDQL 752
Cdd:cd09100    138 HLTSGSEKKLRRNQNYFNILRFLVLgdkKLSPFNITHrfTHLFWLGDLNYRVELPNTEAENIiqkIKQQQYQELLPHDQL 217
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2229487482  753 INQKNAGQIFRGFLEGKVTFAPTYKYD-------LFSEDYDTSEKCRTPAWTDRVLWR 803
Cdd:cd09100    218 LIERKESKVFLQFEEEEITFAPTYRFErgtreryAYTKQKATGMKYNLPSWCDRVLWK 275
INPP5c_SHIP cd09091
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing ...
525-860 2.17e-40

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing inositol polyphosphate 5-phosphatase-1 and -2, and related proteins; This subfamily contains the INPP5c domain of SHIP1 (SH2 domain containing inositol polyphosphate 5-phosphatase-1, also known as SHIP/INPP5D), and SHIP2 (also known as INPPL1). It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Both SHIP1 and -2 catalyze the dephosphorylation of the PI, phosphatidylinositol 3,4,5-trisphosphate [PI(3,4,5)P3], to phosphatidylinositol 3,4-bisphosphate [PI(3,4)P2]. SHIP1 also converts inositol-1,3,4,5- polyphosphate [I(1,3,4,5)P4] to inositol-1,3,4-polyphosphate [I(1,3,4)P3]. SHIP1 and SHIP2 have little overlap in their in vivo functions. SHIP1 is a negative regulator of cell growth and plays a major part in mediating the inhibitory signaling in B cells; it is predominantly expressed in hematopoietic cells. SHIP2 is as an inhibitor of the insulin signaling pathway, and is implicated in actin structure remodeling, cell adhesion and cell spreading, receptor endocytosis and degradation, and in the JIP1-mediated JNK pathway. SHIP2 is widely expressed, most prominently in brain, heart and in skeletal muscle. In addition to this INPP5c domain, SHIP1 has an N-terminal SH2 domain, two NPXY motifs, and a C-terminal proline-rich region (PRD), while SHIP2 has an N-terminal SH2 domain, a C-terminal proline-rich domain (PRD), which includes a WW-domain binding motif (PPLP), an NPXY motif, and a sterile alpha motif (SAM) domain. The gene encoding SHIP2 is a candidate gene for conferring a predisposition for type 2 diabetes.


Pssm-ID: 197325  Cd Length: 307  Bit Score: 152.02  E-value: 2.17e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKQFRSIafknqtlTDWLLDAPKLAGIQEFQDkrSKPTDIFAIGFEEmvelnagnivnaSTTNQKLWA 604
Cdd:cd09091      1 ISIFIGTWNMGSAPPPKNI-------TSWFTSKGQGKTRDDVAD--YIPHDIYVIGTQE------------DPLGEKEWL 59
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  605 VELQKTISR--DNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 682
Cdd:cd09091     60 DLLRHSLKEltSLDYKPIAMQTLWNIRIVVLAKPEHENRISHVCTSSVKTGIANTLGNKGAVGVSFMFNGTSFGFVNSHL 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  683 AAGQSQVKERNEDFVEIARKLSF---PMGRMLFSH--DYVFWCGDFNYRIDLPNEEVKELI---RQQNWDSLIAGDQLIN 754
Cdd:cd09091    140 TSGSEKKLRRNQNYLNILRFLSLgdkKLSAFNITHrfTHLFWLGDLNYRLDLPIQEAENIIqkiEQQQFEPLLRHDQLNL 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  755 QKNAGQIFRGFLEGKVTFAPTYKYDLFSEDY-------DTSEKCRTPAWTDRVLWrrrkwpfdRSAEDLDLLNASFQDES 827
Cdd:cd09091    220 EREEHKVFLRFSEEEITFPPTYRYERGSRDTyaytkqkATGVKYNLPSWCDRILW--------KSYPETHIICQSYGCTD 291
                          330       340       350
                   ....*....|....*....|....*....|...
gi 2229487482  828 KILytwtpgtllhygraelkTSDHRPVVALIDI 860
Cdd:cd09091    292 DIV-----------------TSDHSPVFGTFEV 307
INPP5c_SHIP2-INPPL1 cd09101
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing ...
525-803 4.19e-39

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing inositol 5-phosphatase-2 and related proteins; This subfamily contains the INPP5c domain of SHIP2 (SH2 domain containing inositol 5-phosphatase-2, also called INPPL1) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. SHIP2 catalyzes the dephosphorylation of the PI, phosphatidylinositol 3,4,5-trisphosphate [PI(3,4,5)P3], to phosphatidylinositol 3,4-bisphosphate [PI(3,4)P2]. SHIP2 is widely expressed, most prominently in brain, heart and in skeletal muscle. SHIP2 is an inhibitor of the insulin signaling pathway. It is implicated in actin structure remodeling, cell adhesion and cell spreading, receptor endocytosis and degradation, and in the JIP1-mediated JNK pathway. Its interacting partners include filamin/actin, p130Cas, Shc, Vinexin, Interesectin 1, and c-Jun NH2-terminal kinase (JNK)-interacting protein 1 (JIP1). A large variety of extracellular stimuli appear to lead to the tyrosine phosphorylation of SHIP2, including epidermal growth factor (EGF), platelet-derived growth factor (PDGF), insulin, macrophage colony-stimulating factor (M-CSF) and hepatocyte growth factor (HGF). SHIP2 is localized to the cytosol in quiescent cells; following growth factor stimulation and /or cell adhesion, it relocalizes to membrane ruffles. In addition to this INPP5c domain, SHIP2 has an N-terminal SH2 domain, a C-terminal proline-rich domain (PRD), which includes a WW-domain binding motif (PPLP), an NPXY motif and a sterile alpha motif (SAM) domain. The gene encoding SHIP2 is a candidate for conferring a predisposition for type 2 diabetes; it has been suggested that suppression of SHIP2 may be of benefit in the treatment of obesity and thereby prevent type 2 diabetes. SHIP2 and SHIP1 have little overlap in their in vivo functions.


Pssm-ID: 197335  Cd Length: 304  Bit Score: 148.20  E-value: 4.19e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  525 IRVCVGTWNVNGGKQFRSiafknqtLTDWLLDApklaGIQEFQDKRSK--PTDIFAIGFEEmvelnagnivnaSTTNQKL 602
Cdd:cd09101      1 ISIFIGTWNMGSVPPPKS-------LASWLTSR----GLGKTLDETTVtiPHDIYVFGTQE------------NSVGDRE 57
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  603 WAVELQKTISR--DNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCS 680
Cdd:cd09101     58 WVDFLRASLKEltDIDYQPIALQCLWNIKMVVLVKPEHENRISHVHTSSVKTGIANTLGNKGAVGVSFMFNGTSFGFVNC 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  681 HFAAGQSQVKERNEDFVEIARKLSFPMGR-------MLFSHdyVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLI 753
Cdd:cd09101    138 HLTSGNEKTHRRNQNYLDILRSLSLGDKQlnafdisLRFTH--LFWFGDLNYRLDMDIQEILNYITRKEFDPLLAVDQLN 215
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2229487482  754 NQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRT-------PAWTDRVLWR 803
Cdd:cd09101    216 LEREKNKVFLRFREEEISFPPTYRYERGSRDTYMWQKQKTtgmrtnvPSWCDRILWK 272
RRM_SYNJ1 cd12719
RNA recognition motif (RRM) found in synaptojanin-1 and similar proteins; This subgroup ...
885-961 1.03e-38

RNA recognition motif (RRM) found in synaptojanin-1 and similar proteins; This subgroup corresponds to the RRM of synaptojanin-1, also termed synaptojanin, or synaptic inositol-1,4,5-trisphosphate 5-phosphatase 1, originally identified as one of the major Grb2-binding proteins that may participate in synaptic vesicle endocytosis. It also acts as a Src homology 3 (SH3) domain-binding brain-specific inositol 5-phosphatase with a putative role in clathrin-mediated endocytosis. Synaptojanin-1 contains an N-terminal domain homologous to the cytoplasmic portion of the yeast protein Sac1p, a central inositol 5-phosphatase domain followed by a putative RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal proline-rich region mediating the binding of synaptojanin-1 to various SH3 domain-containing proteins including amphiphysin, SH3p4, SH3p8, SH3p13, and Grb2. Synaptojanin-1 has two tissue-specific alternative splicing isoforms, synaptojanin-145 expressed in brain and synaptojanin-170 expressed in peripheral tissues. Synaptojanin-145 is very abundant in nerve terminals and may play an essential role in the clathrin-mediated endocytosis of synaptic vesicles. In contrast to synaptojanin-145, synaptojanin-170 contains three unique asparagine-proline-phenylalanine (NPF) motifs in the C-terminal region and may functions as a potential binding partner for Eps15, a clathrin coat-associated protein acting as a major substrate for the tyrosine kinase activity of the epidermal growth factor receptor.


Pssm-ID: 410118  Cd Length: 77  Bit Score: 138.69  E-value: 1.03e-38
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2229487482  885 DGTVLVSIKSSAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEGSSALNALSLNGKELLNRTITITLK 961
Cdd:cd12719      1 DGTVVVSVLSSSPEPNYFDDNLIDALLQQFSSFGEVILIRFVEDKMWVTFLEGSSALAALSLNGTEVLGRTIIISLK 77
RRM_SYNJ cd12440
RNA recognition motif (RRM) found in synaptojanin-1, synaptojanin-2 and similar proteins; This ...
885-961 4.19e-31

RNA recognition motif (RRM) found in synaptojanin-1, synaptojanin-2 and similar proteins; This subfamily corresponds to the RRM of two active phosphatidylinositol phosphate phosphatases, synaptojanin-1 and synaptojanin-2. They have different interaction partners and are likely to have different biological functions. Synaptojanin-1 was originally identified as one of the major Grb2-binding proteins that may participate in synaptic vesicle endocytosis. It also acts as a Src homology 3 (SH3) domain-binding brain-specific inositol 5-phosphatase with a putative role in clathrin-mediated endocytosis. Synaptojanin-2 is a ubiquitously expressed homolog of synaptojanin-1. It is a novel Rac1 effector regulating the early step of clathrin-mediated endocytosis. Synaptojanin-2 directly and specifically interacts with Rac1 in a GTP-dependent manner. It mediates the inhibitory effect of Rac1 on endocytosis and plays an important role in the Rac1-mediated control of cell growth. Both, synaptojanin-1 and synaptojanin-2, have two tissue-specific alternative splicing isoforms, a shorter isoform expressed in brain and a longer isoform in peripheral tissues. Synaptojanin-1 contains an N-terminal domain homologous to the cytoplasmic portion of the yeast protein Sac1p, a central inositol 5-phosphatase domain followed by a putative RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal proline-rich region mediating the binding of synaptojanin-1 to various SH3 domain-containing proteins including amphiphysin, SH3p4, SH3p8, SH3p13, and Grb2. Synaptojanin-2 shows high sequence homology to the N-terminal Sac1p homology domain, the central inositol 5-phosphatase domain, the putative RNA recognition motif (RRM) of synaptojanin-1, but differs in the proline-rich region.


Pssm-ID: 409874 [Multi-domain]  Cd Length: 77  Bit Score: 117.14  E-value: 4.19e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2229487482  885 DGTVLVSIKSSAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEGSSALNALSLNGKELLNRTITITLK 961
Cdd:cd12440      1 DATVVVSLDSKSEEWNEFEDALIGELLRVLASYGDVVLVRFAHEGMLVTFRDGRSALAALALNGKQILGRTLKIRLK 77
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1002-1409 9.22e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 66.71  E-value: 9.22e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1002 DMEGDVDDySAEVEELLPQHLQPSSSSGLGTSPSSSPRTSPCQSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQ 1081
Cdd:pfam03154  153 DNESDSDS-SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLI 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1082 PAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQrppppsgrnQPSPQAGLAGPGP-------AGYGAARPTIPARAGVISA 1154
Cdd:pfam03154  232 QQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSP---------QPLPQPSLHGQMPpmphslqTGPSHMQHPVPPQPFPLTP 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1155 PQSQARVCAGrPTPDSQSKPSETLKGPAVLPEPLKPQAafPQQPSLPtPAqklqdplvPIAAPTM--PPSGPQPNLETPP 1232
Cdd:pfam03154  303 QSSQSQVPPG-PSPAAPGQSQQRIHTPPSQSQLQSQQP--PREQPLP-PA--------PLSMPHIkpPPTTPIPQLPNPQ 370
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1233 QPPPRSRSSQSLPsdsspqLQVKINgisgVKQEPTLKsdPFEDLSlsvlavSKAQPSVQISPvltpdpkmlIQLPSASQS 1312
Cdd:pfam03154  371 SHKHPPHLSGPSP------FQMNSN----LPPPPALK--PLSSLS------THHPPSAHPPP---------LQLMPQSQQ 423
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1313 QVNPlssvscmPTRPPGPEESKSQESMGSSANPFPSL---PCRNPFTDRTAAPGNPfrvqsqeseaTSWLSKEEPVPNSp 1389
Cdd:pfam03154  424 LPPP-------PAQPPVLTQSQSLPPPAASHPPTSGLhqvPSQSPFPQHPFVPGGP----------PPITPPSGPPTST- 485
                          410       420
                   ....*....|....*....|
gi 2229487482 1390 fPPLMPLSHDTSKASSSLGG 1409
Cdd:pfam03154  486 -SSAMPGIQPPSSASVSSSG 504
RRM_SYNJ2 cd12720
RNA recognition motif (RRM) found in synaptojanin-2 and similar proteins; This subgroup ...
885-958 1.28e-10

RNA recognition motif (RRM) found in synaptojanin-2 and similar proteins; This subgroup corresponds to the RRM of synaptojanin-2, also termed synaptic inositol-1,4,5-trisphosphate 5-phosphatase 2, an ubiquitously expressed central regulatory enzyme in the phosphoinositide-signaling cascade. As a novel Rac1 effector regulating the early step of clathrin-mediated endocytosis, synaptojanin-2 acts as a polyphosphoinositide phosphatase directly and specifically interacting with Rac1 in a GTP-dependent manner. It mediates the inhibitory effect of Rac1 on endocytosis and plays an important role in the Rac1-mediated control of cell growth. Synaptojanin-2 shows high sequence homology to the N-terminal Sac1p homology domain, the central inositol 5-phosphatase domain, the putative RNA recognition motif (RRM) of synaptojanin-1, but differs in the proline-rich region.


Pssm-ID: 410119 [Multi-domain]  Cd Length: 78  Bit Score: 59.03  E-value: 1.28e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2229487482  885 DGTVLVSIKS-SAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEGSSALNALSLNGKELLNRTITI 958
Cdd:cd12720      1 DATVVVNLLSpTLEEKNDFPEDLSTELVQCFQSYGTVILVRFNRGQMLVTFEDSRSALRVLDLDGIKVNGRAVKI 75
PHA03247 PHA03247
large tegument protein UL36; Provisional
1052-1412 8.07e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 8.07e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1052 SAPSLPIRPsRAPSRTPGPPSSQGSPVDTQPAAqkdssqtlepkrpppprpvapparpappqrPPPPSGRNQPSPQAGLA 1131
Cdd:PHA03247  2590 DAPPQSARP-RAPVDDRGDPRGPAPPSPLPPDT------------------------------HAPDPPPPSPSPAANEP 2638
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1132 GPG--------------PAGYGAARP---TIPARAGVISAPQSQARVCAGRPT---------PDSQSKPSETLKGPAV-- 1183
Cdd:PHA03247  2639 DPHppptvppperprddPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPTvgsltsladPPPPPPTPEPAPHALVsa 2718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1184 LPEPLKPQA---AFPQQPSLPTPAQKLQDPLVPI-----------------AAPTMPPSGPQPNLeTPPQPPPRSRSSQS 1243
Cdd:PHA03247  2719 TPLPPGPAAarqASPALPAAPAPPAVPAGPATPGgparparppttagppapAPPAAPAAGPPRRL-TRPAVASLSESRES 2797
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1244 LPSDSSPQLQVKINGISGVKQEPTLKSDPFEDLSLSVLAVSKAQPSVQISPVLTPDPKMliqLPSASQSQVNPLSSVSCM 1323
Cdd:PHA03247  2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV---APGGDVRRRPPSRSPAAK 2874
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1324 PTRPPGPEESK-SQESMGSSANPFPsLPCRNPFTDRTAAPGNPFRVQSQESEATSWLSKEEPvPNSPFPPLMPLShDTSK 1402
Cdd:PHA03247  2875 PAAPARPPVRRlARPAVSRSTESFA-LPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP-PPRPQPPLAPTT-DPAG 2951
                          410
                   ....*....|
gi 2229487482 1403 ASSSLGGFED 1412
Cdd:PHA03247  2952 AGEPSGAVPQ 2961
PHA03247 PHA03247
large tegument protein UL36; Provisional
1019-1407 9.65e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 9.65e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1019 PQHLQPSSSSGLGTSPSSSPRTSPCQSPTVPeySAPSLPIRPSrAPSRTPGPPSSQGSPVDTQPAAQKDSSQtlepkrpp 1098
Cdd:PHA03247  2712 PHALVSATPLPPGPAAARQASPALPAAPAPP--AVPAGPATPG-GPARPARPPTTAGPPAPAPPAAPAAGPP-------- 2780
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1099 pPRPVAPPARPAPPQRPPPPSGRNQPSPQAGLAGPGPAGYGAARPTIParagvisAPQSQARVCAGRPTPDSQSKPSETL 1178
Cdd:PHA03247  2781 -RRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP-------LPPPTSAQPTAPPPPPGPPPPSLPL 2852
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1179 KGpAVLP-------EPLKPQAAFPQQPSLPtPAQKLQDPLVPIAAPT--MPPSGPQPNLETPPQPPPRSRSSQSLPSDSS 1249
Cdd:PHA03247  2853 GG-SVAPggdvrrrPPSRSPAAKPAAPARP-PVRRLARPAVSRSTESfaLPPDQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1250 PQLQvkingisgvkqePTLKSDPFEDLSLSVLAVSKAQPSV------QISPVLTPDPKMLIqlPSASQSQVNPLSSVSCm 1323
Cdd:PHA03247  2931 PPPP------------PPPRPQPPLAPTTDPAGAGEPSGAVpqpwlgALVPGRVAVPRFRV--PQPAPSREAPASSTPP- 2995
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1324 PTRPPGPEESKSQESMG----SSANPFPSLPCRNPFTDRTAAPGNPFRVQSQESEATSWLSKEEPVPNSPF--PPLMPLS 1397
Cdd:PHA03247  2996 LTGHSLSRVSSWASSLAlheeTDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFahEPDPATP 3075
                          410
                   ....*....|
gi 2229487482 1398 HDTSKASSSL 1407
Cdd:PHA03247  3076 EAGARESPSS 3085
PHA03247 PHA03247
large tegument protein UL36; Provisional
1046-1405 1.97e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 1.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1046 PTVPEYSAPSLPIRPSRAPSRTPgPPSSQGSPVDTQPAAQKDSS--QTLEPKRPPPPRPVAPPARPAPPQRPPPPSGRNQ 1123
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAP-HALVSATPLPPGPAAARQASpaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1124 PSPQAGLAGPGPagygaarPTIPARAGvisAPQSQARVCAgrPTPDSQSKPSETLKGPAvlpePLKPQAAFPQQPSLPTP 1203
Cdd:PHA03247  2769 PAPPAAPAAGPP-------RRLTRPAV---ASLSESRESL--PSPWDPADPPAAVLAPA----AALPPAASPAGPLPPPT 2832
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1204 AqklqdplvPIAAPTMPPSGPQPNLETppqppprsrssqsLPSDSSPQLQVKINGISG-VKQEPTLKSDPfedlSLSVLA 1282
Cdd:PHA03247  2833 S--------AQPTAPPPPPGPPPPSLP-------------LGGSVAPGGDVRRRPPSRsPAAKPAAPARP----PVRRLA 2887
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1283 VSKAQPSVQISPVLTPDPKMLIQLPSASQSQVNPLSSVSCMPT---RPPGPEESKSQESMGSSANPFPSLPCRNPFTDrT 1359
Cdd:PHA03247  2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQpppPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-A 2966
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 2229487482 1360 AAPGNPFRVQSQESEATSwlSKEEPVPNSPFPPLMPLSHDTSKASS 1405
Cdd:PHA03247  2967 LVPGRVAVPRFRVPQPAP--SREAPASSTPPLTGHSLSRVSSWASS 3010
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1124-1427 5.29e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.20  E-value: 5.29e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1124 PSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPdsQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPTP 1203
Cdd:pfam17823  131 PAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAP--RTAASSTTAASSTTAASSAPTTAASSAPATLTP 208
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1204 AQKLQDPLVPIAAPTM-------PPSGPQPNLETPPQPPPRSRS--SQSLPSDSSPQLQVKINGISGVKQEPT-LKSDPF 1273
Cdd:pfam17823  209 ARGISTAATATGHPAAgtalaavGNSSPAAGTVTAAVGTVTPAAlaTLAAAAGTVASAAGTINMGDPHARRLSpAKHMPS 288
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1274 EDLSLSVLAVSKAQ---PSVQIS---PVL------TPDPKMLIQLPSASQSQVNPLSSVSCM-------PTRPPGP--EE 1332
Cdd:pfam17823  289 DTMARNPAAPMGAQaqgPIIQVStdqPVHntagepTPSPSNTTLEPNTPKSVASTNLAVVTTtkaqakePSASPVPvlHT 368
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1333 SKSQESMGSSANPFPSlPCrnPFTDRTAAPGNPFRVQSQESEATSWLSKEEPVPNSPFPPLMPlSHDTSKASSslggfed 1412
Cdd:pfam17823  369 SMIPEVEATSPTTQPS-PL--LPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDPKTL-AMASCQLST------- 437
                          330
                   ....*....|....*
gi 2229487482 1413 nfdlQSQSTVKTSNP 1427
Cdd:pfam17823  438 ----QGQYLVVTTDP 448
EEP cd08372
Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; This large superfamily includes ...
632-729 1.18e-06

Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; This large superfamily includes the catalytic domain (exonuclease/endonuclease/phosphatase or EEP domain) of a diverse set of proteins including the ExoIII family of apurinic/apyrimidinic (AP) endonucleases, inositol polyphosphate 5-phosphatases (INPP5), neutral sphingomyelinases (nSMases), deadenylases (such as the vertebrate circadian-clock regulated nocturnin), bacterial cytolethal distending toxin B (CdtB), deoxyribonuclease 1 (DNase1), the endonuclease domain of the non-LTR retrotransposon LINE-1, and related domains. These diverse enzymes share a common catalytic mechanism of cleaving phosphodiester bonds; their substrates range from nucleic acids to phospholipids and perhaps proteins.


Pssm-ID: 197306 [Multi-domain]  Cd Length: 241  Bit Score: 51.33  E-value: 1.18e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  632 VFIRPqhaPFIRDVAVDTVKTGMGgATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKERNEDFVEIARKLSFpmgRML 711
Cdd:cd08372     71 ILSKT---PKFKIVEKHQYKFGEG-DSGERRAVVVKFDVHDKELCVVNAHLQAGGTRADVRDAQLKEVLEFLKR---LRQ 143
                           90
                   ....*....|....*...
gi 2229487482  712 FSHDYVFWCGDFNYRIDL 729
Cdd:cd08372    144 PNSAPVVICGDFNVRPSE 161
PHA03247 PHA03247
large tegument protein UL36; Provisional
1112-1396 2.79e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.79e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1112 PQRPPPPSGRNQPSPQAGLAGPGPA-GYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKP 1190
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAvTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1191 QAAFPQQPSLPTPAQKLQDPLVP-------------IAAPTMPPSGPQP--------NLETPPQPPPRSRSSQSLPSDSS 1249
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGrvsrprrarrlgrAAQASSPPQRPRRraarptvgSLTSLADPPPPPPTPEPAPHALV 2716
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1250 PQLQVKINGISGVKQEPTLKSDPfedlslsvlaVSKAQPSVQISPVlTPDPKMLIQLPSASQSQVNPLSSVSCMP---TR 1326
Cdd:PHA03247  2717 SATPLPPGPAAARQASPALPAAP----------APPAVPAGPATPG-GPARPARPPTTAGPPAPAPPAAPAAGPPrrlTR 2785
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2229487482 1327 PPGPEESKSQESMGSSANPFP---SLPCRNPFTDRTAAPGNPfrvqsqESEATSWLSKEEPVPNSPFPPLMPL 1396
Cdd:PHA03247  2786 PAVASLSESRESLPSPWDPADppaAVLAPAAALPPAASPAGP------LPPPTSAQPTAPPPPPGPPPPSLPL 2852
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1022-1224 3.27e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 51.80  E-value: 3.27e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1022 LQPSSSSGlgtspsssprtspcQSPTVPEYSAPSLPIRPSRAPSRTPGPPSSqgSPVDTQPAAQKDSSQTLEPKRPPPPR 1101
Cdd:PRK12323   363 FRPGQSGG--------------GAGPATAAAAPVAQPAPAAAAPAAAAPAPA--APPAAPAAAPAAAAAARAVAAAPARR 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1102 PVAPPARPAPPQRPPPPSGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSqSKPSETLKGP 1181
Cdd:PRK12323   427 SPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDD-PPPWEELPPE 505
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2229487482 1182 AVLPEPLKPQAAFPQQPSLPTPAQKLQDPLVPIAAPTMPPSGP 1224
Cdd:PRK12323   506 FASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAA 548
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1045-1226 3.45e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.91  E-value: 3.45e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1045 SPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGR--- 1121
Cdd:PRK07764   596 GGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKagg 675
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1122 ----NQPSPQAGLAGPGPAGyGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQ 1197
Cdd:PRK07764   676 aapaAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGA 754
                          170       180
                   ....*....|....*....|....*....
gi 2229487482 1198 PSLPTPAQKLQDPLVPIAAPTMPPSGPQP 1226
Cdd:PRK07764   755 PAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
PHA03247 PHA03247
large tegument protein UL36; Provisional
1052-1347 6.86e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 6.86e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1052 SAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTlepkrpppprPVAPPARPAPPQRPPPPSGRNQPSPQAGLA 1131
Cdd:PHA03247  2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS----------AQPTAPPPPPGPPPPSLPLGGSVAPGGDVR 2863
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1132 GPGPAGYGAARPTIPARAGVISAPQSQArvcagRPTPDSQSKPSEtlkGPAVLPEPLKPQAAFPQQPSLPTPAQKLQDPL 1211
Cdd:PHA03247  2864 RRPPSRSPAAKPAAPARPPVRRLARPAV-----SRSTESFALPPD---QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP 2935
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1212 VPIAAPTMPP-SGPQPNLETPPQPPPRSRSSQSLPSDSSPQLQVKINGISgvKQEPTLKSDPFEDLSLSvlAVSKAQPSV 1290
Cdd:PHA03247  2936 PPRPQPPLAPtTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS--REAPASSTPPLTGHSLS--RVSSWASSL 3011
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1291 QISPVLTPDPKMLIQ---LPS-----------ASQSQVNPLSSVSCMPTRP---------PGPEESKSQESMGSSANPFP 1347
Cdd:PHA03247  3012 ALHEETDPPPVSLKQtlwPPDdtedsdadslfDSDSERSDLEALDPLPPEPhdpfahepdPATPEAGARESPSSQFGPPP 3091
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1052-1409 9.92e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 50.30  E-value: 9.92e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1052 SAPSLPIRPSRAPSRTPGPPSSQGSPVD-TQPAAQKDSSQTLEPKRPPPPRPVAPPARPAppqrppppsgrnqPSPQagl 1130
Cdd:pfam05109  430 TSPTLNTTGFAAPNTTTGLPSSTHVPTNlTAPASTGPTVSTADVTSPTPAGTTSGASPVT-------------PSPS--- 493
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1131 agPGPAGYGAARPTIPARAGVISAPQSQARV---CAGRPTPDSQSkPSETLKGPAVLPEPLKPQAAFPqQPSLPTPAQKL 1207
Cdd:pfam05109  494 --PRDNGTESKAPDMTSPTSAVTTPTPNATSptpAVTTPTPNATS-PTLGKTSPTSAVTTPTPNATSP-TPAVTTPTPNA 569
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1208 QDPLVPIAAPTMPPSGPQPNLETppqppprsrssqSLPSDSSPQLQVKINGISGVKQEPTLKSDPFEDLSlsvlAVSKAQ 1287
Cdd:pfam05109  570 TIPTLGKTSPTSAVTTPTPNATS------------PTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATS----AVTTGQ 633
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1288 PSVQISPV--LTPDPKMLIQLPSASQSQvNPLSSVSCMPTRPPGPEESKSQESMGSSANPFPSLPCRNP---FTDRTAAP 1362
Cdd:pfam05109  634 HNITSSSTssMSLRPSSISETLSPSTSD-NSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPrpgTTSQASGP 712
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2229487482 1363 GNPfRVQSQESEATswLSKEEPvPNSPFPPLMPLSHDTS---------KASSSLGG 1409
Cdd:pfam05109  713 GNS-STSTKPGEVN--VTKGTP-PKNATSPQAPSGQKTAvptvtstggKANSTTGG 764
INPP5A cd09092
Type I inositol polyphosphate 5-phosphatase I; Type I inositol polyphosphate 5-phosphatase I ...
669-854 2.81e-05

Type I inositol polyphosphate 5-phosphatase I; Type I inositol polyphosphate 5-phosphatase I (INPP5A) hydrolyzes the 5-phosphate from inositol 1,3,4,5-tetrakisphosphate [I(1,3,4,5)P4] and inositol 1,4,5-trisphosphate [I(1,4,5)P3]. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. As the substrates of INPP5A mobilize intracellular calcium ions, INPP5A is a calcium signal-terminating enzyme. In platelets, phosphorylated pleckstrin binds and activates INPP5A in a 1:1 complex, and accelerates the degradation of the calcium ion-mobilizing I(1,4,5)P3.


Pssm-ID: 197326  Cd Length: 383  Bit Score: 48.23  E-value: 2.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  669 LFHTTSLCFVCSHFAAGQSQVKERNEDFVeIARKLSFPMGRMLFshdYVFwcGDFNYRIDL------------------- 729
Cdd:cd09092    176 LFHDASNLAACESSPSVYSQNRHRALGYV-LERLTDERFEKVPF---FVF--GDFNFRLDTksvvetlcakatmqtvrka 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  730 -PNEEVKELIRQQNWD----------SLIAGDQLINQKNAGQIFRGF-----------LEGKVTFAPTYKYdlfSEDYDT 787
Cdd:cd09092    250 dSNIVVKLEFREKDNDnkvvlqiekkKFDYFNQDVFRDNNGKALLKFdkelevfkdvlYELDISFPPSYPY---SEDPEQ 326
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  788 SE---KCRTPAWTDRVLwrrrkwpFDRSAEDLDLLNasfqDESKILYTwtpgtllHYGRaELKTSDHRPV 854
Cdd:cd09092    327 GTqymNTRCPAWCDRIL-------MSHSARELKSEN----EEKSVTYD-------MIGP-NVCMGDHKPV 377
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1124-1318 4.08e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.33  E-value: 4.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1124 PSPQAGLAGPGPAGYGAARPtiPARAGVISAPQSQARVCAGRPTPDSQ--SKPSETLKGPAVLPEPLKPQAAFPQQPSLP 1201
Cdd:PRK12323   408 AAPAAAAAARAVAAAPARRS--PAPEALAAARQASARGPGGAPAPAPApaAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1202 TPAQKLQDPLVPiAAPTMPPSGPQPNLETPPQPPPRSRSSQSLPSDSSPQlqvkingisgvkqeptlkSDPFEDLSLSVL 1281
Cdd:PRK12323   486 AAAPAPADDDPP-PWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADP------------------DDAFETLAPAPA 546
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 2229487482 1282 AVSKAQPSVQISPVLTPDPkmliqlPSASQSQVNPLS 1318
Cdd:PRK12323   547 AAPAPRAAAATEPVVAPRP------PRASASGLPDMF 577
RRM3_Prp24 cd12298
RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar ...
907-960 7.05e-05

RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar proteins; This subfamily corresponds to the RRM3 of Prp24, also termed U4/U6 snRNA-associated-splicing factor PRP24 (U4/U6 snRNP), an RNA-binding protein with four well conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). It facilitates U6 RNA base-pairing with U4 RNA during spliceosome assembly. Prp24 specifically binds free U6 RNA primarily with RRMs 1 and 2 and facilitates pairing of U6 RNA bases with U4 RNA bases. Additionally, it may also be involved in dissociation of the U4/U6 complex during spliceosome activation.


Pssm-ID: 409739 [Multi-domain]  Cd Length: 78  Bit Score: 42.63  E-value: 7.05e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2229487482  907 IDELLRQFAHFGEV---ILIRFVEDKM--------WVTFLEGSSALNALSLNGKELLNRTITITL 960
Cdd:cd12298     14 EEALRGIFEKFGEIesiNIPKKQKNRKgrhnngfaFVTFEDADSAESALQLNGTLLDNRKISVSL 78
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1044-1226 1.53e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.30  E-value: 1.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1044 QSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGRNQ 1123
Cdd:NF033839   285 KEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1124 PSPQagLAGPgpagygaaRPTIPARAGvisAPQSQARVCAGRPTPDSQSKPsETLKgPAVLPEPLKPQAAFPQQPSLPTP 1203
Cdd:NF033839   365 VKPQ--PEKP--------KPEVKPQPE---TPKPEVKPQPEKPKPEVKPQP-EKPK-PEVKPQPEKPKPEVKPQPEKPKP 429
                          170       180
                   ....*....|....*....|...
gi 2229487482 1204 AQKLQdPLVPIAAPTMPPSGPQP 1226
Cdd:NF033839   430 EVKPQ-PEKPKPEVKPQPEKPKP 451
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1123-1226 1.56e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.52  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1123 QPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPT 1202
Cdd:PRK07764   398 APSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPT 477
                           90       100
                   ....*....|....*....|....
gi 2229487482 1203 PAQklqdplVPIAAPTMPPSGPQP 1226
Cdd:PRK07764   478 AAP------APAPPAAPAPAAAPA 495
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
908-958 1.70e-04

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 41.50  E-value: 1.70e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2229487482  908 DELLRQFAHFGEVILIRFVEDKM-------WVTFLEGSSALNALS-LNGKELLNRTITI 958
Cdd:cd00590     13 EDLRELFSKFGEVVSVRIVRDRDgkskgfaFVEFESPEDAEKALEaLNGTELGGRPLKV 71
PHA03247 PHA03247
large tegument protein UL36; Provisional
1046-1512 2.27e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 2.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1046 PTVPEYSA-PSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKrpppprpvapparpappqrppppSGRNQP 1124
Cdd:PHA03247  2589 PDAPPQSArPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPD-----------------------PHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1125 SPQAGLAGPGPAGYGAARP---TIPARAGVISAPQSQARVCAGRPTPDSQSKPSETlKGPAVLPEPlKPQAAFPQQPSLP 1201
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-PPPPPTPEP-APHALVSATPLPP 2723
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1202 TPAQKLQDPLVPIAAPTMPPSGPQPNLETPPQPPPRSRSSQSLPSDSSPQLQVKingisgvKQEPTLKSDPFEDLSLSVL 1281
Cdd:PHA03247  2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA-------GPPRRLTRPAVASLSESRE 2796
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1282 AV-SKAQPSVQISPVLTPDPKMLIQLPSASQSQvNPLSSVSCMPTRPPGPeeSKSQESMGSSANPFPSL----PCRNPFT 1356
Cdd:PHA03247  2797 SLpSPWDPADPPAAVLAPAAALPPAASPAGPLP-PPTSAQPTAPPPPPGP--PPPSLPLGGSVAPGGDVrrrpPSRSPAA 2873
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1357 DRTAAPGNPFRVQSQESEATSWLSKEEPvPNSPFPPLMPlshdtskassslggfednfDLQSQSTVKTSNPkgwVTFDED 1436
Cdd:PHA03247  2874 KPAAPARPPVRRLARPAVSRSTESFALP-PDQPERPPQP-------------------QAPPPPQPQPQPP---PPPQPQ 2930
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1437 DNFPTTGKSKSVCPDLVGNAPAS-----FDDDWSKGASVSFCVLPARRPPPPPPPVPLLPPGTTSSAGPSTTLPSKAPST 1511
Cdd:PHA03247  2931 PPPPPPPRPQPPLAPTTDPAGAGepsgaVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS 3010

                   .
gi 2229487482 1512 L 1512
Cdd:PHA03247  3011 L 3011
PHA03247 PHA03247
large tegument protein UL36; Provisional
1125-1512 2.30e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 2.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1125 SPQAGlaGPGPAGYGAARPTIPARagviSAPQSQarvCAGRPT-PDSQSKPSEtlkgPAVLPEPLKPQAafPQQPSLPTP 1203
Cdd:PHA03247  2545 SDDAG--DPPPPLPPAAPPAAPDR----SVPPPR---PAPRPSePAVTSRARR----PDAPPQSARPRA--PVDDRGDPR 2609
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1204 AQKLQDPLVP-IAAPTMPPSGPQPnletppqpppRSRSSQSLPSDSSPQLQVKINGISGVKqeptlksdpfedLSLSVLA 1282
Cdd:PHA03247  2610 GPAPPSPLPPdTHAPDPPPPSPSP----------AANEPDPHPPPTVPPPERPRDDPAPGR------------VSRPRRA 2667
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1283 VSKAQPSVQISPVLTPDPKmliqlpsASQSQVNPLSSVSCMPTRPPGPEES-----------KSQESMGSSANPFPSLPC 1351
Cdd:PHA03247  2668 RRLGRAAQASSPPQRPRRR-------AARPTVGSLTSLADPPPPPPTPEPAphalvsatplpPGPAAARQASPALPAAPA 2740
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1352 RNPFTDRTAAPGNPFRVQSQESEATswlskeepvPNSPFPPLMPLSHDTSKASSSLGGfednfdlqSQSTVKTSNPKGWV 1431
Cdd:PHA03247  2741 PPAVPAGPATPGGPARPARPPTTAG---------PPAPAPPAAPAAGPPRRLTRPAVA--------SLSESRESLPSPWD 2803
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1432 TFDeddnfpttgksksvcPDLVGNAPASFDDDWSKGASVSfcvlparrpppppppvpllPPGTTSSAGPSTTLPSKAPST 1511
Cdd:PHA03247  2804 PAD---------------PPAAVLAPAAALPPAASPAGPL-------------------PPPTSAQPTAPPPPPGPPPPS 2849

                   .
gi 2229487482 1512 L 1512
Cdd:PHA03247  2850 L 2850
PHA03378 PHA03378
EBNA-3B; Provisional
1045-1227 3.92e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.06  E-value: 3.92e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1045 SPTVPEYSAPSLPIRPSRAPSRTPGPpssQGSPVDTQPAAqkdssqtlepkrpppprpvappARPAPPQRPPPPSGRNQP 1124
Cdd:PHA03378   702 TPMRPPAAPPGRAQRPAAATGRARPP---AAAPGRARPPA----------------------AAPGRARPPAAAPGRARP 756
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1125 SPQAGLAGPGPAGY-GAARPTIPARAGviSAPQSQARvCAGRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQpsLPTP 1203
Cdd:PHA03378   757 PAAAPGRARPPAAApGAPTPQPPPQAP--PAPQQRPR-GAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQ--LLTG 831
                          170       180
                   ....*....|....*....|....*.
gi 2229487482 1204 AQKLQDPLV--PIAAPTMPPSGPQPN 1227
Cdd:PHA03378   832 GVKRGRPSLkkPAALERQAAAGPTPS 857
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1143-1511 7.22e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 7.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1143 PTIPARAGVISAPQSqarvcaGRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPTPAQKLqdplVPIAAPTMPPS 1222
Cdd:pfam03154  172 PVLQAQSGAASPPSP------PPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTL----IQQTPTLHPQR 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1223 GPQPNletppqppprsrssqslpsdssPQLQvkingisGVKQEPTLKSDPFEDLSLSVLAvSKAQP---SVQISPVLTPD 1299
Cdd:pfam03154  242 LPSPH----------------------PPLQ-------PMTQPPPPSQVSPQPLPQPSLH-GQMPPmphSLQTGPSHMQH 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1300 P---KMLIQLPSASQSQVNPLSSVSC------MPTRPPgpEESKSQESMGSSANPFPSLPCRNPFTDRTAAPGNPFRVQS 1370
Cdd:pfam03154  292 PvppQPFPLTPQSSQSQVPPGPSPAApgqsqqRIHTPP--SQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNP 369
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1371 QESEATSWLSKEEPV---PNSPFPP-LMPLSHDTSKASSSlgGFEDNFDLQSQSTVKTSNPKGWVTFDEDDNFPTTGKSK 1446
Cdd:pfam03154  370 QSHKHPPHLSGPSPFqmnSNLPPPPaLKPLSSLSTHHPPS--AHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASH 447
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2229487482 1447 SVCPDLVGNAPASfdddwskgasvSFCVLPARRPPPPPPPVPLLPPGTTSSAGPSTTLPSKAPST 1511
Cdd:pfam03154  448 PPTSGLHQVPSQS-----------PFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVS 501
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1123-1362 1.04e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.92  E-value: 1.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1123 QPSPQAGLAGPGPAGYGAarptiPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPlkpqAAFPQQPSLPT 1202
Cdd:PRK10263   318 EPVAVAAAATTATQSWAA-----PVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAP----EGYPQQSQYAQ 388
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1203 PAQKLQDPLvpiaaptMPPSGPQPNLETPPQPPPRSRSSQSLPSDSSPQLQVKINGISGVKQEPTLKSDPFEDLSLSVLA 1282
Cdd:PRK10263   389 PAVQYNEPL-------QQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQST 461
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1283 VSKAQPSVQisPVlTPDPKMLIQLPSASQSQVNPLSSV-SCMPTRPP-------GPEESKSQESMGSSANPFPSlPCRNP 1354
Cdd:PRK10263   462 YQTEQTYQQ--PA-AQEPLYQQPQPVEQQPVVEPEPVVeETKPARPPlyyfeevEEKRAREREQLAAWYQPIPE-PVKEP 537

                   ....*...
gi 2229487482 1355 FTDRTAAP 1362
Cdd:PRK10263   538 EPIKSSLK 545
PHA03379 PHA03379
EBNA-3A; Provisional
1059-1396 1.10e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 43.51  E-value: 1.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1059 RPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGRNQPSP----QAGLAGPG 1134
Cdd:PHA03379   407 KASEPTYGTPRPPVEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLHDQHSMAPCPVAQLPPGPlqdlEPGDQLPG 486
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1135 PAGYGAARPT-IPARAGVISAPQsqarvcagrptpdsQSKPSETlkgPAVLPEPLKPQaAFPQQPsLPTPAQKLQDPLVP 1213
Cdd:PHA03379   487 VVQDGRPACApVPAPAGPIVRPW--------------EASLSQV---PGVAFAPVMPQ-PMPVEP-VPVPTVALERPVCP 547
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1214 iAAPTMPPSGPQpnlETPPQPPPRSRSSQSLPSDSSPQLQVKI---NGISGVKQEPTLKSDPFEDLSLSVLAVSKAQPsv 1290
Cdd:PHA03379   548 -APPLIAMQGPG---ETSGIVRVRERWRPAPWTPNPPRSPSQMsvrDRLARLRAEAQPYQASVEVQPPQLTQVSPQQP-- 621
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1291 qISPVLTPDPKML-------------------IQLPSASQSQVNPLSSVSC-------MPTRPPGPEESK-------SQE 1337
Cdd:PHA03379   622 -MEYPLEPEQQMFpgspfsqvadvmraggvpaMQPQYFDLPLQQPISQGAPlaplrasMGPVPPVPATQPqyfdiplTEP 700
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2229487482 1338 -SMGSSANPF--------PSLPCRNPFTDRTAAPG-NPFRVQSQESEATSWLSKEEPVPNSPFPPLMPL 1396
Cdd:PHA03379   701 iNQGASAAHFlpqqpmegPLVPERWMFQGATLSQSvRPGVAQSQYFDLPLTQPINHGAPAAHFLHQPPM 769
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1123-1356 1.22e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1123 QPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKP------SETLKGPAVLPEPLKPQAAFPQ 1196
Cdd:PRK12323   364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAaravaaAPARRSPAPEALAAARQASARG 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1197 QPSLPTPAQKLQDPLVPIAAPTMPPSGPQPNLETPPQPPPRSRSSQSLPSDSSPqlqvkingisgvkqeptlksdPFEDL 1276
Cdd:PRK12323   444 PGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP---------------------PWEEL 502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1277 SLSVLAVSKAQPSVQISPV---LTPDPKMLIQLPSASQSQVNPLSSVSCMPTRPPGPEESKSQESMGSSANP------FP 1347
Cdd:PRK12323   503 PPEFASPAPAQPDAAPAGWvaeSIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPdmfdgdWP 582

                   ....*....
gi 2229487482 1348 SLPCRNPFT 1356
Cdd:PRK12323   583 ALAARLPVR 591
RRM smart00360
RNA recognition motif;
903-958 1.70e-03

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 38.34  E-value: 1.70e-03
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2229487482   903 DDALIDELLRQFAHFGEVILIRFVEDKMW--------VTFLEGSSALNALS-LNGKELLNRTITI 958
Cdd:smart00360    9 PDTTEEELRELFSKFGKVESVRLVRDKETgkskgfafVEFESEEDAEKALEaLNGKELDGRPLKV 73
RRM2_SREK1 cd12260
RNA recognition motif 2 (RRM2) found in splicing regulatory glutamine/lysine-rich protein 1 ...
908-959 3.30e-03

RNA recognition motif 2 (RRM2) found in splicing regulatory glutamine/lysine-rich protein 1 (SREK1) and similar proteins; This subfamily corresponds to the RRM2 of SREK1, also termed serine/arginine-rich-splicing regulatory protein 86-kDa (SRrp86), or splicing factor arginine/serine-rich 12 (SFRS12), or splicing regulatory protein 508 amino acid (SRrp508). SREK1 belongs to a family of proteins containing regions rich in serine-arginine dipeptides (SR proteins family), which is involved in bridge-complex formation and splicing by mediating protein-protein interactions across either introns or exons. It is a unique SR family member and it may play a crucial role in determining tissue specific patterns of alternative splicing. SREK1 can alter splice site selection by both positively and negatively modulating the activity of other SR proteins. For instance, SREK1 can activate SRp20 and repress SC35 in a dose-dependent manner both in vitro and in vivo. In addition, SREK1 contains two (some contain only one) RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and two serine-arginine (SR)-rich domains (SR domains) separated by an unusual glutamic acid-lysine (EK) rich region. The RRM and SR domains are highly conserved among other members of the SR superfamily. However, the EK domain is unique to SREK1. It plays a modulatory role controlling SR domain function by involvement in the inhibition of both constitutive and alternative splicing and in the selection of splice-site.


Pssm-ID: 409705 [Multi-domain]  Cd Length: 85  Bit Score: 38.06  E-value: 3.30e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2229487482  908 DELLRQFAHFGEVILIRFVEDK------MWVTFLEGSSALNALSLNGKELLNRTITIT 959
Cdd:cd12260     19 DQLLEFFSQAGEVKYVRMAGDEtqptryAFVEFAEQTSVINALKLNGKMFGGRPLKVN 76
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1012-1222 4.28e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 4.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1012 AEVEELLPQHLQP--SSSSGLGTSPSSSPRTSPCQSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSS 1089
Cdd:PRK07003   420 ATRAEAPPAAPAPpaTADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAP 499
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1090 QTLEPKRPPPPRPVAPPARPAPPQRPPppsgrnQPSPQAglAGPGPAgygAARPtiPARAGVISAPQSQARVCAGRPTPD 1169
Cdd:PRK07003   500 SAATPAAVPDARAPAAASREDAPAAAA------PPAPEA--RPPTPA---AAAP--AARAGGAAAALDVLRNAGMRVSSD 566
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2229487482 1170 SQSKPSETLKGPAVLPEPLKPQAAFPQQPsLPTPAQKLQDPLVPI--AAPTMPPS 1222
Cdd:PRK07003   567 RGARAAAAAKPAAAPAAAPKPAAPRVAVQ-VPTPRARAATGDAPPngAARAEQAA 620
PHA03247 PHA03247
large tegument protein UL36; Provisional
1046-1226 4.52e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 4.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1046 PTVPEYSAPSLPIRPSR---------APSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPappqrpp 1116
Cdd:PHA03247   279 PPPPEAAAPNGAAAPPDgvwgaalagAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQHYPLGF------- 351
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1117 ppSGRNQPS--PQAGLAGPGpAGYGAARPTIPARAGVISAPQSQARVCAGrPTPDSQSKPSETLKGPAVLPEPLKPQAAF 1194
Cdd:PHA03247   352 --PKRRRPTwtPPSSLEDLS-AGRHHPKRASLPTRKRRSARHAATPFARG-PGGDDQTRPAAPVPASVPTPAPTPVPASA 427
                          170       180       190
                   ....*....|....*....|....*....|..
gi 2229487482 1195 PQQPSLPTPAQKLQDPlvpiAAPTMPPSGPQP 1226
Cdd:PHA03247   428 PPPPATPLPSAEPGSD----DGPAPPPERQPP 455
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1260-1405 4.77e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.60  E-value: 4.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1260 SGVKQEPTLKSDPFEDLSLSVLAVSKAQPSVQISPVLTPDPKMliqlPSASQSQVNPlssvsCMPTRPPGPEESKSQESM 1339
Cdd:PTZ00449   551 ETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEE----PKKPKRPRSA-----QRPTRPKSPKLPELLDIP 621
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2229487482 1340 GSSANP-FPSLPCRNPFTDRTAAPGNPfrvqsqesEATSWLSKEEPvPNSPFPPLMP-----LSHDTSKASS 1405
Cdd:PTZ00449   622 KSPKRPeSPKSPKRPPPPQRPSSPERP--------EGPKIIKSPKP-PKSPKPPFDPkfkekFYDDYLDAAA 684
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1124-1221 5.00e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 41.33  E-value: 5.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1124 PSPQAGLAGPGPAGYGAARPTIparagvisAPQSQARVCAGRPTPDSQSKPsETLKGPAVLPEPLKPQAAFPQQPSLPTP 1203
Cdd:PRK14950   362 PVPAPQPAKPTAAAPSPVRPTP--------APSTRPKAAAAANIPPKEPVR-ETATPPPVPPRPVAPPVPHTPESAPKLT 432
                           90
                   ....*....|....*...
gi 2229487482 1204 AQKLQDPLVPIAAPTMPP 1221
Cdd:PRK14950   433 RAAIPVDEKPKYTPPAPP 450
PHA03378 PHA03378
EBNA-3B; Provisional
964-1336 5.00e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.59  E-value: 5.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  964 DWIKHLEEEMSLEKISVTLPSSASSTLLGEDAEVA--ADFDMEGDvddysaEVEELLPQHLQPSSSSGLGTSPSSSPRts 1041
Cdd:PHA03378   507 DLLEKDDEDMEQRVMATLLPPSPPQPRAGRRAPCVytEDLDIESD------EPASTEPVHDQLLPAPGLGPLQIQPLT-- 578
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1042 pcqSPTVPEY--SAPSLPIRPSRA--PSRTPGPPSSQGSPvdtqpaaqkdssqtlepkrpppPRPVAPPARPAPPQRPPP 1117
Cdd:PHA03378   579 ---SPTTSQLasSAPSYAQTPWPVphPSQTPEPPTTQSHI----------------------PETSAPRQWPMPLRPIPM 633
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1118 PSGRNQPSPQAGLAGPGPAGYGAARPTIP----ARAGVISAPQSQARVCAGRPTpdsQSKPSeTLKGPAVLPEPLKPqaa 1193
Cdd:PHA03378   634 RPLRMQPITFNVLVFPTPHQPPQVEITPYkptwTQIGHIPYQPSPTGANTMLPI---QWAPG-TMQPPPRAPTPMRP--- 706
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1194 fPQQPslPTPAQKLQDPLVPIAAPTMPPSGPQPnletppqppprsrssqslpSDSSPQLQVKINGISGVKQEPTLKSDPf 1273
Cdd:PHA03378   707 -PAAP--PGRAQRPAAATGRARPPAAAPGRARP-------------------PAAAPGRARPPAAAPGRARPPAAAPGR- 763
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2229487482 1274 edlsLSVLAVSKAQPSVQISPVLTPDPKmliQLPSAS-----QSQVNPlSSVSCMPTRPPGPEESKSQ 1336
Cdd:PHA03378   764 ----ARPPAAAPGAPTPQPPPQAPPAPQ---QRPRGAptpqpPPQAGP-TSMQLMPRAAPGQQGPTKQ 823
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
1116-1225 5.82e-03

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 39.25  E-value: 5.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1116 PPPSGRNQPSPQAGLAGP-GPAGYGAARPTIPARAGvisAPQSQARVCAGRPTPDSQSKPSETLKGPAvlpeplkPQAAF 1194
Cdd:pfam15240   68 PPPGGPQQPPPQGGKQKPqGPPPQGGPRPPPGKPQG---PPPQGGNQQQGPPPPGKPQGPPPQGGGPP-------PQGGN 137
                           90       100       110
                   ....*....|....*....|....*....|.
gi 2229487482 1195 PQQPSLPtPAQKLQDPLVPIAAPTMPPSGPQ 1225
Cdd:pfam15240  138 QQGPPPP-PPGNPQGPPQRPPQPGNPQGPPQ 167
PHA03378 PHA03378
EBNA-3B; Provisional
1046-1365 5.83e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.21  E-value: 5.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1046 PTVPEYSAPSLPIRPSRAPSRTPGPPSSQGspvdtqpaaqkdssqtlepkrpppprpvapparpapPQRPPPPS-GRNQP 1124
Cdd:PHA03378   673 PYQPSPTGANTMLPIQWAPGTMQPPPRAPT------------------------------------PMRPPAAPpGRAQR 716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1125 sPQA--GLAGPGPAGYGAARP--TIPARAGVISAPQSQARVCAGRPTPdsQSKPSETLKGPAVLPEPLKPQAAFPQQPSL 1200
Cdd:PHA03378   717 -PAAatGRARPPAAAPGRARPpaAAPGRARPPAAAPGRARPPAAAPGR--ARPPAAAPGAPTPQPPPQAPPAPQQRPRGA 793
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1201 PTPAQKLQDPlvPIAAPTMPpsgPQPNLETPPQPPPRSRSSQSLPSDSSPQLQVKingISGVKQEPT-LKSDPFEDLSLS 1279
Cdd:PHA03378   794 PTPQPPPQAG--PTSMQLMP---RAAPGQQGPTKQILRQLLTGGVKRGRPSLKKP---AALERQAAAgPTPSPGSGTSDK 865
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1280 VLavskaQPSVQISPVLTPdpkmlIQLPSASQSqVNPLSSvscmPTRPPGPEESKSQESMGSSANPFPSLPCRNPFTDRT 1359
Cdd:PHA03378   866 IV-----QAPVFYPPVLQP-----IQVMRQLGS-VRAAAA----STVTQAPTEYTGERRGVGPMHPTDIPPSKRAKTDAY 930

                   ....*.
gi 2229487482 1360 AAPGNP 1365
Cdd:PHA03378   931 VESQPP 936
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1126-1226 6.72e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.91  E-value: 6.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1126 PQAGLAGPGPAGygAARPTIPARAGVISAPQSQARVCAgrptpdSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPtpaq 1205
Cdd:PRK14971   363 TQKGDDASGGRG--PKQHIKPVFTQPAAAPQPSAAAAA------SPSPSQSSAAAQPSAPQSATQPAGTPPTVSVD---- 430
                           90       100
                   ....*....|....*....|.
gi 2229487482 1206 klqdplVPIAAPTMPPSGPQP 1226
Cdd:PRK14971   431 ------PPAAVPVNPPSTAPQ 445
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
1124-1399 7.00e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 40.82  E-value: 7.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1124 PSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQArvcAGRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPTP 1203
Cdd:COG5180    163 AKDPDGDSASTLPPPAEKLDKVLTEPRDALKDSPEK---LDRPKVEVKDEAQEEPPDLTGGADHPRPEAASSPKVDPPST 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1204 AQKLQDPLVPIAAPTMPPSGPQPNLEtppqppprSRSSQSLPSDSSPQLQVKINGISGVKQEPTLKSDPFEDLSlSVLAV 1283
Cdd:COG5180    240 SEARSRPATVDAQPEMRPPADAKERR--------RAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPIDVK-GVASA 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1284 SKAQPSVQISPVL----TPDPKMLIQLPSA---SQSQVNPLSSVSCMPT-RPPGPEESKSQESMGSSANPfpslpcRNPF 1355
Cdd:COG5180    311 PPATRPVRPPGGArdpgTPRPGQPTERPAGvpeAASDAGQPPSAYPPAEeAVPGKPLEQGAPRPGSSGGD------GAPF 384
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 2229487482 1356 TDRTAAPGNpfrvQSQESEAtswlskeePVPNSPFPPLMPLSHD 1399
Cdd:COG5180    385 QPPNGAPQP----GLGRRGA--------PGPPMGAGDLVQAALD 416
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1123-1270 8.69e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 8.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482 1123 QPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPT 1202
Cdd:PRK14951   374 APAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPA 453
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2229487482 1203 PAQKLQDPLVPIAAPTMPPSGPQPNLETPPQPPPRSRSSQSLPSDSSPQLQVKINGISGVKQEPTLKS 1270
Cdd:PRK14951   454 QAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDVWHATVQQLAAAEAITALARELALQS 521
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1143-1228 9.88e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 38.62  E-value: 9.88e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2229487482  1143 PTIPARAGVIsaPQSQARVCAGRP--TPDSQSKPSetLKGPAvlPEPLKPQAAFPQQPSLPTPAQKLQDPLVPI----AA 1216
Cdd:smart00818   59 PVLPAQQPVV--PQQPLMPVPGQHsmTPTQHHQPN--LPQPA--QQPFQPQPLQPPQPQQPMQPQPPVHPIPPLppqpPL 132
                            90
                    ....*....|..
gi 2229487482  1217 PTMPPSGPQPNL 1228
Cdd:smart00818  133 PPMFPMQPLPPL 144
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH