NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1889988573|ref|XP_035666550|]
View 

mucin-17-like [Branchiostoma floridae]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
AP-like_stonins_MHD cd09255
Mu homology domain (MHD) of adaptor-like proteins (AP-like), stonins; A small family of ...
1436-1781 4.32e-172

Mu homology domain (MHD) of adaptor-like proteins (AP-like), stonins; A small family of proteins named stonins has been characterized as clathrin-dependent AP-2 mu2 chain related factors, which may act as cargo-specific sorting adaptors in endocytosis. Stonins include stonin 1 and stonin 2, which are only mammalian homologs of Drosophila stoned B, a presynaptic protein implicated in neurotransmission and synaptic vesicle (SV) recycling. They are conserved from C. elegans to humans, but are not found in prokaryotes or yeasts. This family corresponds to the mu homology domain of stonins, which is distantly related to the C-terminal domain of mu chains among AP complexes. Due to the low degree of sequence conservation of the corresponding binding site, the mu homology domain of stonins is unable to recognize tyrosine-based endocytic sorting signals. To data, little is known about the localization and function of stonin 1. Stonin 2, also known as stoned B, acts as an AP-2-dependent synaptotagmin-specific sorting adaptors for SV endocytosis. Stoned A is not a stonin. It is structurally unrelated to the adaptins and does not appear to have mammalian homologs. It is not included in this family.


:

Pssm-ID: 271163 [Multi-domain]  Cd Length: 315  Bit Score: 528.13  E-value: 4.32e-172
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1436 YRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIRGNEVVRRQDIIPNKSDEW 1515
Cdd:cd09255      1 YRDRGITYREDEITVDVTDEFHGKVTKTGEIKKLGVTVQIHILSFVTGDPECVLGLNDLEVEGREVVRRQDIMPSSTDQW 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1516 IKLLNPEFHSCVNKEHFISTRQIKFCPLDGNKFQLMRYKVKSNKKELPLVVKARYINKGPHFELRCDATCTGYVNKKDPE 1595
Cdd:cd09255     81 IKLHNCEFHSCVDVEEFEQSRSIKFHPLDACRFELMRFRTRYNKKNLPLTLKSVVSVKGAHVELRADVRMSGYHSRNPLA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1596 AVPCENIMIRLPVPDLWMKYLRNEtpvGPLGTKSLRSSKKKLPKGSSTFEQYTSPRIEVSVGTAKYEYAFRAIVWKISRL 1675
Cdd:cd09255    161 QVPCENIMIRFPVPESWVPAFRTE---KRFREKSLKSKKNKKASGGSTAESLSEPVIEVSVGSAKYEHAYRAVVWRIDRL 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1676 PERNQvvpftgnvpftgkplpftaskekaGAYSTQTLICRIDMASELDIPTSLGPHFEVDFTMPLTTASKSIVRSISVSN 1755
Cdd:cd09255    238 PDKNS------------------------AADTPHTFSCRLDLASDLEIPSSTYPHAEVEFTMPSTTASKTTVRSISVSN 293
                          330       340
                   ....*....|....*....|....*.
gi 1889988573 1756 plaaPVIPEKWVRYKAHYSYKVAIER 1781
Cdd:cd09255    294 ----KNIPEKWVRYRAHYSYKVEIEV 315
TFIIA pfam03153
Transcription factor IIA, alpha/beta subunit; Transcription initiation factor IIA (TFIIA) is a ...
1813-2154 5.72e-52

Transcription factor IIA, alpha/beta subunit; Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are represented in pfam02268 and pfam02751, respectively. This family represents the precursor that yields both the alpha and beta subunits. The TFIIA heterotrimer is an essential general transcription initiation factor for the expression of genes transcribed by RNA polymerase II. Together with TFIID, TFIIA binds to the promoter region; this is the first step in the formation of a pre-initiation complex (PIC). Binding of the rest of the transcription machinery follows this step. After initiation, the PIC does not completely dissociate from the promoter. Some components, including TFIIA, remain attached and re-initiate a subsequent round of transcription.


:

Pssm-ID: 460829 [Multi-domain]  Cd Length: 344  Bit Score: 187.63  E-value: 5.72e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1813 LYRSVIDDVINNVREAFLDEQVDEQVLQELKQVWETKLLQSKAVeglnpdvHLVAGGFSHHAQAHTASrIQQSPRGGLPT 1892
Cdd:pfam03153    1 VYRSVIEDVINASRVDFEEEGVDEQVLEELKQLWQSKLSQSKVA-------EFPWDPKPEPPHPPPFV-IKQEPGVTLQP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1893 AGPTGWQVGTQQSNAITTCIHSlvwpSLVHQSLAYAAQlGNGVDLTSAASSGSATVALPQGQVVYQQGQVMRTVAPGLTL 1972
Cdd:pfam03153   73 ASGPQAQLYKNVQAQGAAQRAA----QPLQQQQGSAAQ-SSINALQAGASQQQQALALPQQQPQPQQQQQQQQQQQQQQQ 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1973 PQGQTTMSFPGY------QPGQVYIQQPGGQTIQIQQPQTIVQQpSGSQTQPVQQQGQ--------QAQGQAGQMPSIVQ 2038
Cdd:pfam03153  148 QQQQQQQQQQQEqaqqapQQGGLNNSQTDGADDALEDWEGVLAQ-RRAAGAPEELGRVeidrmlreQIAARAKQMGGGLM 226
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 2039 VQSLATTTAH--------------PIIQVDGANDTSDEDDDDddddkedeedkddenddAEGEEEEPLNSED-DVSDDDP 2103
Cdd:pfam03153  227 LPLKEALKGKsrakksrkaprataEIAQLDGAGDDSDDEESS-----------------NEDDDEDAINSDLdDPDDDDD 289
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1889988573 2104 TDLFDIDN----VVVCQYERINRSKNKWKFHLKDGIMNLGGKDYVFQKATGDAEW 2154
Cdd:pfam03153  290 DDLEDTDNalghVMLCQYDKVQRVKNKWKCTLKDGVMTVNGKDYVFHKATGEFEW 344
PHA03247 super family cl33720
large tegument protein UL36; Provisional
662-1308 9.49e-12

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.12  E-value: 9.49e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  662 QSPTAPGAAPptflpigimqseavEPEKESPhfsPPPDSLPTPIAPAPASMP--SPTEAV----IAPVASTPPLASPQTS 735
Cdd:PHA03247  2487 RFPFAAGAAP--------------DPGGGGP---PDPDAPPAPSRLAPAILPdePVGEPVhprmLTWIRGLEELASDDAG 2549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  736 PPNPQPPAVTKPSTPKEKAPsakpsqPSKPSAQPS-PAFDAIEAAkllglPGVPVTSAVKkrvplnqlaqakvvtRSPVQ 814
Cdd:PHA03247  2550 DPPPPLPPAAPPAAPDRSVP------PPRPAPRPSePAVTSRARR-----PDAPPQSARP---------------RAPVD 2603
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  815 ETAKSPVEEAIGATPPEsksAHVEEPAASAAQPvdmfedafvpdelNASEKSAPTKSASahfedafvpgglmdtdeggpt 894
Cdd:PHA03247  2604 DRGDPRGPAPPSPLPPD---THAPDPPPPSPSP-------------AANEPDPHPPPTV--------------------- 2646
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  895 ksfsanfedafvvqewPEPEQPLSQKSPSThfedsfVPMTEKRPAPEEAKKSTSdfdlfsdmettakPLSMPADPFAStg 974
Cdd:PHA03247  2647 ----------------PPPERPRDDPAPGR------VSRPRRARRLGRAAQASS-------------PPQRPRRRAAR-- 2689
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  975 gdaktpelftsaakaatppaamtiDFFGSVAEDVPPPDllggsPEAELKPAMIPTNAFVPKPdeavkqdvlEGQAAAQPe 1054
Cdd:PHA03247  2690 ------------------------PTVGSLTSLADPPP-----PPPTPEPAPHALVSATPLP---------PGPAAARQ- 2730
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1055 tmaidllSSPSQPEAPTSNAVAPATLSAGEVfapsppdaaaaddlfASPGEPAAMPDDPFASKPGDEANTPDLFAGATGA 1134
Cdd:PHA03247  2731 -------ASPALPAAPAPPAVPAGPATPGGP---------------ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1135 AAMSPSIPSPPTAISPSRESLKLeeLDPFAPQKSASKEGTPVSSPPKKTEVVAPLASPEEELNLSPMQHISVADI---KP 1211
Cdd:PHA03247  2789 ASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrrRP 2866
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1212 TAQSPVEDAGLAMGPPLTTIDPFSPKAAVGSPATPPRQTKKKYNPFKAETPTTPPDEEASPFPTVKLPISVVQPLPASPD 1291
Cdd:PHA03247  2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                          650
                   ....*....|....*..
gi 1889988573 1292 TSPLDESFPPKCEEDGW 1308
Cdd:PHA03247  2947 TDPAGAGEPSGAVPQPW 2963
PRK13335 super family cl31400
superantigen-like protein SSL3; Reviewed;
218-322 3.36e-04

superantigen-like protein SSL3; Reviewed;


The actual alignment was detected with superfamily member PRK13335:

Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 45.12  E-value: 3.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  218 SQSVPTTPSSPPVAEQVARSPSMQTTRnldSQDEKITQNPQKKQKSDTDTPPLQTSASLEEATRSFTekkapcsakTPTE 297
Cdd:PRK13335    63 TQAANTRQERTPKLEKAPNTNEEKTSA---SKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTEST---------TPKT 130
                           90       100
                   ....*....|....*....|....*
gi 1889988573  298 EPGLPEGSAVEQPAIPTVSVTPHTP 322
Cdd:PRK13335   131 KVTTPPSTNTPQPMQSTKSDTPQSP 155
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
521-769 2.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  521 TPTQPSPT-NPFLADIAAKQQALSPTNPLktqegakkdnfdpfgTQQPSKAVDPFGAEFEGDSFEAEPVTMPDDPFAPKN 599
Cdd:pfam03154  145 SPSIPSPQdNESDSDSSAQQQILQTQPPV---------------LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQ 209
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  600 GDTAVSaskskggindlfaissSKPEQALNLFASSSLDAAEPGKDPFLSPKPaepevdlfniQSPTAPGAAPPTflpigi 679
Cdd:pfam03154  210 GSPATS----------------QPPNQTQSTAAPHTLIQQTPTLHPQRLPSP----------HPPLQPMTQPPP------ 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  680 mqseavePEKESPHFSPPPdSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKP 759
Cdd:pfam03154  258 -------PSQVSPQPLPQP-SLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPP 329
                          250
                   ....*....|
gi 1889988573  760 SQPSKPSAQP 769
Cdd:pfam03154  330 SQSQLQSQQP 339
 
Name Accession Description Interval E-value
AP-like_stonins_MHD cd09255
Mu homology domain (MHD) of adaptor-like proteins (AP-like), stonins; A small family of ...
1436-1781 4.32e-172

Mu homology domain (MHD) of adaptor-like proteins (AP-like), stonins; A small family of proteins named stonins has been characterized as clathrin-dependent AP-2 mu2 chain related factors, which may act as cargo-specific sorting adaptors in endocytosis. Stonins include stonin 1 and stonin 2, which are only mammalian homologs of Drosophila stoned B, a presynaptic protein implicated in neurotransmission and synaptic vesicle (SV) recycling. They are conserved from C. elegans to humans, but are not found in prokaryotes or yeasts. This family corresponds to the mu homology domain of stonins, which is distantly related to the C-terminal domain of mu chains among AP complexes. Due to the low degree of sequence conservation of the corresponding binding site, the mu homology domain of stonins is unable to recognize tyrosine-based endocytic sorting signals. To data, little is known about the localization and function of stonin 1. Stonin 2, also known as stoned B, acts as an AP-2-dependent synaptotagmin-specific sorting adaptors for SV endocytosis. Stoned A is not a stonin. It is structurally unrelated to the adaptins and does not appear to have mammalian homologs. It is not included in this family.


Pssm-ID: 271163 [Multi-domain]  Cd Length: 315  Bit Score: 528.13  E-value: 4.32e-172
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1436 YRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIRGNEVVRRQDIIPNKSDEW 1515
Cdd:cd09255      1 YRDRGITYREDEITVDVTDEFHGKVTKTGEIKKLGVTVQIHILSFVTGDPECVLGLNDLEVEGREVVRRQDIMPSSTDQW 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1516 IKLLNPEFHSCVNKEHFISTRQIKFCPLDGNKFQLMRYKVKSNKKELPLVVKARYINKGPHFELRCDATCTGYVNKKDPE 1595
Cdd:cd09255     81 IKLHNCEFHSCVDVEEFEQSRSIKFHPLDACRFELMRFRTRYNKKNLPLTLKSVVSVKGAHVELRADVRMSGYHSRNPLA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1596 AVPCENIMIRLPVPDLWMKYLRNEtpvGPLGTKSLRSSKKKLPKGSSTFEQYTSPRIEVSVGTAKYEYAFRAIVWKISRL 1675
Cdd:cd09255    161 QVPCENIMIRFPVPESWVPAFRTE---KRFREKSLKSKKNKKASGGSTAESLSEPVIEVSVGSAKYEHAYRAVVWRIDRL 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1676 PERNQvvpftgnvpftgkplpftaskekaGAYSTQTLICRIDMASELDIPTSLGPHFEVDFTMPLTTASKSIVRSISVSN 1755
Cdd:cd09255    238 PDKNS------------------------AADTPHTFSCRLDLASDLEIPSSTYPHAEVEFTMPSTTASKTTVRSISVSN 293
                          330       340
                   ....*....|....*....|....*.
gi 1889988573 1756 plaaPVIPEKWVRYKAHYSYKVAIER 1781
Cdd:cd09255    294 ----KNIPEKWVRYRAHYSYKVEIEV 315
TFIIA pfam03153
Transcription factor IIA, alpha/beta subunit; Transcription initiation factor IIA (TFIIA) is a ...
1813-2154 5.72e-52

Transcription factor IIA, alpha/beta subunit; Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are represented in pfam02268 and pfam02751, respectively. This family represents the precursor that yields both the alpha and beta subunits. The TFIIA heterotrimer is an essential general transcription initiation factor for the expression of genes transcribed by RNA polymerase II. Together with TFIID, TFIIA binds to the promoter region; this is the first step in the formation of a pre-initiation complex (PIC). Binding of the rest of the transcription machinery follows this step. After initiation, the PIC does not completely dissociate from the promoter. Some components, including TFIIA, remain attached and re-initiate a subsequent round of transcription.


Pssm-ID: 460829 [Multi-domain]  Cd Length: 344  Bit Score: 187.63  E-value: 5.72e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1813 LYRSVIDDVINNVREAFLDEQVDEQVLQELKQVWETKLLQSKAVeglnpdvHLVAGGFSHHAQAHTASrIQQSPRGGLPT 1892
Cdd:pfam03153    1 VYRSVIEDVINASRVDFEEEGVDEQVLEELKQLWQSKLSQSKVA-------EFPWDPKPEPPHPPPFV-IKQEPGVTLQP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1893 AGPTGWQVGTQQSNAITTCIHSlvwpSLVHQSLAYAAQlGNGVDLTSAASSGSATVALPQGQVVYQQGQVMRTVAPGLTL 1972
Cdd:pfam03153   73 ASGPQAQLYKNVQAQGAAQRAA----QPLQQQQGSAAQ-SSINALQAGASQQQQALALPQQQPQPQQQQQQQQQQQQQQQ 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1973 PQGQTTMSFPGY------QPGQVYIQQPGGQTIQIQQPQTIVQQpSGSQTQPVQQQGQ--------QAQGQAGQMPSIVQ 2038
Cdd:pfam03153  148 QQQQQQQQQQQEqaqqapQQGGLNNSQTDGADDALEDWEGVLAQ-RRAAGAPEELGRVeidrmlreQIAARAKQMGGGLM 226
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 2039 VQSLATTTAH--------------PIIQVDGANDTSDEDDDDddddkedeedkddenddAEGEEEEPLNSED-DVSDDDP 2103
Cdd:pfam03153  227 LPLKEALKGKsrakksrkaprataEIAQLDGAGDDSDDEESS-----------------NEDDDEDAINSDLdDPDDDDD 289
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1889988573 2104 TDLFDIDN----VVVCQYERINRSKNKWKFHLKDGIMNLGGKDYVFQKATGDAEW 2154
Cdd:pfam03153  290 DDLEDTDNalghVMLCQYDKVQRVKNKWKCTLKDGVMTVNGKDYVFHKATGEFEW 344
Adap_comp_sub pfam00928
Adaptor complexes medium subunit family; This family also contains members which are coatomer ...
1434-1777 1.72e-41

Adaptor complexes medium subunit family; This family also contains members which are coatomer subunits.


Pssm-ID: 395742  Cd Length: 259  Bit Score: 154.38  E-value: 1.72e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1434 PAYRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIRgnevvrrqdiipnksd 1513
Cdd:pfam00928    1 VPWRPPGIKYKKNEVFLDVIERVSVIVDKDGGLLNSEVQGTIDLKCFLSGMPELRLGLNDKLLL---------------- 64
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1514 ewIKLLNPEFHSCVNKEHFISTRQIKFCPLDGNkFQLMRYKVKSNKKELPLVVKARYINKGPHFELRCDATCTGYVNKKd 1593
Cdd:pfam00928   65 --IELDDVSFHQCVNLDKFESERVISFIPPDGE-FELMRYRLSTNEVKLPFTVKPIVSVSGDEGRVEIEVKLRSDFPKK- 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1594 peaVPCENIMIRLPVPdlwmkylrnetpvgplgtkslrsskkklpkgsstfEQYTSPRIEVSVGTAKYEYAFRAIVWKIS 1673
Cdd:pfam00928  141 ---LTAENVVISIPVP-----------------------------------KEASSPVLRVSDGKAKYDPEENALEWSIK 182
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1674 RLPERNQVVpFTGNVPFTGKplpfTASKEKAGAYSTQTlicridmaseldiptslgphfeVDFTMPLTTASKSIVRSISV 1753
Cdd:pfam00928  183 KIPGGNESS-LSGELELSVE----SSSDDEFPSDPPIS----------------------VEFSIPMFTASGLKVRYLKV 235
                          330       340
                   ....*....|....*....|....*
gi 1889988573 1754 SNPlaaPVIPEKWVRYKAHY-SYKV 1777
Cdd:pfam00928  236 EEE---NYKPYKWVRYVTQSgSYSI 257
TFIIA_alpha_beta_like cd07976
Precursor of TFIIA alpha and beta subunits and similar proteins; Transcription factor II A ...
2109-2154 5.39e-28

Precursor of TFIIA alpha and beta subunits and similar proteins; Transcription factor II A (TFIIA) is one of the general transcription factors for RNA polymerase II. TFIIA increases the affinity of TATA-binding protein (TBP) for DNA in order to assemble the initiation complex. TFIIA also functions as an activator during development and differentiation, and is involved in transcription from TATA-less promoters. TFIIA is composed of more than one subunit in various organisms. Mammalian TFIIA large subunits (TFIIA alpha and beta) and the smaller subunit (TFIIA gamma) form a heterotrimer. TFIIA alpha and beta are encoded by a single gene (TFIIA_alpha_beta), its protein product is post-translationally processed and cleaved. TOA1 and TOA2 are the two subunits of Yeast TFIIA which correspond to Mammalian TFIIA_alpha_beta and TFIIA gamma, respectively. TOA1 and TOA2 form a heterodimeric protein complex. TFIIA_alpha_beta alone is sufficient for transcription in early embryogenesis, but the cleaved forms, TFIIA alpha and TFIIA beta, represent the vast majority of TFIIA in most differentiated cells. The exact functional differences between cleaved and uncleaved forms are not yet clear. This model also contains paralogs of the canonical TFIIA_alpha_beta, such as the human ALF, which may be involved in gametogenesis and early embryogenesis (and is also subject to proteolytic cleavage).


Pssm-ID: 199899 [Multi-domain]  Cd Length: 102  Bit Score: 109.54  E-value: 5.39e-28
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1889988573 2109 IDNVVVCQYERINRSKNKWKFHLKDGIMNLGGKDYVFQKATGDAEW 2154
Cdd:cd07976     57 TENIVLCQYDKVTRVKNKWKCTLKDGIMTLNGKDYVFQKATGEFEW 102
TOA1 COG5149
Transcription initiation factor IIA, large chain [Transcription];
1811-2154 3.89e-12

Transcription initiation factor IIA, large chain [Transcription];


Pssm-ID: 227478 [Multi-domain]  Cd Length: 293  Bit Score: 69.32  E-value: 3.89e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1811 SKLYRSVIDDVINNVREAFLDEQVDEQVLQELKQVWETKLLQSKAVeglnpdvhlvaggfshhaqahTASRIQQSPRGGL 1890
Cdd:COG5149      7 GEVYHHVILDVIANSRSDFEENGVDDATLRELQNLWQSKLVATDVA---------------------TFPWAQAFPIGQL 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1891 ptagptgwqvgtqqsNAITTCIHSLVWPSLVHQSLAYaaqlgngvdlTSAASSGSATVALPQGqvvyqqgqvmrtvapgl 1970
Cdd:COG5149     66 ---------------FGLRTDSLDVTAPAVANSPILN----------QSATNISFDSSAIPNV----------------- 103
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1971 tlpQGQTTMSFPGYQPGQVYIQQPggqtiQIQQPQTIVQQPSGSQTQPVQQQGQQAQGQAGQMpsivqvqSLATTTAHPI 2050
Cdd:COG5149    104 ---QSNNTAPFPSYSSTNQTADSP-----IINDHSTANLKIYGDIIAEVISLPNRLEQVEDEL-------SIGKSAITTL 168
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 2051 IQVDGA----NDTSDEDDDDDDDDKEDEEDKDDENDDAEGEEE---------------EPLNSEDDVSDDDPTDLFDID- 2110
Cdd:COG5149    169 RNTDWRerliDDTQSEWDGERMRRRDGKQGIHQYERLSEGPAHafkgkpttakdegmfSDLDDSDVDSGDSEIEGTKGSt 248
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1889988573 2111 NVVVCQYERINRSKNKWKFHLKDGIMNLGGKDYVFQKATGDAEW 2154
Cdd:COG5149    249 NCMLCLYDKVNMSKGKWKCTFKDGVVSINNIDYVFNKAQGELEW 292
PHA03247 PHA03247
large tegument protein UL36; Provisional
662-1308 9.49e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.12  E-value: 9.49e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  662 QSPTAPGAAPptflpigimqseavEPEKESPhfsPPPDSLPTPIAPAPASMP--SPTEAV----IAPVASTPPLASPQTS 735
Cdd:PHA03247  2487 RFPFAAGAAP--------------DPGGGGP---PDPDAPPAPSRLAPAILPdePVGEPVhprmLTWIRGLEELASDDAG 2549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  736 PPNPQPPAVTKPSTPKEKAPsakpsqPSKPSAQPS-PAFDAIEAAkllglPGVPVTSAVKkrvplnqlaqakvvtRSPVQ 814
Cdd:PHA03247  2550 DPPPPLPPAAPPAAPDRSVP------PPRPAPRPSePAVTSRARR-----PDAPPQSARP---------------RAPVD 2603
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  815 ETAKSPVEEAIGATPPEsksAHVEEPAASAAQPvdmfedafvpdelNASEKSAPTKSASahfedafvpgglmdtdeggpt 894
Cdd:PHA03247  2604 DRGDPRGPAPPSPLPPD---THAPDPPPPSPSP-------------AANEPDPHPPPTV--------------------- 2646
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  895 ksfsanfedafvvqewPEPEQPLSQKSPSThfedsfVPMTEKRPAPEEAKKSTSdfdlfsdmettakPLSMPADPFAStg 974
Cdd:PHA03247  2647 ----------------PPPERPRDDPAPGR------VSRPRRARRLGRAAQASS-------------PPQRPRRRAAR-- 2689
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  975 gdaktpelftsaakaatppaamtiDFFGSVAEDVPPPDllggsPEAELKPAMIPTNAFVPKPdeavkqdvlEGQAAAQPe 1054
Cdd:PHA03247  2690 ------------------------PTVGSLTSLADPPP-----PPPTPEPAPHALVSATPLP---------PGPAAARQ- 2730
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1055 tmaidllSSPSQPEAPTSNAVAPATLSAGEVfapsppdaaaaddlfASPGEPAAMPDDPFASKPGDEANTPDLFAGATGA 1134
Cdd:PHA03247  2731 -------ASPALPAAPAPPAVPAGPATPGGP---------------ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1135 AAMSPSIPSPPTAISPSRESLKLeeLDPFAPQKSASKEGTPVSSPPKKTEVVAPLASPEEELNLSPMQHISVADI---KP 1211
Cdd:PHA03247  2789 ASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrrRP 2866
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1212 TAQSPVEDAGLAMGPPLTTIDPFSPKAAVGSPATPPRQTKKKYNPFKAETPTTPPDEEASPFPTVKLPISVVQPLPASPD 1291
Cdd:PHA03247  2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                          650
                   ....*....|....*..
gi 1889988573 1292 TSPLDESFPPKCEEDGW 1308
Cdd:PHA03247  2947 TDPAGAGEPSGAVPQPW 2963
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
648-980 5.14e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 48.76  E-value: 5.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  648 SPKPAEPEVDLFNIQSPT----APGAAPPTFLPigimQSEAVEPEKESPHFSPPPDSLPTPIAPAPAsmPSPTEAVIAPV 723
Cdd:pfam05109  460 APASTGPTVSTADVTSPTpagtTSGASPVTPSP----SPRDNGTESKAPDMTSPTSAVTTPTPNATS--PTPAVTTPTPN 533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  724 ASTPPLA--SPQTSPPNPQP------PAVTKPsTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAK-----------LLGL 784
Cdd:pfam05109  534 ATSPTLGktSPTSAVTTPTPnatsptPAVTTP-TPNATIPTLGKTSPTSAVTTPTPNATSPTVGEtspqanttnhtLGGT 612
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  785 PGVPVTSAVKKRVPLNQLAQAKVVTRSPVQETAKSPVEEAIGATPPESKSAHVEEPAASAAQPVdmfedafvpDELNASE 864
Cdd:pfam05109  613 SSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPT---------GGENITQ 683
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  865 KSAPTKSASAHFEDAFVPGGLMDTDEGGPTKSFSANFEDAFVVQEWPEPEQPLSQKSPSthfedsfvpmTEKRPAPEEAK 944
Cdd:pfam05109  684 VTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPS----------GQKTAVPTVTS 753
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1889988573  945 KSTSDFDLFSDMETTAKPLSMPADPFASTGGDAKTP 980
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTP 789
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
218-322 3.36e-04

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 45.12  E-value: 3.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  218 SQSVPTTPSSPPVAEQVARSPSMQTTRnldSQDEKITQNPQKKQKSDTDTPPLQTSASLEEATRSFTekkapcsakTPTE 297
Cdd:PRK13335    63 TQAANTRQERTPKLEKAPNTNEEKTSA---SKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTEST---------TPKT 130
                           90       100
                   ....*....|....*....|....*
gi 1889988573  298 EPGLPEGSAVEQPAIPTVSVTPHTP 322
Cdd:PRK13335   131 KVTTPPSTNTPQPMQSTKSDTPQSP 155
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
521-769 2.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  521 TPTQPSPT-NPFLADIAAKQQALSPTNPLktqegakkdnfdpfgTQQPSKAVDPFGAEFEGDSFEAEPVTMPDDPFAPKN 599
Cdd:pfam03154  145 SPSIPSPQdNESDSDSSAQQQILQTQPPV---------------LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQ 209
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  600 GDTAVSaskskggindlfaissSKPEQALNLFASSSLDAAEPGKDPFLSPKPaepevdlfniQSPTAPGAAPPTflpigi 679
Cdd:pfam03154  210 GSPATS----------------QPPNQTQSTAAPHTLIQQTPTLHPQRLPSP----------HPPLQPMTQPPP------ 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  680 mqseavePEKESPHFSPPPdSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKP 759
Cdd:pfam03154  258 -------PSQVSPQPLPQP-SLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPP 329
                          250
                   ....*....|
gi 1889988573  760 SQPSKPSAQP 769
Cdd:pfam03154  330 SQSQLQSQQP 339
COG5373 COG5373
Uncharacterized membrane protein [Function unknown];
693-763 5.53e-03

Uncharacterized membrane protein [Function unknown];


Pssm-ID: 444140 [Multi-domain]  Cd Length: 854  Bit Score: 41.91  E-value: 5.53e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1889988573  693 HFSPPPDSLPTPIAPAPASMPSPTEAviAPVA-STPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPS 763
Cdd:COG5373     36 ELAEAAEAASAPAEPEPEAAAAATAA--APEAaPAPVPEAPAAPPAAAEAPAPAAAAPPAEAEPAAAPAAAS 105
 
Name Accession Description Interval E-value
AP-like_stonins_MHD cd09255
Mu homology domain (MHD) of adaptor-like proteins (AP-like), stonins; A small family of ...
1436-1781 4.32e-172

Mu homology domain (MHD) of adaptor-like proteins (AP-like), stonins; A small family of proteins named stonins has been characterized as clathrin-dependent AP-2 mu2 chain related factors, which may act as cargo-specific sorting adaptors in endocytosis. Stonins include stonin 1 and stonin 2, which are only mammalian homologs of Drosophila stoned B, a presynaptic protein implicated in neurotransmission and synaptic vesicle (SV) recycling. They are conserved from C. elegans to humans, but are not found in prokaryotes or yeasts. This family corresponds to the mu homology domain of stonins, which is distantly related to the C-terminal domain of mu chains among AP complexes. Due to the low degree of sequence conservation of the corresponding binding site, the mu homology domain of stonins is unable to recognize tyrosine-based endocytic sorting signals. To data, little is known about the localization and function of stonin 1. Stonin 2, also known as stoned B, acts as an AP-2-dependent synaptotagmin-specific sorting adaptors for SV endocytosis. Stoned A is not a stonin. It is structurally unrelated to the adaptins and does not appear to have mammalian homologs. It is not included in this family.


Pssm-ID: 271163 [Multi-domain]  Cd Length: 315  Bit Score: 528.13  E-value: 4.32e-172
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1436 YRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIRGNEVVRRQDIIPNKSDEW 1515
Cdd:cd09255      1 YRDRGITYREDEITVDVTDEFHGKVTKTGEIKKLGVTVQIHILSFVTGDPECVLGLNDLEVEGREVVRRQDIMPSSTDQW 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1516 IKLLNPEFHSCVNKEHFISTRQIKFCPLDGNKFQLMRYKVKSNKKELPLVVKARYINKGPHFELRCDATCTGYVNKKDPE 1595
Cdd:cd09255     81 IKLHNCEFHSCVDVEEFEQSRSIKFHPLDACRFELMRFRTRYNKKNLPLTLKSVVSVKGAHVELRADVRMSGYHSRNPLA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1596 AVPCENIMIRLPVPDLWMKYLRNEtpvGPLGTKSLRSSKKKLPKGSSTFEQYTSPRIEVSVGTAKYEYAFRAIVWKISRL 1675
Cdd:cd09255    161 QVPCENIMIRFPVPESWVPAFRTE---KRFREKSLKSKKNKKASGGSTAESLSEPVIEVSVGSAKYEHAYRAVVWRIDRL 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1676 PERNQvvpftgnvpftgkplpftaskekaGAYSTQTLICRIDMASELDIPTSLGPHFEVDFTMPLTTASKSIVRSISVSN 1755
Cdd:cd09255    238 PDKNS------------------------AADTPHTFSCRLDLASDLEIPSSTYPHAEVEFTMPSTTASKTTVRSISVSN 293
                          330       340
                   ....*....|....*....|....*.
gi 1889988573 1756 plaaPVIPEKWVRYKAHYSYKVAIER 1781
Cdd:cd09255    294 ----KNIPEKWVRYRAHYSYKVEIEV 315
AP_stonin-2_MHD cd09263
Mu homology domain (MHD) of adaptor-like protein (AP-like), stonin-2; A small family of ...
1440-1781 4.59e-87

Mu homology domain (MHD) of adaptor-like protein (AP-like), stonin-2; A small family of proteins named stonins has been characterized as clathrin-dependent AP-2 mu2 chain related factors, which may act as cargo-specific sorting adaptors in endocytosis. Stonins include stonin 1 and stonin 2, which are the only mammalian homologs of Drosophila stoned B, a presynaptic protein implicated in neurotransmission and synaptic vesicle (SV) recycling. They are conserved from C. elegans to humans, but are not found in prokaryotes or yeasts. This family corresponds to the mu homology domain of stonin 2, which is distantly related to the C-terminal domain of mu chains among AP complexes. Due to the low degree of sequence conservation of the corresponding binding site, the mu homology domain of stonin-2 is unable to recognize tyrosine-based endocytic sorting signals. It acts as an AP-2-dependent synaptotagmin-specific sorting adaptor for SV endocytosis.


Pssm-ID: 271169  Cd Length: 318  Bit Score: 287.68  E-value: 4.59e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1440 GTTYIEDEITVEVTDEFRGIVAR-TGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIRGNEVVRRQDIIPNKSDEWIKL 1518
Cdd:cd09263      5 GLNYTEEEITVDVRDEFYGILSKgDSRILQHLVLTRINMLSFLSGLAECRLGLNDILIKGNEIVSRQDIMPTTTTKWIKL 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1519 LNPEFHSCVNKEHFISTRQIKFCPLDGNKFQLMRYKVKSNKKELPLVVKARYINKGPHFELRC-DATCTGYVNKKDP-EA 1596
Cdd:cd09263     85 RDCRFHECVDEDEFNNSRAILFNPLDACRFELMRFRTVFAEKTLPFTLRTAASVNGAEVEVQSwLVMSTGFSSNRDPlTQ 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1597 VPCENIMIRLPVPDLWMKYLRNETpvgPLGTKSLRSSKKKLPKGSSTFEQYTSPRIEVSVGTAKYEYAFRAIVWKISRLP 1676
Cdd:cd09263    165 VPCENVMIRYPVPEEWVKNFRRES---VLGEKSLKAKVNKGASFGSTSTSGSEPVMRVTLGTAKYEHAFNSIVWRINRLP 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1677 ERNQVvpftgnvpfTGKPlpftaskekagaystQTLICRIDMASELDIPTSLGPHFEVDFTMPLTTASKSIVRSISVSNP 1756
Cdd:cd09263    242 DKNSA---------SGHP---------------HCFFCHLELGSDREVPSTFECHVEVEFDMPTTSASKAAVRSISVEDK 297
                          330       340
                   ....*....|....*....|....*
gi 1889988573 1757 LAApvipEKWVRYKAHYSYKVAIER 1781
Cdd:cd09263    298 TDV----RKWVNYSAHYSYQVAVEQ 318
TFIIA pfam03153
Transcription factor IIA, alpha/beta subunit; Transcription initiation factor IIA (TFIIA) is a ...
1813-2154 5.72e-52

Transcription factor IIA, alpha/beta subunit; Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are represented in pfam02268 and pfam02751, respectively. This family represents the precursor that yields both the alpha and beta subunits. The TFIIA heterotrimer is an essential general transcription initiation factor for the expression of genes transcribed by RNA polymerase II. Together with TFIID, TFIIA binds to the promoter region; this is the first step in the formation of a pre-initiation complex (PIC). Binding of the rest of the transcription machinery follows this step. After initiation, the PIC does not completely dissociate from the promoter. Some components, including TFIIA, remain attached and re-initiate a subsequent round of transcription.


Pssm-ID: 460829 [Multi-domain]  Cd Length: 344  Bit Score: 187.63  E-value: 5.72e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1813 LYRSVIDDVINNVREAFLDEQVDEQVLQELKQVWETKLLQSKAVeglnpdvHLVAGGFSHHAQAHTASrIQQSPRGGLPT 1892
Cdd:pfam03153    1 VYRSVIEDVINASRVDFEEEGVDEQVLEELKQLWQSKLSQSKVA-------EFPWDPKPEPPHPPPFV-IKQEPGVTLQP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1893 AGPTGWQVGTQQSNAITTCIHSlvwpSLVHQSLAYAAQlGNGVDLTSAASSGSATVALPQGQVVYQQGQVMRTVAPGLTL 1972
Cdd:pfam03153   73 ASGPQAQLYKNVQAQGAAQRAA----QPLQQQQGSAAQ-SSINALQAGASQQQQALALPQQQPQPQQQQQQQQQQQQQQQ 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1973 PQGQTTMSFPGY------QPGQVYIQQPGGQTIQIQQPQTIVQQpSGSQTQPVQQQGQ--------QAQGQAGQMPSIVQ 2038
Cdd:pfam03153  148 QQQQQQQQQQQEqaqqapQQGGLNNSQTDGADDALEDWEGVLAQ-RRAAGAPEELGRVeidrmlreQIAARAKQMGGGLM 226
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 2039 VQSLATTTAH--------------PIIQVDGANDTSDEDDDDddddkedeedkddenddAEGEEEEPLNSED-DVSDDDP 2103
Cdd:pfam03153  227 LPLKEALKGKsrakksrkaprataEIAQLDGAGDDSDDEESS-----------------NEDDDEDAINSDLdDPDDDDD 289
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1889988573 2104 TDLFDIDN----VVVCQYERINRSKNKWKFHLKDGIMNLGGKDYVFQKATGDAEW 2154
Cdd:pfam03153  290 DDLEDTDNalghVMLCQYDKVQRVKNKWKCTLKDGVMTVNGKDYVFHKATGEFEW 344
AP_stonin-1_MHD cd09262
Mu homology domain (MHD) of adaptor-like protein (AP-like), stonin-1 (also called Stoned ...
1443-1781 1.87e-51

Mu homology domain (MHD) of adaptor-like protein (AP-like), stonin-1 (also called Stoned B-like factor); A small family of proteins named stonins has been characterized as clathrin-dependent AP-2 mu2 chain related factors, which may act as cargo-specific sorting adaptors in endocytosis. Stonins include stonin 1 and stonin 2, which are the only mammalian homologs of Drosophila stoned B, a presynaptic protein implicated in neurotransmission and synaptic vesicle (SV) recycling. They are conserved from C. elegans to humans, but are not found in prokaryotes or yeasts. This family corresponds to the mu homology domain of stonin 1, which is distantly related to the C-terminal domain of mu chains among AP complexes. Due to the low degree of sequence conservation of the corresponding binding site, the mu homology domain of stonin-1 is unable to recognize tyrosine-based endocytic sorting signals. To data, little is known about the localization and function of stonin-1.


Pssm-ID: 271168  Cd Length: 314  Bit Score: 185.15  E-value: 1.87e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1443 YIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKqirgnEVVRRQDIIPNKSDE--WIKLLN 1520
Cdd:cd09262      8 YEEQELSLEIVDNFWGKVTKEGKVVESAVITQIYCLCFVNGPGECFLTLNDL-----ELLKRDESYGEKEAGkkWIEILD 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1521 PEFHSCVNKEHFISTRQIKFCPLDGNKFQLMRYKVKSNKKELPLVVKARYINKGPHFELRC--DATCTGYVNKKDPEAVP 1598
Cdd:cd09262     83 CHFHKCVNEQEFEQSRIIKFSPLDACRAELMRFKTAYNGTQLPFSVKATVVVQGAYVELQAflNMASTALSFGVSDSHPL 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1599 CENIMIRLPVPDLWMKYLRNetpVGPLGTKSLRSSKKKLPKGSSTFEQYTSPRIEVSVGTAKYEYAFRAIVWKISRLPER 1678
Cdd:cd09262    163 CENVVIRFPVPAQWIKALWT---MNLQRQKSLKAKMNRRACLGALRETESRPVIQVSVGTAKYESAYSAVVWKIDRLPDK 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1679 NQVVPftgnvpftgkplpftaskekagaySTQTLICRIDMASELDIPTSLGPHFEVDFTMPLTTASKSIVRSISVSNPLA 1758
Cdd:cd09262    240 NSSLD------------------------HPHSLSYKLELGSDQEIPSDWYPFATVQFEVMDTCASQTEVKSLGTESDMQ 295
                          330       340
                   ....*....|....*....|...
gi 1889988573 1759 apviPEKWVRYKAHYSYKVAIER 1781
Cdd:cd09262    296 ----PQKHVTQWARYHCQAEFYK 314
Adap_comp_sub pfam00928
Adaptor complexes medium subunit family; This family also contains members which are coatomer ...
1434-1777 1.72e-41

Adaptor complexes medium subunit family; This family also contains members which are coatomer subunits.


Pssm-ID: 395742  Cd Length: 259  Bit Score: 154.38  E-value: 1.72e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1434 PAYRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIRgnevvrrqdiipnksd 1513
Cdd:pfam00928    1 VPWRPPGIKYKKNEVFLDVIERVSVIVDKDGGLLNSEVQGTIDLKCFLSGMPELRLGLNDKLLL---------------- 64
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1514 ewIKLLNPEFHSCVNKEHFISTRQIKFCPLDGNkFQLMRYKVKSNKKELPLVVKARYINKGPHFELRCDATCTGYVNKKd 1593
Cdd:pfam00928   65 --IELDDVSFHQCVNLDKFESERVISFIPPDGE-FELMRYRLSTNEVKLPFTVKPIVSVSGDEGRVEIEVKLRSDFPKK- 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1594 peaVPCENIMIRLPVPdlwmkylrnetpvgplgtkslrsskkklpkgsstfEQYTSPRIEVSVGTAKYEYAFRAIVWKIS 1673
Cdd:pfam00928  141 ---LTAENVVISIPVP-----------------------------------KEASSPVLRVSDGKAKYDPEENALEWSIK 182
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1674 RLPERNQVVpFTGNVPFTGKplpfTASKEKAGAYSTQTlicridmaseldiptslgphfeVDFTMPLTTASKSIVRSISV 1753
Cdd:pfam00928  183 KIPGGNESS-LSGELELSVE----SSSDDEFPSDPPIS----------------------VEFSIPMFTASGLKVRYLKV 235
                          330       340
                   ....*....|....*....|....*
gi 1889988573 1754 SNPlaaPVIPEKWVRYKAHY-SYKV 1777
Cdd:pfam00928  236 EEE---NYKPYKWVRYVTQSgSYSI 257
AP_MHD_Cterm cd07954
C-terminal domain of adaptor protein (AP) complexes medium mu subunits and its homologs (MHD); ...
1447-1778 6.27e-31

C-terminal domain of adaptor protein (AP) complexes medium mu subunits and its homologs (MHD); This family corresponds to the C-terminal domain of heterotetrameric AP complexes medium mu subunits and its homologs existing in monomeric stonins, delta-subunit of the heteroheptameric coat protein I (delta-COPI), a protein encoded by a pro-death gene referred as MuD (also known as MUDENG, mu-2 related death-inducing gene), an endocytic adaptor syp1, the mammalian FCH domain only proteins (FCHo1/2), SH3-containing GRB2-like protein 3-interacting protein 1 (SGIP1), and related proteins. AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. Stonins have been characterized as clathrin-dependent AP-2 mu chain related factors and may act as cargo-specific sorting adaptors in endocytosis. Coat protein complex I (COPI)-coated vesicles function in the early secretory pathway. They mediate the retrograde transport from the Golgi to the ER, and intra-Golgi transport. MuD is distantly related to the C-terminal domain of mu2 subunit of AP-2. It is able to induce cell death by itself and plays an important role in cell death in various tissues. Syp1 represents a novel type of endocytic adaptor protein that participates in endocytosis, promotes vesicle tabulation, and contributes to cell polarity and stress responses. It shares the same domain architecture with its two ubiquitously expressed mammalian counterparts, FCHo1/2, which represent key initial proteins ultimately controlling cellular nutrient uptake, receptor regulation, and synaptic vesicle retrieval. They bind specifically to the plasma membrane and recruit the scaffold proteins eps15 and intersectin, which subsequently engage the adaptor complex AP2 and clathrin, leading to coated vesicle formation. Another mammalian neuronal-specific protein SGIP1 does have a C-terminal MHD and has been classified into this family as well. It is an endophilin-interacting protein that plays an obligatory role in the regulation of energy homeostasis. It is also involved in clathrin-mediated endocytosis by interacting with phospholipids and eps15.


Pssm-ID: 271157  Cd Length: 245  Bit Score: 123.28  E-value: 6.27e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1447 EITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKqirgnevvrrqdiipnksDEWIKLLNPEFHSC 1526
Cdd:cd07954      1 EVFLDVVEKVNLLISKDGSLLNSEVQGEIALKSFLSGMPEIRLGLNNP------------------DVGIKLDDVSFHPC 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1527 VNKEHFISTRQIKFCPLDGNkFQLMRYKVKSNKKELPLVVKAR----------YINKGPHFELRcdatctgyvnkkdpea 1596
Cdd:cd07954     63 VRLKRFESERVISFIPPDGE-FELMSYRTVEPWSILPITIFPVvseegsqlevVITLKLSESLQ---------------- 125
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1597 VPCENIMIRLPVPdlwmkylrNETpvgplgtkslrsskkklpkgsstfeqyTSPRIEVSVGTAKYEYAFRAIVWKISRLP 1676
Cdd:cd07954    126 LTAENVEVHIPLP--------SGV---------------------------TSLKSKPSDGQAKFDPEKNALVWRIKRIP 170
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1677 ERnqvvpftgnvpftGKPlpftaskekagaystQTLICRIDMASELDIPTSLGPHFEVDFTMPLTTASKSIVRSISVSNP 1756
Cdd:cd07954    171 VG-------------GKE---------------QSLSAHVELGSLAHECPEEAPPVSVSFEIPETTGSGIQVRSLQVFDE 222
                          330       340
                   ....*....|....*....|..
gi 1889988573 1757 LAAPVIPEKWVRYKAHYSYKVA 1778
Cdd:cd07954    223 KNPGHDPIKWVRYITHTGKYVA 244
TFIIA_alpha_beta_like cd07976
Precursor of TFIIA alpha and beta subunits and similar proteins; Transcription factor II A ...
2109-2154 5.39e-28

Precursor of TFIIA alpha and beta subunits and similar proteins; Transcription factor II A (TFIIA) is one of the general transcription factors for RNA polymerase II. TFIIA increases the affinity of TATA-binding protein (TBP) for DNA in order to assemble the initiation complex. TFIIA also functions as an activator during development and differentiation, and is involved in transcription from TATA-less promoters. TFIIA is composed of more than one subunit in various organisms. Mammalian TFIIA large subunits (TFIIA alpha and beta) and the smaller subunit (TFIIA gamma) form a heterotrimer. TFIIA alpha and beta are encoded by a single gene (TFIIA_alpha_beta), its protein product is post-translationally processed and cleaved. TOA1 and TOA2 are the two subunits of Yeast TFIIA which correspond to Mammalian TFIIA_alpha_beta and TFIIA gamma, respectively. TOA1 and TOA2 form a heterodimeric protein complex. TFIIA_alpha_beta alone is sufficient for transcription in early embryogenesis, but the cleaved forms, TFIIA alpha and TFIIA beta, represent the vast majority of TFIIA in most differentiated cells. The exact functional differences between cleaved and uncleaved forms are not yet clear. This model also contains paralogs of the canonical TFIIA_alpha_beta, such as the human ALF, which may be involved in gametogenesis and early embryogenesis (and is also subject to proteolytic cleavage).


Pssm-ID: 199899 [Multi-domain]  Cd Length: 102  Bit Score: 109.54  E-value: 5.39e-28
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1889988573 2109 IDNVVVCQYERINRSKNKWKFHLKDGIMNLGGKDYVFQKATGDAEW 2154
Cdd:cd07976     57 TENIVLCQYDKVTRVKNKWKCTLKDGIMTLNGKDYVFQKATGEFEW 102
TFIIA_alpha_beta_like cd07976
Precursor of TFIIA alpha and beta subunits and similar proteins; Transcription factor II A ...
1811-1864 3.13e-23

Precursor of TFIIA alpha and beta subunits and similar proteins; Transcription factor II A (TFIIA) is one of the general transcription factors for RNA polymerase II. TFIIA increases the affinity of TATA-binding protein (TBP) for DNA in order to assemble the initiation complex. TFIIA also functions as an activator during development and differentiation, and is involved in transcription from TATA-less promoters. TFIIA is composed of more than one subunit in various organisms. Mammalian TFIIA large subunits (TFIIA alpha and beta) and the smaller subunit (TFIIA gamma) form a heterotrimer. TFIIA alpha and beta are encoded by a single gene (TFIIA_alpha_beta), its protein product is post-translationally processed and cleaved. TOA1 and TOA2 are the two subunits of Yeast TFIIA which correspond to Mammalian TFIIA_alpha_beta and TFIIA gamma, respectively. TOA1 and TOA2 form a heterodimeric protein complex. TFIIA_alpha_beta alone is sufficient for transcription in early embryogenesis, but the cleaved forms, TFIIA alpha and TFIIA beta, represent the vast majority of TFIIA in most differentiated cells. The exact functional differences between cleaved and uncleaved forms are not yet clear. This model also contains paralogs of the canonical TFIIA_alpha_beta, such as the human ALF, which may be involved in gametogenesis and early embryogenesis (and is also subject to proteolytic cleavage).


Pssm-ID: 199899 [Multi-domain]  Cd Length: 102  Bit Score: 96.05  E-value: 3.13e-23
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1889988573 1811 SKLYRSVIDDVINNVREAFLDEQVDEQVLQELKQVWETKLLQSKAVEGLNPDVH 1864
Cdd:cd07976      2 SKVYESVIEDVINNVREDFEDEGVDESVLQELKQLWEEKLSQSGAASFPWDPKP 55
AP-1_Mu1_Cterm cd09250
C-terminal domain of medium Mu1 subunit in clathrin-associated adaptor protein (AP) complex ...
1437-1770 5.89e-21

C-terminal domain of medium Mu1 subunit in clathrin-associated adaptor protein (AP) complex AP-1; AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. There are four AP complexes, AP-1, AP-2, AP-3, and AP-4, described in various eukaryotic organisms. Each AP complex consists of four subunits: two large chains (one each of gamma/alpha/delta/epsilon and beta1-4, respectively), a medium mu chain (mu1-4), and a small sigma chain (sigma1-4). Each of the four subunits from the different AP complexes exhibits similarity with each other. This family corresponds to the C-terminal domain of heterotetrameric clathrin-associated adaptor protein complex 1 (AP-1) medium mu1 subunit, which includes two closely related homologs, mu1A (encoded by ap1m1) and mu1B (encoded by ap1m2). Mu1A is ubiquitously expressed, but mu1B is expressed exclusively in polarized epithelial cells. AP-1 has been implicated in bi-directional transport between the trans-Golgi network (TGN) and endosomes. It plays an essential role in the formation of clathrin-coated vesicles (CCVs) from the trans-Golgi network (TGN). Epithelial cell-specific AP-1 is also involved in sorting to the basolateral surface of polarized epithelial cells. Recruitment of AP-1 to the TGN membrane is regulated by a small GTPase, ADP-ribosylation factor 1 (ARF1). Phosphorylation/dephosphorylation events can also regulate the function of AP-1. The membrane-anchored cargo molecules can be linked to the outer lattice of CCVs by AP-1. Those cargo molecules interact with adaptors through short sorting signals in their cytosolic segments. Tyrosine-based endocytotic signals are one of the most important sorting signals. They are of the form Y-X-X-Phi, where Y is tyrosine, X is any amino acid and Phi is a bulky hydrophobic residue that can be Leu, Ile, Met, Phe, or Val. These kinds of sorting signals can be recognized by the C-terminal domain of AP-1 mu1 subunit, also known as Y-X-X-Phi signal-binding domain that contains two hydrophobic pockets, one for the tyrosine-binding and one for the bulky hydrophobic residue-binding.


Pssm-ID: 271158  Cd Length: 272  Bit Score: 94.98  E-value: 5.89e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1437 RDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAV--EVNIQTlvFITGMPECVLGMNDKQIRGNEVVRRQDiipnKSDE 1514
Cdd:cd09250      7 RPEGIKYKKNEVFLDVIESVNLLVDLNGQVLRSEIvgAIKMRS--YLSGMPELKLGLNDKVLFEATGRSSKG----KAVE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1515 wikLLNPEFHSCVNKEHFISTRQIKFCPLDGnKFQLMRYKVKSNKKelPLV-VKARYINKGPHfelRCDATCTGYVNKKd 1593
Cdd:cd09250     81 ---LEDVKFHQCVRLSRFENDRTISFIPPDG-EFELMSYRLSTQVK--PLIwVEPTVERHSRS---RVEIMVKAKTQFK- 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1594 pEAVPCENIMIRLPVPDLwmkylrnetpvgplgtkslrsskkklpkgsstfeqYTSPRIEVSVGTAKYEYAFRAIVWKIS 1673
Cdd:cd09250    151 -RRSTANNVEIRIPVPPD-----------------------------------ADSPRFKCSAGSVVYAPEKDALLWKIK 194
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1674 RLPERNQvvpFTGNVPFtgkPLPFTASKEKAGAYStqtlicridmaselDIPTSlgphfeVDFTMPLTTASKSIVRSISV 1753
Cdd:cd09250    195 SFPGGKE---FSMRAEF---GLPSIESEEEQGTEK--------------KAPIQ------VKFEIPYFTVSGLQVRYLKI 248
                          330
                   ....*....|....*..
gi 1889988573 1754 SNPLAAPVIPekWVRYK 1770
Cdd:cd09250    249 IEKSGYQALP--WVRYI 263
AP-2_Mu2_Cterm cd09251
C-terminal domain of medium Mu2 subunit in ubiquitously expressed clathrin-associated adaptor ...
1443-1777 2.39e-20

C-terminal domain of medium Mu2 subunit in ubiquitously expressed clathrin-associated adaptor protein (AP) complex AP-2; AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. There are four AP complexes, AP-1, -2, -3, and -4, described in various eukaryotic organisms. Each AP complex consists of four subunits: two large chains (one each of gamma/alpha/delta/epsilon and beta1-4, respectively), a medium mu chain (mu1-4), and a small sigma chain (sigma1-4). Each of the four subunits from the different AP complexes exhibits similarity with each other. This family corresponds to the C-terminal domain of heterotetrameric clathrin-associated adaptor protein complex 2 (AP-2) medium mu2 subunit. Mu2 is ubiquitously expressed in mammals. In higher eukaryotes, AP-2 plays a critical role in clathrin-mediated endocytosis from the plasma membrane in different cells. The membrane-anchored cargo molecules can be linked to the outer lattice of CCVs by AP-2. Those cargo molecules interact with adaptors through short sorting signals in their cytosolic segments. Tyrosine-based endocytotic signals are one of the most important sorting signals. They are of the form Y-X-X-Phi, where Y is tyrosine, X is any amino acid and Phi is a bulky hydrophobic residue that can be Leu, Ile, Met, Phe, or Val. These kinds of sorting signals can be recognized by the C-terminal domain of AP-2 mu2 subunit, also known as Y-X-X-Phi signal-binding domain that contains two hydrophobic pockets, one for the tyrosine-binding and one for the bulky hydrophobic residue-binding. Since the Y-X-X-Phi binding site is buried in the core structure of AP-2, a phosphorylation induced conformational change is required when the cargo molecules binds to AP-2. In addition, the C-terminal domain of mu2 subunit has been shown to bind other molecules. For instance, it can bind phosphoinositides, in particular PI[4,5]P2, which might be involved in the recognition process of the tyrosine-based signals. It can also interact with synaptotagmins, a family of important modulators of calcium-dependent neurosecretion within the synaptic vesicle (SV) membrane. Since many of the other endocytic adaptors responsible for biogenesis of synaptic vesicles exist, in the absence of AP-2, clathrin-mediated endocytosis can still occur. However, the cells may not survive in the complete absence of clathrin as well as AP-2.


Pssm-ID: 271159  Cd Length: 263  Bit Score: 93.04  E-value: 2.39e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1443 YIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIRGNEVvrRQDIIPNKSDEWIKLLNPE 1522
Cdd:cd09251      1 YRKNEVFLDVVESVNLLMSPQGQVLRADVDGVIVMKTYLSGMPECKFGLNDKLVLESEG--KEKSGSKSGKGSVELDDCT 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1523 FHSCVNKEHFISTRQIKFCPLDGnKFQLMRYKVKSNKKeLPLVVKARYINKGPHfelRCDATctgyVNKKD--PEAVPCE 1600
Cdd:cd09251     79 FHQCVRLSKFDSERSISFIPPDG-EFELMRYRVTENIN-LPFRVIPLVKEVGRT---KLEYK----VKIKSnfPPKLLAT 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1601 NIMIRLPVPdlwmkylRNETPVgplgtkslrsskkklpkgsstfeqytspRIEVSVGTAKYEYAFRAIVWKISRlpernq 1680
Cdd:cd09251    150 NVVVRIPVP-------KNTAKV----------------------------TINVSKGKAKYDPEENAIVWKIKK------ 188
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1681 vvpftgnvpFTGKplpftaskekagaySTQTLICRIDMASELDIPTS-LGPHFEVDFTMPLTTASKSIVRSISVSNPLAA 1759
Cdd:cd09251    189 ---------FAGM--------------TESTLSAEVELLSTTSKKKKwSRPPISMDFEVPMFTASGLRVRYLKVFEKSNY 245
                          330
                   ....*....|....*....
gi 1889988573 1760 PVIpeKWVRYKAHY-SYKV 1777
Cdd:cd09251    246 KTV--KWVRYITRAgSYEI 262
AP-1_Mu1B_Cterm cd09259
C-terminal domain of medium Mu1B subunit in epithelial cell-specific clathrin-associated ...
1435-1769 2.38e-16

C-terminal domain of medium Mu1B subunit in epithelial cell-specific clathrin-associated adaptor protein (AP) complex AP-1; AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. There are four AP complexes, AP-1, AP-2, AP-3, and AP-4, described in various eukaryotic organisms. Each AP complex consists of four subunits: two large chains (one each of gamma/alpha/delta/epsilon and beta1-4, respectively), a medium mu chain (mu1-4), and a small sigma chain (sigma1-4). Each of the four subunits from different AP complexes exhibits similarity with each other. This subfamily corresponds to the C-terminal domain of heterotetrameric clathrin-associated adaptor protein complex 1 (AP-1) medium mu1B subunit encoded by ap1m2 gene exclusively expressed in polarized epithelial cells. Epithelial cell-specific AP-1 is used to sort proteins to the basolateral plasma membrane, which involves the formation of clathrin-coated vesicles (CCVs) from the trans-Golgi network (TGN). Recruitment of AP-1 to the TGN membrane is regulated by a small GTPase, ADP-ribosylation factor 1 (ARF1). The phosphorylation/dephosphorylation events can also regulate the function of AP-1. The membrane-anchored cargo molecules can be linked to the outer lattice of CCVs by AP-1. Those cargo molecules interact with adaptors through short sorting signals in their cytosolic segments. Tyrosine-based endocytotic signals are one of the most important sorting signals. They are of the form Y-X-X-Phi, where Y is tyrosine, X is any amino acid and Phi is a bulky hydrophobic residue that can be Leu, Ile, Met, Phe, or Val. These kinds of sorting signals can be recognized by the C-terminal domain of AP-1 mu1B subunit, also known as Y-X-X-Phi signal-binding domain that contains two hydrophobic pockets, one for the tyrosine-binding and one for the bulky hydrophobic reside-binding. Besides, AP-1 mu1B subunit mediates the basolateral recycling of low-density lipoprotein receptor (LDLR) and transferrin receptor (TfR) from the sorting endosomes, where the basolateral sorting signal does not belong to the tyrosine-based signals. Thus, the binding site in mu1B subunit of AP-1 for the signals of LDLR and TfR might be distinct from that for YXXPhi signals.


Pssm-ID: 271167  Cd Length: 268  Bit Score: 81.61  E-value: 2.38e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1435 AYRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIRgnEVVRRQDiipNKSde 1514
Cdd:cd09259      5 SWRSEGIKYKKNEVFIDVIESVNVLVNANGSVLSSEIVGCIKLKVFLSGMPELRLGLNDRVLF--ELTGRDK---NKT-- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1515 wIKLLNPEFHSCVNKEHFISTRQIKFCPLDGNkFQLMRYKVKSNKKelPLVVKARYINKGPHFELRCDATCTGYVNKKDp 1594
Cdd:cd09259     78 -VELEDVKFHQCVRLSRFENDRTISFIPPDGD-FELMSYRLNTQVK--PLIWIESVIEKFSHSRVEIMVKAKGQFKKQS- 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1595 eavPCENIMIRLPVPdlwmkylrnetpvgplgtkslrsskkklpkgsstfEQYTSPRIEVSVGTAKYEYAFRAIVWKISR 1674
Cdd:cd09259    153 ---VANNVEIRVPVP-----------------------------------SDADSPKFKTSVGSAKYVPEKNVVVWSIKS 194
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1675 LPERNQvvpFTGNVPFTgkpLPFTASKEKAGAystqtlicridmaseldiptslgPHFEVDFTMPLTTASKSIVRSISVS 1754
Cdd:cd09259    195 FPGGKE---YLMRAHFG---LPSVENEELEGK-----------------------PPITVKFEIPYFTVSGIQVRYMKII 245
                          330
                   ....*....|....*
gi 1889988573 1755 NPLAAPVIPekWVRY 1769
Cdd:cd09259    246 EKSGYQALP--WVRY 258
AP-3_Mu3_Cterm cd09252
C-terminal domain of medium Mu3 subunit in adaptor protein (AP) complex AP-3; AP complexes ...
1437-1769 1.48e-14

C-terminal domain of medium Mu3 subunit in adaptor protein (AP) complex AP-3; AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. There are four AP complexes, AP-1, AP-2, AP-3, and AP-4, described in various eukaryotic organisms. Each AP complex consists of four subunits: two large chains (one each of gamma/alpha/delta/epsilon and beta1-4, respectively), a medium mu chain (mu1-4), and a small sigma chain (sigma1-4). Each of the four subunits from the different AP complexes exhibits similarity with each other. This family corresponds to the C-terminal domain of heterotetrameric adaptor protein complex 3 (AP-3) medium mu3 subunit, which includes two closely related homologs, mu3A (P47A, encoded by ap3m1) and mu1B (P47B, encoded by ap3m2). Mu3A is ubiquitously expressed, but mu3B is specifically expressed in neurons and neuroendocrine cells. AP-3 is particularly important for targeting integral membrane proteins to lysosomes and lysome-related organelles at trans-Golgi network (TGN) and/or endosomes, such as the yeast vacuole, fly pigment granules and mammalian melanosomes, platelet dense bodies and the secretory lysosomes of cytotoxic T lymphocytes. Unlike AP-1 and AP-2, which function in conjunction with clathrin which is a scaffolding protein participating in the formation of coated vesicles, the nature of the outer shell of AP-3 containing coats remains to be elucidated. Membrane-anchored cargo molecules interact with adaptors through short sorting signals in their cytosolic segments. Tyrosine-based endocytotic signals are one of the most important sorting signals. They are of the form Y-X-X-Phi, where Y is tyrosine, X is any amino acid and Phi is a bulky hydrophobic residue that can be Leu, Ile, Met, Phe, or Val. These kinds of sorting signals can be recognized by the C-terminal domain of AP-3 mu3 subunit, also known as Y-X-X-Phi signal-binding domain that contains two hydrophobic pockets, one for the tyrosine-binding and one for the bulky hydrophobic residue-binding.


Pssm-ID: 271160  Cd Length: 251  Bit Score: 75.70  E-value: 1.48e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1437 RDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQirgnevvrrqdiipnksdewi 1516
Cdd:cd09252      4 RRAGVKYTNNEIYFDVVEEIDAIVDKSGKPVSGEVRGEIDCNSRLSGMPDLLLSFNNPR--------------------- 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1517 KLLNPEFHSCVNKEHFISTRQIKFCPLDGnKFQLMRYKVKSNK-KELPLVVKARY-INKGP-HFELRcdatctgyVNKKD 1593
Cdd:cd09252     63 LLDDPSFHPCVRYSRWESERVLSFIPPDG-KFTLMSYRVDLNSlVSLPVYVKPQIsFSGSSgRFEIT--------VGSRQ 133
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1594 PEAVPCENIMIRLPVPDlwmkylrnetpvgplGTKSLRSSkkklpkgsstfeqytsprieVSVGTAKYEYAFRAIVWKIS 1673
Cdd:cd09252    134 NLGKSIENVVVEIPLPK---------------GVKSLRLT--------------------ASHGSFSFDSSTKTLVWNIG 178
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1674 RLPErnqvvpftgnvpftgKPLPftaskekagaystqTLICRIDMASELDIPTSLgPHFEVDFTMPLTTASKSIVRSISV 1753
Cdd:cd09252    179 KLTP---------------GKTP--------------TLRGSVSLSSGLEAPSES-PSISVQFKIPGYTPSGLKVDSLDI 228
                          330
                   ....*....|....*.
gi 1889988573 1754 SNplaAPVIPEKWVRY 1769
Cdd:cd09252    229 YN---EKYKPFKGVKY 241
AP-1_Mu1A_Cterm cd09258
C-terminal domain of medium Mu1A subunit in ubiquitously expressed clathrin-associated adaptor ...
1435-1769 2.02e-13

C-terminal domain of medium Mu1A subunit in ubiquitously expressed clathrin-associated adaptor protein (AP) complex AP-1; AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. There are four AP complexes, AP-1, AP-2, AP-3, and AP-4, described in various eukaryotic organisms. Each AP complex consists of four subunits: two large chains (one each of gamma/alpha/delta/epsilon and beta1-4, respectively), a medium mu chain (mu1-4), and a small sigma chain (sigma1-4). Each of the four subunits from the different AP complexes exhibits similarity with each other. This subfamily corresponds to the C-terminal domain of heterotetrameric clathrin-associated adaptor protein complex 1 (AP-1) medium mu1A subunit encoded by ap1m1 gene, which is ubiquitously expressed in all mammalian tissues and cells. AP-1 has been implicated in bidirectional transport between the trans-Golgi network (TGN) and endosomes. It is involved in the formation of clathrin-coated vesicles (CCVs) from the trans-Golgi network (TGN). The ubiquitous AP-1 is recruited to the TGN membrane, as well as to immature secretory granules. Recruitment of AP-1 to the TGN membrane is regulated by a small GTPase, ADP-ribosylation factor 1 (ARF1). Phosphorylation/dephosphorylation events can also regulate the function of AP-1. The membrane-anchored cargo molecules can be linked to the outer lattice of CCVs by AP-1. Those cargo molecules interact with adaptors through short sorting signals in their cytosolic segments. Tyrosine-based endocytotic signals are one of the most important sorting signals. They are of the form Y-X-X-Phi, where Y is tyrosine, X is any amino acid and Phi is a bulky hydrophobic residue that can be Leu, Ile, Met, Phe, or Val. These kinds of sorting signals can be recognized by the C-terminal domain of AP-1 mu1A subunit, also known as Y-X-X-Phi signal-binding domain that contains two hydrophobic pockets, one for the tyrosine-binding and one for the bulky hydrophobic residue-binding.


Pssm-ID: 271166  Cd Length: 270  Bit Score: 72.61  E-value: 2.02e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1435 AYRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKqirgnevVRRQDIIPNKSDE 1514
Cdd:cd09258      6 SWRSEGIKYRKNEVFLDVIESVNLLVSANGNVLRSEIVGSIKMRVYLSGMPELRLGLNDK-------VLFENTGRGKSKS 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1515 wIKLLNPEFHSCVNKEHFISTRQIKFCPLDGnKFQLMRYKVKSNKKelPLVVKARYINKGPHFELRCDATCTGYVNKKDp 1594
Cdd:cd09258     79 -VELEDVKFHQCVRLSRFENDRTISFIPPDG-EFELMSYRLNTHVK--PLIWIESVIERHSHSRVEYMIKAKSQFKRRS- 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1595 eavPCENIMIRLPVPDlwmkylrnetpvgplgtkslrsskkklpkgsstfeQYTSPRIEVSVGTAKYEYAFRAIVWKISR 1674
Cdd:cd09258    154 ---TANNVEIHIPVPN-----------------------------------DADSPKFKTTVGSVKYVPENSEIVWSIKS 195
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1675 LPERNQvvpFTGNVPFTgkpLPFTASKEKAGAystqtlicridmaseldiptslgPHFEVDFTMPLTTASKSIVRSISVS 1754
Cdd:cd09258    196 FPGGKE---YLMRAHFG---LPSVESEEKEGR-----------------------PPISVKFEIPYFTTSGIQVRYLKII 246
                          330
                   ....*....|....*
gi 1889988573 1755 NPLAAPVIPekWVRY 1769
Cdd:cd09258    247 EKSGYQALP--WVRY 259
TOA1 COG5149
Transcription initiation factor IIA, large chain [Transcription];
1811-2154 3.89e-12

Transcription initiation factor IIA, large chain [Transcription];


Pssm-ID: 227478 [Multi-domain]  Cd Length: 293  Bit Score: 69.32  E-value: 3.89e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1811 SKLYRSVIDDVINNVREAFLDEQVDEQVLQELKQVWETKLLQSKAVeglnpdvhlvaggfshhaqahTASRIQQSPRGGL 1890
Cdd:COG5149      7 GEVYHHVILDVIANSRSDFEENGVDDATLRELQNLWQSKLVATDVA---------------------TFPWAQAFPIGQL 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1891 ptagptgwqvgtqqsNAITTCIHSLVWPSLVHQSLAYaaqlgngvdlTSAASSGSATVALPQGqvvyqqgqvmrtvapgl 1970
Cdd:COG5149     66 ---------------FGLRTDSLDVTAPAVANSPILN----------QSATNISFDSSAIPNV----------------- 103
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1971 tlpQGQTTMSFPGYQPGQVYIQQPggqtiQIQQPQTIVQQPSGSQTQPVQQQGQQAQGQAGQMpsivqvqSLATTTAHPI 2050
Cdd:COG5149    104 ---QSNNTAPFPSYSSTNQTADSP-----IINDHSTANLKIYGDIIAEVISLPNRLEQVEDEL-------SIGKSAITTL 168
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 2051 IQVDGA----NDTSDEDDDDDDDDKEDEEDKDDENDDAEGEEE---------------EPLNSEDDVSDDDPTDLFDID- 2110
Cdd:COG5149    169 RNTDWRerliDDTQSEWDGERMRRRDGKQGIHQYERLSEGPAHafkgkpttakdegmfSDLDDSDVDSGDSEIEGTKGSt 248
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1889988573 2111 NVVVCQYERINRSKNKWKFHLKDGIMNLGGKDYVFQKATGDAEW 2154
Cdd:COG5149    249 NCMLCLYDKVNMSKGKWKCTFKDGVVSINNIDYVFNKAQGELEW 292
PHA03247 PHA03247
large tegument protein UL36; Provisional
662-1308 9.49e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.12  E-value: 9.49e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  662 QSPTAPGAAPptflpigimqseavEPEKESPhfsPPPDSLPTPIAPAPASMP--SPTEAV----IAPVASTPPLASPQTS 735
Cdd:PHA03247  2487 RFPFAAGAAP--------------DPGGGGP---PDPDAPPAPSRLAPAILPdePVGEPVhprmLTWIRGLEELASDDAG 2549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  736 PPNPQPPAVTKPSTPKEKAPsakpsqPSKPSAQPS-PAFDAIEAAkllglPGVPVTSAVKkrvplnqlaqakvvtRSPVQ 814
Cdd:PHA03247  2550 DPPPPLPPAAPPAAPDRSVP------PPRPAPRPSePAVTSRARR-----PDAPPQSARP---------------RAPVD 2603
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  815 ETAKSPVEEAIGATPPEsksAHVEEPAASAAQPvdmfedafvpdelNASEKSAPTKSASahfedafvpgglmdtdeggpt 894
Cdd:PHA03247  2604 DRGDPRGPAPPSPLPPD---THAPDPPPPSPSP-------------AANEPDPHPPPTV--------------------- 2646
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  895 ksfsanfedafvvqewPEPEQPLSQKSPSThfedsfVPMTEKRPAPEEAKKSTSdfdlfsdmettakPLSMPADPFAStg 974
Cdd:PHA03247  2647 ----------------PPPERPRDDPAPGR------VSRPRRARRLGRAAQASS-------------PPQRPRRRAAR-- 2689
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  975 gdaktpelftsaakaatppaamtiDFFGSVAEDVPPPDllggsPEAELKPAMIPTNAFVPKPdeavkqdvlEGQAAAQPe 1054
Cdd:PHA03247  2690 ------------------------PTVGSLTSLADPPP-----PPPTPEPAPHALVSATPLP---------PGPAAARQ- 2730
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1055 tmaidllSSPSQPEAPTSNAVAPATLSAGEVfapsppdaaaaddlfASPGEPAAMPDDPFASKPGDEANTPDLFAGATGA 1134
Cdd:PHA03247  2731 -------ASPALPAAPAPPAVPAGPATPGGP---------------ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1135 AAMSPSIPSPPTAISPSRESLKLeeLDPFAPQKSASKEGTPVSSPPKKTEVVAPLASPEEELNLSPMQHISVADI---KP 1211
Cdd:PHA03247  2789 ASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrrRP 2866
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1212 TAQSPVEDAGLAMGPPLTTIDPFSPKAAVGSPATPPRQTKKKYNPFKAETPTTPPDEEASPFPTVKLPISVVQPLPASPD 1291
Cdd:PHA03247  2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                          650
                   ....*....|....*..
gi 1889988573 1292 TSPLDESFPPKCEEDGW 1308
Cdd:PHA03247  2947 TDPAGAGEPSGAVPQPW 2963
AP-4_Mu4_Cterm cd09253
C-terminal domain of medium Mu4 subunit in adaptor protein (AP) complex AP-4; AP complexes ...
1446-1777 1.90e-09

C-terminal domain of medium Mu4 subunit in adaptor protein (AP) complex AP-4; AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. There are four AP complexes, AP-1, AP-2, AP-3, and AP-4, described in various eukaryotic organisms. Each AP complex consists of four subunits: two large chains (one each of gamma/alpha/delta/epsilon and beta1-4, respectively), a medium mu chain (mu1-4), and a small sigma chain (sigma1-4). Each of the four subunits from the different AP complexes exhibits similarity with each other. This family corresponds to the C-terminal domain of heterotetrameric adaptor protein complex 4 (AP-4) medium mu4 subunit. AP-4 plays a role in signal-mediated trafficking of integral membrane proteins in mammalian cells. Unlike other AP complexes, AP-4 is found only in mammals and plants. It is believed to be part of a nonclathrin coat, since it might function independently of clathrin, a scaffolding protein participating in the formation of coated vesicles. Recruitment of AP-4 to the trans-Golgi network (TGN) membrane is regulated by a small GTPase, ADP-ribosylation factor 1 (ARF1) or a related protein. Membrane-anchored cargo molecules interact with adaptors through short sorting signals in their cytosolic segments. One of the most important sorting signals binding to mu subunits of AP complexes are tyrosine-based endocytotic signals, which are of the form Y-X-X-Phi, where Y is tyrosine, X is any amino acid and Phi is a bulky hydrophobic residue that can be Leu, Ile, Met, Phe, or Val. However, AP-4 does not bind most canonical tyrosine-based signals except for two naturally occurring ones from the lysosomal membrane proteins CD63 and LAMP-2a. It binds YX [FYL][FL]E motif, where X can be any residue, from the cytosolic tails of amyloid precursor protein (APP) family members in a distinct way.


Pssm-ID: 271161  Cd Length: 271  Bit Score: 60.66  E-value: 1.90e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1446 DEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDkqirgNEVVRRQDIIPNKSDEWIKLLNpeFHS 1525
Cdd:cd09253     11 NEIFVDVLERLSVVFNANGQVLNSEIDGSIQMKSYLPGNPELRLALNE-----DLVIGKRENRAYYSAVVLDDCN--FHE 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1526 CVNKEHFISTRQIKFCPLDGnKFQLMRYKVKSNKKeLPLVVKARYINKGPH-----FELRCDAtctgyvnkkdPEAVPCE 1600
Cdd:cd09253     84 SVDLEEFESDRTLSLTPPDG-EFTLMNYRISGEFK-PPFRVFPSVEETSPYklelvLKLRADF----------PPKSTAT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1601 NIMIRLPVpdlwmkylrnetpvgPLGTKSLRSSkkkLPKGSStfEQytsprievsvgTAKYEYAFRAIVWKISRLPERNQ 1680
Cdd:cd09253    152 NVVVRIPL---------------PKGTTSVSCE---LGSGAS--GQ-----------SAEYKEKEKLVLWNIKKFPGGTE 200
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1681 vvpftgnvpftgkplpftaskekagaystQTLICRIDMASELDIPT--SLGP---HFEVdftmPLTTASKSIVRSISVSN 1755
Cdd:cd09253    201 -----------------------------LTLRAKITLSSPVSSSVrkEIGPislSFEI----PMYNVSGLQVRYLRILE 247
                          330       340
                   ....*....|....*....|....*
gi 1889988573 1756 PlAAPVIPEKWVRYKAH---YSYKV 1777
Cdd:cd09253    248 R-SSSYNPHRWVRYVTQsssYVCRI 271
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
662-872 4.37e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 61.82  E-value: 4.37e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  662 QSPTAPGAAPPTFLPIGIMQSEAVEPEKESPHFSPPPDSLPTPIAPAPASMP---SPTEAVIAPVASTPPLASPQTSPPN 738
Cdd:PRK12323   372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAParrSPAPEALAAARQASARGPGGAPAPA 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  739 PQPPAVTKPSTPkekaPSAKPSQPSKPSAQPSPAFDAIEAAKLLGLPGVPVTSAVKKRVPLNQLAQAKvvtRSPVQETAK 818
Cdd:PRK12323   452 PAPAAAPAAAAR----PAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPD---AAPAGWVAE 524
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1889988573  819 SPVEEAIGATPPESKSAHVEEPAASAAQPVDMFEDAFVPDELNASEKSAPTKSA 872
Cdd:PRK12323   525 SIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFD 578
PHA03247 PHA03247
large tegument protein UL36; Provisional
525-1083 5.42e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.88  E-value: 5.42e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  525 PSPTNPFLADIAAKQQALSPTN--PLKTQEGAKKDNFDPFGTQQPSKAVDPfGAEFEGDSFEAEPVTMPDDPFAPKNGDT 602
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRpaPRPSEPAVTSRARRPDAPPQSARPRAP-VDDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  603 AVSASKSKGGINDlfAISSSKPEQALNLFASSSLDAAEPGKDPFLSPKPAEPevdlfnIQSPTAPgAAPPtflPIGIMQS 682
Cdd:PHA03247  2630 SPSPAANEPDPHP--PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP------PQRPRRR-AARP---TVGSLTS 2697
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  683 EAVEPEKESPHFSPPP---DSLPTPIAPAPA--SMPSPTEAVIAPVASTPPlASPQTSPPNPQPPAvtkPSTPKEKAPSA 757
Cdd:PHA03247  2698 LADPPPPPPTPEPAPHalvSATPLPPGPAAArqASPALPAAPAPPAVPAGP-ATPGGPARPARPPT---TAGPPAPAPPA 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  758 KPSQPSKPSAQPSPAFDAIEAAKLLGLPGVPVTSAvkkrvplnqlaqAKVVTRSPVQETAKSPveeAIGATPPESKSAHV 837
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPP------------AAVLAPAAALPPAASP---AGPLPPPTSAQPTA 2838
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  838 EEPAASAAQPVDMFEDAFVP----DELNASEKSAPTKSASAHFEDAFVPGGLMDTdeggPTKSFSANfedafvvqewPEP 913
Cdd:PHA03247  2839 PPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSR----STESFALP----------PDQ 2904
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  914 EQPLSQKSPSTHFEDSFVPMTEKRPAPEEAKKSTSDFDLFSDMETTAKPLSMPADPFASTG----GDAKTPELFTSAAKA 989
Cdd:PHA03247  2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGalvpGRVAVPRFRVPQPAP 2984
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  990 atppaamtidffgSVAEDVPPPDLLGGSP---------------EAELKPAMIPTNAFVPKPDEAVKQDVLEGQAAAQPE 1054
Cdd:PHA03247  2985 -------------SREAPASSTPPLTGHSlsrvsswasslalheETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSD 3051
                          570       580
                   ....*....|....*....|....*....
gi 1889988573 1055 TMAIDLLssPSQPEAPTSNAVAPATLSAG 1083
Cdd:PHA03247  3052 LEALDPL--PPEPHDPFAHEPDPATPEAG 3078
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
695-848 3.39e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 58.72  E-value: 3.39e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  695 SPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPkeKAPSAKPSQPSKPSAQPSPAFD 774
Cdd:PRK07994   365 LPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTS--QLLAARQQLQRAQGATKAKKSE 442
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1889988573  775 AIEAAKllglpGVPVTSAVKKRVPLNQLAQA---KVVTRSPVQETAKSPVEEA--IGATPPESKSAHVEEPAASAAQPV 848
Cdd:PRK07994   443 PAAASR-----ARPVNSALERLASVRPAPSAlekAPAKKEAYRWKATNPVEVKkePVATPKALKKALEHEKTPELAAKL 516
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
649-851 4.37e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 58.35  E-value: 4.37e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  649 PKPAEPEVDLFNIQSPTAPGAAPPTFLPIGIMQSEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPP 728
Cdd:PRK12323   392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRP 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  729 LASPQTSPPNPQPPAvtkpsTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAkllglpgvpVTSAVKKRVPLNQLAQAKVV 808
Cdd:PRK12323   472 VAAAAAAAPARAAPA-----AAPAPADDDPPPWEELPPEFASPAPAQPDAA---------PAGWVAESIPDPATADPDDA 537
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1889988573  809 TRSPVQETAKSPVEEAIGATPPeskSAHVEEPAASAAQPVDMF 851
Cdd:PRK12323   538 FETLAPAPAAAPAPRAAAATEP---VVAPRPPRASASGLPDMF 577
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
670-779 4.52e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 58.28  E-value: 4.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  670 APPTFLPIGIMQSEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPST 749
Cdd:PRK14950   344 TSYGQLPLELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPH 423
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1889988573  750 PKEKAP----SAKPSQPSKPSAQPSPAFDAIEAA 779
Cdd:PRK14950   424 TPESAPkltrAAIPVDEKPKYTPPAPPKEEEKAL 457
PHA03378 PHA03378
EBNA-3B; Provisional
496-829 7.33e-08

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 58.15  E-value: 7.33e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  496 DFLMKKGEKAKTPDTLDFLMMQSGLTPTQPSP-TNPFLADIAAKQQALSPTNPLKTQEGAKKDnfDPFGTQQPSKAVDPF 574
Cdd:PHA03378   545 DLDIESDEPASTEPVHDQLLPAPGLGPLQIQPlTSPTTSQLASSAPSYAQTPWPVPHPSQTPE--PPTTQSHIPETSAPR 622
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  575 GAEFEGDSFEAEPVTMPDDPFAPKNGDTAvsaskskggindlfaissSKPEQALNLFASSSLdaAEPGKDPFlSPKPAEP 654
Cdd:PHA03378   623 QWPMPLRPIPMRPLRMQPITFNVLVFPTP------------------HQPPQVEITPYKPTW--TQIGHIPY-QPSPTGA 681
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  655 EVDLFNIQSPT-------APGAAPPTFLPIGIMQSEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPteaviAPVASTP 727
Cdd:PHA03378   682 NTMLPIQWAPGtmqppprAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAA-----APGRARP 756
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  728 PLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPafDAIEAAKLLGLPGVPVTSAVKKRVPLNQLAQAKV 807
Cdd:PHA03378   757 PAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPP--QAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834
                          330       340
                   ....*....|....*....|..
gi 1889988573  808 VTRSPVQETAKSPVEEAIGATP 829
Cdd:PHA03378   835 RGRPSLKKPAALERQAAAGPTP 856
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
667-951 7.94e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 57.78  E-value: 7.94e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  667 PGAA---PPTFLPIGIMQSEAVEPEKeSPHFSPPPDSLPTPIAPA-PASMPSPTEAVIAPVASTPPLASPQTSPPNPQPP 742
Cdd:PTZ00449   561 PGPAkehKPSKIPTLSKKPEFPKDPK-HPKDPEEPKKPKRPRSAQrPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPP 639
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  743 avTKPSTPkEKAPSAKPSQPSKPSAQPSPAFDAIEAAKLLGlpgvpvtsavkkrVPLNQLAQAK-VVTRSPVQETAKSPV 821
Cdd:PTZ00449   640 --QRPSSP-ERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYD-------------DYLDAAAKSKeTKTTVVLDESFESIL 703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  822 EEAIGATPPESKSAHVEEPAASAAQPVDMFEDAFVPD--ELNASEKSAPTKSASAHFEDAFVPGGLMDTdeggptksFSA 899
Cdd:PTZ00449   704 KETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDaeQPDDIEFFTPPEEERTFFHETPADTPLPDI--------LAE 775
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1889988573  900 NFEDAFVVQEWPEPEQPLSQ-KSPSTHFEDS---FVPMTEKRPAPEEAKKSTSDFD 951
Cdd:PTZ00449   776 EFKEEDIHAETGEPDEAMKRpDSPSEHEDKPpgdHPSLPKKRHRLDGLALSTTDLE 831
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
648-822 2.15e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 56.26  E-value: 2.15e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  648 SPKPAEPEvdlfniqSPTAPGAAPPTFLPIGIMQ----SEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPV 723
Cdd:PRK14951   372 AAAPAEKK-------TPARPEAAAPAAAPVAQAAaapaPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  724 ASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAKLLGLPGVPVTSAVKkrvplnQLA 803
Cdd:PRK14951   445 VALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDVWHATVQQLAAAEAITALAR------ELA 518
                          170       180
                   ....*....|....*....|
gi 1889988573  804 -QAKVVTRSPVQETAKSPVE 822
Cdd:PRK14951   519 lQSELVARDGDQWLLRVERE 538
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
612-985 6.21e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 55.18  E-value: 6.21e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  612 GINDLFAISSSKPEQALNLFASSSLDAAEPGKDPFLSPKPAEPEVDLFNI----QSPTAPGAAPPTFLPIGIMQSEAVEP 687
Cdd:PHA03307    38 GSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTptwsLSTLAPASPAREGSPTPPGPSSPDPP 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  688 EKESPHFSPPPDSLP---------TPIAPAPASMPSPTEAVIAPVASTP--------PLASPQTSPPNPQPPAVTKPSTP 750
Cdd:PHA03307   118 PPTPPPASPPPSPAPdlsemlrpvGSPGPPPAASPPAAGASPAAVASDAassrqaalPLSSPEETARAPSSPPAEPPPST 197
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  751 KEKAPSAKPSQPSKPSAQPSPAFDAIEAAKLLGLPGVPVTSAVKKRVPLNQLAQAKVVTR-SPVQETAKSPVEEAIGATP 829
Cdd:PHA03307   198 PPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLpRPAPITLPTRIWEASGWNG 277
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  830 PESKSAHVE------------EPAASAAQPVDMFEDAFVPDELNASEKSAPTKSASAHFEDAFVPGGLMDTDEGGPTKSf 897
Cdd:PHA03307   278 PSSRPGPASssssprerspspSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP- 356
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  898 sanfedafvvqewPEPEQPLS-QKSPSTHFEDSFVPMTEKRPAPEEAKKSTSDFDLFSD----METTAKPLSMPADPFAS 972
Cdd:PHA03307   357 -------------PPPADPSSpRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDatgrFPAGRPRPSPLDAGAAS 423
                          410
                   ....*....|...
gi 1889988573  973 TGGDAKTPELFTS 985
Cdd:PHA03307   424 GAFYARYPLLTPS 436
PRK10263 PRK10263
DNA translocase FtsK; Provisional
650-1200 1.04e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 54.32  E-value: 1.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  650 KPAEPEVDLFNIQSPTAPGAAPPTFLPIGIMQSEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPL 729
Cdd:PRK10263   399 QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLY 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  730 ASPqtsPPNPQPPAVtkpstpkEKAPSAKPSQPSKPsaqPSPAFDAIEAAKL-----LGLPGVPVTSAVKKRVPLNQLAQ 804
Cdd:PRK10263   479 QQP---QPVEQQPVV-------EPEPVVEETKPARP---PLYYFEEVEEKRArereqLAAWYQPIPEPVKEPEPIKSSLK 545
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  805 AKVVTRSPVQET--AKSPVEEAI-GATPPESKSAHVEEPA---ASAAQPVDMFEDAFVPDELNASEKSAPTKSASAHFeD 878
Cdd:PRK10263   546 APSVAAVPPVEAaaAVSPLASGVkKATLATGAAATVAAPVfslANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASY-G 624
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  879 AFVPGGLMDTDEGGPTKSFSANFEDAFVVQEWPEPEQplsqkspsTHFEDSFVPMTEKRPAPE---EAKKSTSDFDLFSD 955
Cdd:PRK10263   625 IKLPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQ--------DELARQFAQTQQQRYGEQyqhDVPVNAEDADAAAE 696
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  956 METtakplsmpADPFASTGGDAKTPElftsaakAATPPAAMTIDFFgsvaEDVPPPDLLGGSPEAEL-KPAMIPTNAFVP 1034
Cdd:PRK10263   697 AEL--------ARQFAQTQQQRYSGE-------QPAGANPFSLDDF----EFSPMKALLDDGPHEPLfTPIVEPVQQPQQ 757
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1035 KPDEAVKQDVLEGQAAAQPEtmaidlLSSPSQPEAPTSNAVAPATLSAGEVFAPSPPDAAAADDLFASPGEPAA------ 1108
Cdd:PRK10263   758 PVAPQQQYQQPQQPVAPQPQ------YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVApqpqyq 831
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1109 MPDDPFASKPGDEANTPDLFAGATGAAAMSPSIPSPPTAISPSRESlKLEELDPFAPQKSASKEGTPVSSPPKKTEVVAP 1188
Cdd:PRK10263   832 QPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPS-EVEPVDTFALEQMARLVEARLADFRIKADVVNY 910
                          570
                   ....*....|....*
gi 1889988573 1189 LASP---EEELNLSP 1200
Cdd:PRK10263   911 SPGPvitRFELNLAP 925
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
666-854 1.65e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 53.31  E-value: 1.65e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  666 APGAAPPTFLPIGIMQSEAVEP----EKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQP 741
Cdd:PRK07003   367 APGGGVPARVAGAVPAPGARAAaavgASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGD 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  742 PAVTK----PSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAKL-LGLPGVPVTSAVKKRVplnqlAQAKVVTRSPVQET 816
Cdd:PRK07003   447 APVPAkanaRASADSRCDERDAQPPADSGSASAPASDAPPDAAFePAPRAAAPSAATPAAV-----PDARAPAAASREDA 521
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1889988573  817 AKSPVEEAIGATPPESKSAHVEEPAASAAQPVDMFEDA 854
Cdd:PRK07003   522 PAAAAPPAPEARPPTPAAAAPAARAGGAAAALDVLRNA 559
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
674-912 5.23e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 51.80  E-value: 5.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  674 FLPIGIMQSEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEK 753
Cdd:PRK12323   363 FRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASAR 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  754 APSAKPSQPSKPSAQPSPAfdaiEAAKLLGLPGVPVTSAVKKRVPLNQLAQAKVVTRSPVQE----TAKSPVEEAIGATP 829
Cdd:PRK12323   443 GPGGAPAPAPAPAAAPAAA----ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEelppEFASPAPAQPDAAP 518
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  830 PESKSAHVEEPAASAAqpvdmfEDAFVPDELNASEKSAPTKSASAhfeDAFVPGGLMDTDEGGPTKSFSANfedafvvqe 909
Cdd:PRK12323   519 AGWVAESIPDPATADP------DDAFETLAPAPAAAPAPRAAAAT---EPVVAPRPPRASASGLPDMFDGD--------- 580

                   ...
gi 1889988573  910 WPE 912
Cdd:PRK12323   581 WPA 583
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
632-853 5.64e-06

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 51.47  E-value: 5.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  632 ASSSLDAAEPGKDPFLSPKPAEPEVDLFNIQSPTAPGAAPPTFLPIGIMQSEAVEpEKESPHFSPPPDSLPTPIAPAPAS 711
Cdd:PLN03209   351 APSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSPIPTPPSSSPASSKSVDAVA-KPAEPDVVPSPGSASNVPEVEPAQ 429
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  712 MPSPTEAVIAPVASTP---PLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAfdaieaakllGLPGVP 788
Cdd:PLN03209   430 VEAKKTRPLSPYARYEdlkPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPAN----------MRPLSP 499
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1889988573  789 VTSAVKKRVPLNQLAQAKVVTRSPVQETAKSPVEEAIGATPPESKSAHVeEPAASAAQPVDMFED 853
Cdd:PLN03209   500 YAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHA-QPKPRPLSPYTMYED 563
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
563-810 7.22e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 51.42  E-value: 7.22e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  563 GTQQPSKAVDPFGAEFEGDSFEAEPVTMPDDPFAPKNGDTAVSASKSKGGINDLfaiSSSKPEQALNLFASSSLDAAEPG 642
Cdd:PRK12323   371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPA---RRSPAPEALAAARQASARGPGGA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  643 KDPFLSPKPAE-PEVDLFNIQSPTAPGAAPPTFLPIGIMQSEAVEPEKESPHFSPPPD-SLPTPIAPAPASMPSPTEAVI 720
Cdd:PRK12323   448 PAPAPAPAAAPaAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEfASPAPAQPDAAPAGWVAESIP 527
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  721 APVASTPPLASPQTSPPNPQPPAvtkPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAKLLGLP--GVP---------- 788
Cdd:PRK12323   528 DPATADPDDAFETLAPAPAAAPA---PRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLPvrGLAqqlarqsela 604
                          250       260
                   ....*....|....*....|....
gi 1889988573  789 --VTSAVKKRVPLNQLAQAKVVTR 810
Cdd:PRK12323   605 gvEGDTVRLRVPVPALAEAEVVER 628
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
706-879 9.43e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.00  E-value: 9.43e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  706 APAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPP-AVTKPSTPKEKAPSAkPSQPSKPSAQPSPAFDAIEA-AKLLG 783
Cdd:PRK07003   367 APGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTgAAGAALAPKAAAAAA-ATRAEAPPAAPAPPATADRGdDAADG 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  784 LPGVPVTSAVKKRVPLNQLAQAKVVTRSPVQETAKSPVEEAIGATPPESKSAHVEEPAASAAQPVDMFEDAFVPDELNAS 863
Cdd:PRK07003   446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAA 525
                          170
                   ....*....|....*.
gi 1889988573  864 EKSAPTKSASAHFEDA 879
Cdd:PRK07003   526 APPAPEARPPTPAAAA 541
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
694-786 1.10e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 51.04  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  694 FSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPqtsPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAF 773
Cdd:PRK12270    36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAP---PAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAA 112
                           90
                   ....*....|...
gi 1889988573  774 DAIEAAKLLGLPG 786
Cdd:PRK12270   113 VEDEVTPLRGAAA 125
PRK10263 PRK10263
DNA translocase FtsK; Provisional
638-1278 1.31e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.85  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  638 AAEPGKDPFLSPKPAEPEVD----LFNIQSPTAPGAApptflpigimqseAVEPEKESPHFSPPPDslptPIAPAPaSMP 713
Cdd:PRK10263   286 AADPDDVLFSGNRATQPEYDeydpLLNGAPITEPVAV-------------AAAATTATQSWAAPVE----PVTQTP-PVA 347
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  714 SPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAKLLGLPGVPVTSAV 793
Cdd:PRK10263   348 SVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQ 427
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  794 KKRVPLNQLAQAKVVTRSPVQ-ETAKSPVEEAIGATPPESKSAHVEEPAASAAQPVDMFEDAFVPDElnASEKSAPTKSA 872
Cdd:PRK10263   428 PAQQPYYAPAPEQPVAGNAWQaEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEP--VVEETKPARPP 505
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  873 SAHFEDafvpgglMDTDEGGPTKSFSANFedafvvQEWPEP-EQPLSQKSPSTHFEDSFVPMTEKRPAPEEAKKSTSDFD 951
Cdd:PRK10263   506 LYYFEE-------VEEKRAREREQLAAWY------QPIPEPvKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKAT 572
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  952 LFSdmeTTAKPLSMPADPFASTGGD-AKTPELFTSAAKAATPPAAMTIDFFGSVAEDVPPPDLLGGSPEAELKPAMIPTN 1030
Cdd:PRK10263   573 LAT---GAAATVAAPVFSLANSGGPrPQVKEGIGPQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKAREAQRNQYDSGD 649
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1031 AFVPKPDEAVKQDVLEGQAAAQPETMAIDLLSSPSQPEAPTSNAVAPATLSagEVFAPSPPDAAaaddlfaSPGEPAAMp 1110
Cdd:PRK10263   650 QYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELA--RQFAQTQQQRY-------SGEQPAGA- 719
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1111 dDPFASKPGDEANTPDLFAGATGAAAMSPSI-PSPPTAISPSRESLKLEELDPFAPQKSASKEGTPVSSPPKKTEVVAPL 1189
Cdd:PRK10263   720 -NPFSLDDFEFSPMKALLDDGPHEPLFTPIVePVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 798
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1190 AS------PEEELNLSPMQHISVADIKPTAQSPVEDAGLAMGPPLTTIDPFSPKAAVGSPATPPRqtkkkyNPFKAETPT 1263
Cdd:PRK10263   799 APqpqyqqPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPT------TPLPSLDLL 872
                          650
                   ....*....|....*
gi 1889988573 1264 TPPDEEASPFPTVKL 1278
Cdd:PRK10263   873 TPPPSEVEPVDTFAL 887
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
704-845 1.32e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 50.48  E-value: 1.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  704 PIAPAPASMPSPTEAVIAPVAsTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAKLLG 783
Cdd:PRK14951   366 PAAAAEAAAPAEKKTPARPEA-AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1889988573  784 LPGVPVTSAV--KKRVPLNQLAQAKVVTRSPVQETAKSPVEEAIGATPP-ESKSAHVEEPAASAA 845
Cdd:PRK14951   445 VALAPAPPAQaaPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEgDVWHATVQQLAAAEA 509
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
648-980 5.14e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 48.76  E-value: 5.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  648 SPKPAEPEVDLFNIQSPT----APGAAPPTFLPigimQSEAVEPEKESPHFSPPPDSLPTPIAPAPAsmPSPTEAVIAPV 723
Cdd:pfam05109  460 APASTGPTVSTADVTSPTpagtTSGASPVTPSP----SPRDNGTESKAPDMTSPTSAVTTPTPNATS--PTPAVTTPTPN 533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  724 ASTPPLA--SPQTSPPNPQP------PAVTKPsTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAK-----------LLGL 784
Cdd:pfam05109  534 ATSPTLGktSPTSAVTTPTPnatsptPAVTTP-TPNATIPTLGKTSPTSAVTTPTPNATSPTVGEtspqanttnhtLGGT 612
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  785 PGVPVTSAVKKRVPLNQLAQAKVVTRSPVQETAKSPVEEAIGATPPESKSAHVEEPAASAAQPVdmfedafvpDELNASE 864
Cdd:pfam05109  613 SSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPT---------GGENITQ 683
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  865 KSAPTKSASAHFEDAFVPGGLMDTDEGGPTKSFSANFEDAFVVQEWPEPEQPLSQKSPSthfedsfvpmTEKRPAPEEAK 944
Cdd:pfam05109  684 VTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPS----------GQKTAVPTVTS 753
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1889988573  945 KSTSDFDLFSDMETTAKPLSMPADPFASTGGDAKTP 980
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTP 789
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
714-916 5.27e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.33  E-value: 5.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  714 SPTEAVIAPVASTPPLAS-PQTSPPNPQPPAVTKPSTPkekAPSAKPSQPSKPSAQPSPAFDAIEAAKLLGLPGVPVTSA 792
Cdd:PRK12323   373 GPATAAAAPVAQPAPAAAaPAAAAPAPAAPPAAPAAAP---AAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPA 449
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  793 vkkrvplnqLAQAKVVTRSPVQETAKSPVEEAIGATPPESKSAhvEEPAASAAQPVDMFEDAFVPDELNASEKSAPTKSA 872
Cdd:PRK12323   450 ---------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARA--APAAAPAPADDDPPPWEELPPEFASPAPAQPDAAP 518
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1889988573  873 SAHFEDAFVPGGLMDTDEGGPTKSFSANFEDAFVVQEWPEPEQP 916
Cdd:PRK12323   519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA 562
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
620-882 5.40e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.31  E-value: 5.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  620 SSSKPEQALNLFASSSLDAAEPGKDPFLSPKPAEPEVdlfnIQSPTAPGAAPPTFLPIGIMQSEAVEPEKESPHFSPPPD 699
Cdd:PRK07003   402 VTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADR----GDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSA 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  700 SLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQtspPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAqPSPAFDAIEAA 779
Cdd:PRK07003   478 SAPASDAPPDAAFEPAPRAAAPSAATPAAVPDAR---APAAASREDAPAAAAPPAPEARPPTPAAAAP-AARAGGAAAAL 553
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  780 KLLGLPGVPVTSAVKKRVPlnqlAQAKVVTRSPVQETAKSPVEEAIGATP--PESKSAHVEEPAASAAQPV--------- 848
Cdd:PRK07003   554 DVLRNAGMRVSSDRGARAA----AAAKPAAAPAAAPKPAAPRVAVQVPTPraRAATGDAPPNGAARAEQAAesrgapppw 629
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1889988573  849 -DMFEDAFVPdeLNASEKSAPTksasahfEDAFVP 882
Cdd:PRK07003   630 eDIPPDDYVP--LSADEGFGGP-------DDGFVP 655
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
713-1131 1.29e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 1.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  713 PSPTEAVIAPVASTPPLASPQTSPPNPQPPAVtkpsTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAKLLGLPGVPVTSA 792
Cdd:PRK07764   387 VAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPA----AAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAP 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  793 VKKRVPLNQLAQAkvvtrsPVQETAKSPVEEAIGATPPESKSAHVEEPAASAAQPVDMFEDAFVPDELNASEKSAPTKSA 872
Cdd:PRK07764   463 SAQPAPAPAAAPE------PTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPEILAAVPKRSRKTWAILLP 536
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  873 SAHFED--------AFVPGGLMD---------------TDEGGPTKSFSANFEDAFVVQEWPEPEQPLSQKSPSTHFEDS 929
Cdd:PRK07764   537 EATVLGvrgdtlvlGFSTGGLARrfaspgnaevlvtalAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPA 616
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  930 FVPMTEKRPAPEEAKKSTsdfdlfsdmeTTAKPLSMPADPFASTGGDAKTPELFTSAAKAATPPAAMTIDFFGSVAEDVP 1009
Cdd:PRK07764   617 APAAPAAPAAPAPAGAAA----------APAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPA 686
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1010 PPDLLGGSPEAELKPAMIPTNAFVPKP-DEAVKQDVLEGQAAAQPETMAIDLLSSPSQPEAPTSNAVAPATLSAGEVFAP 1088
Cdd:PRK07764   687 PAAPAAPAGAAPAQPAPAPAATPPAGQaDDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAP 766
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1889988573 1089 SPPDAAAADDlfASPGEPAAMPDDPFASKPGDEANTPDLFAGA 1131
Cdd:PRK07764   767 AAAPAAAPPP--SPPSEEEEMAEDDAPSMDDEDRRDAEEVAME 807
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
702-850 1.48e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 46.73  E-value: 1.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  702 PTPIAPAPASMPSPteaviAPVASTP-PLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAak 780
Cdd:PRK14950   363 VPAPQPAKPTAAAP-----SPVRPTPaPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAA-- 435
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1889988573  781 llglpgVPVTSAVKKRVPLNQLAQAKVVTRS-PVQETAKSPVEEAIGATPPESKSahVEEPAASAAQPVDM 850
Cdd:PRK14950   436 ------IPVDEKPKYTPPAPPKEEEKALIADgDVLEQLEAIWKQILRDVPPRSPA--VQALLSSGVRPVSV 498
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
722-875 1.60e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.78  E-value: 1.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  722 PVASTP-PLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAfdaieaakllGLPGVPVTSAVKKRVPL- 799
Cdd:PRK07994   361 PAAPLPePEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPA----------VPLPETTSQLLAARQQLq 430
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1889988573  800 NQLAQAKVVTRSPVQETAKSPVEEAIGATPPESKSAHVEEPAASAAQPVDMFEDAFVPDELNASEKSAPTKSASAH 875
Cdd:PRK07994   431 RAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEH 506
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
624-779 1.61e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.90  E-value: 1.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  624 PEQALNLFASSSLDAAEPGKDPFLSPKPAEPevdlfniQSPTAPGAAPPTFLPIGIMQSEAVEPEKESPhfsPPPDSLPT 703
Cdd:PRK07764   381 LERRLGVAGGAGAPAAAAPSAAAAAPAAAPA-------PAAAAPAAAAAPAPAAAPQPAPAPAPAPAPP---SPAGNAPA 450
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1889988573  704 PIAPAPASMPSPTeaviAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAA 779
Cdd:PRK07764   451 GGAPSPPPAAAPS----AQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPEILAA 522
AP-3_Mu3B_Cterm cd09261
C-terminal domain of medium Mu3B subunit in neuron-specific adaptor protein (AP) complex AP-3; ...
1436-1567 1.67e-04

C-terminal domain of medium Mu3B subunit in neuron-specific adaptor protein (AP) complex AP-3; AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. There are four AP complexes, AP-1, AP-2, AP-3, and AP-4, described in various eukaryotic organisms. Each AP complex consists of four subunits: two large chains (one each of gamma/alpha/delta/epsilon and beta1-4, respectively), a medium mu chain (mu1-4), and a small sigma chain (sigma1-4). Each of the four subunits from the different AP complexes exhibits similarity with each other. This subfamily corresponds to the C-terminal domain of heterotetrameric adaptor protein complex 3 (AP-3) medium mu3B subunit encoded by ap3m2 gene. Mu3B is specifically expressed in neurons and neuroendocrine cells. Neuron-specific AP-3 appears to be involved in synaptic vesicle biogenesis from endosomes in neurons and plays an important role in synaptic transmission in the central nervous system. Unlike AP-1 and AP-2, which function in conjunction with clathrin which is a scaffolding protein participating in the formation of coated vesicles, the nature of the outer shell of neuron-specific AP-3 containing coats remains to be elucidated. Membrane-anchored cargo molecules interact with adaptors through short sorting signals in their cytosolic segments. Tyrosine-based endocytotic signals are one of the most important sorting signals. They are of the form Y-X-X-Phi, where Y is tyrosine, X is any amino acid and Phi is a bulky hydrophobic residue that can be Leu, Ile, Met, Phe, or Val. These kinds of sorting signals can be recognized by the C-terminal domain of AP-3 mu3B subunit, also known as Y-X-X-Phi signal-binding domain that contains two hydrophobic pockets, one for the tyrosine-binding and one for the bulky hydrophobic residue-binding.


Pssm-ID: 211372  Cd Length: 254  Bit Score: 45.42  E-value: 1.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1436 YRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIrgnevvrrqdiipnksdew 1515
Cdd:cd09261      3 WRRTGVKYTNNEAYFDVIEEIDAIIDKSGSTITAEIQGVIDACVKLTGMPDLTLSFMNPRL------------------- 63
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1889988573 1516 ikLLNPEFHSCVNKEHFISTRQIKFCPLDGNkFQLMRYKVKS-NKKELPLVVK 1567
Cdd:cd09261     64 --LDDVSFHPCVRFKRWESERILSFIPPDGN-FRLLSYHVSAqNLVAIPVYVK 113
PHA02682 PHA02682
ORF080 virion core protein; Provisional
702-791 1.83e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 45.62  E-value: 1.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  702 PTPIAPAPASM-PSPTEAVIAPVASTP---PLASPQTSPPNPQPPAVTKPSTPKEKAPSAK---PSQPSKPSAQPSPAFD 774
Cdd:PHA02682    84 PSPACAAPAPAcPACAPAAPAPAVTCPapaPACPPATAPTCPPPAVCPAPARPAPACPPSTrqcPPAPPLPTPKPAPAAK 163
                           90
                   ....*....|....*..
gi 1889988573  775 AIEAAKLLGLPGVPVTS 791
Cdd:PHA02682   164 PIFLHNQLPPPDYPAAS 180
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
638-849 1.91e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.52  E-value: 1.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  638 AAEPGKDPFLSPKPAEPEVDLFNIQSPTAPGAAPPTFLPIGimQSEAVEPEKESPHfsPPPDSLPTPIAPAPASMPSPTE 717
Cdd:PRK07764   589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAP--AGAAAAPAEASAA--PAPGVAAPEHHPKHVAVPDASD 664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  718 AViAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAKlLGLPGVPVTSAVKKRV 797
Cdd:PRK07764   665 GG-DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQ-GASAPSPAADDPVPLP 742
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1889988573  798 PLNQLAQAKVVTRSPVQETAKSPVEEAIGATPPESKSAHVEEPAASAAQPVD 849
Cdd:PRK07764   743 PEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMD 794
AP-3_Mu3A_Cterm cd09260
C-terminal domain of medium Mu3A subunit in ubiquitously expressed adaptor protein (AP) ...
1436-1567 2.32e-04

C-terminal domain of medium Mu3A subunit in ubiquitously expressed adaptor protein (AP) complex AP-3; AP complexes participate in the formation of intracellular coated transport vesicles and select cargo molecules for incorporation into the coated vesicles in the late secretory and endocytic pathways. There are four AP complexes, AP-1, AP-2, AP-3, and AP-4, described in various eukaryotic organisms. Each AP complex consists of four subunits: two large chains (one each of gamma/alpha/delta/epsilon and beta1-4, respectively), a medium mu chain (mu1-4), and a small sigma chain (sigma1-4). Each of the four subunits from the different AP complexes exhibits similarity with each other. This subfamily corresponds to the C-terminal domain of heterotetrameric adaptor protein complex 3 (AP-3) medium mu3A subunit encoded by ap3m1gene. Mu3A is ubiquitously expressed in all mammalian tissues and cells. It appears to be localized to the trans-Golgi network (TGN) and/or endosomes and participates in trafficking to the vacuole/lysosome in yeast, flies, and mammals. Unlike AP-1 and AP-2, which function in conjunction with clathrin which is a scaffolding protein participating in the formation of coated vesicles, the nature of the outer shell of ubiquitous AP-3 containing coats remains to be elucidated. Membrane-anchored cargo molecules interact with adaptors through short sorting signals in their cytosolic segments. Tyrosine-based endocytotic signals are one of the most important sorting signals. They are of the form Y-X-X-Phi, where Y is tyrosine, X is any amino acid and Phi is a bulky hydrophobic residue that can be Leu, Ile, Met, Phe, or Val. These kinds of sorting signals can be recognized by the C-terminal domain of AP-3 mu3A subunit, also known as Y-X-X-Phi signal-binding domain that contains two hydrophobic pockets, one for the tyrosine-binding and one for the bulky hydrophobic residue-binding.


Pssm-ID: 211371  Cd Length: 254  Bit Score: 45.09  E-value: 2.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1436 YRDRGTTYIEDEITVEVTDEFRGIVARTGEIEKQAVEVNIQTLVFITGMPECVLGMNDKQIrgnevvrrqdiipnksdew 1515
Cdd:cd09260      3 WRRAGVKYTNNEAYFDVVEEIDAIIDKSGSTVFAEIQGVIDACIKLSGMPDLSLSFMNPRL------------------- 63
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1889988573 1516 ikLLNPEFHSCVNKEHFISTRQIKFCPLDGNkFQLMRYKVKS-NKKELPLVVK 1567
Cdd:cd09260     64 --LDDVSFHPCIRFKRWESERVLSFIPPDGN-FRLISYRVSSqNLVAIPVYVK 113
PLN02983 PLN02983
biotin carboxyl carrier protein of acetyl-CoA carboxylase
683-757 3.10e-04

biotin carboxyl carrier protein of acetyl-CoA carboxylase


Pssm-ID: 215533 [Multi-domain]  Cd Length: 274  Bit Score: 44.83  E-value: 3.10e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1889988573  683 EAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPlASPQTSPPNPQPPAVTKPSTPKEKAPSA 757
Cdd:PLN02983   132 ELVIRKKEALPQPPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPA-SSPPPTPASPPPAKAPKSSHPPLKSPMA 205
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
699-832 3.25e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.92  E-value: 3.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  699 DSLPTPIAPAPASMPSPTEaviaPVASTPPLASPQTSPPNPQPPAVTKPSTPkEKAPSAKPSQPSKPSAQPSPAFDAIEA 778
Cdd:PRK14971   367 DDASGGRGPKQHIKPVFTQ----PAAAPQPSAAAAASPSPSQSSAAAQPSAP-QSATQPAGTPPTVSVDPPAAVPVNPPS 441
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1889988573  779 AKLLGLPgvPVTSAVKKRVPLNQLAQAKVVTRSPVQETAKSP---VEEAIGATPPES 832
Cdd:PRK14971   442 TAPQAVR--PAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQAtgnIKEAPTGTQKEI 496
PRK10819 PRK10819
transport protein TonB; Provisional
649-770 3.26e-04

transport protein TonB; Provisional


Pssm-ID: 236768 [Multi-domain]  Cd Length: 246  Bit Score: 44.67  E-value: 3.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  649 PKPAEP-EVDLFNIQSPTAPGAAPPTFLPIGIMQSEAvEPEKESPHFSPPPDSLPTPIaPAPASMPSPTeaviaPVASTP 727
Cdd:PRK10819    42 PAPAQPiSVTMVAPADLEPPQAVQPPPEPVVEPEPEP-EPIPEPPKEAPVVIPKPEPK-PKPKPKPKPK-----PVKKVE 114
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1889988573  728 PLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPS 770
Cdd:PRK10819   115 EQPKREVKPVEPRPASPFENTAPARPTSSTATAAASKPVTSVS 157
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
218-322 3.36e-04

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 45.12  E-value: 3.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  218 SQSVPTTPSSPPVAEQVARSPSMQTTRnldSQDEKITQNPQKKQKSDTDTPPLQTSASLEEATRSFTekkapcsakTPTE 297
Cdd:PRK13335    63 TQAANTRQERTPKLEKAPNTNEEKTSA---SKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTEST---------TPKT 130
                           90       100
                   ....*....|....*....|....*
gi 1889988573  298 EPGLPEGSAVEQPAIPTVSVTPHTP 322
Cdd:PRK13335   131 KVTTPPSTNTPQPMQSTKSDTPQSP 155
PHA02682 PHA02682
ORF080 virion core protein; Provisional
687-798 4.19e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 44.47  E-value: 4.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  687 PEKESPHFSPPPDSLPTPIAPAPA-SMPSPTEAVIAPVASTPPLASPQTSPPNPQP---------PAVTKPSTPKEKAPS 756
Cdd:PHA02682    76 PSGQSPLAPSPACAAPAPACPACApAAPAPAVTCPAPAPACPPATAPTCPPPAVCPaparpapacPPSTRQCPPAPPLPT 155
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1889988573  757 AKPSQPSKPS----AQPSPAFDAIEAAKLLGLPGvpVTSAVKKRVP 798
Cdd:PHA02682   156 PKPAPAAKPIflhnQLPPPDYPAASCPTIETAPA--ASPVLEPRIP 199
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
664-781 4.33e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.36  E-value: 4.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  664 PTAPGAAPPTFLPIGIMQSE---AVEPEKESPHFSPPPDSLPTPI-APAPASMPSPTEAVIAPVASTPPLASPQTSP--- 736
Cdd:PRK07764   386 GVAGGAGAPAAAAPSAAAAApaaAPAPAAAAPAAAAAPAPAAAPQpAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPsaq 465
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1889988573  737 --PNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAKL 781
Cdd:PRK07764   466 paPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL 512
flhF PRK06995
flagellar biosynthesis protein FlhF;
694-815 5.35e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 44.96  E-value: 5.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  694 FSPPPDSLPTPIAPAPASMPSPTEAVIAP-VASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPA 772
Cdd:PRK06995    51 LAPPAAAAPAAAQPPPAAAPAAVSRPAAPaAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAPAAPAERAAAENAAR 130
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1889988573  773 FDAIEAAKLLGLPGVPVTSAVKKRVPLNQLAQakVVTRSPVQE 815
Cdd:PRK06995   131 RLARAAAAAPRPRVPADAAAAVADAVKARIER--IVNDTVMQE 171
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
691-772 8.65e-04

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 41.22  E-value: 8.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  691 SPHFSPPPdslptpiaPAPASMPSPTEAVIAPVASTPPLASP-QTSPPNPQPPAVTKPSTPkekaPSAKPSQPSKPSAQP 769
Cdd:pfam12526   26 SSCFSPPE--------SAHPDPPPPVGDPRPPVVDTPPPVSAvWVLPPPSEPAAPEPDLVP----PVTGPAGPPSPLAPP 93

                   ...
gi 1889988573  770 SPA 772
Cdd:pfam12526   94 APA 96
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
713-795 8.66e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.50  E-value: 8.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  713 PSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPsQPSKPSAQPSPAFDAIEAAKLLGLPGVPVTSA 792
Cdd:PRK12270    38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-KPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116

                   ...
gi 1889988573  793 VKK 795
Cdd:PRK12270   117 VTP 119
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
684-956 9.69e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 44.30  E-value: 9.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  684 AVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPlASPQTSPPNPQPPAVTK-PSTPKEKAPS------ 756
Cdd:PTZ00449   498 PIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDE-PKEGGKPGETKEGEVGKkPGPAKEHKPSkiptls 576
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  757 AKPSQPSKPSAQPSPafDAIEAAKLLGLPGVPVTSAVKKRVPLNQLAQAKVVTRSPvqETAKSPVEEAIGATP--PES-K 833
Cdd:PTZ00449   577 KKPEFPKDPKHPKDP--EEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESP--KSPKRPPPPQRPSSPerPEGpK 652
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  834 SAHVEEPAASAAQPVD-MFEDAFVPDELNASEKSAPTKSASAhfEDAFVPGGLMDTDEGGPTKSFSANFEDAFVVQEWPE 912
Cdd:PTZ00449   653 IIKSPKPPKSPKPPFDpKFKEKFYDDYLDAAAKSKETKTTVV--LDESFESILKETLPETPGTPFTTPRPLPPKLPRDEE 730
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1889988573  913 -PEQPLSQ-KSPSTHFEDSFVPMTEKRPAPEEAKKSTSDFDLFSDM 956
Cdd:PTZ00449   731 fPFEPIGDpDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEE 776
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
691-811 1.10e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.00  E-value: 1.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  691 SPHFSPPPDSLPTPIAPAPAsmPSPTEAVIAPVASTPPLASPQ--TSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQ 768
Cdd:PRK14971   380 KPVFTQPAAAPQPSAAAAAS--PSPSQSSAAAQPSAPQSATQPagTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKK 457
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1889988573  769 PSPAfdaieaakLLGLPGVPVTSAVKKRVPLNQLAQAKVVTRS 811
Cdd:PRK14971   458 IPVS--------KVSSLGPSTLRPIQEKAEQATGNIKEAPTGT 492
PHA03379 PHA03379
EBNA-3A; Provisional
587-830 1.13e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 44.28  E-value: 1.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  587 PVTMPDdPFAPKNGDTAVSASKSKGGINDLFAISSSKPEQALNLFASSSLDAAEPGKDPFLSPKPAEPEVdlfnIQSP-T 665
Cdd:PHA03379   419 PVEKPR-PEVPQSLETATSHGSAQVPEPPPVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGV----VQDGrP 493
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  666 APGAAPPTFLPI------GIMQSEAVEPEKESP-HFSPPPDSLPT-----PIAPAP--ASMPSPTEA----------VIA 721
Cdd:PHA03379   494 ACAPVPAPAGPIvrpweaSLSQVPGVAFAPVMPqPMPVEPVPVPTvalerPVCPAPplIAMQGPGETsgivrvrerwRPA 573
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  722 PVASTPPL-----------------ASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAakllgl 784
Cdd:PHA03379   574 PWTPNPPRspsqmsvrdrlarlraeAQPYQASVEVQPPQLTQVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRA------ 647
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1889988573  785 PGVPVTSAVKKRVPLNQlaqaKVVTRSPVqetakSPVEEAIGATPP 830
Cdd:PHA03379   648 GGVPAMQPQYFDLPLQQ----PISQGAPL-----APLRASMGPVPP 684
PRK11633 PRK11633
cell division protein DedD; Provisional
695-772 1.27e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 42.68  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  695 SPPPDSLPtPIAPAPASMPSP--TEAVIAPVASTPPLASPQTSPPN------------------PQPPAVTKPSTPKEKA 754
Cdd:PRK11633    50 RDEPDMMP-AATQALPTQPPEgaAEAVRAGDAAAPSLDPATVAPPNtpvepepapveppkpkpvEKPKPKPKPQQKVEAP 128
                           90
                   ....*....|....*...
gi 1889988573  755 PSAKPSQPSKPSAQPSPA 772
Cdd:PRK11633   129 PAPKPEPKPVVEEKAAPT 146
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1062-1263 1.30e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.71  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1062 SSPSQPEAPTSNAVAPATLSAGEVFAPSPPDAAAADDLFASPGEPAAMPDDPFASKPGDEANTPDLFAGATGAAAMSPSI 1141
Cdd:PRK12323   372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1142 PSP--------PTAISPSRESLKLEELDPFAPQKSASKEGTPVSSPPKKTEVVAPLASPEEELNLSPMQHISVADIKP-T 1212
Cdd:PRK12323   452 PAPaaapaaaaRPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPaT 531
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1889988573 1213 AQSPVEDAGLAMGPPLTTIDPFSPKAAVGSPATPPRQTKKKYNP-FKAETPT 1263
Cdd:PRK12323   532 ADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDmFDGDWPA 583
PTZ00144 PTZ00144
dihydrolipoamide succinyltransferase; Provisional
705-769 1.57e-03

dihydrolipoamide succinyltransferase; Provisional


Pssm-ID: 240289 [Multi-domain]  Cd Length: 418  Bit Score: 43.13  E-value: 1.57e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1889988573  705 IAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPsTPKEKAPSAKPSQPSKPSAQP 769
Cdd:PTZ00144   118 IDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKP-TPPAAAKPPEPAPAAKPPPTP 181
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
699-774 1.59e-03

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 42.99  E-value: 1.59e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1889988573  699 DSLPTPIAPAPA-SMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTkpstpkekaPSAKPSQPSKPSAQPSPAFD 774
Cdd:pfam07174   40 DPEPAPPPPSTAtAPPAPPPPPPAPAAPAPPPPPAAPNAPNAPPPPAD---------PNAPPPPPADPNAPPPPAVD 107
PRK10819 PRK10819
transport protein TonB; Provisional
676-772 1.62e-03

transport protein TonB; Provisional


Pssm-ID: 236768 [Multi-domain]  Cd Length: 246  Bit Score: 42.36  E-value: 1.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  676 PIGIMQseaVEPEKESPHFSPPPDSLPTPiAPAPASMPSPTEAVIAPVASTPPLASPQTSP-PNPQP------------- 741
Cdd:PRK10819    47 PISVTM---VAPADLEPPQAVQPPPEPVV-EPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPkPKPKPvkkveeqpkrevk 122
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1889988573  742 PAVTKPSTPKEKAPSAKPSQPSKPSAQPSPA 772
Cdd:PRK10819   123 PVEPRPASPFENTAPARPTSSTATAAASKPV 153
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
521-769 2.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  521 TPTQPSPT-NPFLADIAAKQQALSPTNPLktqegakkdnfdpfgTQQPSKAVDPFGAEFEGDSFEAEPVTMPDDPFAPKN 599
Cdd:pfam03154  145 SPSIPSPQdNESDSDSSAQQQILQTQPPV---------------LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQ 209
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  600 GDTAVSaskskggindlfaissSKPEQALNLFASSSLDAAEPGKDPFLSPKPaepevdlfniQSPTAPGAAPPTflpigi 679
Cdd:pfam03154  210 GSPATS----------------QPPNQTQSTAAPHTLIQQTPTLHPQRLPSP----------HPPLQPMTQPPP------ 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  680 mqseavePEKESPHFSPPPdSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKP 759
Cdd:pfam03154  258 -------PSQVSPQPLPQP-SLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPP 329
                          250
                   ....*....|
gi 1889988573  760 SQPSKPSAQP 769
Cdd:pfam03154  330 SQSQLQSQQP 339
rne PRK10811
ribonuclease E; Reviewed
676-848 2.96e-03

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 42.72  E-value: 2.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  676 PIGIMQSEAVEPEKESPHFSPPPDSLPTP-IAPAPASMPSPT-EAVIAPVASTPPLASPQtsPPNPQPPAVTKPSTpkEK 753
Cdd:PRK10811   846 PVVRPQDVQVEEQREAEEVQVQPVVAEVPvAAAVEPVVSAPVvEAVAEVVEEPVVVAEPQ--PEEVVVVETTHPEV--IA 921
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  754 APSAKPSQPSKPSAQPSPAFDAIEAAkllglpgvPVTSAVKKRVPLNQLAQ-AKVVTRSPvqETAKSPVEEAIGATPPES 832
Cdd:PRK10811   922 APVTEQPQVITESDVAVAQEVAEHAE--------PVVEPQDETADIEEAAEtAEVVVAEP--EVVAQPAAPVVAEVAAEV 991
                          170
                   ....*....|....*.
gi 1889988573  833 KSAHVEEPAASAAQPV 848
Cdd:PRK10811   992 ETVTAVEPEVAPAQVP 1007
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
661-764 2.97e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.49  E-value: 2.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  661 IQSPTAPGAAPPTflpigimqSEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEaviaPVASTPPLASPQTSPPNPQ 740
Cdd:PRK14950   360 LVPVPAPQPAKPT--------AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETAT----PPPVPPRPVAPPVPHTPES 427
                           90       100
                   ....*....|....*....|....
gi 1889988573  741 PPAVTKPSTPKEKAPSAKPSQPSK 764
Cdd:PRK14950   428 APKLTRAAIPVDEKPKYTPPAPPK 451
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
700-772 3.06e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 42.64  E-value: 3.06e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1889988573  700 SLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAvTKPSTPKEKAPSAKPSQPSKPSAQPSPA 772
Cdd:PRK14948   516 SASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTA-TQASSNAPAQIPADSSPPPPIPEEPTPS 587
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
701-786 3.66e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 42.26  E-value: 3.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  701 LPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVtkPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEAAK 780
Cdd:PRK14948   360 LPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATT--PSPPPAKASPPIPVPAEPTEPSPTPPANAANAPP 437

                   ....*.
gi 1889988573  781 LLGLPG 786
Cdd:PRK14948   438 SLNLEE 443
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
636-782 3.78e-03

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 42.10  E-value: 3.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  636 LDAAEPGKDPFLSPKP------AEPEVDL-----------FNIQSPTAPGAAPPTFLPIGIMQSEAvepekesphfsPPP 698
Cdd:PRK12373   166 IDAFAAGKGPVVKPGPqigryaSEPAGGLtslteeagkarYNASKALAEDIGDTVKRIDGTEVPLL-----------APW 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  699 DSLPTPIAPAPASMPSPTEAviapvASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFDAIEA 778
Cdd:PRK12373   235 QGDAAPVPPSEAARPKSADA-----ETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAAAAAKPALED 309

                   ....
gi 1889988573  779 AKLL 782
Cdd:PRK12373   310 KPRP 313
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
657-742 4.22e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.57  E-value: 4.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  657 DLFNIQSPTAPGAAPPtflPIGIMQSEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSP 736
Cdd:PRK12270    31 EFFADYGPGSTAAPTA---AAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAA 107

                   ....*.
gi 1889988573  737 PNPQPP 742
Cdd:PRK12270   108 PAAAAV 113
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
695-766 4.92e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 41.87  E-value: 4.92e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1889988573  695 SPPPDSLPTPIAPAPASMPSPTEAVIAPVaSTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPSKPS 766
Cdd:PRK14948   525 PPPQKSPPPPAPTPPLPQPTATAPPPTPP-PPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSPE 595
COG5373 COG5373
Uncharacterized membrane protein [Function unknown];
693-763 5.53e-03

Uncharacterized membrane protein [Function unknown];


Pssm-ID: 444140 [Multi-domain]  Cd Length: 854  Bit Score: 41.91  E-value: 5.53e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1889988573  693 HFSPPPDSLPTPIAPAPASMPSPTEAviAPVA-STPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPS 763
Cdd:COG5373     36 ELAEAAEAASAPAEPEPEAAAAATAA--APEAaPAPVPEAPAAPPAAAEAPAPAAAAPPAEAEPAAAPAAAS 105
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
634-772 5.71e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 41.84  E-value: 5.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  634 SSLDAAEPGKDPFLSPKPAEPEVDLFNIQSPTAPGAAPPTFLPIGIMQSEAVEPEKESP---------HFSPPPDSLPTP 704
Cdd:PLN03209   420 SNVPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTApataatdaaAPPPANMRPLSP 499
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1889988573  705 IAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAKPSQPS------KPSAQPSPA 772
Cdd:PLN03209   500 YAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPYtmyedlKPPTSPTPS 573
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
687-769 5.92e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 5.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  687 PEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPstPKEKAPSAKPSQPSKPS 766
Cdd:PRK12270    39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAP--PAAAAAAAPAAAAVEDE 116

                   ...
gi 1889988573  767 AQP 769
Cdd:PRK12270   117 VTP 119
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
682-848 7.28e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 41.09  E-value: 7.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  682 SEAVEPEKESPHFSPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPQPPAVTKPSTPKEKAPSAkpsq 761
Cdd:PTZ00436   195 AAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAP---- 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  762 PSKPSAQPSPAfdAIEAAKLLGLPGVPVTSAVKKRVPLNQLAQAKVVTRSPVQETAKSPveeAIGATPPESKSAhveEPA 841
Cdd:PTZ00436   271 PAKAAAPPAKA--AAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPP---AKAATPPAKAAA---PPA 342

                   ....*..
gi 1889988573  842 ASAAQPV 848
Cdd:PTZ00436   343 KAAAAPV 349
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
662-1012 7.77e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 7.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  662 QSPTAPGAAPPTFLPIGIMQSEAVEPEKESPHfsPPPDSLPtpiAPAPASMPSPTEAVIAPVASTPPLASPQTSP-PNPQ 740
Cdd:PRK07764   401 AAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPA--PAPAPAP---APPSPAGNAPAGGAPSPPPAAAPSAQPAPAPaAAPE 475
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  741 PPAVTKPSTPKEKAPSAKPSQPSKPSAQPSP--------------------------------------------AFDAI 776
Cdd:PRK07764   476 PTAAPAPAPPAAPAPAAAPAAPAAPAAPAGAddaatlrerwpeilaavpkrsrktwaillpeatvlgvrgdtlvlGFSTG 555
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  777 EAAKLLGLPGVP--VTSAVKKRVPLNQLAQAKVVTRSPVQETAKSPVEEAIGATPPESKSAHVEEPAASAAQPVDmfEDA 854
Cdd:PRK07764   556 GLARRFASPGNAevLVTALAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA--GAA 633
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  855 FVPDElnASEKSAPTKSASAHFEDAFVPGGLMDTDEGGPTKSFSANFEDAFVVQEWPEPEQPLSQKSPSTHFEDSFVPMT 934
Cdd:PRK07764   634 AAPAE--ASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPA 711
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  935 EKRPAPEEAKKSTSDFDLfSDMETTAKPLSMPADPFASTGGDAK------TPELFTSAAKAATPPAAMTIDFFGSVAEDV 1008
Cdd:PRK07764   712 GQADDPAAQPPQAAQGAS-APSPAADDPVPLPPEPDDPPDPAGApaqpppPPAPAPAAAPAAAPPPSPPSEEEEMAEDDA 790

                   ....
gi 1889988573 1009 PPPD 1012
Cdd:PRK07764   791 PSMD 794
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1001-1301 8.61e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 8.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1001 FGSVAEDVPPPDLLGGSPeaelkpamiptNAFVPKPDEAVKQDVLEGQAAAQPETMAIDLLSSPSQPE-APTSNAVAPAT 1079
Cdd:PHA03307    23 RPPATPGDAADDLLSGSQ-----------GQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEApANESRSTPTWS 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1080 LSAGEVFAPSPPDaaaaddlfaSPGEPAAMPDDPFASKPGDEANTPDlfaGATGAAAMSPSIPSPPTAISPSRESLKLEE 1159
Cdd:PHA03307    92 LSTLAPASPAREG---------SPTPPGPSSPDPPPPTPPPASPPPS---PAPDLSEMLRPVGSPGPPPAASPPAAGASP 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1160 LDPFAPQKSASKEGTPVSSPPkktEVVAPLASPEEELNLSPmqhisvADIKPTAQSPVEDAGLAMGPPLTTIDPfSPKAA 1239
Cdd:PHA03307   160 AAVASDAASSRQAALPLSSPE---ETARAPSSPPAEPPPST------PPAAASPRPPRRSSPISASASSPAPAP-GRSAA 229
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1889988573 1240 VGSPATPPRQTKKKYNPFKAETPTTPPDEEASPFPTVKLPISVVQPLPASPDTSPLDESFPP 1301
Cdd:PHA03307   230 DDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSP 291
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
645-741 9.04e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 40.57  E-value: 9.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  645 PFLSPKPAEPEVDlFNIQSPTAPGAAPPTFLPIGIMQ-SEAVEPEKESPHFSPPPdslPTPIAPAPASMPSPTEAVIAPV 723
Cdd:pfam15279  198 PFLRPPPSIPQPN-SPLSNPMLPGIGPPPKPPRNLGPpSNPMHRPPFSPHHPPPP---PTPPGPPPGLPPPPPRGFTPPF 273
                           90
                   ....*....|....*...
gi 1889988573  724 ASTPPlasPQTSPPNPQP 741
Cdd:pfam15279  274 GPPFP---PVNMMPNPPE 288
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1101-1305 9.48e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 9.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1101 ASPGEPAAMP--DDPFASKPGDEANTPDLFAGATGAAAMSPSIPSPPTAISPSRESLKLEELdpfAPQKSASKEGTPVSS 1178
Cdd:PRK12323   372 AGPATAAAAPvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAL---AAARQASARGPGGAP 448
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573 1179 PPKKTEVVAPLASPeeelnlspmqhisvadiKPTAQSPVEDAGLAMGPPlttidpfspkaAVGSPATPPRQTKKKYNPFK 1258
Cdd:PRK12323   449 APAPAPAAAPAAAA-----------------RPAAAGPRPVAAAAAAAP-----------ARAAPAAAPAPADDDPPPWE 500
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1889988573 1259 aETPTTPPdeEASPFPTVKLPISVVQPLPASPDTSPLDESFPPKCEE 1305
Cdd:PRK12323   501 -ELPPEFA--SPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPA 544
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
695-774 9.89e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 41.10  E-value: 9.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1889988573  695 SPPPDSLPTPIAPAPASMPSPTEAVIAPVASTPPLASPQTSPPNPqPPAVTKPSTPKEKAPSAKPSQPSKPSAQPSPAFD 774
Cdd:PRK14948   362 SAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSP-PPAKASPPIPVPAEPTEPSPTPPANAANAPPSLN 440
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH