NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1987696148|ref|XP_039427727|]
View 

THO complex subunit 2 isoform X1 [Corvus cornix cornix]

Protein Classification

Thoc2 and Tho2 domain-containing protein( domain architecture ID 11069020)

protein containing domains THOC2_N, Thoc2, and Tho2

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
870-1169 3.40e-128

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


:

Pssm-ID: 463251  Cd Length: 304  Bit Score: 401.61  E-value: 3.40e-128
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  870 DDISPQFYATFWSLTMYDLAVPHSSYDREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQLEHV 945
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  946 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCAHFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1023
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1024 VASCTENEASRYGRFLCCMLETVTRWHSDRVIYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1100
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1987696148 1101 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYSGQL 1169
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N super family cl24644
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
8-562 1.03e-80

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


The actual alignment was detected with superfamily member pfam16134:

Pssm-ID: 465032  Cd Length: 614  Bit Score: 279.51  E-value: 1.03e-80
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148    8 EWIKNWEKGGKSEfvqLCRALSENKNHDVGfRDIQQALYELAYHVVRGNLKHDQASNVLGDVI-EFREDMPSILADVFCi 86
Cdd:pfam16134    3 ERINNWGGSGRQE---LIEQLKLARNDEDE-DELSDLFQELIRSVLDGRLDPEDAGSFLKEIIkEEPTDSSEDVAKLFL- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148   87 lDIEtSCLEEKNKRDHFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREENE 166
Cdd:pfam16134   78 -DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREESE 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  167 GYAKLIAELgQDLSGNITSDLI---LENIKSLIGCFNLDPNRVLDIILEVYECR-PEYDDFFV---------PLIESYMY 233
Cdd:pfam16134  153 GYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVkflrasswwPRTEESDW 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  234 MCEPQTLCH--------ILGFKFKFYQ-DPSGETPSSLYRVAAVLLQHNLIDLEDLYVHLLPGDNT---IIEEHKREIve 301
Cdd:pfam16134  232 ISSTKTLPPggnrvaaqLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDEEmeaLKEEYKKEL-- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  302 AKQIVRK----LTMV-VLS-------------SEKTEEKEKEKEKEEEKTEKPPDNQKLGLLEALLKIGDWHHSQSIMDQ 363
Cdd:pfam16134  310 EEESMEGganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILGR 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  364 MpPFYATSHKPIAIALCQLVHVTVEPLYR-----------RVGVPKGAKgSPISSLPNKRA---PKQAESFEELRK---- 425
Cdd:pfam16134  390 Y-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNI-VRLDENPPRRLlrwPKTDKPFFDLGTkyrf 467
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  426 ----------------EVF----NMLCYLGPHLSHDPILFAKVVRLGKAFMKEfqsDGSKQEDKEKMETLFScllsitdQ 485
Cdd:pfam16134  468 yydewkdnlpvcqtvdDLFtlshEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRWIDYLR-------R 537
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1987696148  486 VLLPSLSLMDCNACMSEELWGMFKTFPYQYRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 562
Cdd:pfam16134  538 FIFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
564-638 3.13e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


:

Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.31  E-value: 3.13e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1987696148  564 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 638
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg super family cl36941
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1476-1573 1.60e-06

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


The actual alignment was detected with superfamily member TIGR01642:

Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 52.59  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1476 KSKEREVEKK---DLDKSRERSRERE--KKDEKDRKDRKRDHSNSDREVPQDSTKRRKEENGTTGSSKHSKSESP----S 1546
Cdd:TIGR01642    3 EEPDREREKSrgrDRDRSSERPRRRSrdRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRprrrS 82
                           90       100
                   ....*....|....*....|....*..
gi 1987696148 1547 DSPRLNEKEKDKNKSKSSGKEKGDSIK 1573
Cdd:TIGR01642   83 RSVRSIEQHRRRLRDRSPSNQWRKDDK 109
PTZ00121 super family cl31754
MAEBL; Provisional
1294-1620 1.28e-03

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 1.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1294 LGKDGKDKPKEERANKD-EKAREIKEKTPKSDKDKEKVKKEEKASKEEKSKTVVTIIESKSTAEKEREKEPSRErdlAKE 1372
Cdd:PTZ00121  1462 AKKKAEEAKKADEAKKKaEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK---ADE 1538
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1373 MKSKENAKGGEKVPVAGSLKSPVPRSETSESEREQKRRKVdthsspshssTVKGTAVLPKVplvsenySSSRVISVHFLQ 1452
Cdd:PTZ00121  1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM----------ALRKAEEAKKA-------EEARIEEVMKLY 1601
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1453 DNLNELKDSSAKHYVNHTTPTLSKSKEREvEKKDLDKSRERSREREKKDEKDRKDRKRDHSNSDREV-PQDSTKRRKEEN 1531
Cdd:PTZ00121  1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEE-EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAkKAEEDKKKAEEA 1680
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1532 GTTGSSKHSKSESPSDSPRLNEKEKDKNKSKSSGKEKGDSIKAEKMEKSSSGSKKESRHDKEKAEKKEKRDSTGGKEEKK 1611
Cdd:PTZ00121  1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIA 1760

                   ....*....
gi 1987696148 1612 HHKSSDKHR 1620
Cdd:PTZ00121  1761 HLKKEEEKK 1769
 
Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
870-1169 3.40e-128

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


Pssm-ID: 463251  Cd Length: 304  Bit Score: 401.61  E-value: 3.40e-128
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  870 DDISPQFYATFWSLTMYDLAVPHSSYDREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQLEHV 945
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  946 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCAHFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1023
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1024 VASCTENEASRYGRFLCCMLETVTRWHSDRVIYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1100
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1987696148 1101 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYSGQL 1169
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N pfam16134
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
8-562 1.03e-80

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


Pssm-ID: 465032  Cd Length: 614  Bit Score: 279.51  E-value: 1.03e-80
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148    8 EWIKNWEKGGKSEfvqLCRALSENKNHDVGfRDIQQALYELAYHVVRGNLKHDQASNVLGDVI-EFREDMPSILADVFCi 86
Cdd:pfam16134    3 ERINNWGGSGRQE---LIEQLKLARNDEDE-DELSDLFQELIRSVLDGRLDPEDAGSFLKEIIkEEPTDSSEDVAKLFL- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148   87 lDIEtSCLEEKNKRDHFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREENE 166
Cdd:pfam16134   78 -DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREESE 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  167 GYAKLIAELgQDLSGNITSDLI---LENIKSLIGCFNLDPNRVLDIILEVYECR-PEYDDFFV---------PLIESYMY 233
Cdd:pfam16134  153 GYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVkflrasswwPRTEESDW 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  234 MCEPQTLCH--------ILGFKFKFYQ-DPSGETPSSLYRVAAVLLQHNLIDLEDLYVHLLPGDNT---IIEEHKREIve 301
Cdd:pfam16134  232 ISSTKTLPPggnrvaaqLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDEEmeaLKEEYKKEL-- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  302 AKQIVRK----LTMV-VLS-------------SEKTEEKEKEKEKEEEKTEKPPDNQKLGLLEALLKIGDWHHSQSIMDQ 363
Cdd:pfam16134  310 EEESMEGganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILGR 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  364 MpPFYATSHKPIAIALCQLVHVTVEPLYR-----------RVGVPKGAKgSPISSLPNKRA---PKQAESFEELRK---- 425
Cdd:pfam16134  390 Y-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNI-VRLDENPPRRLlrwPKTDKPFFDLGTkyrf 467
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  426 ----------------EVF----NMLCYLGPHLSHDPILFAKVVRLGKAFMKEfqsDGSKQEDKEKMETLFScllsitdQ 485
Cdd:pfam16134  468 yydewkdnlpvcqtvdDLFtlshEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRWIDYLR-------R 537
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1987696148  486 VLLPSLSLMDCNACMSEELWGMFKTFPYQYRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 562
Cdd:pfam16134  538 FIFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
564-638 3.13e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.31  E-value: 3.13e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1987696148  564 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 638
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1476-1573 1.60e-06

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 52.59  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1476 KSKEREVEKK---DLDKSRERSRERE--KKDEKDRKDRKRDHSNSDREVPQDSTKRRKEENGTTGSSKHSKSESP----S 1546
Cdd:TIGR01642    3 EEPDREREKSrgrDRDRSSERPRRRSrdRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRprrrS 82
                           90       100
                   ....*....|....*....|....*..
gi 1987696148 1547 DSPRLNEKEKDKNKSKSSGKEKGDSIK 1573
Cdd:TIGR01642   83 RSVRSIEQHRRRLRDRSPSNQWRKDDK 109
PTZ00121 PTZ00121
MAEBL; Provisional
1294-1620 1.28e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 1.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1294 LGKDGKDKPKEERANKD-EKAREIKEKTPKSDKDKEKVKKEEKASKEEKSKTVVTIIESKSTAEKEREKEPSRErdlAKE 1372
Cdd:PTZ00121  1462 AKKKAEEAKKADEAKKKaEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK---ADE 1538
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1373 MKSKENAKGGEKVPVAGSLKSPVPRSETSESEREQKRRKVdthsspshssTVKGTAVLPKVplvsenySSSRVISVHFLQ 1452
Cdd:PTZ00121  1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM----------ALRKAEEAKKA-------EEARIEEVMKLY 1601
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1453 DNLNELKDSSAKHYVNHTTPTLSKSKEREvEKKDLDKSRERSREREKKDEKDRKDRKRDHSNSDREV-PQDSTKRRKEEN 1531
Cdd:PTZ00121  1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEE-EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAkKAEEDKKKAEEA 1680
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1532 GTTGSSKHSKSESPSDSPRLNEKEKDKNKSKSSGKEKGDSIKAEKMEKSSSGSKKESRHDKEKAEKKEKRDSTGGKEEKK 1611
Cdd:PTZ00121  1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIA 1760

                   ....*....
gi 1987696148 1612 HHKSSDKHR 1620
Cdd:PTZ00121  1761 HLKKEEEKK 1769
TCP2 PLN03106
Protein TCP2; Provisional
1473-1554 2.01e-03

Protein TCP2; Provisional


Pssm-ID: 215579 [Multi-domain]  Cd Length: 447  Bit Score: 42.76  E-value: 2.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1473 TLSKSKERevekkdlDKSRERSREREKKDEKDRKDRKRDHSNSDREVPQDS--TKRRKEENGTTGSSKHSKSESPSDSPR 1550
Cdd:PLN03106   200 SLSRSELR-------DKARERARERTAKEKEKEDHNHAAHHNNNNPISQNSsfTELLTGGIDPVNNNRQWMASAPSGQKA 272

                   ....
gi 1987696148 1551 LNEK 1554
Cdd:PLN03106   273 AAAR 276
DMP1 pfam07263
Dentin matrix protein 1 (DMP1); This family consists of several mammalian dentin matrix ...
1477-1571 2.03e-03

Dentin matrix protein 1 (DMP1); This family consists of several mammalian dentin matrix protein 1 (DMP1) sequences. The dentin matrix acidic phosphoprotein 1 (DMP1) gene has been mapped to human chromosome 4q21. DMP1 is a bone and teeth specific protein initially identified from mineralized dentin. DMP1 is primarily localized in the nuclear compartment of undifferentiated osteoblasts. In the nucleus, DMP1 acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the early phase of osteoblast maturation, Ca(2+) surges into the nucleus from the cytoplasm, triggering the phosphorylation of DMP1 by a nuclear isoform of casein kinase II. This phosphorylated DMP1 is then exported out into the extracellular matrix, where it regulates nucleation of hydroxyapatite. DMP1 is a unique molecule that initiates osteoblast differentiation by transcription in the nucleus and orchestrates mineralized matrix formation extracellularly, at later stages of osteoblast maturation. The DMP1 gene has been found to be ectopically expressed in lung cancer although the reason for this is unknown.


Pssm-ID: 462128 [Multi-domain]  Cd Length: 519  Bit Score: 42.61  E-value: 2.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1477 SKEREVEKKDLDKSRERSREREKKDEKDRKDRKR------------DHSNSDREVPQDSTKRR-KEENGTTGSSKHSKSE 1543
Cdd:pfam07263  236 VKSKESKGDSEQASTQDSGDSQSVEYPSRKFFRKsriseeddrgelDDSNTMEEVKSDSTESTsSKEAGLSQSREDSKSE 315
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1987696148 1544 SPSDSPRLNEKEKDKN----KSKSSGKEKGDS 1571
Cdd:pfam07263  316 SQEDSEESQSQEDSQNsqdpSSESSQEADLPS 347
 
Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
870-1169 3.40e-128

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


Pssm-ID: 463251  Cd Length: 304  Bit Score: 401.61  E-value: 3.40e-128
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  870 DDISPQFYATFWSLTMYDLAVPHSSYDREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQLEHV 945
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  946 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCAHFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1023
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1024 VASCTENEASRYGRFLCCMLETVTRWHSDRVIYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1100
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1987696148 1101 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYSGQL 1169
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N pfam16134
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
8-562 1.03e-80

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


Pssm-ID: 465032  Cd Length: 614  Bit Score: 279.51  E-value: 1.03e-80
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148    8 EWIKNWEKGGKSEfvqLCRALSENKNHDVGfRDIQQALYELAYHVVRGNLKHDQASNVLGDVI-EFREDMPSILADVFCi 86
Cdd:pfam16134    3 ERINNWGGSGRQE---LIEQLKLARNDEDE-DELSDLFQELIRSVLDGRLDPEDAGSFLKEIIkEEPTDSSEDVAKLFL- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148   87 lDIEtSCLEEKNKRDHFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREENE 166
Cdd:pfam16134   78 -DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREESE 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  167 GYAKLIAELgQDLSGNITSDLI---LENIKSLIGCFNLDPNRVLDIILEVYECR-PEYDDFFV---------PLIESYMY 233
Cdd:pfam16134  153 GYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVkflrasswwPRTEESDW 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  234 MCEPQTLCH--------ILGFKFKFYQ-DPSGETPSSLYRVAAVLLQHNLIDLEDLYVHLLPGDNT---IIEEHKREIve 301
Cdd:pfam16134  232 ISSTKTLPPggnrvaaqLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDEEmeaLKEEYKKEL-- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  302 AKQIVRK----LTMV-VLS-------------SEKTEEKEKEKEKEEEKTEKPPDNQKLGLLEALLKIGDWHHSQSIMDQ 363
Cdd:pfam16134  310 EEESMEGganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILGR 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  364 MpPFYATSHKPIAIALCQLVHVTVEPLYR-----------RVGVPKGAKgSPISSLPNKRA---PKQAESFEELRK---- 425
Cdd:pfam16134  390 Y-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNI-VRLDENPPRRLlrwPKTDKPFFDLGTkyrf 467
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148  426 ----------------EVF----NMLCYLGPHLSHDPILFAKVVRLGKAFMKEfqsDGSKQEDKEKMETLFScllsitdQ 485
Cdd:pfam16134  468 yydewkdnlpvcqtvdDLFtlshEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRWIDYLR-------R 537
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1987696148  486 VLLPSLSLMDCNACMSEELWGMFKTFPYQYRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 562
Cdd:pfam16134  538 FIFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
564-638 3.13e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.31  E-value: 3.13e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1987696148  564 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 638
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1476-1573 1.60e-06

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 52.59  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1476 KSKEREVEKK---DLDKSRERSRERE--KKDEKDRKDRKRDHSNSDREVPQDSTKRRKEENGTTGSSKHSKSESP----S 1546
Cdd:TIGR01642    3 EEPDREREKSrgrDRDRSSERPRRRSrdRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRprrrS 82
                           90       100
                   ....*....|....*....|....*..
gi 1987696148 1547 DSPRLNEKEKDKNKSKSSGKEKGDSIK 1573
Cdd:TIGR01642   83 RSVRSIEQHRRRLRDRSPSNQWRKDDK 109
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1475-1568 8.59e-05

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 47.22  E-value: 8.59e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1475 SKSKEREvekKDLDKSRERSREREKkdEKDRkDRKRDHSnSDREVPQD-STKRRKEENGTTGSSKHSKSESPSDSPRLNE 1553
Cdd:TIGR01622    2 YRDRERE---RLRDSSSAGDRDRRR--DKGR-ERSRDRS-RDRERSRSrRRDRHRDRDYYRGRERRSRSRRPNRRYRPRE 74
                           90       100
                   ....*....|....*....|..
gi 1987696148 1554 KEKDK-------NKSKSSGKEK 1568
Cdd:TIGR01622   75 KRRRRgdsyrrrRDDRRSRREK 96
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1475-1557 8.24e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 43.75  E-value: 8.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1475 SKSKEREVEKkdldkSRERSREREKKDEKDRKDRKRDHSNSDREVPQDSTKRRKEENGTTGSSKHSKSESPSDSPRLNEK 1554
Cdd:TIGR01622   32 DRSRDRERSR-----SRRRDRHRDRDYYRGRERRSRSRRPNRRYRPREKRRRRGDSYRRRRDDRRSRREKPRARDGTPEP 106

                   ...
gi 1987696148 1555 EKD 1557
Cdd:TIGR01622  107 LTE 109
PTZ00121 PTZ00121
MAEBL; Provisional
1294-1620 1.28e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 1.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1294 LGKDGKDKPKEERANKD-EKAREIKEKTPKSDKDKEKVKKEEKASKEEKSKTVVTIIESKSTAEKEREKEPSRErdlAKE 1372
Cdd:PTZ00121  1462 AKKKAEEAKKADEAKKKaEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK---ADE 1538
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1373 MKSKENAKGGEKVPVAGSLKSPVPRSETSESEREQKRRKVdthsspshssTVKGTAVLPKVplvsenySSSRVISVHFLQ 1452
Cdd:PTZ00121  1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM----------ALRKAEEAKKA-------EEARIEEVMKLY 1601
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1453 DNLNELKDSSAKHYVNHTTPTLSKSKEREvEKKDLDKSRERSREREKKDEKDRKDRKRDHSNSDREV-PQDSTKRRKEEN 1531
Cdd:PTZ00121  1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEE-EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAkKAEEDKKKAEEA 1680
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1532 GTTGSSKHSKSESPSDSPRLNEKEKDKNKSKSSGKEKGDSIKAEKMEKSSSGSKKESRHDKEKAEKKEKRDSTGGKEEKK 1611
Cdd:PTZ00121  1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIA 1760

                   ....*....
gi 1987696148 1612 HHKSSDKHR 1620
Cdd:PTZ00121  1761 HLKKEEEKK 1769
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1476-1561 1.94e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 42.57  E-value: 1.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1476 KSKEREVEK-KDLDKSRERSRE----REKKDEKDRkDRKRDHSNSDREVPQDSTKR-----RKEENGTTGSSKHsKSESP 1545
Cdd:TIGR01642   19 RSSERPRRRsRDRSRFRDRHRRsrerSYREDSRPR-DRRRYDSRSPRSLRYSSVRRsrdrpRRRSRSVRSIEQH-RRRLR 96
                           90
                   ....*....|....*.
gi 1987696148 1546 SDSPRLNEKEKDKNKS 1561
Cdd:TIGR01642   97 DRSPSNQWRKDDKKRS 112
TCP2 PLN03106
Protein TCP2; Provisional
1473-1554 2.01e-03

Protein TCP2; Provisional


Pssm-ID: 215579 [Multi-domain]  Cd Length: 447  Bit Score: 42.76  E-value: 2.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1473 TLSKSKERevekkdlDKSRERSREREKKDEKDRKDRKRDHSNSDREVPQDS--TKRRKEENGTTGSSKHSKSESPSDSPR 1550
Cdd:PLN03106   200 SLSRSELR-------DKARERARERTAKEKEKEDHNHAAHHNNNNPISQNSsfTELLTGGIDPVNNNRQWMASAPSGQKA 272

                   ....
gi 1987696148 1551 LNEK 1554
Cdd:PLN03106   273 AAAR 276
DMP1 pfam07263
Dentin matrix protein 1 (DMP1); This family consists of several mammalian dentin matrix ...
1477-1571 2.03e-03

Dentin matrix protein 1 (DMP1); This family consists of several mammalian dentin matrix protein 1 (DMP1) sequences. The dentin matrix acidic phosphoprotein 1 (DMP1) gene has been mapped to human chromosome 4q21. DMP1 is a bone and teeth specific protein initially identified from mineralized dentin. DMP1 is primarily localized in the nuclear compartment of undifferentiated osteoblasts. In the nucleus, DMP1 acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the early phase of osteoblast maturation, Ca(2+) surges into the nucleus from the cytoplasm, triggering the phosphorylation of DMP1 by a nuclear isoform of casein kinase II. This phosphorylated DMP1 is then exported out into the extracellular matrix, where it regulates nucleation of hydroxyapatite. DMP1 is a unique molecule that initiates osteoblast differentiation by transcription in the nucleus and orchestrates mineralized matrix formation extracellularly, at later stages of osteoblast maturation. The DMP1 gene has been found to be ectopically expressed in lung cancer although the reason for this is unknown.


Pssm-ID: 462128 [Multi-domain]  Cd Length: 519  Bit Score: 42.61  E-value: 2.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1477 SKEREVEKKDLDKSRERSREREKKDEKDRKDRKR------------DHSNSDREVPQDSTKRR-KEENGTTGSSKHSKSE 1543
Cdd:pfam07263  236 VKSKESKGDSEQASTQDSGDSQSVEYPSRKFFRKsriseeddrgelDDSNTMEEVKSDSTESTsSKEAGLSQSREDSKSE 315
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1987696148 1544 SPSDSPRLNEKEKDKN----KSKSSGKEKGDS 1571
Cdd:pfam07263  316 SQEDSEESQSQEDSQNsqdpSSESSQEADLPS 347
PRP38_assoc pfam12871
Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region ...
1452-1518 2.36e-03

Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region found at the C-terminus of plant and metazoan pre-mRNA-splicing factor 38 proteins. The function is not known.


Pssm-ID: 463734 [Multi-domain]  Cd Length: 98  Bit Score: 38.99  E-value: 2.36e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1452 QDNLNELKDSSAKHyvNHTTPTLSKSKEREVEKKD---LDKSRERSREREKKDEKDRKDRKRDHSNSDRE 1518
Cdd:pfam12871   30 RASLSRKRRSRSRR--RSSTRDRSRSRSRSRSRDRrsrGTRDRRRDRDRDRYRSLRSRSRDRSRDRDRDR 97
SEEEED pfam14797
Serine-rich region of AP3B1, clathrin-adaptor complex; This short low-complexity, highly ...
1498-1575 3.11e-03

Serine-rich region of AP3B1, clathrin-adaptor complex; This short low-complexity, highly serine-rich region lies on clathrin-adaptor complex 3 beta-1 subunit proteins, between family Adaptin_N, pfam01602 and a C-terminal domain, AP3B1_C,pfam14796.


Pssm-ID: 434218 [Multi-domain]  Cd Length: 111  Bit Score: 39.14  E-value: 3.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1498 EKKDEKDRKDrkrdhSNSDREVPQDST--KRRKEENGTTGSSKHSKSESPSDSPRLNEKEK-DKNKSKSSGKEKGDSIKA 1574
Cdd:pfam14797    8 ESEEEEDSSD-----SSSDSESESGSEseEEGKEGSSSEDSSEDSSSEQESESGSESEKKRtAKRNSKAKGKSDSEDGEK 82

                   .
gi 1987696148 1575 E 1575
Cdd:pfam14797   83 K 83
PTZ00121 PTZ00121
MAEBL; Provisional
1303-1575 4.86e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 41.67  E-value: 4.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1303 KEERANKDEKAREIKEKTPKSDKDKEKVKKEEKASKEEKSKTVVTIIESKSTAEKEREKEPSRERDLAKEMKSKENAKGG 1382
Cdd:PTZ00121  1219 KAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKA 1298
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1383 EKVPVAGSLKspvprsetsesEREQKRRKVDTHSSPSHSSTVKGTAVLPKVplvsenysssrvisvhflqdnlnELKDSS 1462
Cdd:PTZ00121  1299 EEKKKADEAK-----------KKAEEAKKADEAKKKAEEAKKKADAAKKKA-----------------------EEAKKA 1344
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1987696148 1463 AKhyvnhttptlSKSKEREVEKKDLDKSRERSREREKKDEKDRKDRKRDHSNSDREVPQDSTKRRKEENGTTGSSKHSKS 1542
Cdd:PTZ00121  1345 AE----------AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1987696148 1543 ESPSDSPRLNEKEKDKNKSKSSGKEKGDSIKAE 1575
Cdd:PTZ00121  1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH