NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|198470952|ref|XP_002133620|]
View 

pre-rRNA-processing protein esf1 [Drosophila pseudoobscura]

Protein Classification

ESF1 family protein( domain architecture ID 1002617)

ESF1 family protein similar to Homo sapiens ESF1 pre-rRNA-processing protein homolog, which may constitute a novel regulatory system for basal transcription

Gene Ontology:  GO:0003723|GO:0006364

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG5638 super family cl35032
Uncharacterized conserved protein [Function unknown];
44-783 3.58e-73

Uncharacterized conserved protein [Function unknown];


The actual alignment was detected with superfamily member COG5638:

Pssm-ID: 227925 [Multi-domain]  Cd Length: 622  Bit Score: 250.47  E-value: 3.58e-73
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952  44 DARFQHLLSDPRFKGVPKVQRKVKIDKRFQGmfTDEKFKVKYTVDKYGRPVNTSNA-EDLRKFYELDENDSDDGEKEaea 122
Cdd:COG5638   12 DPRFQSVHSDPRFSRLKRGNFKVKVDERFKK--EDKDFKTTASVDRYGRPLNQDKAtKEIDRLYELENESSESSEIT--- 86
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 123 eaevvakkeieeeeqkerraeelaiacvdvkedkfENDSLDSDGSEVPENLrerltnpnvDYARGEGRLMTDSSSDDDTD 202
Cdd:COG5638   87 -----------------------------------DNEEVASASSELTDEY---------DPARGEGIISTSESSDESRE 122
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 203 DDEGAEGPELQidhvwgELDNDAESTEVSTRRLAICNMDWDRIRAEDLMVLLSSFLPLGGSILSVKIYPSEFGKARLAEE 282
Cdd:COG5638  123 ESEEEKANEIS------EKAGAVPEEGNPTKRLAVVNMDWDRVDAKDLFKIFSSFLPYGGKLSKVKIYPSEFGKERMAAE 196
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 283 EIHGPAELVKRDEREQEEEDDSDEELVkEQDSDAEE--------GDDYHMEKLRQYQLNRLRYYYAVAECDSVATADKVY 354
Cdd:COG5638  197 HVQGPPRDIFTPADNQPSSQKFGDDNV-FSDRDAGEdaliegdrGNEFDMVKLRQYQLERLRYYYAVVECEDIETSKNIY 275
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 355 KECDGIEYESSATRVDLRFIPDDTSFEEDTpKDECFELPDasNYKPRQFTTTALQQAKVDLTWDETALDRRELGDKLSSG 434
Cdd:COG5638  276 SACDGVEYENSANVLDLRFVPDSLTFDDDS-REVCTKAPE--KYEPRDFVTDALQHSKVKLSWDAEDPHRKDLCKEAFTD 352
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 435 qvDKLTDKELRQIVAYSSEEDEDEDEEQpqaeekkeqqqpeqkqhqppkklSKQERIASYKNLLADILQKEKQEKEHKYE 514
Cdd:COG5638  353 --DGIRDKDFSAYTASKLSDEDDDSVME-----------------------SKMQKLFSEKEIDFGLNSELVDMSDDGEN 407
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 515 MEMSWNIKPAVEPKEEQEKTStatscqpSELTPIEKVIQKRSEKNKLRKELRRKKQSEARGGDLEDSDDSsvpdgidmnd 594
Cdd:COG5638  408 GEMEDTFTSHLPASNESESDD-------KLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKES---------- 470
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 595 ayfaeefangdyeppktkkQAKKKNNKQDQVEDSAAEQQhqqELALLLDDGDGLEEKHHFSLTKILKEEEQnsggskrkr 674
Cdd:COG5638  471 -------------------INKKNKKGKHAIERTAASKE---ELELIKADDEDDEQLDHFDMKSILKAEKF--------- 519
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 675 rkqLKKAKQQPENAKPDDDFRVDLNDTRFKAVYKSHEYNIDPTHSHYKATKGMQQIIGEKLKRRERQVDggETGDSPDES 754
Cdd:COG5638  520 ---KKNRKLKKKASNLEEGFVFDPKDPRFVAIFEDHNFAIDPTHPEFKKTGGMKKIMDEKRKRLKNNIE--QTQDGKPEL 594
                        730       740
                 ....*....|....*....|....*....
gi 198470952 755 lapKRSKQQLEQSALVKSLKRKLQQQPKV 783
Cdd:COG5638  595 ---KIKKRKAEKGDQRQELDRIVKSIKRS 620
 
Name Accession Description Interval E-value
COG5638 COG5638
Uncharacterized conserved protein [Function unknown];
44-783 3.58e-73

Uncharacterized conserved protein [Function unknown];


Pssm-ID: 227925 [Multi-domain]  Cd Length: 622  Bit Score: 250.47  E-value: 3.58e-73
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952  44 DARFQHLLSDPRFKGVPKVQRKVKIDKRFQGmfTDEKFKVKYTVDKYGRPVNTSNA-EDLRKFYELDENDSDDGEKEaea 122
Cdd:COG5638   12 DPRFQSVHSDPRFSRLKRGNFKVKVDERFKK--EDKDFKTTASVDRYGRPLNQDKAtKEIDRLYELENESSESSEIT--- 86
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 123 eaevvakkeieeeeqkerraeelaiacvdvkedkfENDSLDSDGSEVPENLrerltnpnvDYARGEGRLMTDSSSDDDTD 202
Cdd:COG5638   87 -----------------------------------DNEEVASASSELTDEY---------DPARGEGIISTSESSDESRE 122
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 203 DDEGAEGPELQidhvwgELDNDAESTEVSTRRLAICNMDWDRIRAEDLMVLLSSFLPLGGSILSVKIYPSEFGKARLAEE 282
Cdd:COG5638  123 ESEEEKANEIS------EKAGAVPEEGNPTKRLAVVNMDWDRVDAKDLFKIFSSFLPYGGKLSKVKIYPSEFGKERMAAE 196
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 283 EIHGPAELVKRDEREQEEEDDSDEELVkEQDSDAEE--------GDDYHMEKLRQYQLNRLRYYYAVAECDSVATADKVY 354
Cdd:COG5638  197 HVQGPPRDIFTPADNQPSSQKFGDDNV-FSDRDAGEdaliegdrGNEFDMVKLRQYQLERLRYYYAVVECEDIETSKNIY 275
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 355 KECDGIEYESSATRVDLRFIPDDTSFEEDTpKDECFELPDasNYKPRQFTTTALQQAKVDLTWDETALDRRELGDKLSSG 434
Cdd:COG5638  276 SACDGVEYENSANVLDLRFVPDSLTFDDDS-REVCTKAPE--KYEPRDFVTDALQHSKVKLSWDAEDPHRKDLCKEAFTD 352
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 435 qvDKLTDKELRQIVAYSSEEDEDEDEEQpqaeekkeqqqpeqkqhqppkklSKQERIASYKNLLADILQKEKQEKEHKYE 514
Cdd:COG5638  353 --DGIRDKDFSAYTASKLSDEDDDSVME-----------------------SKMQKLFSEKEIDFGLNSELVDMSDDGEN 407
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 515 MEMSWNIKPAVEPKEEQEKTStatscqpSELTPIEKVIQKRSEKNKLRKELRRKKQSEARGGDLEDSDDSsvpdgidmnd 594
Cdd:COG5638  408 GEMEDTFTSHLPASNESESDD-------KLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKES---------- 470
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 595 ayfaeefangdyeppktkkQAKKKNNKQDQVEDSAAEQQhqqELALLLDDGDGLEEKHHFSLTKILKEEEQnsggskrkr 674
Cdd:COG5638  471 -------------------INKKNKKGKHAIERTAASKE---ELELIKADDEDDEQLDHFDMKSILKAEKF--------- 519
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 675 rkqLKKAKQQPENAKPDDDFRVDLNDTRFKAVYKSHEYNIDPTHSHYKATKGMQQIIGEKLKRRERQVDggETGDSPDES 754
Cdd:COG5638  520 ---KKNRKLKKKASNLEEGFVFDPKDPRFVAIFEDHNFAIDPTHPEFKKTGGMKKIMDEKRKRLKNNIE--QTQDGKPEL 594
                        730       740
                 ....*....|....*....|....*....
gi 198470952 755 lapKRSKQQLEQSALVKSLKRKLQQQPKV 783
Cdd:COG5638  595 ---KIKKRKAEKGDQRQELDRIVKSIKRS 620
NUC153 pfam08159
NUC153 domain; This small domain is found in a a novel nucleolar family.
700-728 4.70e-09

NUC153 domain; This small domain is found in a a novel nucleolar family.


Pssm-ID: 462385 [Multi-domain]  Cd Length: 29  Bit Score: 51.95  E-value: 4.70e-09
                          10        20
                  ....*....|....*....|....*....
gi 198470952  700 DTRFKAVYKSHEYNIDPTHSHYKATKGMQ 728
Cdd:pfam08159   1 DPRFKALFEDHDFAIDPTSPEFKKTNPMK 29
 
Name Accession Description Interval E-value
COG5638 COG5638
Uncharacterized conserved protein [Function unknown];
44-783 3.58e-73

Uncharacterized conserved protein [Function unknown];


Pssm-ID: 227925 [Multi-domain]  Cd Length: 622  Bit Score: 250.47  E-value: 3.58e-73
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952  44 DARFQHLLSDPRFKGVPKVQRKVKIDKRFQGmfTDEKFKVKYTVDKYGRPVNTSNA-EDLRKFYELDENDSDDGEKEaea 122
Cdd:COG5638   12 DPRFQSVHSDPRFSRLKRGNFKVKVDERFKK--EDKDFKTTASVDRYGRPLNQDKAtKEIDRLYELENESSESSEIT--- 86
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 123 eaevvakkeieeeeqkerraeelaiacvdvkedkfENDSLDSDGSEVPENLrerltnpnvDYARGEGRLMTDSSSDDDTD 202
Cdd:COG5638   87 -----------------------------------DNEEVASASSELTDEY---------DPARGEGIISTSESSDESRE 122
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 203 DDEGAEGPELQidhvwgELDNDAESTEVSTRRLAICNMDWDRIRAEDLMVLLSSFLPLGGSILSVKIYPSEFGKARLAEE 282
Cdd:COG5638  123 ESEEEKANEIS------EKAGAVPEEGNPTKRLAVVNMDWDRVDAKDLFKIFSSFLPYGGKLSKVKIYPSEFGKERMAAE 196
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 283 EIHGPAELVKRDEREQEEEDDSDEELVkEQDSDAEE--------GDDYHMEKLRQYQLNRLRYYYAVAECDSVATADKVY 354
Cdd:COG5638  197 HVQGPPRDIFTPADNQPSSQKFGDDNV-FSDRDAGEdaliegdrGNEFDMVKLRQYQLERLRYYYAVVECEDIETSKNIY 275
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 355 KECDGIEYESSATRVDLRFIPDDTSFEEDTpKDECFELPDasNYKPRQFTTTALQQAKVDLTWDETALDRRELGDKLSSG 434
Cdd:COG5638  276 SACDGVEYENSANVLDLRFVPDSLTFDDDS-REVCTKAPE--KYEPRDFVTDALQHSKVKLSWDAEDPHRKDLCKEAFTD 352
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 435 qvDKLTDKELRQIVAYSSEEDEDEDEEQpqaeekkeqqqpeqkqhqppkklSKQERIASYKNLLADILQKEKQEKEHKYE 514
Cdd:COG5638  353 --DGIRDKDFSAYTASKLSDEDDDSVME-----------------------SKMQKLFSEKEIDFGLNSELVDMSDDGEN 407
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 515 MEMSWNIKPAVEPKEEQEKTStatscqpSELTPIEKVIQKRSEKNKLRKELRRKKQSEARGGDLEDSDDSsvpdgidmnd 594
Cdd:COG5638  408 GEMEDTFTSHLPASNESESDD-------KLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKES---------- 470
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 595 ayfaeefangdyeppktkkQAKKKNNKQDQVEDSAAEQQhqqELALLLDDGDGLEEKHHFSLTKILKEEEQnsggskrkr 674
Cdd:COG5638  471 -------------------INKKNKKGKHAIERTAASKE---ELELIKADDEDDEQLDHFDMKSILKAEKF--------- 519
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 198470952 675 rkqLKKAKQQPENAKPDDDFRVDLNDTRFKAVYKSHEYNIDPTHSHYKATKGMQQIIGEKLKRRERQVDggETGDSPDES 754
Cdd:COG5638  520 ---KKNRKLKKKASNLEEGFVFDPKDPRFVAIFEDHNFAIDPTHPEFKKTGGMKKIMDEKRKRLKNNIE--QTQDGKPEL 594
                        730       740
                 ....*....|....*....|....*....
gi 198470952 755 lapKRSKQQLEQSALVKSLKRKLQQQPKV 783
Cdd:COG5638  595 ---KIKKRKAEKGDQRQELDRIVKSIKRS 620
NUC153 pfam08159
NUC153 domain; This small domain is found in a a novel nucleolar family.
700-728 4.70e-09

NUC153 domain; This small domain is found in a a novel nucleolar family.


Pssm-ID: 462385 [Multi-domain]  Cd Length: 29  Bit Score: 51.95  E-value: 4.70e-09
                          10        20
                  ....*....|....*....|....*....
gi 198470952  700 DTRFKAVYKSHEYNIDPTHSHYKATKGMQ 728
Cdd:pfam08159   1 DPRFKALFEDHDFAIDPTSPEFKKTNPMK 29
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH