NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|118100253|ref|XP_415805|]
View 

pre-mRNA-processing-splicing factor 8 [Gallus gallus]

Protein Classification

PRP8 family protein( domain architecture ID 11474092)

PRP8 family protein similar to Homo sapiens pre-mRNA-processing-splicing factor 8 that plays role in pre-mRNA splicing as core component of precatalytic, catalytic and postcatalytic spliceosomal complexes, both of the predominant U2-type spliceosome and the minor U12-type spliceosome

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
10-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


:

Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3579.95  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   10 GCAPVPTPMAPLPDYMSEEKLQEK---------------ARKWQQLQ-AKRYAEKRKFGFVDAQKEDMPPEHVRKIIRDH 73
Cdd:COG5178    17 GFEPPSQPPPPPPPGVNVKKRSRKqlsivgdilghsgnpIYSLRVSDkPVKLGNKAKTLHVLTLKAPIPPEHLRKIQSPC 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   74 GDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIEPVYIAQWGSMWIMMRR 153
Cdd:COG5178    97 SDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIEPQLFAQWGLCWSPMRR 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  154 EKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLKDNRKyVNGSTYQRWQFTLP 233
Cdd:COG5178   177 EKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHPS-VNGAMYRRWKYMLP 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  234 MMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFNDINKIIIRQPIRTEYK 313
Cdd:COG5178   256 AMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFNGIVRIIRRPPIDDEYP 335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  314 IAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLPD----DDEEFELPEFv 384
Cdd:COG5178   336 VAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNEEscvmERKLFSEPIF- 414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  385 EPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPP 464
Cdd:COG5178   415 YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQKLLKNYVRNMLHKTRP 494
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  465 KAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAF 544
Cdd:COG5178   495 RPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRVGNSF 574
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  545 HLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGpVGKG 624
Cdd:COG5178   575 HLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDWKHLIYYAFNEG-VGKG 653
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  625 PGCGFWAPGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQN 704
Cdd:COG5178   654 PGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQVMADILSMLPEGIRQT 732
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  705 KARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYL 784
Cdd:COG5178   733 KVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVGKTKEKKNLGRLTRLWV 812
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  785 KAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKEAYSVKSRLNQSQREEL 864
Cdd:COG5178   813 KLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKSKYTGKIRLNESTREEL 892
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  865 GLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWYEADKRRLFPPWIKPAD 944
Cdd:COG5178   893 ALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWYEADRRNLFPEWIKPSD 972
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  945 TEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYKDMNH 1024
Cdd:COG5178   973 SEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADYIIAKNNVVVVYKDMSH 1052
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1025 TNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEVAHPIRLFCRYIDRIHIFFRFTAD 1104
Cdd:COG5178  1053 TNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLYTRYMDDIYIVFRFQRK 1132
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1105 EARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVTTVQWENSFVSVYSKDN 1182
Cdd:COG5178  1133 EEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLTATRWEPSFGSVYSKIN 1212
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1183 PNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQILMASGSTTFTKIVNK 1262
Cdd:COG5178  1213 PNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRHILMTSGSTTFTKVATK 1291
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1263 WNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRW 1342
Cdd:COG5178  1292 WNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLGMLSVGHILIPHSDLEW 1371
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1343 SKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLEDLEDSWDRGIPRINTL 1422
Cdd:COG5178  1372 SKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLEDLEMSWDRGIPRISTL 1451
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1423 FQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLF 1502
Cdd:COG5178  1452 FQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILEHTLFKATGFRSWEGLF 1531
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1503 WEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISLIQIFRAHLW 1582
Cdd:COG5178  1532 WEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKIPTLKISLIQIFRNHLW 1611
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1583 QKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLADSKDVMDSTTTQKYWI 1662
Cdd:COG5178  1612 QKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLLEERDGGSNVRTNKLWI 1691
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1663 DIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLIQQAMAKIMKANPALYV 1742
Cdd:COG5178  1692 DVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLIQSSMERIMKANPALYV 1771
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1743 LRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKII 1822
Cdd:COG5178  1772 LRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGAIFVLNPATGNLFLKVI 1851
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1823 HTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKF 1902
Cdd:COG5178  1852 HTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRTCELALPFSAVMGIDKI 1931
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1903 GDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQ 1982
Cdd:COG5178  1932 RDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINHLWPGFSDSQWIKKEIQ 2011
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1983 LKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQTRTVNKHGDEIITSTTS 2060
Cdd:COG5178  2012 LRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVSTKTINGWGREYVVLDGM 2091
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2061 NYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQIAGYLYGVSPPDNPQV 2139
Cdd:COG5178  2092 IYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQVAGLVYGKSGSDNPQI 2171
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2140 KEIRCIVMVPQWGTHQTVH----LPGQLPQHEYLkemEPLGWIHTQPNESPQLSPQDVTTHAKVMADnPSWDgekTIIIT 2215
Cdd:COG5178  2172 KEILSFGLVPQLGSLSGVQsssfVPHDLPGDEDL---EILGWIHTQDDELPYLEVAGVLTHRKKIVD-PEWD---AVTLT 2244
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2216 CSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQGSWNYNFMGVRHDPNMKY 2294
Cdd:COG5178  2245 VSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEVWNYNFAGPFFDDRLEY 2324
                        2330      2340      2350      2360
                  ....*....|....*....|....*....|....*....|.
gi 118100253 2295 ELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178  2325 TWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
 
Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
10-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3579.95  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   10 GCAPVPTPMAPLPDYMSEEKLQEK---------------ARKWQQLQ-AKRYAEKRKFGFVDAQKEDMPPEHVRKIIRDH 73
Cdd:COG5178    17 GFEPPSQPPPPPPPGVNVKKRSRKqlsivgdilghsgnpIYSLRVSDkPVKLGNKAKTLHVLTLKAPIPPEHLRKIQSPC 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   74 GDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIEPVYIAQWGSMWIMMRR 153
Cdd:COG5178    97 SDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIEPQLFAQWGLCWSPMRR 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  154 EKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLKDNRKyVNGSTYQRWQFTLP 233
Cdd:COG5178   177 EKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHPS-VNGAMYRRWKYMLP 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  234 MMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFNDINKIIIRQPIRTEYK 313
Cdd:COG5178   256 AMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFNGIVRIIRRPPIDDEYP 335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  314 IAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLPD----DDEEFELPEFv 384
Cdd:COG5178   336 VAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNEEscvmERKLFSEPIF- 414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  385 EPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPP 464
Cdd:COG5178   415 YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQKLLKNYVRNMLHKTRP 494
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  465 KAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAF 544
Cdd:COG5178   495 RPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRVGNSF 574
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  545 HLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGpVGKG 624
Cdd:COG5178   575 HLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDWKHLIYYAFNEG-VGKG 653
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  625 PGCGFWAPGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQN 704
Cdd:COG5178   654 PGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQVMADILSMLPEGIRQT 732
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  705 KARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYL 784
Cdd:COG5178   733 KVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVGKTKEKKNLGRLTRLWV 812
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  785 KAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKEAYSVKSRLNQSQREEL 864
Cdd:COG5178   813 KLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKSKYTGKIRLNESTREEL 892
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  865 GLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWYEADKRRLFPPWIKPAD 944
Cdd:COG5178   893 ALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWYEADRRNLFPEWIKPSD 972
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  945 TEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYKDMNH 1024
Cdd:COG5178   973 SEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADYIIAKNNVVVVYKDMSH 1052
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1025 TNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEVAHPIRLFCRYIDRIHIFFRFTAD 1104
Cdd:COG5178  1053 TNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLYTRYMDDIYIVFRFQRK 1132
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1105 EARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVTTVQWENSFVSVYSKDN 1182
Cdd:COG5178  1133 EEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLTATRWEPSFGSVYSKIN 1212
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1183 PNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQILMASGSTTFTKIVNK 1262
Cdd:COG5178  1213 PNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRHILMTSGSTTFTKVATK 1291
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1263 WNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRW 1342
Cdd:COG5178  1292 WNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLGMLSVGHILIPHSDLEW 1371
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1343 SKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLEDLEDSWDRGIPRINTL 1422
Cdd:COG5178  1372 SKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLEDLEMSWDRGIPRISTL 1451
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1423 FQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLF 1502
Cdd:COG5178  1452 FQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILEHTLFKATGFRSWEGLF 1531
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1503 WEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISLIQIFRAHLW 1582
Cdd:COG5178  1532 WEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKIPTLKISLIQIFRNHLW 1611
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1583 QKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLADSKDVMDSTTTQKYWI 1662
Cdd:COG5178  1612 QKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLLEERDGGSNVRTNKLWI 1691
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1663 DIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLIQQAMAKIMKANPALYV 1742
Cdd:COG5178  1692 DVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLIQSSMERIMKANPALYV 1771
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1743 LRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKII 1822
Cdd:COG5178  1772 LRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGAIFVLNPATGNLFLKVI 1851
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1823 HTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKF 1902
Cdd:COG5178  1852 HTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRTCELALPFSAVMGIDKI 1931
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1903 GDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQ 1982
Cdd:COG5178  1932 RDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINHLWPGFSDSQWIKKEIQ 2011
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1983 LKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQTRTVNKHGDEIITSTTS 2060
Cdd:COG5178  2012 LRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVSTKTINGWGREYVVLDGM 2091
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2061 NYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQIAGYLYGVSPPDNPQV 2139
Cdd:COG5178  2092 IYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQVAGLVYGKSGSDNPQI 2171
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2140 KEIRCIVMVPQWGTHQTVH----LPGQLPQHEYLkemEPLGWIHTQPNESPQLSPQDVTTHAKVMADnPSWDgekTIIIT 2215
Cdd:COG5178  2172 KEILSFGLVPQLGSLSGVQsssfVPHDLPGDEDL---EILGWIHTQDDELPYLEVAGVLTHRKKIVD-PEWD---AVTLT 2244
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2216 CSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQGSWNYNFMGVRHDPNMKY 2294
Cdd:COG5178  2245 VSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEVWNYNFAGPFFDDRLEY 2324
                        2330      2340      2350      2360
                  ....*....|....*....|....*....|....*....|.
gi 118100253 2295 ELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178  2325 TWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
PROCN pfam08083
PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of ...
395-796 0e+00

PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 400431  Cd Length: 402  Bit Score: 877.10  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   395 TDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPPKAQKKRYLFR 474
Cdd:pfam08083    1 NENTKDGISLYWAPYPFNRRSGRTKRAQDVPLIKHWYREHPPSNYPVKVRVSYQKLLKNYVLNELHHRKQKKHKKKNLLK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   475 SFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREVLRLT 554
Cdd:pfam08083   81 SLKNTKFFQQTTIDWVEAGLQVCRQGHNMLNLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRFGNAFHLIRELLRLT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   555 KLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGPVGKGPGCGFWAPGW 634
Cdd:pfam08083  161 KLLVDAHVQYRLGNIDAYQLADGLQYIFNHVGQLTGMYRYKYKVMRQIRACKDLKHVIYSRFNTGEVGKGPGCGFWAPSW 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   635 RVWLFFMRGITPLLERWLGNLLARQFEGRHSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQNKARTILQHLS 714
Cdd:pfam08083  241 RVWIFFLRGIIPLLERWLGNLLARQFEGRKSKDVAKTITKQRVESYYDLELRASVMHDILDMMPEGVKQNKARTILQHLS 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   715 EAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNY 794
Cdd:pfam08083  321 EAWRCWKANIPWKVPGLPEPIENIILRYVKAKADWWTSVAHYNRERIKRGATVDKTVAKKNLGRLTRLWLKAEQERQANY 400

                   ..
gi 118100253   795 LK 796
Cdd:pfam08083  401 LK 402
RNase_H_like_Prp8_IV cd13838
Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a ...
1766-2016 0e+00

Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a RNase H like fold within its core structure but with little sequence similarity. Prp8, a spliceosome protein, interacts directly with the splice sites and branch regions of precursor-mRNAs and spliceosomal RNAs associated with catalysis of the two steps of splicing. Catalysis of RNA cleavage by RNase H-like proteins involves a two-metal mechanism in which adjacently-bound divalent magnesium ions promote hydrolysis by activation of a water nucleophile and stabilization of the transition-state. However, the Prp8 domain IV contains only one of the canonical metal-binding sites and the coordinating side chains are spatially conserved with respect to Mg2+-coordinating residues within the RNase H fold.


Pssm-ID: 260013  Cd Length: 251  Bit Score: 575.45  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1766 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 1845
Cdd:cd13838     1 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1846 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLK 1925
Cdd:cd13838    81 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNLYDDWLK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1926 TISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 2005
Cdd:cd13838   161 TISSYTAFSRLILILRALHVNNEKAKIILKPDKTVITEPHHIWPTLSDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 240
                         250
                  ....*....|.
gi 118100253 2006 EIRDIILGMEI 2016
Cdd:cd13838   241 EIRDIILGMEI 251
JAB_MPN smart00232
JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal ...
2102-2233 3.64e-28

JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function.


Pssm-ID: 214573  Cd Length: 135  Bit Score: 111.31  E-value: 3.64e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   2102 YTYILPKNVLKKFIciSDLRAQIAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTV-HLPGQLPQHEYLK---EMEPLGW 2177
Cdd:smart00232    4 VHPLVPLNILKHAI--RDGPEEVCGVLLGKSNKDRPEVKEVFAVPNEPQDDSVQEYdEDYSHLMDEELKKvnkDLEIVGW 81
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 118100253   2178 IHTQPNESPQLSPQDVTTHAKVMADNPSWDGeKTIIITCSFTpGSCTLTAYKLTPS 2233
Cdd:smart00232   82 YHSHPDESPFPSEVDVATHESYQAPWPISVV-LIVDPIKSFQ-GRLSLRAFRLTPE 135
 
Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
10-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3579.95  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   10 GCAPVPTPMAPLPDYMSEEKLQEK---------------ARKWQQLQ-AKRYAEKRKFGFVDAQKEDMPPEHVRKIIRDH 73
Cdd:COG5178    17 GFEPPSQPPPPPPPGVNVKKRSRKqlsivgdilghsgnpIYSLRVSDkPVKLGNKAKTLHVLTLKAPIPPEHLRKIQSPC 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   74 GDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIEPVYIAQWGSMWIMMRR 153
Cdd:COG5178    97 SDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIEPQLFAQWGLCWSPMRR 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  154 EKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLKDNRKyVNGSTYQRWQFTLP 233
Cdd:COG5178   177 EKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHPS-VNGAMYRRWKYMLP 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  234 MMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFNDINKIIIRQPIRTEYK 313
Cdd:COG5178   256 AMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFNGIVRIIRRPPIDDEYP 335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  314 IAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLPD----DDEEFELPEFv 384
Cdd:COG5178   336 VAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNEEscvmERKLFSEPIF- 414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  385 EPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPP 464
Cdd:COG5178   415 YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQKLLKNYVRNMLHKTRP 494
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  465 KAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAF 544
Cdd:COG5178   495 RPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRVGNSF 574
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  545 HLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGpVGKG 624
Cdd:COG5178   575 HLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDWKHLIYYAFNEG-VGKG 653
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  625 PGCGFWAPGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQN 704
Cdd:COG5178   654 PGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQVMADILSMLPEGIRQT 732
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  705 KARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYL 784
Cdd:COG5178   733 KVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVGKTKEKKNLGRLTRLWV 812
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  785 KAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKEAYSVKSRLNQSQREEL 864
Cdd:COG5178   813 KLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKSKYTGKIRLNESTREEL 892
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  865 GLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWYEADKRRLFPPWIKPAD 944
Cdd:COG5178   893 ALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWYEADRRNLFPEWIKPSD 972
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  945 TEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYKDMNH 1024
Cdd:COG5178   973 SEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADYIIAKNNVVVVYKDMSH 1052
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1025 TNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEVAHPIRLFCRYIDRIHIFFRFTAD 1104
Cdd:COG5178  1053 TNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLYTRYMDDIYIVFRFQRK 1132
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1105 EARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVTTVQWENSFVSVYSKDN 1182
Cdd:COG5178  1133 EEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLTATRWEPSFGSVYSKIN 1212
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1183 PNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQILMASGSTTFTKIVNK 1262
Cdd:COG5178  1213 PNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRHILMTSGSTTFTKVATK 1291
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1263 WNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRW 1342
Cdd:COG5178  1292 WNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLGMLSVGHILIPHSDLEW 1371
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1343 SKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLEDLEDSWDRGIPRINTL 1422
Cdd:COG5178  1372 SKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLEDLEMSWDRGIPRISTL 1451
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1423 FQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLF 1502
Cdd:COG5178  1452 FQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILEHTLFKATGFRSWEGLF 1531
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1503 WEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISLIQIFRAHLW 1582
Cdd:COG5178  1532 WEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKIPTLKISLIQIFRNHLW 1611
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1583 QKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLADSKDVMDSTTTQKYWI 1662
Cdd:COG5178  1612 QKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLLEERDGGSNVRTNKLWI 1691
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1663 DIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLIQQAMAKIMKANPALYV 1742
Cdd:COG5178  1692 DVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLIQSSMERIMKANPALYV 1771
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1743 LRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKII 1822
Cdd:COG5178  1772 LRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGAIFVLNPATGNLFLKVI 1851
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1823 HTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKF 1902
Cdd:COG5178  1852 HTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRTCELALPFSAVMGIDKI 1931
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1903 GDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQ 1982
Cdd:COG5178  1932 RDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINHLWPGFSDSQWIKKEIQ 2011
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1983 LKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQTRTVNKHGDEIITSTTS 2060
Cdd:COG5178  2012 LRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVSTKTINGWGREYVVLDGM 2091
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2061 NYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQIAGYLYGVSPPDNPQV 2139
Cdd:COG5178  2092 IYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQVAGLVYGKSGSDNPQI 2171
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2140 KEIRCIVMVPQWGTHQTVH----LPGQLPQHEYLkemEPLGWIHTQPNESPQLSPQDVTTHAKVMADnPSWDgekTIIIT 2215
Cdd:COG5178  2172 KEILSFGLVPQLGSLSGVQsssfVPHDLPGDEDL---EILGWIHTQDDELPYLEVAGVLTHRKKIVD-PEWD---AVTLT 2244
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2216 CSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQGSWNYNFMGVRHDPNMKY 2294
Cdd:COG5178  2245 VSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEVWNYNFAGPFFDDRLEY 2324
                        2330      2340      2350      2360
                  ....*....|....*....|....*....|....*....|.
gi 118100253 2295 ELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178  2325 TWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
PROCN pfam08083
PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of ...
395-796 0e+00

PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 400431  Cd Length: 402  Bit Score: 877.10  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   395 TDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPPKAQKKRYLFR 474
Cdd:pfam08083    1 NENTKDGISLYWAPYPFNRRSGRTKRAQDVPLIKHWYREHPPSNYPVKVRVSYQKLLKNYVLNELHHRKQKKHKKKNLLK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   475 SFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREVLRLT 554
Cdd:pfam08083   81 SLKNTKFFQQTTIDWVEAGLQVCRQGHNMLNLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRFGNAFHLIRELLRLT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   555 KLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGPVGKGPGCGFWAPGW 634
Cdd:pfam08083  161 KLLVDAHVQYRLGNIDAYQLADGLQYIFNHVGQLTGMYRYKYKVMRQIRACKDLKHVIYSRFNTGEVGKGPGCGFWAPSW 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   635 RVWLFFMRGITPLLERWLGNLLARQFEGRHSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQNKARTILQHLS 714
Cdd:pfam08083  241 RVWIFFLRGIIPLLERWLGNLLARQFEGRKSKDVAKTITKQRVESYYDLELRASVMHDILDMMPEGVKQNKARTILQHLS 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   715 EAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNY 794
Cdd:pfam08083  321 EAWRCWKANIPWKVPGLPEPIENIILRYVKAKADWWTSVAHYNRERIKRGATVDKTVAKKNLGRLTRLWLKAEQERQANY 400

                   ..
gi 118100253   795 LK 796
Cdd:pfam08083  401 LK 402
RNase_H_like_Prp8_IV cd13838
Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a ...
1766-2016 0e+00

Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a RNase H like fold within its core structure but with little sequence similarity. Prp8, a spliceosome protein, interacts directly with the splice sites and branch regions of precursor-mRNAs and spliceosomal RNAs associated with catalysis of the two steps of splicing. Catalysis of RNA cleavage by RNase H-like proteins involves a two-metal mechanism in which adjacently-bound divalent magnesium ions promote hydrolysis by activation of a water nucleophile and stabilization of the transition-state. However, the Prp8 domain IV contains only one of the canonical metal-binding sites and the coordinating side chains are spatially conserved with respect to Mg2+-coordinating residues within the RNase H fold.


Pssm-ID: 260013  Cd Length: 251  Bit Score: 575.45  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1766 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 1845
Cdd:cd13838     1 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1846 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLK 1925
Cdd:cd13838    81 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNLYDDWLK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 1926 TISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 2005
Cdd:cd13838   161 TISSYTAFSRLILILRALHVNNEKAKIILKPDKTVITEPHHIWPTLSDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 240
                         250
                  ....*....|.
gi 118100253 2006 EIRDIILGMEI 2016
Cdd:cd13838   241 EIRDIILGMEI 251
MPN_PRP8 cd08056
Mpr1p, Pad1p N-terminal (MPN) domains without isopeptidase activity found in splicing factor ...
2067-2317 1.24e-179

Mpr1p, Pad1p N-terminal (MPN) domains without isopeptidase activity found in splicing factor Prp8; Members of this family are found in pre-mRNA-processing factor 8 (Prp8) which is a critical splicing factor, interacting with several other spliceosomal proteins, snRNAs, and the pre-mRNA, thus organizing and stabilizing the spliceosome catalytic core. Prp8 is one of the largest and most highly conserved of nuclear proteins, occupying a central position in the catalytic core of the spliceosome. Its C-terminal domain exhibits a JAB1/MPN-like core similar to deubiquitinating enzymes, but does not show catalytic isopeptidase activity, possibly because the putative isopeptidase center is covered by insertions and terminal appendices that are grafted onto this core, thus impairing the metal binding site. It is proposed that this domain is a protein interaction domain instead of a Zn(2+)-dependent metalloenzyme as proposed for some MPN proteins. The DEAD-box protein Brr2 and the GTPase Snu114 bind to the Prp8 C-terminus, a region where mutations in human Prp8 (hPrp8) cause a severe form of the genetic disorder retinitis pigmentosa, RP13, which leads to progressive photoreceptor degeneration in the retina and eventual blindness. At the N-terminus of Prp8, there are several domains, including a highly variable nuclear localization signal (NLS) motif rich in prolines, a conserved RNA recognition motif (RRM), and U5 and U6 snRNA binding sites.


Pssm-ID: 163687 [Multi-domain]  Cd Length: 252  Bit Score: 547.61  E-value: 1.24e-179
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2067 FSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETGYTYILPKNVLKKFICISDLRAQIAGYLYGVSPPDNPQVKEIRCIV 2146
Cdd:cd08056     1 FSSKTDWRVRAIAATNLHLRTKNIYVSSDDIKETGYTYILPKNLLKKFISISDLRTQIAGYLYGKSPPDNPQVKEIRCIV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2147 MVPQWGTHQTVHLPGQLPQHEYLKEMEPLGWIHTQPNESPQLSPQDVTTHAKVMADNPSWDGEKTIIITCSFTPGSCTLT 2226
Cdd:cd08056    81 LVPQLGTHQTVTLPQQLPQHEYLEDLEPLGWIHTQPNELPQLSPQDVTTHAKILADNPSWDGEKTVILTCSFTPGSCSLT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2227 AYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQGSWNYNFMGVRHDPNMKYELQLANPKEFY 2305
Cdd:cd08056   161 AYKLTPEGYEWGKQNKDLGNNtPKGYSPSFYEKVQLLLSDRFLGFFLVPEDGVWNYNFMGAKHSPNMKYDLKLDIPKEFY 240
                         250
                  ....*....|..
gi 118100253 2306 HEVHRPSHFLNF 2317
Cdd:cd08056   241 HELHRPTHFLQF 252
PRP8_domainIV pfam12134
PRP8 domain IV core; This domain is found in eukaryotes, and is about 20 amino acids in length. ...
1760-1989 1.19e-164

PRP8 domain IV core; This domain is found in eukaryotes, and is about 20 amino acids in length. It is found associated with pfam10597, pfam10596, pfam10598, pfam08083, pfam08082, pfam01398, pfam08084. There is a conserved LILR sequence motif. The domain is a selenomethionine domain in a subunit of the spliceosome. The function of PRP8 domain IV is believed to be interaction with the splicosomal core.


Pssm-ID: 432353  Cd Length: 230  Bit Score: 504.97  E-value: 1.19e-164
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  1760 EPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKW 1839
Cdd:pfam12134    1 EPFLNSQNYAELFSNQTQWFVDDTNVYRVTVHKTFEGNLTTKPVNGAVFILNPKTGQLFLKIIHTSVWAGQKRLSQLAKW 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  1840 KTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNL 1919
Cdd:pfam12134   81 KTAEEVAALVRSLPPEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIRGSELHLPFPAALKIEKLGDLVLKATEPQMVLFNI 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  1920 YDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQLKDLILA 1989
Cdd:pfam12134  161 YDDWLKSISPYTAFSRLILILRALHINPEKTKMILRPDPTVVTKPHHLWPTLSDQQWIEVEIQLRDLILS 230
PRO8NT pfam08082
PRO8NT (NUC069), PrP8 N-terminal domain; The PRO8NT domain is found at the N-terminus of ...
58-209 3.67e-116

PRO8NT (NUC069), PrP8 N-terminal domain; The PRO8NT domain is found at the N-terminus of pre-mRNA splicing factors of PRO8 family. The NLS or nuclear localization signal for these spliceosome proteins begins at the start and runs for 60 residues. N-terminal to this domain is a highly variable proline-rich region.


Pssm-ID: 462361 [Multi-domain]  Cd Length: 152  Bit Score: 363.92  E-value: 3.67e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253    58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:pfam08082    1 KEDMPPEHLRKIIKDHGDMSSKKFRSDKRSYLGALKYMPHAVLKLLENMPMPWEEVREVKVLYHVTGAITFVNEIPRVIE 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 118100253   138 PVYIAQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYD 209
Cdd:pfam08082   81 PVYIAQWSTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQMELDEEEDAAVIDWFYD 152
U6-snRNA_bdg pfam10596
U6-snRNA interacting domain of PrP8; This domain incorporates the interacting site for the ...
1442-1600 2.64e-108

U6-snRNA interacting domain of PrP8; This domain incorporates the interacting site for the U6-snRNA as part of the U4/U6.U5 tri-snRNPs complex of the spliceosome, and is the prime candidate for the role of cofactor for the spliceosome's RNA core. The essential spliceosomal protein Prp8 interacts with U5 and U6 snRNAs and with specific pre-mRNA sequences that participate in catalysis. This close association with crucial RNA sequences, together with extensive genetic evidence, suggests that Prp8 could directly affect the function of the catalytic core, perhaps acting as a splicing cofactor.


Pssm-ID: 431383  Cd Length: 159  Bit Score: 341.77  E-value: 2.64e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  1442 FKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNA 1521
Cdd:pfam10596    1 FKKYQLLKYNPFWWTHARHDGKLWNLERYRTDIIQALGGVEGILEHTLFKATGFPSWEGLFWDKASGFEESLKYKKLTNA 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 118100253  1522 QRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQE 1600
Cdd:pfam10596   81 QRQGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGVFMHGKLPTLKISLIQIFRGHLWQKIHESVVMDLCQVLDQE 159
U5_2-snRNA_bdg pfam10597
U5-snRNA binding site 2 of PrP8; The essential spliceosomal protein Prp8 interacts with U5 and ...
1210-1343 1.58e-89

U5-snRNA binding site 2 of PrP8; The essential spliceosomal protein Prp8 interacts with U5 and U6 snRNAs and with specific pre-mRNA sequences that participate in catalysis. This close association with crucial RNA sequences, together with extensive genetic evidence, suggests that Prp8 could directly affect the function of the catalytic core, perhaps acting as a splicing cofactor.


Pssm-ID: 402297  Cd Length: 134  Bit Score: 286.97  E-value: 1.58e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  1210 KDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLV 1289
Cdd:pfam10597    1 NEGVWDLINESTKERTAKAFLQVSEESINNFRNRIRQILMSSGSTTFTKIANKWNTALIALFTYFREAIVSTEELLDILV 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 118100253  1290 KCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRWS 1343
Cdd:pfam10597   81 KCETRVQNRVKLGLNSKMPSRFPPAVFYTPKELGGLGMLSAGHVLIPASDLRWS 134
PROCT pfam08084
PROCT (NUC072) domain; The PROCT domain is the C-terminal domain in pre-mRNA splicing factors ...
2212-2321 2.87e-68

PROCT (NUC072) domain; The PROCT domain is the C-terminal domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 462362  Cd Length: 111  Bit Score: 225.12  E-value: 2.87e-68
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  2212 IIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDK-GNNPKGYLPSHYERVQMLLSDRFLGFFMVPAQGSWNYNFMGVRHDP 2290
Cdd:pfam08084    1 ITLTVSFTPGSVSLSAYTLTPEGYEWGRQNKDLiSDNPQGFSPSFSEKVQLLLSDRILGFFLVPEDGVWNYSFMGASFNP 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 118100253  2291 NMKYELQLANPKEFYHEVHRPSHFLNFALLQ 2321
Cdd:pfam08084   81 NMKYDLKLDIPLEFYDELHRPTHFLNFNELE 111
RRM_4 pfam10598
RNA recognition motif of the spliceosomal PrP8; The large RNA-protein complex of the ...
986-1077 5.04e-53

RNA recognition motif of the spliceosomal PrP8; The large RNA-protein complex of the spliceosome catalyzes pre-mRNA splicing. One of the most conserved core proteins is PrP8 which occupies a central position in the catalytic core of the spliceosome, and has been implicated in several crucial molecular rearrangements that occur there, and has recently come under the spotlight for its role in the inherited human disease, Retinitis Pigmentosa. The RNA-recognition motif of PrP8 is highly conserved and provides a possible RNA binding centre for the 5-prime SS, BP, or 3-prime SS of pre-mRNA which are known to contact with Prp8. The most conserved regions of an RRM are defined as the RNP1 and RNP2 sequences. Recognition of RNA targets can also be modulated by a number of other factors, most notably the two loops beta1-alpha1, beta2-beta3 and the amino acid residues C-terminal to the RNP2 domain.


Pssm-ID: 463163  Cd Length: 92  Bit Score: 180.90  E-value: 5.04e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   986 EKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPP 1065
Cdd:pfam10598    1 EKIDLTLLNRLLRLIMDHNLADYITAKNNVVLTFKDMSHVNSYGLIRGLQFSSFVYQYYGLVLDLLILGLQRASEIAGPP 80
                           90
                   ....*....|..
gi 118100253  1066 QMPNDFLSFQDI 1077
Cdd:pfam10598   81 QMPNEFLQFKDI 92
JAB_MPN smart00232
JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal ...
2102-2233 3.64e-28

JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function.


Pssm-ID: 214573  Cd Length: 135  Bit Score: 111.31  E-value: 3.64e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253   2102 YTYILPKNVLKKFIciSDLRAQIAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTV-HLPGQLPQHEYLK---EMEPLGW 2177
Cdd:smart00232    4 VHPLVPLNILKHAI--RDGPEEVCGVLLGKSNKDRPEVKEVFAVPNEPQDDSVQEYdEDYSHLMDEELKKvnkDLEIVGW 81
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 118100253   2178 IHTQPNESPQLSPQDVTTHAKVMADNPSWDGeKTIIITCSFTpGSCTLTAYKLTPS 2233
Cdd:smart00232   82 YHSHPDESPFPSEVDVATHESYQAPWPISVV-LIVDPIKSFQ-GRLSLRAFRLTPE 135
JAB pfam01398
JAB1/Mov34/MPN/PAD-1 ubiquitin protease; Members of this family are found in proteasome ...
2099-2205 6.55e-23

JAB1/Mov34/MPN/PAD-1 ubiquitin protease; Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain, JABP1 domain or JAMM domain. These are metalloenzymes that function as the ubiquitin isopeptidase/ deubiquitinase in the ubiquitin-based signalling and protein turnover pathways in eukaryotes. Versions of the domain in prokaryotic cognates of the ubiquitin-modification pathway are shown to have a similar role, and the archael protein from Haloferax volcanii is found to cleave ubiquitin-like small archaeal modifier proteins (SAMP1/2) from protein conjugates.


Pssm-ID: 396120  Cd Length: 117  Bit Score: 95.87  E-value: 6.55e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253  2099 ETGYTYILPKNVLKKFICISDLRAQI----AGYLYGVSPPDNpqVKEIRCIVMVPQWGTHQTVHLPGQLPQ-----HEYL 2169
Cdd:pfam01398    1 SSVRTVIIHPLVLLKILDHANRGGKIgeevMGVLLGKLEGDG--TIEITNSFALPQEETEDDVNAVALDQEymenmHEML 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 118100253  2170 KE----MEPLGWIHTQPNEsPQLSPQDVTTHAKVMADNPS 2205
Cdd:pfam01398   79 KKvnrkEEVVGWYHTHPGL-CWLSSVDVHTHALYQRMIPE 117
MPN cd07767
Mpr1p, Pad1p N-terminal (MPN) domains; MPN (also known as Mov34, PAD-1, JAMM, JAB, MPN+) ...
2111-2221 3.91e-22

Mpr1p, Pad1p N-terminal (MPN) domains; MPN (also known as Mov34, PAD-1, JAMM, JAB, MPN+) domains are found in the N-terminal termini of proteins with a variety of functions; they are components of the proteasome regulatory subunits, the signalosome (CSN), eukaryotic translation initiation factor 3 (eIF3) complexes, and regulators of transcription factors. These domains are isopeptidases that release ubiquitin from ubiquitinated proteins (thus having deubiquitinating (DUB) activity) that are tagged for degradation. Catalytically active MPN domains contain a metalloprotease signature known as the JAB1/MPN/Mov34 metalloenzyme (JAMM) motif. For example, Rpn11 (also known as POH1 or PSMD14), a subunit of the 19S proteasome lid is involved in the ATP-dependent degradation of ubiquitinated proteins, contains the conserved JAMM motif involved in zinc ion coordination. Poh1 is a regulator of c-Jun, an important regulator of cell proliferation, differentiation, survival and death. JAB1 is a component of the COP9 signalosome (CSN), a regulatory particle of the ubiquitin (Ub)/26S proteasome system occurring in all eukaryotic cells; it cleaves the ubiquitin-like protein NEDD8 from the cullin subunit of the SCF (Skp1, Cullins, F-box proteins) family of E3 ubiquitin ligases. AMSH (associated molecule with the SH3 domain of STAM, also known as STAMBP), a member of JAMM/MPN+ deubiquitinases (DUBs), specifically cleaves Lys 63-linked polyubiquitin (poly-Ub) chains, thus facilitating the recycling and subsequent trafficking of receptors to the cell surface. Similarly, BRCC36, part of the nuclear complex that includes BRCA1 protein and is targeted to DNA damage foci after irradiation, specifically disassembles K63-linked polyUb. BRCC36 is aberrantly expressed in sporadic breast tumors, indicative of a potential role in the pathogenesis of the disease. Some variants of the JAB1/MPN domains lack key residues in their JAMM motif and are unable to coordinate a metal ion. Comparisons of key catalytic and metal binding residues explain why the MPN-containing proteins Mov34/PSMD7, Rpn8, CSN6, Prp8p, and the translation initiation factor 3 subunits f (p47) and h (p40) do not show catalytic isopeptidase activity. It has been proposed that the MPN domain in these proteins has a primarily structural function.


Pssm-ID: 163686 [Multi-domain]  Cd Length: 116  Bit Score: 93.73  E-value: 3.91e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118100253 2111 LKKFI--CISDLRAQIAGYLYGVSPPDnpqVKEIRCIVMVPQWGTHQTVHLP--GQLPQHEYLKEMEPLGWIHTQPNESP 2186
Cdd:cd07767     1 LKMFLdaAKSINGKEVIGLLYGSKTKK---VLDVDEVIAVPFDEGDKDDNVWflMYLDFKKLNAGLRIVGWYHTHPKPSC 77
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 118100253 2187 QLSPQDVTTHAKVMAdnpswDGEKTIIITCSFTPG 2221
Cdd:cd07767    78 FLSPNDLATHELFQR-----YFPEKVMIIVDVKPK 107
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH