|
Name |
Accession |
Description |
Interval |
E-value |
| PLN03077 |
PLN03077 |
Protein ECB2; Provisional |
53-684 |
5.31e-168 |
|
Protein ECB2; Provisional
Pssm-ID: 215561 [Multi-domain] Cd Length: 857 Bit Score: 503.23 E-value: 5.31e-168
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 53 PDIRTLRTVHSRIILEDLRCNSSLGVKLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGFYGEGVKVFGTMC 132
Cdd:PLN03077 201 PDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMR 280
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 133 GCNVRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNGLVSMYGKCGFLSEARLVLDEMSRRDVVSWNSL 212
Cdd:PLN03077 281 ELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAM 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 213 VVGYAQNQRFDDALEVCREMESVKISHDAGTMASLLPA---------------------------VSNTTTEnvMYVK-- 263
Cdd:PLN03077 361 ISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSAcaclgdldvgvklhelaerkglisyvvVANALIE--MYSKck 438
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 264 ------DMFFKMGKKSLVSWNVMIGVYMKNAMPVEAVELYSRMEADgFEPDAVSITSVLPACGDTSALSLGKKIHGYIER 337
Cdd:PLN03077 439 cidkalEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLR 517
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 338 KKLIPNLLLENALIDMYAKCGCLEKARDVFeNMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTL 417
Cdd:PLN03077 518 TGIGFDGFLPNALLDLYVRCGRMNYAWNQF-NSHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLL 596
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 418 AACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEPNERVWGALLGACRVHSDTDIG 497
Cdd:PLN03077 597 CACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELG 676
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 498 LLAADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRNIMKSKGLKKNPGASNVEVNRIIHTFLVGDRSHPQSDEIYRE 577
Cdd:PLN03077 677 ELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDDESHPQIKEINTV 756
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 578 LDVLVKKMKELGYVPDSESALHDVEEEDKEThLAVHSEKLAIVFALMNTKeeeedSNNTIRITKNLRICGDCHVAAKLIS 657
Cdd:PLN03077 757 LEGFYEKMKASGLAGSESSSMDEIEVSKDDI-FCGHSERLAIAFGLINTV-----PGMPIWVTKNLYMCENCHNTVKFIS 830
|
650 660
....*....|....*....|....*..
gi 334185836 658 QITSREIIIRDTNRFHVFRFGVCSCGD 684
Cdd:PLN03077 831 KIVRREISVRDTEQFHHFKDGECSCGD 857
|
|
| DYW_deaminase |
pfam14432 |
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ... |
589-686 |
1.36e-40 |
|
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.
Pssm-ID: 464172 [Multi-domain] Cd Length: 93 Bit Score: 143.33 E-value: 1.36e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 589 GYVPDSESALHDVEEEDKETHLAVHSEKLAIVFALMNTKEeeedsNNTIRITKNLRICGDCHVAAKLISQITSREIIIRD 668
Cdd:pfam14432 1 GYVPDLKFVLHDVDEEEKKQLLHGHSEKLALAYGLLTTPD-----GATIRITKNLRVCGDCHTAFKYISKIFGREIVVRD 75
|
90
....*....|....*...
gi 334185836 669 TNRFHVFRFGVCSCGDYW 686
Cdd:pfam14432 76 ASRFHHFKNGLCSCGDFW 93
|
|
| PPR |
TIGR00756 |
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ... |
275-309 |
2.75e-05 |
|
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Pssm-ID: 273253 [Multi-domain] Cd Length: 35 Bit Score: 41.29 E-value: 2.75e-05
10 20 30
....*....|....*....|....*....|....*
gi 334185836 275 VSWNVMIGVYMKNAMPVEAVELYSRMEADGFEPDA 309
Cdd:TIGR00756 1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
|
|
| BepA |
COG4783 |
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ... |
449-530 |
1.52e-03 |
|
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443813 [Multi-domain] Cd Length: 139 Bit Score: 39.40 E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 449 LACMVDLLGRAGKVKEAYRFIQD-MSMEPNERVWGALLGACRVHS-DTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAG 526
Cdd:COG4783 41 FALLGEILLQLGDLDEAIVLLHEaLELDPDEPEARLNLGLALLKAgDYDEALALLEKALKLDPEHPEAYLRLARAYRALG 120
|
....
gi 334185836 527 RWEE 530
Cdd:COG4783 121 RPDE 124
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PLN03077 |
PLN03077 |
Protein ECB2; Provisional |
53-684 |
5.31e-168 |
|
Protein ECB2; Provisional
Pssm-ID: 215561 [Multi-domain] Cd Length: 857 Bit Score: 503.23 E-value: 5.31e-168
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 53 PDIRTLRTVHSRIILEDLRCNSSLGVKLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGFYGEGVKVFGTMC 132
Cdd:PLN03077 201 PDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMR 280
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 133 GCNVRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNGLVSMYGKCGFLSEARLVLDEMSRRDVVSWNSL 212
Cdd:PLN03077 281 ELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAM 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 213 VVGYAQNQRFDDALEVCREMESVKISHDAGTMASLLPA---------------------------VSNTTTEnvMYVK-- 263
Cdd:PLN03077 361 ISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSAcaclgdldvgvklhelaerkglisyvvVANALIE--MYSKck 438
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 264 ------DMFFKMGKKSLVSWNVMIGVYMKNAMPVEAVELYSRMEADgFEPDAVSITSVLPACGDTSALSLGKKIHGYIER 337
Cdd:PLN03077 439 cidkalEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLR 517
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 338 KKLIPNLLLENALIDMYAKCGCLEKARDVFeNMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTL 417
Cdd:PLN03077 518 TGIGFDGFLPNALLDLYVRCGRMNYAWNQF-NSHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLL 596
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 418 AACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEPNERVWGALLGACRVHSDTDIG 497
Cdd:PLN03077 597 CACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELG 676
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 498 LLAADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRNIMKSKGLKKNPGASNVEVNRIIHTFLVGDRSHPQSDEIYRE 577
Cdd:PLN03077 677 ELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDDESHPQIKEINTV 756
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 578 LDVLVKKMKELGYVPDSESALHDVEEEDKEThLAVHSEKLAIVFALMNTKeeeedSNNTIRITKNLRICGDCHVAAKLIS 657
Cdd:PLN03077 757 LEGFYEKMKASGLAGSESSSMDEIEVSKDDI-FCGHSERLAIAFGLINTV-----PGMPIWVTKNLYMCENCHNTVKFIS 830
|
650 660
....*....|....*....|....*..
gi 334185836 658 QITSREIIIRDTNRFHVFRFGVCSCGD 684
Cdd:PLN03077 831 KIVRREISVRDTEQFHHFKDGECSCGD 857
|
|
| PLN03081 |
PLN03081 |
pentatricopeptide (PPR) repeat-containing protein; Provisional |
79-686 |
4.06e-154 |
|
pentatricopeptide (PPR) repeat-containing protein; Provisional
Pssm-ID: 215563 [Multi-domain] Cd Length: 697 Bit Score: 462.42 E-value: 4.06e-154
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 79 KLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGFYGEGVKVFGTMCGCNVRPDHYTFPCVLKACSCSGTIVI 158
Cdd:PLN03081 163 RVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARA 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 159 GRKIHGSATKVGLSSTLFVGNGLVSMYGKCGFLSEARLVLDEMSRRDVVSWNSLVVGYAQNQRFDDALEVCREMESVKIS 238
Cdd:PLN03081 243 GQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVS 322
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 239 HDAGTMAsllpavsntttenvmyvkdmffkmgkkslvswnVMIGVYMKNAMPVEAVELYSRMEADGFEPDAVSITsvlpa 318
Cdd:PLN03081 323 IDQFTFS---------------------------------IMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANT----- 364
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 319 cgdtsalslgkkihgyierkklipnlllenALIDMYAKCGCLEKARDVFENMKSRDVVSWTAMISAYGFSGRGCDAVALF 398
Cdd:PLN03081 365 ------------------------------ALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMF 414
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 399 SKLQDSGLVPDSIAFVTTLAACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEPNE 478
Cdd:PLN03081 415 ERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTV 494
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 479 RVWGALLGACRVHSDTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRNIMKSKGLKKNPGASNVEVNRII 558
Cdd:PLN03081 495 NMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQD 574
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 559 HTFLVGDRSHPQSDEIYRELDVLVKKMKELGYVPDSESALHDVEEEDKETHLAVHSEKLAIVFALMNTKEEeedsnNTIR 638
Cdd:PLN03081 575 HSFFSGDRLHPQSREIYQKLDELMKEISEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLINTSEW-----TPLQ 649
|
570 580 590 600
....*....|....*....|....*....|....*....|....*...
gi 334185836 639 ITKNLRICGDCHVAAKLISQITSREIIIRDTNRFHVFRFGVCSCGDYW 686
Cdd:PLN03081 650 ITQSHRICKDCHKVIKFIALVTKREIVVRDASRFHHFKLGKCSCGDYW 697
|
|
| DYW_deaminase |
pfam14432 |
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ... |
589-686 |
1.36e-40 |
|
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.
Pssm-ID: 464172 [Multi-domain] Cd Length: 93 Bit Score: 143.33 E-value: 1.36e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 589 GYVPDSESALHDVEEEDKETHLAVHSEKLAIVFALMNTKEeeedsNNTIRITKNLRICGDCHVAAKLISQITSREIIIRD 668
Cdd:pfam14432 1 GYVPDLKFVLHDVDEEEKKQLLHGHSEKLALAYGLLTTPD-----GATIRITKNLRVCGDCHTAFKYISKIFGREIVVRD 75
|
90
....*....|....*...
gi 334185836 669 TNRFHVFRFGVCSCGDYW 686
Cdd:pfam14432 76 ASRFHHFKNGLCSCGDFW 93
|
|
| PLN03077 |
PLN03077 |
Protein ECB2; Provisional |
265-541 |
7.99e-29 |
|
Protein ECB2; Provisional
Pssm-ID: 215561 [Multi-domain] Cd Length: 857 Bit Score: 123.04 E-value: 7.99e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 265 MFFKMGKKSLVSWNVMIGVYMKNAMPVEAVELYSRMEADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNL 344
Cdd:PLN03077 143 VFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDV 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 345 LLENALIDMYAKCGCLEKARDVFENMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTLAACSHAG 424
Cdd:PLN03077 223 DVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLG 302
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 425 LLEEGRS----------------CFKL---------MTDHYKITPRLE-----HLACMVDLLGRAG---KVKEAYRFIQD 471
Cdd:PLN03077 303 DERLGREmhgyvvktgfavdvsvCNSLiqmylslgsWGEAEKVFSRMEtkdavSWTAMISGYEKNGlpdKALETYALMEQ 382
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 334185836 472 MSMEPNERVWGALLGACRVHSDTDIGLlaadKLFQLApEQSGY--YVLLSN----IYAKAGRWEEVTNIRNIMKSK 541
Cdd:PLN03077 383 DNVSPDEITIASVLSACACLGDLDVGV----KLHELA-ERKGLisYVVVANalieMYSKCKCIDKALEVFHNIPEK 453
|
|
| E_motif |
pfam20431 |
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ... |
491-553 |
9.33e-18 |
|
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.
Pssm-ID: 466580 [Multi-domain] Cd Length: 63 Bit Score: 77.58 E-value: 9.33e-18
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 334185836 491 HSDTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRNIMKSKGLKKNPGASNVE 553
Cdd:pfam20431 1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
|
|
| PLN03218 |
PLN03218 |
maturation of RBCL 1; Provisional |
187-546 |
1.18e-11 |
|
maturation of RBCL 1; Provisional
Pssm-ID: 215636 [Multi-domain] Cd Length: 1060 Bit Score: 68.36 E-value: 1.18e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 187 KCGFLSEARLVLDEMSRRDVVSWNSLvvgyaQNQRFddaLEVCREMESVKishDAGTMASLLPAVSNTTTENVMYV---- 262
Cdd:PLN03218 382 RDGRIKDCIDLLEDMEKRGLLDMDKI-----YHAKF---FKACKKQRAVK---EAFRFAKLIRNPTLSTFNMLMSVcass 450
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 263 KDMFFKMGKKSLVS----------WNVMIGVYMKnAMPVEAV-ELYSRMEADGFEPDAVSITSVLPACGDTSALSLGKKI 331
Cdd:PLN03218 451 QDIDGALRVLRLVQeaglkadcklYTTLISTCAK-SGKVDAMfEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGA 529
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 332 HGYIERKKLIPNLLLENALIDMYAKCGCLEKARDVFENMKSR------DVVSWTAMISAYGFSGRGCDAVALFSKLQDSG 405
Cdd:PLN03218 530 YGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEthpidpDHITVGALMKACANAGQVDRAKEVYQMIHEYN 609
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 406 LVPDSIAFVTTLAACSHAGLLEEGRSCFKLMTDHyKITPRLEHLACMVDLLGRAGKVKEAYRFIQDM---SMEPNERVWG 482
Cdd:PLN03218 610 IKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKK-GVKPDEVFFSALVDVAGHAGDLDKAFEILQDArkqGIKLGTVSYS 688
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 334185836 483 ALLGACRVHSDTDIGL-LAAD-KLFQLAPEQSGYYVLLSNIyAKAGRWEEVTNIRNIMKSKGLKKN 546
Cdd:PLN03218 689 SLMGACSNAKNWKKALeLYEDiKSIKLRPTVSTMNALITAL-CEGNQLPKALEVLSEMKRLGLCPN 753
|
|
| PLN03077 |
PLN03077 |
Protein ECB2; Provisional |
292-430 |
1.97e-08 |
|
Protein ECB2; Provisional
Pssm-ID: 215561 [Multi-domain] Cd Length: 857 Bit Score: 57.55 E-value: 1.97e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 292 EAVELYSRMEADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLLENALIDMYAKCGCLEKARDVFENMK 371
Cdd:PLN03077 69 QALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMP 148
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 334185836 372 SRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTLAACSHAGLLEEGR 430
Cdd:PLN03077 149 ERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGR 207
|
|
| Eplus_motif |
pfam20430 |
E+ motif; This is the E+ motif found in some plant pentatricopeptide repeat (PPR) proteins ... |
558-585 |
5.23e-08 |
|
E+ motif; This is the E+ motif found in some plant pentatricopeptide repeat (PPR) proteins which contain a C-terminal DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E motif, precedes the DYW domain and, although their role is not clear, they are essential in th RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.
Pssm-ID: 466579 [Multi-domain] Cd Length: 28 Bit Score: 49.19 E-value: 5.23e-08
10 20
....*....|....*....|....*...
gi 334185836 558 IHTFLVGDRSHPQSDEIYRELDVLVKKM 585
Cdd:pfam20430 1 THTFFAGDKSHPESKQIYEKLSDLTQRI 28
|
|
| PPR_2 |
pfam13041 |
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ... |
272-319 |
9.45e-07 |
|
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.
Pssm-ID: 463778 [Multi-domain] Cd Length: 50 Bit Score: 45.82 E-value: 9.45e-07
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 334185836 272 KSLVSWNVMIGVYMKNAMPVEAVELYSRMEADGFEPDAVSITSVLPAC 319
Cdd:pfam13041 1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
|
|
| PLN03218 |
PLN03218 |
maturation of RBCL 1; Provisional |
299-512 |
5.31e-06 |
|
maturation of RBCL 1; Provisional
Pssm-ID: 215636 [Multi-domain] Cd Length: 1060 Bit Score: 49.88 E-value: 5.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 299 RMEADGFEPDAVSITSVLPACGDTsalslgkkihGYIERKKLIPNLLLEN----------ALIDMYAKCGCLEKARDVFE 368
Cdd:PLN03218 569 KAETHPIDPDHITVGALMKACANA----------GQVDRAKEVYQMIHEYnikgtpevytIAVNSCSQKGDWDFALSIYD 638
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 369 NMKSRDV----VSWTAMISAYGFSGrgcDAVALFSKLQD---SGLVPDSIAFVTTLAACSHAGLLEEGRSCF---KLMtd 438
Cdd:PLN03218 639 DMKKKGVkpdeVFFSALVDVAGHAG---DLDKAFEILQDarkQGIKLGTVSYSSLMGACSNAKNWKKALELYediKSI-- 713
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 334185836 439 hyKITPRLEHLACMVDLLGRAGKVKEAYRFIQDMS---MEPNERVWGALLGACRVHSDTDIGLlaadKLFQLAPEQS 512
Cdd:PLN03218 714 --KLRPTVSTMNALITALCEGNQLPKALEVLSEMKrlgLCPNTITYSILLVASERKDDADVGL----DLLSQAKEDG 784
|
|
| PPR_2 |
pfam13041 |
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ... |
348-385 |
6.28e-06 |
|
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.
Pssm-ID: 463778 [Multi-domain] Cd Length: 50 Bit Score: 43.51 E-value: 6.28e-06
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 334185836 348 NALIDMYAKCGCLEKARDVFENMKSR----DVVSWTAMISAY 385
Cdd:pfam13041 7 NTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGL 48
|
|
| PLN03218 |
PLN03218 |
maturation of RBCL 1; Provisional |
136-379 |
1.68e-05 |
|
maturation of RBCL 1; Provisional
Pssm-ID: 215636 [Multi-domain] Cd Length: 1060 Bit Score: 48.33 E-value: 1.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 136 VRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNGLVSMYGKCGFLSEARLVLDEMSRRDV----VSWNS 211
Cdd:PLN03218 575 IDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVkpdeVFFSA 654
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 212 LVVGYAQNQRFDDALEVCREMESVKISHDAGTMASLLPAVSNTTT-ENVMYVKDMFFKMGKKSLVS-WNVMIGVYMKNAM 289
Cdd:PLN03218 655 LVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNwKKALELYEDIKSIKLRPTVStMNALITALCEGNQ 734
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 290 PVEAVELYSRMEADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLLENALIDMyakcgCLEKardvFEN 369
Cdd:PLN03218 735 LPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITGL-----CLRR----FEK 805
|
250
....*....|
gi 334185836 370 MKSRDVVSWT 379
Cdd:PLN03218 806 ACALGEPVVS 815
|
|
| PPR_2 |
pfam13041 |
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ... |
373-422 |
2.44e-05 |
|
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.
Pssm-ID: 463778 [Multi-domain] Cd Length: 50 Bit Score: 41.97 E-value: 2.44e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 334185836 373 RDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTLAACSH 422
Cdd:pfam13041 1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
|
|
| PPR |
TIGR00756 |
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ... |
275-309 |
2.75e-05 |
|
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Pssm-ID: 273253 [Multi-domain] Cd Length: 35 Bit Score: 41.29 E-value: 2.75e-05
10 20 30
....*....|....*....|....*....|....*
gi 334185836 275 VSWNVMIGVYMKNAMPVEAVELYSRMEADGFEPDA 309
Cdd:TIGR00756 1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
|
|
| PLN03218 |
PLN03218 |
maturation of RBCL 1; Provisional |
401-624 |
3.18e-05 |
|
maturation of RBCL 1; Provisional
Pssm-ID: 215636 [Multi-domain] Cd Length: 1060 Bit Score: 47.56 E-value: 3.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 401 LQDSGLVPDSIAFVTTLAACSHAGLLEEGRSCFKLMtDHYKITPRLEHLACMVDLLGRAGKVKE---AYRFIQDMSMEPN 477
Cdd:PLN03218 463 VQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM-VNAGVEANVHTFGALIDGCARAGQVAKafgAYGIMRSKNVKPD 541
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 478 ERVWGALLGACrvhsdTDIGllAADKLFQ-LA----------PEQSGYYVLLSNIyAKAGRWEEVTNIRNIMKSKGLKKN 546
Cdd:PLN03218 542 RVVFNALISAC-----GQSG--AVDRAFDvLAemkaethpidPDHITVGALMKAC-ANAGQVDRAKEVYQMIHEYNIKGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 547 PGASNVEVNriihtflvgdrSHPQSDEIYRELDVLvKKMKELGYVPDSE--SALHDVeeedkethlAVHSEKLAIVFALM 624
Cdd:PLN03218 614 PEVYTIAVN-----------SCSQKGDWDFALSIY-DDMKKKGVKPDEVffSALVDV---------AGHAGDLDKAFEIL 672
|
|
| PPR_2 |
pfam13041 |
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ... |
103-150 |
9.46e-05 |
|
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.
Pssm-ID: 463778 [Multi-domain] Cd Length: 50 Bit Score: 40.42 E-value: 9.46e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 334185836 103 RNVIIINVMIRSYVNNGFYGEGVKVFGTMCGCNVRPDHYTFPCVLKAC 150
Cdd:pfam13041 1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
|
|
| PPR |
pfam01535 |
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ... |
275-305 |
1.57e-04 |
|
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.
Pssm-ID: 366695 [Multi-domain] Cd Length: 31 Bit Score: 39.37 E-value: 1.57e-04
10 20 30
....*....|....*....|....*....|.
gi 334185836 275 VSWNVMIGVYMKNAMPVEAVELYSRMEADGF 305
Cdd:pfam01535 1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
|
|
| PPR_3 |
pfam13812 |
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ... |
300-356 |
1.91e-04 |
|
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.
Pssm-ID: 316342 [Multi-domain] Cd Length: 63 Bit Score: 40.03 E-value: 1.91e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 334185836 300 MEADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLLENALIDMYAK 356
Cdd:pfam13812 6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
|
|
| PPR |
pfam01535 |
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ... |
348-375 |
3.77e-04 |
|
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.
Pssm-ID: 366695 [Multi-domain] Cd Length: 31 Bit Score: 38.22 E-value: 3.77e-04
10 20
....*....|....*....|....*...
gi 334185836 348 NALIDMYAKCGCLEKARDVFENMKSRDV 375
Cdd:pfam01535 4 NSLISGYCKNGKLEEALELFKEMKEKGI 31
|
|
| PPR |
TIGR00756 |
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ... |
348-376 |
9.35e-04 |
|
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Pssm-ID: 273253 [Multi-domain] Cd Length: 35 Bit Score: 37.05 E-value: 9.35e-04
10 20
....*....|....*....|....*....
gi 334185836 348 NALIDMYAKCGCLEKARDVFENMKSRDVV 376
Cdd:TIGR00756 4 NTLIDGLCKAGRVEEALELFKEMKERGIE 32
|
|
| PPR |
TIGR00756 |
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ... |
207-241 |
9.44e-04 |
|
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Pssm-ID: 273253 [Multi-domain] Cd Length: 35 Bit Score: 37.05 E-value: 9.44e-04
10 20 30
....*....|....*....|....*....|....*
gi 334185836 207 VSWNSLVVGYAQNQRFDDALEVCREMESVKISHDA 241
Cdd:TIGR00756 1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
|
|
| BepA |
COG4783 |
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ... |
449-530 |
1.52e-03 |
|
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443813 [Multi-domain] Cd Length: 139 Bit Score: 39.40 E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334185836 449 LACMVDLLGRAGKVKEAYRFIQD-MSMEPNERVWGALLGACRVHS-DTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAG 526
Cdd:COG4783 41 FALLGEILLQLGDLDEAIVLLHEaLELDPDEPEARLNLGLALLKAgDYDEALALLEKALKLDPEHPEAYLRLARAYRALG 120
|
....
gi 334185836 527 RWEE 530
Cdd:COG4783 121 RPDE 124
|
|
| PPR |
pfam01535 |
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ... |
207-234 |
2.72e-03 |
|
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.
Pssm-ID: 366695 [Multi-domain] Cd Length: 31 Bit Score: 35.90 E-value: 2.72e-03
10 20
....*....|....*....|....*...
gi 334185836 207 VSWNSLVVGYAQNQRFDDALEVCREMES 234
Cdd:pfam01535 1 VTYNSLISGYCKNGKLEEALELFKEMKE 28
|
|
| PPR_1 |
pfam12854 |
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ... |
205-233 |
3.77e-03 |
|
PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.
Pssm-ID: 403914 [Multi-domain] Cd Length: 34 Bit Score: 35.40 E-value: 3.77e-03
10 20
....*....|....*....|....*....
gi 334185836 205 DVVSWNSLVVGYAQNQRFDDALEVCREME 233
Cdd:pfam12854 6 DVVTYNTLINGLCRAGRVDEAFELLDEME 34
|
|
|