NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|795345956|ref|XP_011783426|]
View 

PREDICTED: methenyltetrahydrofolate synthase domain-containing protein isoform X5 [Colobus angolensis palliatus]

Protein Classification

methenyltetrahydrofolate synthase domain-containing protein( domain architecture ID 10221107)

methenyltetrahydrofolate synthase domain-containing protein (MTHFSD) is a novel RNA-binding protein abnormally regulated in amyotrophic lateral sclerosis

Gene Symbol:  mthfsd
Gene Ontology:  GO:0003723
PubMed:  26525917

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RRM_MTHFSD cd12270
RNA recognition motif (RRM) found in vertebrate methenyltetrahydrofolate synthetase ...
307-378 2.10e-33

RNA recognition motif (RRM) found in vertebrate methenyltetrahydrofolate synthetase domain-containing proteins; This subfamily corresponds to methenyltetrahydrofolate synthetase domain (MTHFSD), a putative RNA-binding protein found in various vertebrate species. It contains an N-terminal 5-formyltetrahydrofolate cyclo-ligase domain and a C-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The biological role of MTHFSD remains unclear.


:

Pssm-ID: 409713 [Multi-domain]  Cd Length: 72  Bit Score: 118.96  E-value: 2.10e-33
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956 307 DVYVGNLPRDARVSDLKRALRELGSVPLRLTWQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALAR 378
Cdd:cd12270    1 TVYVGNIPRSVRVSDLKSALRERGINPLRITWQGARGKAFLHYSDMADADSAVSSLQGLRIGGNTLTVELAK 72
5-FTHF_cyc-lig super family cl00360
5-formyltetrahydrofolate cyclo-ligase family; 5-formyltetrahydrofolate cyclo-ligase or ...
9-207 6.59e-31

5-formyltetrahydrofolate cyclo-ligase family; 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyzes the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF is used in chemotherapy where it is clinically known as Leucovorin.


The actual alignment was detected with superfamily member pfam01812:

Pssm-ID: 444864 [Multi-domain]  Cd Length: 186  Bit Score: 116.25  E-value: 6.59e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956    9 KQDIREQI---WDHMESQNLADFPRPVHHRIPNFKGAShaaeqlprlQAFKTARTIKVNPDAPQKNARFFVLESKKTLLV 85
Cdd:pfam01812   1 KQELRKQLlarRRALSEEERAAQSEALHQRLISLPEYQ---------KAKRVAAYVSVGGEIDTRELIDLLLEEGKRVLL 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956   86 PTPRLRTGLFN--KITPPPGATKDILRKCATSQGVRNCSVPIGLDsrvLVDLVVVGSVAVSeKDGWRIGKGEGYADLEYA 163
Cdd:pfam01812  72 PVPRPGSGHLDmvRFTPYYPEDSLPRGAWGLKEPVEEELRELALG---QLDLVLVPGVAFD-RQGYRLGRGGGYYDRYLA 147
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 795345956  164 MMVSMGAvsNETPVVTIVHDCQVVDIPeelLEEHDITVDYILTP 207
Cdd:pfam01812 148 RLQGHGA--KPYTVGLAFDEQLVERLP---VEPHDVPVDEVVTE 186
 
Name Accession Description Interval E-value
RRM_MTHFSD cd12270
RNA recognition motif (RRM) found in vertebrate methenyltetrahydrofolate synthetase ...
307-378 2.10e-33

RNA recognition motif (RRM) found in vertebrate methenyltetrahydrofolate synthetase domain-containing proteins; This subfamily corresponds to methenyltetrahydrofolate synthetase domain (MTHFSD), a putative RNA-binding protein found in various vertebrate species. It contains an N-terminal 5-formyltetrahydrofolate cyclo-ligase domain and a C-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The biological role of MTHFSD remains unclear.


Pssm-ID: 409713 [Multi-domain]  Cd Length: 72  Bit Score: 118.96  E-value: 2.10e-33
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956 307 DVYVGNLPRDARVSDLKRALRELGSVPLRLTWQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALAR 378
Cdd:cd12270    1 TVYVGNIPRSVRVSDLKSALRERGINPLRITWQGARGKAFLHYSDMADADSAVSSLQGLRIGGNTLTVELAK 72
5-FTHF_cyc-lig pfam01812
5-formyltetrahydrofolate cyclo-ligase family; 5-formyltetrahydrofolate cyclo-ligase or ...
9-207 6.59e-31

5-formyltetrahydrofolate cyclo-ligase family; 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyzes the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF is used in chemotherapy where it is clinically known as Leucovorin.


Pssm-ID: 396398 [Multi-domain]  Cd Length: 186  Bit Score: 116.25  E-value: 6.59e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956    9 KQDIREQI---WDHMESQNLADFPRPVHHRIPNFKGAShaaeqlprlQAFKTARTIKVNPDAPQKNARFFVLESKKTLLV 85
Cdd:pfam01812   1 KQELRKQLlarRRALSEEERAAQSEALHQRLISLPEYQ---------KAKRVAAYVSVGGEIDTRELIDLLLEEGKRVLL 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956   86 PTPRLRTGLFN--KITPPPGATKDILRKCATSQGVRNCSVPIGLDsrvLVDLVVVGSVAVSeKDGWRIGKGEGYADLEYA 163
Cdd:pfam01812  72 PVPRPGSGHLDmvRFTPYYPEDSLPRGAWGLKEPVEEELRELALG---QLDLVLVPGVAFD-RQGYRLGRGGGYYDRYLA 147
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 795345956  164 MMVSMGAvsNETPVVTIVHDCQVVDIPeelLEEHDITVDYILTP 207
Cdd:pfam01812 148 RLQGHGA--KPYTVGLAFDEQLVERLP---VEPHDVPVDEVVTE 186
FAU1 COG0212
5-formyltetrahydrofolate cyclo-ligase [Coenzyme transport and metabolism];
133-210 1.31e-09

5-formyltetrahydrofolate cyclo-ligase [Coenzyme transport and metabolism];


Pssm-ID: 439982 [Multi-domain]  Cd Length: 186  Bit Score: 57.09  E-value: 1.31e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956 133 VDLVVVGSVAVSeKDGWRIGKGEGYAD--LEyammvsmgAVSNETPVVTIVHDCQVVD-IPeelLEEHDITVDYILTPTR 209
Cdd:COG0212  118 IDLVLVPLLAFD-RRGYRLGYGGGYYDrtLA--------RLRPRPLTIGLAFDCQLVDeLP---VEPHDVPLDAIVTEKG 185

                 .
gi 795345956 210 V 210
Cdd:COG0212  186 V 186
RRM smart00360
RNA recognition motif;
308-374 1.74e-08

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 51.06  E-value: 1.74e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956   308 VYVGNLPRDARVSDLKRALRELGSV-----PLRLTWQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRV 374
Cdd:smart00360   2 LFVGNLPPDTTEEELRELFSKFGKVesvrlVRDKETGKSKGFAFVEFESEEDAEKALEALNGKELDGRPLKV 73
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
308-373 1.69e-07

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 48.00  E-value: 1.69e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 795345956  308 VYVGNLPRDARVSDLKRALRELGSV-----PLRLTWQgPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLR 373
Cdd:pfam00076   1 LFVGNLPPDTTEEDLKDLFSKFGPIksirlVRDETGR-SKGFAFVEFEDEEDAEKAIEALNGKELGGRELK 70
RRM COG0724
RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];
309-383 8.05e-06

RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 440488 [Multi-domain]  Cd Length: 85  Bit Score: 43.55  E-value: 8.05e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956 309 YVGNLPRDARVSDLKRALRELGSV------PLRLTwQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALARQQRD 382
Cdd:COG0724    5 YVGNLPYSVTEEDLRELFSEYGEVtsvkliTDRET-GRSRGFGFVEMPDDEEAQAAIEALNGAELMGRTLKVNEARPREE 83

                 .
gi 795345956 383 K 383
Cdd:COG0724   84 R 84
PLN02812 PLN02812
5-formyltetrahydrofolate cyclo-ligase
133-211 2.94e-03

5-formyltetrahydrofolate cyclo-ligase


Pssm-ID: 178408  Cd Length: 211  Bit Score: 38.47  E-value: 2.94e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956 133 VDLVVVGSVAVSEKdGWRIGKGEGYADLEYAMMVSMGAVSNETP--VVTIVHDCQVVDIPEELLEEHDITVDYILTPTRV 210
Cdd:PLN02812 132 LDLLLLPGLAFDRS-GRRLGRGGGYYDTFLSKYQELAKEKGWKQplLVALSYSPQILDEGSVPVDETDVLVDALVTPSGV 210

                 .
gi 795345956 211 I 211
Cdd:PLN02812 211 I 211
 
Name Accession Description Interval E-value
RRM_MTHFSD cd12270
RNA recognition motif (RRM) found in vertebrate methenyltetrahydrofolate synthetase ...
307-378 2.10e-33

RNA recognition motif (RRM) found in vertebrate methenyltetrahydrofolate synthetase domain-containing proteins; This subfamily corresponds to methenyltetrahydrofolate synthetase domain (MTHFSD), a putative RNA-binding protein found in various vertebrate species. It contains an N-terminal 5-formyltetrahydrofolate cyclo-ligase domain and a C-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The biological role of MTHFSD remains unclear.


Pssm-ID: 409713 [Multi-domain]  Cd Length: 72  Bit Score: 118.96  E-value: 2.10e-33
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956 307 DVYVGNLPRDARVSDLKRALRELGSVPLRLTWQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALAR 378
Cdd:cd12270    1 TVYVGNIPRSVRVSDLKSALRERGINPLRITWQGARGKAFLHYSDMADADSAVSSLQGLRIGGNTLTVELAK 72
5-FTHF_cyc-lig pfam01812
5-formyltetrahydrofolate cyclo-ligase family; 5-formyltetrahydrofolate cyclo-ligase or ...
9-207 6.59e-31

5-formyltetrahydrofolate cyclo-ligase family; 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyzes the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF is used in chemotherapy where it is clinically known as Leucovorin.


Pssm-ID: 396398 [Multi-domain]  Cd Length: 186  Bit Score: 116.25  E-value: 6.59e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956    9 KQDIREQI---WDHMESQNLADFPRPVHHRIPNFKGAShaaeqlprlQAFKTARTIKVNPDAPQKNARFFVLESKKTLLV 85
Cdd:pfam01812   1 KQELRKQLlarRRALSEEERAAQSEALHQRLISLPEYQ---------KAKRVAAYVSVGGEIDTRELIDLLLEEGKRVLL 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956   86 PTPRLRTGLFN--KITPPPGATKDILRKCATSQGVRNCSVPIGLDsrvLVDLVVVGSVAVSeKDGWRIGKGEGYADLEYA 163
Cdd:pfam01812  72 PVPRPGSGHLDmvRFTPYYPEDSLPRGAWGLKEPVEEELRELALG---QLDLVLVPGVAFD-RQGYRLGRGGGYYDRYLA 147
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 795345956  164 MMVSMGAvsNETPVVTIVHDCQVVDIPeelLEEHDITVDYILTP 207
Cdd:pfam01812 148 RLQGHGA--KPYTVGLAFDEQLVERLP---VEPHDVPVDEVVTE 186
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
308-374 1.05e-09

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 54.21  E-value: 1.05e-09
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGSV----PLRLTWQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRV 374
Cdd:cd00590    1 LFVGNLPPDTTEEDLRELFSKFGEVvsvrIVRDRDGKSKGFAFVEFESPEDAEKALEALNGTELGGRPLKV 71
FAU1 COG0212
5-formyltetrahydrofolate cyclo-ligase [Coenzyme transport and metabolism];
133-210 1.31e-09

5-formyltetrahydrofolate cyclo-ligase [Coenzyme transport and metabolism];


Pssm-ID: 439982 [Multi-domain]  Cd Length: 186  Bit Score: 57.09  E-value: 1.31e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956 133 VDLVVVGSVAVSeKDGWRIGKGEGYAD--LEyammvsmgAVSNETPVVTIVHDCQVVD-IPeelLEEHDITVDYILTPTR 209
Cdd:COG0212  118 IDLVLVPLLAFD-RRGYRLGYGGGYYDrtLA--------RLRPRPLTIGLAFDCQLVDeLP---VEPHDVPLDAIVTEKG 185

                 .
gi 795345956 210 V 210
Cdd:COG0212  186 V 186
RRM smart00360
RNA recognition motif;
308-374 1.74e-08

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 51.06  E-value: 1.74e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956   308 VYVGNLPRDARVSDLKRALRELGSV-----PLRLTWQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRV 374
Cdd:smart00360   2 LFVGNLPPDTTEEELRELFSKFGKVesvrlVRDKETGKSKGFAFVEFESEEDAEKALEALNGKELDGRPLKV 73
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
308-373 1.69e-07

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 48.00  E-value: 1.69e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 795345956  308 VYVGNLPRDARVSDLKRALRELGSV-----PLRLTWQgPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLR 373
Cdd:pfam00076   1 LFVGNLPPDTTEEDLKDLFSKFGPIksirlVRDETGR-SKGFAFVEFEDEEDAEKAIEALNGKELGGRELK 70
RRM_SRSF3_like cd12373
RNA recognition motif (RRM) found in serine/arginine-rich splicing factor 3 (SRSF3) and ...
308-378 4.33e-07

RNA recognition motif (RRM) found in serine/arginine-rich splicing factor 3 (SRSF3) and similar proteins; This subfamily corresponds to the RRM of two serine/arginine (SR) proteins, serine/arginine-rich splicing factor 3 (SRSF3) and serine/arginine-rich splicing factor 7 (SRSF7). SRSF3, also termed pre-mRNA-splicing factor SRp20, modulates alternative splicing by interacting with RNA cis-elements in a concentration- and cell differentiation-dependent manner. It is also involved in termination of transcription, alternative RNA polyadenylation, RNA export, and protein translation. SRSF3 is critical for cell proliferation, and tumor induction and maintenance. It can shuttle between the nucleus and cytoplasm. SRSF7, also termed splicing factor 9G8, plays a crucial role in both constitutive splicing and alternative splicing of many pre-mRNAs. Its localization and functions are tightly regulated by phosphorylation. SRSF7 is predominantly present in the nuclear and can shuttle between nucleus and cytoplasm. It cooperates with the export protein, Tap/NXF1, helps mRNA export to the cytoplasm, and enhances the expression of unspliced mRNA. Moreover, SRSF7 inhibits tau E10 inclusion through directly interacting with the proximal downstream intron of E10, a clustering region for frontotemporal dementia with Parkinsonism (FTDP) mutations. Both SRSF3 and SRSF7 contain a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal RS domain rich in serine-arginine dipeptides. The RRM domain is involved in RNA binding, and the RS domain has been implicated in protein shuttling and protein-protein interactions.


Pssm-ID: 409808 [Multi-domain]  Cd Length: 73  Bit Score: 46.85  E-value: 4.33e-07
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGsvPLRLTW--QGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALAR 378
Cdd:cd12373    2 VYVGNLGPRVTKRELEDAFEKYG--PLRNVWvaRNPPGFAFVEFEDPRDAEDAVRALDGRRICGSRVRVELSR 72
RRM_CSTF2_RNA15_like cd12398
RNA recognition motif (RRM) found in cleavage stimulation factor subunit 2 (CSTF2), yeast ...
308-377 6.27e-06

RNA recognition motif (RRM) found in cleavage stimulation factor subunit 2 (CSTF2), yeast ortholog mRNA 3'-end-processing protein RNA15 and similar proteins; This subfamily corresponds to the RRM domain of CSTF2, its tau variant and eukaryotic homologs. CSTF2, also termed cleavage stimulation factor 64 kDa subunit (CstF64), is the vertebrate conterpart of yeast mRNA 3'-end-processing protein RNA15. It is expressed in all somatic tissues and is one of three cleavage stimulatory factor (CstF) subunits required for polyadenylation. CstF64 contains an N-terminal RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a CstF77-binding domain, a repeated MEARA helical region and a conserved C-terminal domain reported to bind the transcription factor PC-4. During polyadenylation, CstF interacts with the pre-mRNA through the RRM of CstF64 at U- or GU-rich sequences within 10 to 30 nucleotides downstream of the cleavage site. CSTF2T, also termed tauCstF64, is a paralog of the X-linked cleavage stimulation factor CstF64 protein that supports polyadenylation in most somatic cells. It is expressed during meiosis and subsequent haploid differentiation in a more limited set of tissues and cell types, largely in meiotic and postmeiotic male germ cells, and to a lesser extent in brain. The loss of CSTF2T will cause male infertility, as it is necessary for spermatogenesis and fertilization. Moreover, CSTF2T is required for expression of genes involved in morphological differentiation of spermatids, as well as for genes having products that function during interaction of motile spermatozoa with eggs. It promotes germ cell-specific patterns of polyadenylation by using its RRM to bind to different sequence elements downstream of polyadenylation sites than does CstF64. The family also includes yeast ortholog mRNA 3'-end-processing protein RNA15 and similar proteins. RNA15 is a core subunit of cleavage factor IA (CFIA), an essential transcriptional 3'-end processing factor from Saccharomyces cerevisiae. RNA recognition by CFIA is mediated by an N-terminal RRM, which is contained in the RNA15 subunit of the complex. The RRM of RNA15 has a strong preference for GU-rich RNAs, mediated by a binding pocket that is entirely conserved in both yeast and vertebrate RNA15 orthologs.


Pssm-ID: 409832 [Multi-domain]  Cd Length: 77  Bit Score: 43.66  E-value: 6.27e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGSV-PLRL-----TWQgPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALA 377
Cdd:cd12398    3 VFVGNIPYDATEEQLKEIFSEVGPVvSFRLvtdreTGK-PKGYGFCEFRDAETALSAVRNLNGYELNGRPLRVDFA 77
RRM COG0724
RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];
309-383 8.05e-06

RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 440488 [Multi-domain]  Cd Length: 85  Bit Score: 43.55  E-value: 8.05e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956 309 YVGNLPRDARVSDLKRALRELGSV------PLRLTwQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALARQQRD 382
Cdd:COG0724    5 YVGNLPYSVTEEDLRELFSEYGEVtsvkliTDRET-GRSRGFGFVEMPDDEEAQAAIEALNGAELMGRTLKVNEARPREE 83

                 .
gi 795345956 383 K 383
Cdd:COG0724   84 R 84
RRM_SRSF3 cd12645
RNA recognition motif (RRM) found in vertebrate serine/arginine-rich splicing factor 3 (SRSF3); ...
303-381 1.47e-04

RNA recognition motif (RRM) found in vertebrate serine/arginine-rich splicing factor 3 (SRSF3); This subgroup corresponds to the RRM of SRSF3, also termed pre-mRNA-splicing factor SRp20, a splicing regulatory serine/arginine (SR) protein that modulates alternative splicing by interacting with RNA cis-elements in a concentration- and cell differentiation-dependent manner. It is also involved in termination of transcription, alternative RNA polyadenylation, RNA export, and protein translation. SRSF3 is critical for cell proliferation and tumor induction and maintenance. SRSF3 can shuttle between the nucleus and cytoplasm. It contains a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal RS domain rich in serine-arginine dipeptides. The RRM domain is involved in RNA binding, and the RS domain has been implicated in protein shuttling and protein-protein interactions.


Pssm-ID: 241089 [Multi-domain]  Cd Length: 81  Bit Score: 40.02  E-value: 1.47e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956 303 PLAADVYVGNLPRDARVSDLKRALRELGsvPLRLTW--QGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALARQQ 380
Cdd:cd12645    2 PLDCKVYVGNLGNNGNKTELERAFGYYG--PLRSVWvaRNPPGFAFVEFEDPRDAADAVRELDGRTLCGCRVRVELSNGE 79

                 .
gi 795345956 381 R 381
Cdd:cd12645   80 K 80
RRM_HP0827_like cd12399
RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; ...
308-374 1.50e-04

RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; This subfamily corresponds to the RRM of H. pylori HP0827, a putative ssDNA-binding protein 12rnp2 precursor, containing one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The ssDNA binding may be important in activation of HP0827.


Pssm-ID: 409833 [Multi-domain]  Cd Length: 75  Bit Score: 39.81  E-value: 1.50e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGSV-----PLRLTWQGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRV 374
Cdd:cd12399    1 LYVGNLPYSASEEQLKSLFGQFGAVfdvklPMDRETKRPRGFGFVELQEEESAEKAIAKLDGTDFMGRTIRV 72
RRM_CSTF2_CSTF2T cd12671
RNA recognition motif (RRM) found in cleavage stimulation factor subunit 2 (CSTF2), cleavage ...
308-374 1.86e-04

RNA recognition motif (RRM) found in cleavage stimulation factor subunit 2 (CSTF2), cleavage stimulation factor subunit 2 tau variant (CSTF2T) and similar proteins; This subgroup corresponds to the RRM domain of CSTF2, its tau variant and eukaryotic homologs. CSTF2, also termed cleavage stimulation factor 64 kDa subunit (CstF64), is the vertebrate conterpart of yeast mRNA 3'-end-processing protein RNA15. It is expressed in all somatic tissues and is one of three cleavage stimulatory factor (CstF) subunits required for polyadenylation. CstF64 contains an N-terminal RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a CstF77-binding domain, a repeated MEARA helical region and a conserved C-terminal domain reported to bind the transcription factor PC-4. During polyadenylation, CstF interacts with the pre-mRNA through the RRM of CstF64 at U- or GU-rich sequences within 10 to 30 nucleotides downstream of the cleavage site. CSTF2T, also termed tauCstF64, is a paralog of the X-linked cleavage stimulation factor CstF64 protein that supports polyadenylation in most somatic cells. It is expressed during meiosis and subsequent haploid differentiation in a more limited set of tissues and cell types, largely in meiotic and postmeiotic male germ cells, and to a lesser extent in brain. The loss of CSTF2T will cause male infertility, as it is necessary for spermatogenesis and fertilization. Moreover, CSTF2T is required for expression of genes involved in morphological differentiation of spermatids, as well as for genes having products that function during interaction of motile spermatozoa with eggs. It promotes germ cell-specific patterns of polyadenylation by using its RRM to bind to different sequence elements downstream of polyadenylation sites than does CstF64.


Pssm-ID: 410072 [Multi-domain]  Cd Length: 85  Bit Score: 39.80  E-value: 1.86e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGSV-PLRLTWQ----GPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRV 374
Cdd:cd12671    9 VFVGNIPYEATEEQLKDIFSEVGPVvSFRLVYDretgKPKGYGFCEYQDQETALSAMRNLNGYELNGRALRV 80
RRM_NOL8 cd12226
RNA recognition motif (RRM) found in nucleolar protein 8 (NOL8) and similar proteins; This ...
308-378 1.93e-04

RNA recognition motif (RRM) found in nucleolar protein 8 (NOL8) and similar proteins; This model corresponds to the RRM of NOL8 (also termed Nop132) encoded by a novel NOL8 gene that is up-regulated in the majority of diffuse-type, but not intestinal-type, gastric cancers. Thus, NOL8 may be a good molecular target for treatment of diffuse-type gastric cancer. Also, NOL8 is a phosphorylated protein that contains an N-terminal RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), suggesting NOL8 is likely to function as a novel RNA-binding protein. It may be involved in regulation of gene expression at the post-transcriptional level or in ribosome biogenesis in cancer cells.


Pssm-ID: 409673 [Multi-domain]  Cd Length: 77  Bit Score: 39.48  E-value: 1.93e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGSV-PLRL---TWQGPRRRAFLRY-PDSAAAQQAVSCLQGLRLGTDTLRVALAR 378
Cdd:cd12226    2 LFVGGLSPSITEDDLERRFSRFGTVsDVEIirkKDAPDRGFAYIDLrTSEAALQKCLSTLNGVKWKGSRLKIQLAK 77
RRM1_SRSF1_like cd12338
RNA recognition motif 1 (RRM1) found in serine/arginine-rich splicing factor 1 (SRSF1) and ...
308-376 6.39e-04

RNA recognition motif 1 (RRM1) found in serine/arginine-rich splicing factor 1 (SRSF1) and similar proteins; This subgroup corresponds to the RRM1 in three serine/arginine (SR) proteins: serine/arginine-rich splicing factor 1 (SRSF1 or ASF-1), serine/arginine-rich splicing factor 9 (SRSF9 or SRp30C), and plant pre-mRNA-splicing factor SF2 (SR1). SRSF1 is a shuttling SR protein involved in constitutive and alternative splicing, nonsense-mediated mRNA decay (NMD), mRNA export and translation. It also functions as a splicing-factor oncoprotein that regulates apoptosis and proliferation to promote mammary epithelial cell transformation. SRSF9 has been implicated in the activity of many elements that control splice site selection, the alternative splicing of the glucocorticoid receptor beta in neutrophils and in the gonadotropin-releasing hormone pre-mRNA. It can also interact with other proteins implicated in alternative splicing, including YB-1, rSLM-1, rSLM-2, E4-ORF4, Nop30, and p32. Both, SRSF1 and SRSF9, contain two N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a C-terminal RS domains rich in serine-arginine dipeptides. In contrast, SF2 contains two N-terminal RRMs and a C-terminal PSK domain rich in proline, serine and lysine residues.


Pssm-ID: 409775 [Multi-domain]  Cd Length: 72  Bit Score: 38.11  E-value: 6.39e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGSVpLRLTWQGPRRR---AFLRYPDSAAAQQAVSCLQGLRLGTDTLRVAL 376
Cdd:cd12338    2 IYVGNLPGDIRERDIEDLFYKYGPI-LAIDLKNRRRGppfAFVEFEDPRDAEDAIRGRDGYDFDGYRLRVEF 72
RRM2_SXL cd12651
RNA recognition motif 2 (RRM2) found in Drosophila sex-lethal (SXL) and similar proteins; This ...
306-377 7.82e-04

RNA recognition motif 2 (RRM2) found in Drosophila sex-lethal (SXL) and similar proteins; This subfamily corresponds to the RRM2 of the sex-lethal protein (SXL) which governs sexual differentiation and X chromosome dosage compensation in Drosophila melanogaster. It induces female-specific alternative splicing of the transformer (tra) pre-mRNA by binding to the tra uridine-rich polypyrimidine tract at the non-sex-specific 3' splice site during the sex-determination process. SXL binds also to its own pre-mRNA and promotes female-specific alternative splicing. SXL contains an N-terminal Gly/Asn-rich domain that may be responsible for the protein-protein interaction, and tandem RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), that show high preference to bind single-stranded, uridine-rich target RNA transcripts.


Pssm-ID: 410054 [Multi-domain]  Cd Length: 81  Bit Score: 37.95  E-value: 7.82e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 795345956 306 ADVYVGNLPRDARVSDLKRALRELGSVP----LRLTWQG-PRRRAFLRYPDSAAAQQAVSCLQGLRL--GTDTLRVALA 377
Cdd:cd12651    3 TNLYVTNLPRTITEDELDTIFGAYGNIVqknlLRDKLTGrPRGVAFVRYDKREEAQAAISALNGTIPegGTQPLSVRLA 81
RRM_SRSF7 cd12646
RNA recognition motif (RRM) found in vertebrate serine/arginine-rich splicing factor 7 (SRSF7); ...
308-377 2.07e-03

RNA recognition motif (RRM) found in vertebrate serine/arginine-rich splicing factor 7 (SRSF7); This subgroup corresponds to the RRM of SRSF7, also termed splicing factor 9G8, is a splicing regulatory serine/arginine (SR) protein that plays a crucial role in both constitutive splicing and alternative splicing of many pre-mRNAs. Its localization and functions are tightly regulated by phosphorylation. SRSF7 is predominantly present in the nuclear and can shuttle between nucleus and cytoplasm. It cooperates with the export protein, Tap/NXF1, helps mRNA export to the cytoplasm, and enhances the expression of unspliced mRNA. SRSF7 inhibits tau E10 inclusion through directly interacting with the proximal downstream intron of E10, a clustering region for frontotemporal dementia with Parkinsonism (FTDP) mutations. SRSF7 contains a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by a CCHC-type zinc knuckle motif in its median region, and a C-terminal RS domain rich in serine-arginine dipeptides. The RRM domain is involved in RNA binding, and the RS domain has been implicated in protein shuttling and protein-protein interactions.


Pssm-ID: 410050 [Multi-domain]  Cd Length: 77  Bit Score: 36.86  E-value: 2.07e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGsvPLRLTW--QGPRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALA 377
Cdd:cd12646    2 VYVGNLGTGAGKGELERAFSYYG--PLRTVWiaRNPPGFAFVEFEDPRDAEDAVRGLDGKVICGSRVRVELS 71
RRM2_NsCP33_like cd21608
RNA recognition motif 2 (RRM2) found in Nicotiana sylvestris chloroplastic 33 kDa ...
308-377 2.71e-03

RNA recognition motif 2 (RRM2) found in Nicotiana sylvestris chloroplastic 33 kDa ribonucleoprotein (NsCP33) and similar proteins; The family includes NsCP33, Arabidopsis thaliana chloroplastic 31 kDa ribonucleoprotein (CP31A) and mitochondrial glycine-rich RNA-binding protein 2 (AtGR-RBP2). NsCP33 may be involved in splicing and/or processing of chloroplast RNA's. AtCP31A, also called RNA-binding protein 1/2/3 (AtRBP33), or RNA-binding protein CP31A, or RNA-binding protein RNP-T, or RNA-binding protein cp31, is required for specific RNA editing events in chloroplasts and stabilizes specific chloroplast mRNAs, as well as for normal chloroplast development under cold stress conditions by stabilizing transcripts of numerous mRNAs under these conditions. CP31A may modulate telomere replication through RNA binding domains. AtGR-RBP2, also called AtRBG2, or glycine-rich protein 2 (AtGRP2), or mitochondrial RNA-binding protein 1a (At-mRBP1a), plays a role in RNA transcription or processing during stress. It binds RNAs and DNAs sequence with a preference to single-stranded nucleic acids. AtGR-RBP2 displays strong affinity to poly(U) sequence. It exerts cold and freezing tolerance, probably by exhibiting an RNA chaperone activity during the cold and freezing adaptation process. Some members in this family contain two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The model corresponds to the second RRM motif.


Pssm-ID: 410187 [Multi-domain]  Cd Length: 76  Bit Score: 36.38  E-value: 2.71e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 795345956 308 VYVGNLPRDARVSDLKRALRELG---SVPL---RLTwqGpRRR--AFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALA 377
Cdd:cd21608    2 LYVGNLSWDTTEDDLRDLFSEFGeveSAKVitdRET--G-RSRgfGFVTFSTAEAAEAAIDALNGKELDGRSIVVNEA 76
PLN02812 PLN02812
5-formyltetrahydrofolate cyclo-ligase
133-211 2.94e-03

5-formyltetrahydrofolate cyclo-ligase


Pssm-ID: 178408  Cd Length: 211  Bit Score: 38.47  E-value: 2.94e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795345956 133 VDLVVVGSVAVSEKdGWRIGKGEGYADLEYAMMVSMGAVSNETP--VVTIVHDCQVVDIPEELLEEHDITVDYILTPTRV 210
Cdd:PLN02812 132 LDLLLLPGLAFDRS-GRRLGRGGGYYDTFLSKYQELAKEKGWKQplLVALSYSPQILDEGSVPVDETDVLVDALVTPSGV 210

                 .
gi 795345956 211 I 211
Cdd:PLN02812 211 I 211
RRM2_SART3 cd12392
RNA recognition motif 2 (RRM2) found in squamous cell carcinoma antigen recognized by T-cells ...
308-378 4.31e-03

RNA recognition motif 2 (RRM2) found in squamous cell carcinoma antigen recognized by T-cells 3 (SART3) and similar proteins; This subfamily corresponds to the RRM2 of SART3, also termed Tat-interacting protein of 110 kDa (Tip110), is an RNA-binding protein expressed in the nucleus of the majority of proliferating cells, including normal cells and malignant cells, but not in normal tissues except for the testes and fetal liver. It is involved in the regulation of mRNA splicing probably via its complex formation with RNA-binding protein with a serine-rich domain (RNPS1), a pre-mRNA-splicing factor. SART3 has also been identified as a nuclear Tat-interacting protein that regulates Tat transactivation activity through direct interaction and functions as an important cellular factor for HIV-1 gene expression and viral replication. In addition, SART3 is required for U6 snRNP targeting to Cajal bodies. It binds specifically and directly to the U6 snRNA, interacts transiently with the U6 and U4/U6 snRNPs, and promotes the reassembly of U4/U6 snRNPs after splicing in vitro. SART3 contains an N-terminal half-a-tetratricopeptide repeat (HAT)-rich domain, a nuclearlocalization signal (NLS) domain, and two C-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409826 [Multi-domain]  Cd Length: 81  Bit Score: 35.77  E-value: 4.31e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 795345956 308 VYVGNLPRDARVSDLKRALRELGSVP-LRL-TWQG--PRRRAFLRYPDSAAAQQAVSCLQGLRLGTDTLRVALAR 378
Cdd:cd12392    5 LFVKGLPFSCTKEELEELFKQHGTVKdVRLvTYRNgkPKGLAYVEYENEADASQAVLKTDGTEIKDHTISVAISN 79
RRM1_VICKZ cd12358
RNA recognition motif 1 (RRM1) found in the VICKZ family proteins; Thid subfamily corresponds ...
309-367 4.38e-03

RNA recognition motif 1 (RRM1) found in the VICKZ family proteins; Thid subfamily corresponds to the RRM1 of IGF2BPs (or IMPs) found in the VICKZ family that have been implicated in the post-transcriptional regulation of several different RNAs and in subcytoplasmic localization of mRNAs during embryogenesis. IGF2BPs are composed of two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and four hnRNP K homology (KH) domains.


Pssm-ID: 240804 [Multi-domain]  Cd Length: 73  Bit Score: 35.81  E-value: 4.38e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 795345956 309 YVGNLPRDARVSDLKRALRELGSVPlrltwQGP--RRR--AFLRYPDSAAAQQAVSCLQGLRL 367
Cdd:cd12358    2 YIGNLSSDVNESDLRQLFEEHKIPV-----SSVlvKKGgyAFVDCPDQSWADKAIEKLNGKIL 59
RRM1_SRSF9 cd12598
RNA recognition motif 1 (RRM1) found in vertebrate serine/arginine-rich splicing factor 9 ...
308-374 6.90e-03

RNA recognition motif 1 (RRM1) found in vertebrate serine/arginine-rich splicing factor 9 (SRSF9); This subgroup corresponds to the RRM1 of SRSF9, also termed pre-mRNA-splicing factor SRp30C. SRSF9 is an essential splicing regulatory serine/arginine (SR) protein that has been implicated in the activity of many elements that control splice site selection, the alternative splicing of the glucocorticoid receptor beta in neutrophils and in the gonadotropin-releasing hormone pre-mRNA. SRSF9 can also interact with other proteins implicated in alternative splicing, including YB-1, rSLM-1, rSLM-2, E4-ORF4, Nop30, and p32. SRSF9 contains two N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), followed by an unusually short C-terminal RS domains rich in serine-arginine dipeptides.


Pssm-ID: 241042 [Multi-domain]  Cd Length: 72  Bit Score: 35.16  E-value: 6.90e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 795345956 308 VYVGNLPRDARVSDLKR------ALREL------GSVPLrltwqgprrrAFLRYPDSAAAQQAVSCLQGLRLGTDTLRV 374
Cdd:cd12598    2 IYVGNLPSDVREKDLEDlfykygRIRDIelknrrGLVPF----------AFVRFEDPRDAEDAVFGRNGYDFGQCRLRV 70
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH