NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|807066338|ref|NP_001293018|]
View 

zinc finger protein 236 isoform 1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
197-707 3.27e-09

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 61.64  E-value: 3.27e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  197 FTYSCPHCGKTFQKPSQLTRHIRIHTGERPFKCSECGKAFNQK--GALQTHMIKHTGEKPHACAfcpaafsqKGNLQSHV 274
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSDLNS--------KSLPLSNS 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  275 QRVHSEVKNGPTYNCTECSCVFKSLG------SLNTHISKMHMGGPQNSTSSTETAHVLTATLFQ-TLPLQQTEAQATSA 347
Cdd:COG5048   104 KASSSSLSSSSSNSNDNNLLSSHSLPpssrdpQLPDLLSISNLRNNPLPGNNSSSVNTPQSNSLHpPLPANSLSKDPSSN 183
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  348 SSQPSSQAVSDVIQQLLELSEPAPVESGQSPQPGQQLSitvgiNQDILQQALENSGLSSIPAAAHPNDSCHAKTSAPHAQ 427
Cdd:COG5048   184 LSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLE-----NSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSAS 258
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  428 NPDVSSVSNEQTdptdaeqekeqespekldkkekkmiKKKSPFLPGSIREENGVRWHVCPYCAKEFRKPSDLVRHIRIHT 507
Cdd:COG5048   259 ESPRSSLPTASS-------------------------QSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVN 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  508 HE----KPFKCPQcfrafavkstltahikthtgikafkcQYCMKSFSTSGSLKVHIRLHTGVRPFACP--HCDKKFrtsg 581
Cdd:COG5048   314 HSgeslKPFSCPY--------------------------SLCGKLFSRNDALKRHILLHTSISPAKEKllNSSSKF---- 363
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  582 hrkthiaSHFKHTELRKMRHQRKpakvrvgktnipvpdIPLQEPILITDLGLIQPIPKNQFFQSYFNNNFVNEADRPYKC 661
Cdd:COG5048   364 -------SPLLNNEPPQSLQQYK---------------DLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKN 421
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*.
gi 807066338  662 FYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRgFVSAGVLKAHIR 707
Cdd:COG5048   422 PPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKS-FRRDLDLSNHGK 466
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1339-1615 7.47e-08

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


:

Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 57.85  E-value: 7.47e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1339 ASVSAGGDLTVSL--TDGSLaTLEGIQLQLAANLVGPNVQISGIDAASINNITLQIDPSILQQTLQQGNLLAQQLTGEPG 1416
Cdd:COG3210   802 GTITAAGTTAINVtgSGGTI-TINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVG 880
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1417 LAPQNSSLQTSDSTVPASVVIQPISGLSLQPTVTSANLTIGPLSEQDSVLTTNSSGTQDLTQVMTSQGLVSPSGGPHEIT 1496
Cdd:COG3210   881 SGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASAS 960
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1497 LTINNSSLSQVLAQAAGPTATSSSGSPQEITLTISELNTTSGSLPSTTPMSPSAISTQNLVMSSSGVGGDASVTLTLADT 1576
Cdd:COG3210   961 DGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATA 1040
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 807066338 1577 QGMLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAGS 1615
Cdd:COG3210  1041 GGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGG 1079
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1183-1207 3.33e-06

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.05  E-value: 3.33e-06
                           10        20
                   ....*....|....*....|....*
gi 807066338  1183 DLVRHVRIHTGEKPYKCDECGKSFT 1207
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFK 25
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
52-234 3.79e-06

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.79e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338   52 SQFQRHMRDHERNDKPHRCDQCPQTFNVEFNLTLHK--CTHSGEDPT---CPV--CNKKFSRVASLKAHIMLHEKEENLI 124
Cdd:COG5048   274 SPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLrsVNHSGESLKpfsCPYslCGKLFSRNDALKRHILLHTSISPAK 353
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  125 CSECGDEFTlqsqlavHMEEHRQELAGTRQHACKACK-KEFETS-SELKEHMKTHYKIRVSSTRSYNRNIDRsgftYSCP 202
Cdd:COG5048   354 EKLLNSSSK-------FSPLLNNEPPQSLQQYKDLKNdKKSETLsNSCIRNFKRDSNLSLHIITHLSFRPYN----CKNP 422
                         170       180       190
                  ....*....|....*....|....*....|..
gi 807066338  203 HCGKTFQKPSQLTRHIRIHTGERPFKCSECGK 234
Cdd:COG5048   423 PCSKSFNRHYNLIPHKKIHTNHAPLLCSILKS 454
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
953-1315 3.99e-06

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  953 QGSQFLEDNEDQSRRSYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRGFVSSGVLKSH---EKTHTGVKAFSCSV 1029
Cdd:COG5048    18 STPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSrhlRTHHNNPSDLNSKS 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1030 CNASFTTNGSLTRHMATHMSMKPYKCPFCEEGFRTTV--------HCKKHMKRHQTV-------PSAVSATGETEGGDIC 1094
Cdd:COG5048    98 LPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDpqlpdllsISNLRNNPLPGNnsssvntPQSNSLHPPLPANSLS 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1095 MEEEEEHSDRNASRKSRPEVITFTEEETAQLAKIRPQESATVSEKVLV-QSAAEKDRISELRDKQAELQDEPKHANcCTY 1173
Cdd:COG5048   178 KDPSSNLSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSsLPLTTNSQLSPKSLLSQSPSSLSSSDS-SSS 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1174 CPKSFKKPSDLVRHVRIH----------TGEKPYKCDECGKSFTVKSTLDCHVKT--HTGQKL--FSCHV--CSNAFSTK 1237
Cdd:COG5048   257 ASESPRSSLPTASSQSSSpnesdsssekGFSLPIKSKQCNISFSRSSPLTRHLRSvnHSGESLkpFSCPYslCGKLFSRN 336
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1238 GSLKVHMRLHTGAKPFKCP--HCELRFRTSGRRKTHMQFHYKPDPKKARKPMTrsSSEGLQPVNLLNSSSTDPNVFIMNN 1315
Cdd:COG5048   337 DALKRHILLHTSISPAKEKllNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSET--LSNSCIRNFKRDSNLSLHIITHLSF 414
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1739-1763 2.98e-05

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 42.36  E-value: 2.98e-05
                           10        20
                   ....*....|....*....|....*
gi 807066338  1739 LERHSRIHTGERPFHCTLCEKAFNQ 1763
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
653-1057 8.95e-05

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 47.00  E-value: 8.95e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  653 NEADRPYKCFYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRG--FVSAGVLKAHIRTHTGLKSFKCLICNG-AFTTGGS 729
Cdd:COG5048    28 SNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDksFSRPLELSRHLRTHHNNPSDLNSKSLPlSNSKASS 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  730 LRRHMGIHNDLRPYMCPYCQKTFKTSLNCKKHMKTHRYELAQQLQQHQQAASIDDSTvdqqsmqASTQMQVEIESDELPQ 809
Cdd:COG5048   108 SSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQS-------NSLHPPLPANSLSKDP 180
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  810 TAEVVAANPEAMLDLEPQHvvgtEEAGLGQQLADQPLEADEDGFVAPQDPL-RGHVDQFEEQSPAQQSfepaglPQGFTV 888
Cdd:COG5048   181 SSNLSLLISSNVSTSIPSS----SENSPLSSSYSIPSSSSDQNLENSSSSLpLTTNSQLSPKSLLSQS------PSSLSS 250
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  889 TDTYHQQPQFPPVQQLQDSSTLESQALS-TSFHQQSLLQAPSSDGMNVTTRLIQESSQEELDLqaqgsqflEDNEDQSRR 967
Cdd:COG5048   251 SDSSSSASESPRSSLPTASSQSSSPNESdSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVN--------HSGESLKPF 322
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  968 SYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRGFVSSGVLKS-------HEKTHTGVKAFSCSV--CNASFTTNG 1038
Cdd:COG5048   323 SCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNeppqslqQYKDLKNDKKSETLSnsCIRNFKRDS 402
                         410
                  ....*....|....*....
gi 807066338 1039 SLTRHMATHMSMKPYKCPF 1057
Cdd:COG5048   403 NLSLHIITHLSFRPYNCKN 421
SUF4-like super family cl41227
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1661-1709 4.37e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


The actual alignment was detected with superfamily member cd20908:

Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 40.62  E-value: 4.37e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 807066338 1661 CLECDRAFSSAAVLMHHSKEVHGRerihgCPVCRKAFKRATHLKEHMQT 1709
Cdd:cd20908     4 CYYCDREFDDEKILIQHQKAKHFK-----CHICHKKLYTAGGLAVHCLQ 47
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1766-1791 5.08e-04

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 5.08e-04
                           10        20
                   ....*....|....*....|....*.
gi 807066338  1766 ALQVHMKKHTGERPYKCAYCVMGFTQ 1791
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
 
Name Accession Description Interval E-value
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
197-707 3.27e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 61.64  E-value: 3.27e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  197 FTYSCPHCGKTFQKPSQLTRHIRIHTGERPFKCSECGKAFNQK--GALQTHMIKHTGEKPHACAfcpaafsqKGNLQSHV 274
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSDLNS--------KSLPLSNS 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  275 QRVHSEVKNGPTYNCTECSCVFKSLG------SLNTHISKMHMGGPQNSTSSTETAHVLTATLFQ-TLPLQQTEAQATSA 347
Cdd:COG5048   104 KASSSSLSSSSSNSNDNNLLSSHSLPpssrdpQLPDLLSISNLRNNPLPGNNSSSVNTPQSNSLHpPLPANSLSKDPSSN 183
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  348 SSQPSSQAVSDVIQQLLELSEPAPVESGQSPQPGQQLSitvgiNQDILQQALENSGLSSIPAAAHPNDSCHAKTSAPHAQ 427
Cdd:COG5048   184 LSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLE-----NSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSAS 258
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  428 NPDVSSVSNEQTdptdaeqekeqespekldkkekkmiKKKSPFLPGSIREENGVRWHVCPYCAKEFRKPSDLVRHIRIHT 507
Cdd:COG5048   259 ESPRSSLPTASS-------------------------QSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVN 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  508 HE----KPFKCPQcfrafavkstltahikthtgikafkcQYCMKSFSTSGSLKVHIRLHTGVRPFACP--HCDKKFrtsg 581
Cdd:COG5048   314 HSgeslKPFSCPY--------------------------SLCGKLFSRNDALKRHILLHTSISPAKEKllNSSSKF---- 363
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  582 hrkthiaSHFKHTELRKMRHQRKpakvrvgktnipvpdIPLQEPILITDLGLIQPIPKNQFFQSYFNNNFVNEADRPYKC 661
Cdd:COG5048   364 -------SPLLNNEPPQSLQQYK---------------DLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKN 421
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*.
gi 807066338  662 FYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRgFVSAGVLKAHIR 707
Cdd:COG5048   422 PPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKS-FRRDLDLSNHGK 466
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1339-1615 7.47e-08

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 57.85  E-value: 7.47e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1339 ASVSAGGDLTVSL--TDGSLaTLEGIQLQLAANLVGPNVQISGIDAASINNITLQIDPSILQQTLQQGNLLAQQLTGEPG 1416
Cdd:COG3210   802 GTITAAGTTAINVtgSGGTI-TINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVG 880
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1417 LAPQNSSLQTSDSTVPASVVIQPISGLSLQPTVTSANLTIGPLSEQDSVLTTNSSGTQDLTQVMTSQGLVSPSGGPHEIT 1496
Cdd:COG3210   881 SGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASAS 960
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1497 LTINNSSLSQVLAQAAGPTATSSSGSPQEITLTISELNTTSGSLPSTTPMSPSAISTQNLVMSSSGVGGDASVTLTLADT 1576
Cdd:COG3210   961 DGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATA 1040
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 807066338 1577 QGMLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAGS 1615
Cdd:COG3210  1041 GGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGG 1079
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1183-1207 3.33e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.05  E-value: 3.33e-06
                           10        20
                   ....*....|....*....|....*
gi 807066338  1183 DLVRHVRIHTGEKPYKCDECGKSFT 1207
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFK 25
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
52-234 3.79e-06

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.79e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338   52 SQFQRHMRDHERNDKPHRCDQCPQTFNVEFNLTLHK--CTHSGEDPT---CPV--CNKKFSRVASLKAHIMLHEKEENLI 124
Cdd:COG5048   274 SPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLrsVNHSGESLKpfsCPYslCGKLFSRNDALKRHILLHTSISPAK 353
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  125 CSECGDEFTlqsqlavHMEEHRQELAGTRQHACKACK-KEFETS-SELKEHMKTHYKIRVSSTRSYNRNIDRsgftYSCP 202
Cdd:COG5048   354 EKLLNSSSK-------FSPLLNNEPPQSLQQYKDLKNdKKSETLsNSCIRNFKRDSNLSLHIITHLSFRPYN----CKNP 422
                         170       180       190
                  ....*....|....*....|....*....|..
gi 807066338  203 HCGKTFQKPSQLTRHIRIHTGERPFKCSECGK 234
Cdd:COG5048   423 PCSKSFNRHYNLIPHKKIHTNHAPLLCSILKS 454
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
953-1315 3.99e-06

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  953 QGSQFLEDNEDQSRRSYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRGFVSSGVLKSH---EKTHTGVKAFSCSV 1029
Cdd:COG5048    18 STPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSrhlRTHHNNPSDLNSKS 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1030 CNASFTTNGSLTRHMATHMSMKPYKCPFCEEGFRTTV--------HCKKHMKRHQTV-------PSAVSATGETEGGDIC 1094
Cdd:COG5048    98 LPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDpqlpdllsISNLRNNPLPGNnsssvntPQSNSLHPPLPANSLS 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1095 MEEEEEHSDRNASRKSRPEVITFTEEETAQLAKIRPQESATVSEKVLV-QSAAEKDRISELRDKQAELQDEPKHANcCTY 1173
Cdd:COG5048   178 KDPSSNLSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSsLPLTTNSQLSPKSLLSQSPSSLSSSDS-SSS 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1174 CPKSFKKPSDLVRHVRIH----------TGEKPYKCDECGKSFTVKSTLDCHVKT--HTGQKL--FSCHV--CSNAFSTK 1237
Cdd:COG5048   257 ASESPRSSLPTASSQSSSpnesdsssekGFSLPIKSKQCNISFSRSSPLTRHLRSvnHSGESLkpFSCPYslCGKLFSRN 336
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1238 GSLKVHMRLHTGAKPFKCP--HCELRFRTSGRRKTHMQFHYKPDPKKARKPMTrsSSEGLQPVNLLNSSSTDPNVFIMNN 1315
Cdd:COG5048   337 DALKRHILLHTSISPAKEKllNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSET--LSNSCIRNFKRDSNLSLHIITHLSF 414
zf-H2C2_2 pfam13465
Zinc-finger double domain;
214-238 8.27e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.90  E-value: 8.27e-06
                           10        20
                   ....*....|....*....|....*
gi 807066338   214 LTRHIRIHTGERPFKCSECGKAFNQ 238
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
983-1008 2.25e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 42.74  E-value: 2.25e-05
                           10        20
                   ....*....|....*....|....*.
gi 807066338   983 HLKQHVRSHTGEKPYKCKLCGRGFVS 1008
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1739-1763 2.98e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 42.36  E-value: 2.98e-05
                           10        20
                   ....*....|....*....|....*
gi 807066338  1739 LERHSRIHTGERPFHCTLCEKAFNQ 1763
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
653-1057 8.95e-05

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 47.00  E-value: 8.95e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  653 NEADRPYKCFYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRG--FVSAGVLKAHIRTHTGLKSFKCLICNG-AFTTGGS 729
Cdd:COG5048    28 SNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDksFSRPLELSRHLRTHHNNPSDLNSKSLPlSNSKASS 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  730 LRRHMGIHNDLRPYMCPYCQKTFKTSLNCKKHMKTHRYELAQQLQQHQQAASIDDSTvdqqsmqASTQMQVEIESDELPQ 809
Cdd:COG5048   108 SSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQS-------NSLHPPLPANSLSKDP 180
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  810 TAEVVAANPEAMLDLEPQHvvgtEEAGLGQQLADQPLEADEDGFVAPQDPL-RGHVDQFEEQSPAQQSfepaglPQGFTV 888
Cdd:COG5048   181 SSNLSLLISSNVSTSIPSS----SENSPLSSSYSIPSSSSDQNLENSSSSLpLTTNSQLSPKSLLSQS------PSSLSS 250
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  889 TDTYHQQPQFPPVQQLQDSSTLESQALS-TSFHQQSLLQAPSSDGMNVTTRLIQESSQEELDLqaqgsqflEDNEDQSRR 967
Cdd:COG5048   251 SDSSSSASESPRSSLPTASSQSSSPNESdSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVN--------HSGESLKPF 322
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  968 SYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRGFVSSGVLKS-------HEKTHTGVKAFSCSV--CNASFTTNG 1038
Cdd:COG5048   323 SCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNeppqslqQYKDLKNDKKSETLSnsCIRNFKRDS 402
                         410
                  ....*....|....*....
gi 807066338 1039 SLTRHMATHMSMKPYKCPF 1057
Cdd:COG5048   403 NLSLHIITHLSFRPYNCKN 421
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1661-1709 4.37e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 40.62  E-value: 4.37e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 807066338 1661 CLECDRAFSSAAVLMHHSKEVHGRerihgCPVCRKAFKRATHLKEHMQT 1709
Cdd:cd20908     4 CYYCDREFDDEKILIQHQKAKHFK-----CHICHKKLYTAGGLAVHCLQ 47
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
1395-1597 4.59e-04

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 45.42  E-value: 4.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1395 SILQQTLQQGNLLAQQLTGEPG--LAPQNSSLQTSDSTVPASVVIQPISGLSLQPTVTSANLTIGPLSEQDSVLTTNSSG 1472
Cdd:NF033176    6 SIVWNHSRQAWVVASELARGHGfvLAKNTLLVLAVASTIGNAFAQNISSGVVSGGVVSSGETQVVYSNGQTSNATVNSGG 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1473 TQDLTqvmtsqglvspSGGPHEITlTINNSSLSQVLAQAAGPTATSSSGSPQEItltiselntTSGSLPSTTPMSPSAIS 1552
Cdd:NF033176   86 IQNVN-----------NGGKTTST-TVNSSGAQNVGNSGTAISTIVNSGGVQRV---------SSGGVTSATSLSGGAQN 144
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 807066338 1553 TQNLvmsssgvgGDASVTLTL-ADTQGMLSGGLDTVTlNITSQGQQ 1597
Cdd:NF033176  145 IYNL--------GHASNTVIFnGGNQTIFSGGISDDT-NISSGGQQ 181
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1766-1791 5.08e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 5.08e-04
                           10        20
                   ....*....|....*....|....*.
gi 807066338  1766 ALQVHMKKHTGERPYKCAYCVMGFTQ 1791
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
685-734 6.10e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 40.23  E-value: 6.10e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 807066338  685 KPFkCSQCGRGFVSAGVLKAHIRTHTglksFKCLICNGAFTTGGSLRRHM 734
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
971-1015 9.57e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 39.85  E-value: 9.57e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 807066338  971 CDYCNKGFKKSSHLKQHVRSHTgekpYKCKLCGRGFVSSGVLKSH 1015
Cdd:cd20908     4 CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
201-246 2.07e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 2.07e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 807066338  201 CPHCGKTFQKPSQLTRHIRIHTgerpFKCSECGKAFNQKGALQTHM 246
Cdd:cd20908     4 CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
zf-H2C2_2 pfam13465
Zinc-finger double domain;
729-754 5.16e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.20  E-value: 5.16e-03
                           10        20
                   ....*....|....*....|....*.
gi 807066338   729 SLRRHMGIHNDLRPYMCPYCQKTFKT 754
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
PHA00733 PHA00733
hypothetical protein
537-587 7.87e-03

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 38.32  E-value: 7.87e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 807066338  537 IKAFKCQYCMKSFSTSGSLKVHIRL--HTGVrpfaCPHCDKKFRTSGHRKTHI 587
Cdd:PHA00733   71 VSPYVCPLCLMPFSSSVSLKQHIRYteHSKV----CPVCGKEFRNTDSTLDHV 119
PHA00733 PHA00733
hypothetical protein
96-142 8.66e-03

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 38.32  E-value: 8.66e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 807066338   96 TCPVCNKKFSRVASLKAHImlHEKEENLICSECGDEFTLQSQLAVHM 142
Cdd:PHA00733   75 VCPLCLMPFSSSVSLKQHI--RYTEHSKVCPVCGKEFRNTDSTLDHV 119
SP3_N cd22537
N-terminal domain of transcription factor Specificity Protein (SP) 3; Specificity Proteins ...
1290-1575 9.48e-03

N-terminal domain of transcription factor Specificity Protein (SP) 3; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP1 and SP3 can interact with and recruit a large number of proteins including the transcription initiation complex, histone modifying enzymes, and chromatin remodeling complexes, which strongly suggest that SP1 and SP3 are important transcription factors in remodeling chromatin and the regulation of gene expression. SP3 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP3.


Pssm-ID: 411774 [Multi-domain]  Cd Length: 574  Bit Score: 40.70  E-value: 9.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1290 SSSEGLQPVNLLNSSSTDPNVFIMNNSvlTGQFDQNLLQPGLVGQAILPASVSAGG----DLTVSLTDGSLATLEGIQLQ 1365
Cdd:cd22537   259 NSGESGKVSPDINETNTNADLFVPTSS--SSQLPVTIDSTGILQQNASSLTTVSGQvhtsDLQGNYIQAPVSDETQAQNI 336
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1366 LAANLVGPNVQISGIDAASinnitlQIDPSILQQTLQQGNLLAQQLTGE---PGLAPQNSSLQTSDstvPASVVIQpisg 1442
Cdd:cd22537   337 QVSTAQPSVQQIQLHESQQ------PTSQAQIVQGITQQAIQGVQALGAqaiPQQALQNLQLQLLN---PGTFLIQ---- 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1443 lslQPTVTSanltigplSEQDSVLTTNSSGTQDLtqvmtsQGLVSPSGGPHEITLT-INNSSLSQVLAQAAG---PTATS 1518
Cdd:cd22537   404 ---AQTVTP--------SGQITWQTFQVQGVQNL------QNLQIQNAPAQQITLTpVQTLTLGQVGAGGAItstPVSLS 466
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 807066338 1519 SSGSPQEITLTISELNTTSGSL-PSTTPMSPSAISTQNLVMSSSGVGGDASVTLTLAD 1575
Cdd:cd22537   467 TGQLPNLQTVTVNSIDSAGIQLqQSENADSPADIQIKEEEPDSEEWQLSGDSTLNTND 524
 
Name Accession Description Interval E-value
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
197-707 3.27e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 61.64  E-value: 3.27e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  197 FTYSCPHCGKTFQKPSQLTRHIRIHTGERPFKCSECGKAFNQK--GALQTHMIKHTGEKPHACAfcpaafsqKGNLQSHV 274
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSDLNS--------KSLPLSNS 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  275 QRVHSEVKNGPTYNCTECSCVFKSLG------SLNTHISKMHMGGPQNSTSSTETAHVLTATLFQ-TLPLQQTEAQATSA 347
Cdd:COG5048   104 KASSSSLSSSSSNSNDNNLLSSHSLPpssrdpQLPDLLSISNLRNNPLPGNNSSSVNTPQSNSLHpPLPANSLSKDPSSN 183
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  348 SSQPSSQAVSDVIQQLLELSEPAPVESGQSPQPGQQLSitvgiNQDILQQALENSGLSSIPAAAHPNDSCHAKTSAPHAQ 427
Cdd:COG5048   184 LSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLE-----NSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSAS 258
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  428 NPDVSSVSNEQTdptdaeqekeqespekldkkekkmiKKKSPFLPGSIREENGVRWHVCPYCAKEFRKPSDLVRHIRIHT 507
Cdd:COG5048   259 ESPRSSLPTASS-------------------------QSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVN 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  508 HE----KPFKCPQcfrafavkstltahikthtgikafkcQYCMKSFSTSGSLKVHIRLHTGVRPFACP--HCDKKFrtsg 581
Cdd:COG5048   314 HSgeslKPFSCPY--------------------------SLCGKLFSRNDALKRHILLHTSISPAKEKllNSSSKF---- 363
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  582 hrkthiaSHFKHTELRKMRHQRKpakvrvgktnipvpdIPLQEPILITDLGLIQPIPKNQFFQSYFNNNFVNEADRPYKC 661
Cdd:COG5048   364 -------SPLLNNEPPQSLQQYK---------------DLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKN 421
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*.
gi 807066338  662 FYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRgFVSAGVLKAHIR 707
Cdd:COG5048   422 PPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKS-FRRDLDLSNHGK 466
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1339-1615 7.47e-08

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 57.85  E-value: 7.47e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1339 ASVSAGGDLTVSL--TDGSLaTLEGIQLQLAANLVGPNVQISGIDAASINNITLQIDPSILQQTLQQGNLLAQQLTGEPG 1416
Cdd:COG3210   802 GTITAAGTTAINVtgSGGTI-TINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVG 880
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1417 LAPQNSSLQTSDSTVPASVVIQPISGLSLQPTVTSANLTIGPLSEQDSVLTTNSSGTQDLTQVMTSQGLVSPSGGPHEIT 1496
Cdd:COG3210   881 SGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASAS 960
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1497 LTINNSSLSQVLAQAAGPTATSSSGSPQEITLTISELNTTSGSLPSTTPMSPSAISTQNLVMSSSGVGGDASVTLTLADT 1576
Cdd:COG3210   961 DGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATA 1040
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 807066338 1577 QGMLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAGS 1615
Cdd:COG3210  1041 GGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGG 1079
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1299-1651 2.25e-07

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 56.31  E-value: 2.25e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1299 NLLNSSSTDPNVFIMNNSVLTGQFDQNLLQ--PGLVGQAILPASVSAGGDLTVSLTDGSLATLEGIQLQLAANLVGPNVQ 1376
Cdd:COG3210   639 VGAALSGTGSGTTGTASANGSNTTGVNTAGgtGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQ 718
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1377 ISGIDAASINNITLQIDPSILQQTLQQGNLLAqqlTGEPGlapqNSSLQTSDSTVpasvviqpISGLSLQPTVTSANLTI 1456
Cdd:COG3210   719 IGALANANGDTVTFGNLGTGATLTLNAGVTIT---SGNAG----TLSIGLTANTT--------ASGTTLTLANANGNTSA 783
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1457 GplseqdsvlTTNSSGTQDLTQVMTSQGLVSpSGGPHEITLTINNSSLSQVLAQAAGPTATSSSGSPQEITLTISELNTT 1536
Cdd:COG3210   784 G---------ATLDNAGAEISIDITADGTIT-AAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1537 SGSLPSTTPMSPSAISTQNLVMSSSGVGGDASVT--LTLADTQGMLSGGLDTVTLNITSQGQQFPALLTDPSLSGQGGAG 1614
Cdd:COG3210   854 SDGASGGGTAGANSGSLAATAASITVGSGGVATStgTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 807066338 1615 SPQVILVSHTPQSASAaceeIAYQVAGVSGNLAPGNQ 1651
Cdd:COG3210   934 GGTGAGNGTTALSGTQ----GNAGLSAASASDGAGDT 966
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1183-1207 3.33e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 45.05  E-value: 3.33e-06
                           10        20
                   ....*....|....*....|....*
gi 807066338  1183 DLVRHVRIHTGEKPYKCDECGKSFT 1207
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFK 25
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
52-234 3.79e-06

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.79e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338   52 SQFQRHMRDHERNDKPHRCDQCPQTFNVEFNLTLHK--CTHSGEDPT---CPV--CNKKFSRVASLKAHIMLHEKEENLI 124
Cdd:COG5048   274 SPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLrsVNHSGESLKpfsCPYslCGKLFSRNDALKRHILLHTSISPAK 353
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  125 CSECGDEFTlqsqlavHMEEHRQELAGTRQHACKACK-KEFETS-SELKEHMKTHYKIRVSSTRSYNRNIDRsgftYSCP 202
Cdd:COG5048   354 EKLLNSSSK-------FSPLLNNEPPQSLQQYKDLKNdKKSETLsNSCIRNFKRDSNLSLHIITHLSFRPYN----CKNP 422
                         170       180       190
                  ....*....|....*....|....*....|..
gi 807066338  203 HCGKTFQKPSQLTRHIRIHTGERPFKCSECGK 234
Cdd:COG5048   423 PCSKSFNRHYNLIPHKKIHTNHAPLLCSILKS 454
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
953-1315 3.99e-06

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 51.62  E-value: 3.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  953 QGSQFLEDNEDQSRRSYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRGFVSSGVLKSH---EKTHTGVKAFSCSV 1029
Cdd:COG5048    18 STPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSrhlRTHHNNPSDLNSKS 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1030 CNASFTTNGSLTRHMATHMSMKPYKCPFCEEGFRTTV--------HCKKHMKRHQTV-------PSAVSATGETEGGDIC 1094
Cdd:COG5048    98 LPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDpqlpdllsISNLRNNPLPGNnsssvntPQSNSLHPPLPANSLS 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1095 MEEEEEHSDRNASRKSRPEVITFTEEETAQLAKIRPQESATVSEKVLV-QSAAEKDRISELRDKQAELQDEPKHANcCTY 1173
Cdd:COG5048   178 KDPSSNLSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSsLPLTTNSQLSPKSLLSQSPSSLSSSDS-SSS 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1174 CPKSFKKPSDLVRHVRIH----------TGEKPYKCDECGKSFTVKSTLDCHVKT--HTGQKL--FSCHV--CSNAFSTK 1237
Cdd:COG5048   257 ASESPRSSLPTASSQSSSpnesdsssekGFSLPIKSKQCNISFSRSSPLTRHLRSvnHSGESLkpFSCPYslCGKLFSRN 336
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1238 GSLKVHMRLHTGAKPFKCP--HCELRFRTSGRRKTHMQFHYKPDPKKARKPMTrsSSEGLQPVNLLNSSSTDPNVFIMNN 1315
Cdd:COG5048   337 DALKRHILLHTSISPAKEKllNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSET--LSNSCIRNFKRDSNLSLHIITHLSF 414
zf-H2C2_2 pfam13465
Zinc-finger double domain;
214-238 8.27e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.90  E-value: 8.27e-06
                           10        20
                   ....*....|....*....|....*
gi 807066338   214 LTRHIRIHTGERPFKCSECGKAFNQ 238
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
673-698 2.10e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 42.74  E-value: 2.10e-05
                           10        20
                   ....*....|....*....|....*.
gi 807066338   673 HLKQHIRSHTGEKPFKCSQCGRGFVS 698
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
983-1008 2.25e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 42.74  E-value: 2.25e-05
                           10        20
                   ....*....|....*....|....*.
gi 807066338   983 HLKQHVRSHTGEKPYKCKLCGRGFVS 1008
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1739-1763 2.98e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 42.36  E-value: 2.98e-05
                           10        20
                   ....*....|....*....|....*
gi 807066338  1739 LERHSRIHTGERPFHCTLCEKAFNQ 1763
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
653-1057 8.95e-05

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 47.00  E-value: 8.95e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  653 NEADRPYKCFYCHRAYKKSCHLKQHIRSHTGEKPFKCSQCGRG--FVSAGVLKAHIRTHTGLKSFKCLICNG-AFTTGGS 729
Cdd:COG5048    28 SNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDksFSRPLELSRHLRTHHNNPSDLNSKSLPlSNSKASS 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  730 LRRHMGIHNDLRPYMCPYCQKTFKTSLNCKKHMKTHRYELAQQLQQHQQAASIDDSTvdqqsmqASTQMQVEIESDELPQ 809
Cdd:COG5048   108 SSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQS-------NSLHPPLPANSLSKDP 180
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  810 TAEVVAANPEAMLDLEPQHvvgtEEAGLGQQLADQPLEADEDGFVAPQDPL-RGHVDQFEEQSPAQQSfepaglPQGFTV 888
Cdd:COG5048   181 SSNLSLLISSNVSTSIPSS----SENSPLSSSYSIPSSSSDQNLENSSSSLpLTTNSQLSPKSLLSQS------PSSLSS 250
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  889 TDTYHQQPQFPPVQQLQDSSTLESQALS-TSFHQQSLLQAPSSDGMNVTTRLIQESSQEELDLqaqgsqflEDNEDQSRR 967
Cdd:COG5048   251 SDSSSSASESPRSSLPTASSQSSSPNESdSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVN--------HSGESLKPF 322
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  968 SYRCDYCNKGFKKSSHLKQHVRSHTGEKPYKCKLCGRGFVSSGVLKS-------HEKTHTGVKAFSCSV--CNASFTTNG 1038
Cdd:COG5048   323 SCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNeppqslqQYKDLKNDKKSETLSnsCIRNFKRDS 402
                         410
                  ....*....|....*....
gi 807066338 1039 SLTRHMATHMSMKPYKCPF 1057
Cdd:COG5048   403 NLSLHIITHLSFRPYNCKN 421
zf-H2C2_2 pfam13465
Zinc-finger double domain;
554-579 2.51e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 39.66  E-value: 2.51e-04
                           10        20
                   ....*....|....*....|....*.
gi 807066338   554 SLKVHIRLHTGVRPFACPHCDKKFRT 579
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
199-221 2.78e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.59  E-value: 2.78e-04
                           10        20
                   ....*....|....*....|...
gi 807066338   199 YSCPHCGKTFQKPSQLTRHIRIH 221
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1661-1709 4.37e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 40.62  E-value: 4.37e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 807066338 1661 CLECDRAFSSAAVLMHHSKEVHGRerihgCPVCRKAFKRATHLKEHMQT 1709
Cdd:cd20908     4 CYYCDREFDDEKILIQHQKAKHFK-----CHICHKKLYTAGGLAVHCLQ 47
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1039-1064 4.43e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 39.28  E-value: 4.43e-04
                           10        20
                   ....*....|....*....|....*.
gi 807066338  1039 SLTRHMATHMSMKPYKCPFCEEGFRT 1064
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
1395-1597 4.59e-04

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 45.42  E-value: 4.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1395 SILQQTLQQGNLLAQQLTGEPG--LAPQNSSLQTSDSTVPASVVIQPISGLSLQPTVTSANLTIGPLSEQDSVLTTNSSG 1472
Cdd:NF033176    6 SIVWNHSRQAWVVASELARGHGfvLAKNTLLVLAVASTIGNAFAQNISSGVVSGGVVSSGETQVVYSNGQTSNATVNSGG 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1473 TQDLTqvmtsqglvspSGGPHEITlTINNSSLSQVLAQAAGPTATSSSGSPQEItltiselntTSGSLPSTTPMSPSAIS 1552
Cdd:NF033176   86 IQNVN-----------NGGKTTST-TVNSSGAQNVGNSGTAISTIVNSGGVQRV---------SSGGVTSATSLSGGAQN 144
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 807066338 1553 TQNLvmsssgvgGDASVTLTL-ADTQGMLSGGLDTVTlNITSQGQQ 1597
Cdd:NF033176  145 IYNL--------GHASNTVIFnGGNQTIFSGGISDDT-NISSGGQQ 181
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1766-1791 5.08e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 5.08e-04
                           10        20
                   ....*....|....*....|....*.
gi 807066338  1766 ALQVHMKKHTGERPYKCAYCVMGFTQ 1791
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
685-734 6.10e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 40.23  E-value: 6.10e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 807066338  685 KPFkCSQCGRGFVSAGVLKAHIRTHTglksFKCLICNGAFTTGGSLRRHM 734
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
zf-H2C2_2 pfam13465
Zinc-finger double domain;
1239-1264 6.89e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.51  E-value: 6.89e-04
                           10        20
                   ....*....|....*....|....*.
gi 807066338  1239 SLKVHMRLHTGAKPFKCPHCELRFRT 1264
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
971-1015 9.57e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 39.85  E-value: 9.57e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 807066338  971 CDYCNKGFKKSSHLKQHVRSHTgekpYKCKLCGRGFVSSGVLKSH 1015
Cdd:cd20908     4 CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
969-991 1.55e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 37.66  E-value: 1.55e-03
                           10        20
                   ....*....|....*....|...
gi 807066338   969 YRCDYCNKGFKKSSHLKQHVRSH 991
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
201-246 2.07e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 2.07e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 807066338  201 CPHCGKTFQKPSQLTRHIRIHTgerpFKCSECGKAFNQKGALQTHM 246
Cdd:cd20908     4 CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
227-249 2.81e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.89  E-value: 2.81e-03
                           10        20
                   ....*....|....*....|...
gi 807066338   227 FKCSECGKAFNQKGALQTHMIKH 249
Cdd:pfam00096    1 YKCPDCGKSFSRKSNLKRHLRTH 23
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
508-765 3.83e-03

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 41.99  E-value: 3.83e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  508 HEKPFKCPQCFRAFAVKSTLTAHIKTHTGIKAFKCQYCM--KSFSTSGSLKVHIRLHTG--------------------- 564
Cdd:COG5048    30 APRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGcdKSFSRPLELSRHLRTHHNnpsdlnskslplsnskassss 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  565 --------VRPFACPHCDKKFRT------SGHRKTHIASHFKHTELRKMRHQRKPAKVRVGKTNIPVPDIPLQEPILITD 630
Cdd:COG5048   110 lsssssnsNDNNLLSSHSLPPSSrdpqlpDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLIS 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  631 LGLIQPIPK------NQFFQSYFNNNFVNEADRPYKCFYCHRAYK-KSCHLKQHIRSHTGEKPFKCS--------QCGRG 695
Cdd:COG5048   190 SNVSTSIPSssenspLSSSYSIPSSSSDQNLENSSSSLPLTTNSQlSPKSLLSQSPSSLSSSDSSSSasesprssLPTAS 269
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 807066338  696 FVSAGVLKAHIRTHTG-LKSFKCLICNGAFTTGGSLRRHM--GIHN--DLRPYMCPY--CQKTFKTSLNCKKHMKTH 765
Cdd:COG5048   270 SQSSSPNESDSSSEKGfSLPIKSKQCNISFSRSSPLTRHLrsVNHSgeSLKPFSCPYslCGKLFSRNDALKRHILLH 346
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
510-558 3.85e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 37.92  E-value: 3.85e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 807066338  510 KPFkCPQCFRAFAVKSTLTAHIKTHTgikaFKCQYCMKSFSTSGSLKVH 558
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
zf-H2C2_2 pfam13465
Zinc-finger double domain;
729-754 5.16e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.20  E-value: 5.16e-03
                           10        20
                   ....*....|....*....|....*.
gi 807066338   729 SLRRHMGIHNDLRPYMCPYCQKTFKT 754
Cdd:pfam13465    1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
1195-1244 5.16e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 37.54  E-value: 5.16e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 807066338 1195 KPYkCDECGKSFTVKSTLDCHVKTHTgqklFSCHVCSNAFSTKGSLKVHM 1244
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
1707-1775 6.74e-03

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 41.22  E-value: 6.74e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 807066338 1707 MQTHQAGPSLSSQKPRVFKCDTCEKAFAKPSQLERHSRIHTGERPFHCTLCEKAFNQK--SALQVHMKKHT 1775
Cdd:COG5048    17 SSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHH 87
PHA00733 PHA00733
hypothetical protein
537-587 7.87e-03

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 38.32  E-value: 7.87e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 807066338  537 IKAFKCQYCMKSFSTSGSLKVHIRL--HTGVrpfaCPHCDKKFRTSGHRKTHI 587
Cdd:PHA00733   71 VSPYVCPLCLMPFSSSVSLKQHIRYteHSKV----CPVCGKEFRNTDSTLDHV 119
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
993-1073 8.23e-03

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 40.86  E-value: 8.23e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338  993 GEKPYKCKL--CGRGFVSSGVLKSHEKT-HtgvkafscsvCNASFTTNGSLTRHMATHMSMKPYKCPFCEEGFRTTVHCK 1069
Cdd:COG5189   346 DGKPYKCPVegCNKKYKNQNGLKYHMLHgH----------QNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLK 415

                  ....
gi 807066338 1070 KHMK 1073
Cdd:COG5189   416 YHRK 419
PHA00733 PHA00733
hypothetical protein
96-142 8.66e-03

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 38.32  E-value: 8.66e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 807066338   96 TCPVCNKKFSRVASLKAHImlHEKEENLICSECGDEFTLQSQLAVHM 142
Cdd:PHA00733   75 VCPLCLMPFSSSVSLKQHI--RYTEHSKVCPVCGKEFRNTDSTLDHV 119
zf-H2C2_2 pfam13465
Zinc-finger double domain;
527-551 9.12e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 35.42  E-value: 9.12e-03
                           10        20
                   ....*....|....*....|....*
gi 807066338   527 LTAHIKTHTGIKAFKCQYCMKSFST 551
Cdd:pfam13465    2 LKRHMRTHTGEKPYKCPECGKSFKS 26
SP3_N cd22537
N-terminal domain of transcription factor Specificity Protein (SP) 3; Specificity Proteins ...
1290-1575 9.48e-03

N-terminal domain of transcription factor Specificity Protein (SP) 3; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP1 and SP3 can interact with and recruit a large number of proteins including the transcription initiation complex, histone modifying enzymes, and chromatin remodeling complexes, which strongly suggest that SP1 and SP3 are important transcription factors in remodeling chromatin and the regulation of gene expression. SP3 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP3.


Pssm-ID: 411774 [Multi-domain]  Cd Length: 574  Bit Score: 40.70  E-value: 9.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1290 SSSEGLQPVNLLNSSSTDPNVFIMNNSvlTGQFDQNLLQPGLVGQAILPASVSAGG----DLTVSLTDGSLATLEGIQLQ 1365
Cdd:cd22537   259 NSGESGKVSPDINETNTNADLFVPTSS--SSQLPVTIDSTGILQQNASSLTTVSGQvhtsDLQGNYIQAPVSDETQAQNI 336
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1366 LAANLVGPNVQISGIDAASinnitlQIDPSILQQTLQQGNLLAQQLTGE---PGLAPQNSSLQTSDstvPASVVIQpisg 1442
Cdd:cd22537   337 QVSTAQPSVQQIQLHESQQ------PTSQAQIVQGITQQAIQGVQALGAqaiPQQALQNLQLQLLN---PGTFLIQ---- 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 807066338 1443 lslQPTVTSanltigplSEQDSVLTTNSSGTQDLtqvmtsQGLVSPSGGPHEITLT-INNSSLSQVLAQAAG---PTATS 1518
Cdd:cd22537   404 ---AQTVTP--------SGQITWQTFQVQGVQNL------QNLQIQNAPAQQITLTpVQTLTLGQVGAGGAItstPVSLS 466
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 807066338 1519 SSGSPQEITLTISELNTTSGSL-PSTTPMSPSAISTQNLVMSSSGVGGDASVTLTLAD 1575
Cdd:cd22537   467 TGQLPNLQTVTVNSIDSAGIQLqQSENADSPADIQIKEEEPDSEEWQLSGDSTLNTND 524
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH