NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1610158347|ref|NP_001356401|]
View 

nuclear factor 1 B-type isoform 19 [Homo sapiens]

Protein Classification

nuclear factor I( domain architecture ID 12106891)

nuclear factor I (NFI) is a CCAAT-box-binding protein active in transcription and DNA replication

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CTF_NFI super family cl25839
CTF/NF-I family transcription modulation region;
205-411 2.45e-86

CTF/NF-I family transcription modulation region;


The actual alignment was detected with superfamily member pfam00859:

Pssm-ID: 459967 [Multi-domain]  Cd Length: 288  Bit Score: 264.47  E-value: 2.45e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347 205 EDSFVKSGVFNVSELVRVSRTPITQGTGVNFPIGEIPSqPYYHDMNSGVNLQRSLSSPPS--SKRPKTISIDENMEPSPT 282
Cdd:pfam00859   1 QDSFVTSGVFSVTELVRVSRTPVATGTGPNFSLGELQG-PLYYDLNPGVGLRRSLPSTSSsgSKRHKSGSMEDDVDTSPG 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347 283 GDFYPSPSSPAAGSRTW-HERDQDMSSPTTMKKPEKPLFSSASPQDSSPRLSTFPQHHHPGIpgVAHSVIStRTPPPPSP 361
Cdd:pfam00859  80 GDYYRSPSSPASSSRNWpHDVEGGMSSPVKKKKPDKSDFSSPSPQDSSPRLMAFTQHHRPVI--AVHSGIS-RSPHPSSA 156
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1610158347 362 LPFPTQAILpPAPSSYFSHPTIRYPPHLnPQDTLKNYVP--SYDPSSPQTSQ 411
Cdd:pfam00859 157 LHFPSSSIL-QQPSSYFPHPAIRYPPHL-PQDPLKDLVSlaCYDPSSQQPSQ 206
NfI_DNAbd_pre-N pfam10524
Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific ...
4-43 1.25e-17

Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific DNA-binding proteins (also known as CTF or CAAT box transcription factor) functions both in viral DNA replication and in the regulation of gene expression in higher organizms. The N-terminal 200 residues contains the DNA-binding and dimerization domain, but also has an 8-47 residue highly conserved region 5' of this, whose function is not known. Deletion of the N-terminal 200 amino acids removes the DNA-binding activity, dimerization-ability and the stimulation of adenovirus DNA replication.


:

Pssm-ID: 463134  Cd Length: 41  Bit Score: 76.11  E-value: 1.25e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 1610158347   4 SAMDEFHPFIEALLPHVRAIAYTWFNLQARKRKYFKKHEK 43
Cdd:pfam10524   2 YQQEDFHPFIEALLPYVKAFAYTWFNLQAAKRRHYKKHDK 41
MH1 super family cl45991
MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related ...
65-169 2.96e-17

MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related proteins such as Smads. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain shows that a highly conserved 11 residue beta hairpin is used to bind the DNA consensus sequence GNCN in the major groove, shown to be vital for the transcriptional activation of target genes. Not all examples of MH1 can bind to DNA however. Smad2 cannot bind DNA and has a large insertion within the hairpin that presumably abolishes DNA binding. A basic helix (H2) in MH1 with the nuclear localization signal KKLKK has been shown to be essential for Smad3 nuclear import. Smads also use the MH1 domain to interact with transcription factors such as Jun, TFE3, Sp1, and Runx.


The actual alignment was detected with superfamily member pfam03165:

Pssm-ID: 460833  Cd Length: 103  Bit Score: 76.64  E-value: 2.96e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347  65 KQKWASRLLAKLRKDIrqEYREDFVLTVTGK---KHPCCVLSN--------PDQKGKIRRIDClrqadKVWRL-DLVMVI 132
Cdd:pfam03165   1 LKKAVESLLKKLKKKI--QQLEELELAVESRgdpPTGCVTIPRsldgrlqvAGRKGLPHVIYC-----RLWRWpDLQSQH 73
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1610158347 133 LFKGIPLESTDGErlMKSPHCtnpalCVQPHHITVSV 169
Cdd:pfam03165  74 ELKAIPTCETAFE--SKKDEV-----CINPYHYSRVE 103
 
Name Accession Description Interval E-value
CTF_NFI pfam00859
CTF/NF-I family transcription modulation region;
205-411 2.45e-86

CTF/NF-I family transcription modulation region;


Pssm-ID: 459967 [Multi-domain]  Cd Length: 288  Bit Score: 264.47  E-value: 2.45e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347 205 EDSFVKSGVFNVSELVRVSRTPITQGTGVNFPIGEIPSqPYYHDMNSGVNLQRSLSSPPS--SKRPKTISIDENMEPSPT 282
Cdd:pfam00859   1 QDSFVTSGVFSVTELVRVSRTPVATGTGPNFSLGELQG-PLYYDLNPGVGLRRSLPSTSSsgSKRHKSGSMEDDVDTSPG 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347 283 GDFYPSPSSPAAGSRTW-HERDQDMSSPTTMKKPEKPLFSSASPQDSSPRLSTFPQHHHPGIpgVAHSVIStRTPPPPSP 361
Cdd:pfam00859  80 GDYYRSPSSPASSSRNWpHDVEGGMSSPVKKKKPDKSDFSSPSPQDSSPRLMAFTQHHRPVI--AVHSGIS-RSPHPSSA 156
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1610158347 362 LPFPTQAILpPAPSSYFSHPTIRYPPHLnPQDTLKNYVP--SYDPSSPQTSQ 411
Cdd:pfam00859 157 LHFPSSSIL-QQPSSYFPHPAIRYPPHL-PQDPLKDLVSlaCYDPSSQQPSQ 206
NfI_DNAbd_pre-N pfam10524
Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific ...
4-43 1.25e-17

Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific DNA-binding proteins (also known as CTF or CAAT box transcription factor) functions both in viral DNA replication and in the regulation of gene expression in higher organizms. The N-terminal 200 residues contains the DNA-binding and dimerization domain, but also has an 8-47 residue highly conserved region 5' of this, whose function is not known. Deletion of the N-terminal 200 amino acids removes the DNA-binding activity, dimerization-ability and the stimulation of adenovirus DNA replication.


Pssm-ID: 463134  Cd Length: 41  Bit Score: 76.11  E-value: 1.25e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 1610158347   4 SAMDEFHPFIEALLPHVRAIAYTWFNLQARKRKYFKKHEK 43
Cdd:pfam10524   2 YQQEDFHPFIEALLPYVKAFAYTWFNLQAAKRRHYKKHDK 41
MH1 pfam03165
MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related ...
65-169 2.96e-17

MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related proteins such as Smads. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain shows that a highly conserved 11 residue beta hairpin is used to bind the DNA consensus sequence GNCN in the major groove, shown to be vital for the transcriptional activation of target genes. Not all examples of MH1 can bind to DNA however. Smad2 cannot bind DNA and has a large insertion within the hairpin that presumably abolishes DNA binding. A basic helix (H2) in MH1 with the nuclear localization signal KKLKK has been shown to be essential for Smad3 nuclear import. Smads also use the MH1 domain to interact with transcription factors such as Jun, TFE3, Sp1, and Runx.


Pssm-ID: 460833  Cd Length: 103  Bit Score: 76.64  E-value: 2.96e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347  65 KQKWASRLLAKLRKDIrqEYREDFVLTVTGK---KHPCCVLSN--------PDQKGKIRRIDClrqadKVWRL-DLVMVI 132
Cdd:pfam03165   1 LKKAVESLLKKLKKKI--QQLEELELAVESRgdpPTGCVTIPRsldgrlqvAGRKGLPHVIYC-----RLWRWpDLQSQH 73
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1610158347 133 LFKGIPLESTDGErlMKSPHCtnpalCVQPHHITVSV 169
Cdd:pfam03165  74 ELKAIPTCETAFE--SKKDEV-----CINPYHYSRVE 103
DWA smart00523
Domain A in dwarfin family proteins;
64-172 1.30e-16

Domain A in dwarfin family proteins;


Pssm-ID: 214708  Cd Length: 109  Bit Score: 75.11  E-value: 1.30e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347   64 IKQKWASRLLAKLRKDIRQEYREDFVLTVTGKKHPC--CVLSNPDQKGKirridcLRQADKVWRLDLVMVILFKGIPLES 141
Cdd:smart00523   1 VEEKWAKKATESLLKKLKKKQLEELLQAVESKGGPPtrCVLIPRSLDGR------LQVAHRKGLPHVLYCRLFRWPDLQS 74
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1610158347  142 tdGERLMKSPHC------TNPALCVQPHHITVSVKEL 172
Cdd:smart00523  75 --PHELKALPTCehafesKSDEVCCNPYHYSRVERPE 109
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
261-412 5.28e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.37  E-value: 5.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347 261 SPPSSKRPKTISIDEnmepSPTGDFYP-SPSSPAAGSR-TWHERDQDMSSPTTMKKPEKPlfssASPQDSSPRLSTFPQH 338
Cdd:PTZ00449  608 RPKSPKLPELLDIPK----SPKRPESPkSPKRPPPPQRpSSPERPEGPKIIKSPKPPKSP----KPPFDPKFKEKFYDDY 679
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1610158347 339 hhpgIPGVAHSVISTRTPPPPSPLPFPTQAILPPAPSSYFSHPtiRYPPHLNPQDTLKNYVPSYDPSSPQTSQS 412
Cdd:PTZ00449  680 ----LDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP--RPLPPKLPRDEEFPFEPIGDPDAEQPDDI 747
 
Name Accession Description Interval E-value
CTF_NFI pfam00859
CTF/NF-I family transcription modulation region;
205-411 2.45e-86

CTF/NF-I family transcription modulation region;


Pssm-ID: 459967 [Multi-domain]  Cd Length: 288  Bit Score: 264.47  E-value: 2.45e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347 205 EDSFVKSGVFNVSELVRVSRTPITQGTGVNFPIGEIPSqPYYHDMNSGVNLQRSLSSPPS--SKRPKTISIDENMEPSPT 282
Cdd:pfam00859   1 QDSFVTSGVFSVTELVRVSRTPVATGTGPNFSLGELQG-PLYYDLNPGVGLRRSLPSTSSsgSKRHKSGSMEDDVDTSPG 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347 283 GDFYPSPSSPAAGSRTW-HERDQDMSSPTTMKKPEKPLFSSASPQDSSPRLSTFPQHHHPGIpgVAHSVIStRTPPPPSP 361
Cdd:pfam00859  80 GDYYRSPSSPASSSRNWpHDVEGGMSSPVKKKKPDKSDFSSPSPQDSSPRLMAFTQHHRPVI--AVHSGIS-RSPHPSSA 156
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1610158347 362 LPFPTQAILpPAPSSYFSHPTIRYPPHLnPQDTLKNYVP--SYDPSSPQTSQ 411
Cdd:pfam00859 157 LHFPSSSIL-QQPSSYFPHPAIRYPPHL-PQDPLKDLVSlaCYDPSSQQPSQ 206
NfI_DNAbd_pre-N pfam10524
Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific ...
4-43 1.25e-17

Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific DNA-binding proteins (also known as CTF or CAAT box transcription factor) functions both in viral DNA replication and in the regulation of gene expression in higher organizms. The N-terminal 200 residues contains the DNA-binding and dimerization domain, but also has an 8-47 residue highly conserved region 5' of this, whose function is not known. Deletion of the N-terminal 200 amino acids removes the DNA-binding activity, dimerization-ability and the stimulation of adenovirus DNA replication.


Pssm-ID: 463134  Cd Length: 41  Bit Score: 76.11  E-value: 1.25e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 1610158347   4 SAMDEFHPFIEALLPHVRAIAYTWFNLQARKRKYFKKHEK 43
Cdd:pfam10524   2 YQQEDFHPFIEALLPYVKAFAYTWFNLQAAKRRHYKKHDK 41
MH1 pfam03165
MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related ...
65-169 2.96e-17

MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related proteins such as Smads. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain shows that a highly conserved 11 residue beta hairpin is used to bind the DNA consensus sequence GNCN in the major groove, shown to be vital for the transcriptional activation of target genes. Not all examples of MH1 can bind to DNA however. Smad2 cannot bind DNA and has a large insertion within the hairpin that presumably abolishes DNA binding. A basic helix (H2) in MH1 with the nuclear localization signal KKLKK has been shown to be essential for Smad3 nuclear import. Smads also use the MH1 domain to interact with transcription factors such as Jun, TFE3, Sp1, and Runx.


Pssm-ID: 460833  Cd Length: 103  Bit Score: 76.64  E-value: 2.96e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347  65 KQKWASRLLAKLRKDIrqEYREDFVLTVTGK---KHPCCVLSN--------PDQKGKIRRIDClrqadKVWRL-DLVMVI 132
Cdd:pfam03165   1 LKKAVESLLKKLKKKI--QQLEELELAVESRgdpPTGCVTIPRsldgrlqvAGRKGLPHVIYC-----RLWRWpDLQSQH 73
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1610158347 133 LFKGIPLESTDGErlMKSPHCtnpalCVQPHHITVSV 169
Cdd:pfam03165  74 ELKAIPTCETAFE--SKKDEV-----CINPYHYSRVE 103
DWA smart00523
Domain A in dwarfin family proteins;
64-172 1.30e-16

Domain A in dwarfin family proteins;


Pssm-ID: 214708  Cd Length: 109  Bit Score: 75.11  E-value: 1.30e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347   64 IKQKWASRLLAKLRKDIRQEYREDFVLTVTGKKHPC--CVLSNPDQKGKirridcLRQADKVWRLDLVMVILFKGIPLES 141
Cdd:smart00523   1 VEEKWAKKATESLLKKLKKKQLEELLQAVESKGGPPtrCVLIPRSLDGR------LQVAHRKGLPHVLYCRLFRWPDLQS 74
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1610158347  142 tdGERLMKSPHC------TNPALCVQPHHITVSVKEL 172
Cdd:smart00523  75 --PHELKALPTCehafesKSDEVCCNPYHYSRVERPE 109
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
261-412 5.28e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.37  E-value: 5.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1610158347 261 SPPSSKRPKTISIDEnmepSPTGDFYP-SPSSPAAGSR-TWHERDQDMSSPTTMKKPEKPlfssASPQDSSPRLSTFPQH 338
Cdd:PTZ00449  608 RPKSPKLPELLDIPK----SPKRPESPkSPKRPPPPQRpSSPERPEGPKIIKSPKPPKSP----KPPFDPKFKEKFYDDY 679
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1610158347 339 hhpgIPGVAHSVISTRTPPPPSPLPFPTQAILPPAPSSYFSHPtiRYPPHLNPQDTLKNYVPSYDPSSPQTSQS 412
Cdd:PTZ00449  680 ----LDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP--RPLPPKLPRDEEFPFEPIGDPDAEQPDDI 747
Enamelin pfam15362
Enamelin; ENAMELIN is involved in the mineralization and structural organization of enamel. It ...
285-325 5.79e-04

Enamelin; ENAMELIN is involved in the mineralization and structural organization of enamel. It is necessary for the extension of enamel during the secretory stage of dental enamel formation. The proteins are expressed in teeth, particularly in odontoblasts, ameloblasts and cementoblasts.


Pssm-ID: 464672 [Multi-domain]  Cd Length: 907  Bit Score: 42.12  E-value: 5.79e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1610158347 285 FYPSPSSPAAGSRTWHERDQdmsSPTTMKKPEKPLFSSASP 325
Cdd:pfam15362 393 YDPRENSPYLRSNTWDERDD---SPNTMGQPENPLYPMNTP 430
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH