NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568908631|ref|XP_006529438|]
View 

neuron navigator 1 isoform X6 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
27-277 1.64e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.64e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   27 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRRG--KAAVSSRNSSPKAYRGRGTPRAAGPAR 104
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRArrLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  105 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 179
Cdd:PHA03247 2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  180 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 259
Cdd:PHA03247 2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                         250
                  ....*....|....*....
gi 568908631  260 EPGPVP-QESWVLDGYTSP 277
Cdd:PHA03247 2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1320-1404 3.47e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  1320 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1399
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 568908631  1400 VIQGA 1404
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA super family cl41901
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1750-1903 3.51e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


The actual alignment was detected with superfamily member NF038214:

Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1750 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1822
Cdd:NF038214   41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1823 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1892
Cdd:NF038214  121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                         170
                  ....*....|.
gi 568908631 1893 PYIIgTTNQPV 1903
Cdd:NF038214  185 STII-TSNLPF 194
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
27-277 1.64e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.64e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   27 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRRG--KAAVSSRNSSPKAYRGRGTPRAAGPAR 104
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRArrLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  105 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 179
Cdd:PHA03247 2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  180 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 259
Cdd:PHA03247 2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                         250
                  ....*....|....*....
gi 568908631  260 EPGPVP-QESWVLDGYTSP 277
Cdd:PHA03247 2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1320-1404 3.47e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  1320 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1399
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 568908631  1400 VIQGA 1404
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA NF038214
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1750-1903 3.51e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1750 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1822
Cdd:NF038214   41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1823 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1892
Cdd:NF038214  121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                         170
                  ....*....|.
gi 568908631 1893 PYIIgTTNQPV 1903
Cdd:NF038214  185 STII-TSNLPF 194
AAA smart00382
ATPases associated with a variety of cellular activities; AAA - ATPases associated with a ...
1791-1902 1.43e-04

ATPases associated with a variety of cellular activities; AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.


Pssm-ID: 214640 [Multi-domain]  Cd Length: 148  Bit Score: 44.29  E-value: 1.43e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   1791 KHRRLVLSGPSGTGKTYLTNRLAEYLVERSGREVTdgIVSTFNMHQQSCKDLQLYLSNLANQIDRETGIGDV-------- 1862
Cdd:smart00382    1 PGEVILIVGPPGSGKTTLARALARELGPPGGGVIY--IDGEDILEEVLDQLLLIIVGGKKASGSGELRLRLAlalarklk 78
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*....
gi 568908631   1863 PLVILLDDLSEAGS---------ISELVNGALTCKYHKCPyIIGTTNQP 1902
Cdd:smart00382   79 PDVLILDEITSLLDaeqeallllLEELRLLLLLKSEKNLT-VILTTNDE 126
McrB COG1401
5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB ...
1786-1816 4.73e-04

5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB [Defense mechanisms];


Pssm-ID: 441011 [Multi-domain]  Cd Length: 477  Bit Score: 45.15  E-value: 4.73e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 568908631 1786 ISLLLKHRRLV-LSGPSGTGKTYLTNRLAEYL 1816
Cdd:COG1401   214 FLAALKTKKNViLAGPPGTGKTYLARRLAEAL 245
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1322-1420 5.68e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.76  E-value: 5.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1322 EQIRKLRRELESSQEKVATLTSQLSANANL---------VAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKK 1392
Cdd:COG1579    59 KEIKRLELEIEEVEARIKKYEEQLGNVRNNkeyealqkeIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEA 138
                          90       100
                  ....*....|....*....|....*...
gi 568908631 1393 KNSEAQAVIQGALNASEATPKELRIKRQ 1420
Cdd:COG1579   139 ELEEKKAELDEELAELEAELEELEAERE 166
AAA cd00009
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily ...
1795-1902 6.01e-04

The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.


Pssm-ID: 99707 [Multi-domain]  Cd Length: 151  Bit Score: 42.52  E-value: 6.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1795 LVLSGPSGTGKTYLTNRLAEYLVERSGRevtdgiVSTFNMHQqscKDLQLYLSNLANQIDRETGIGDV----PLVILLDd 1870
Cdd:cd00009    22 LLLYGPPGTGKTTLARAIANELFRPGAP------FLYLNASD---LLEGLVVAELFGHFLVRLLFELAekakPGVLFID- 91
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 568908631 1871 lsEAGSISELVNGAL----------TCKYHKCPyIIGTTNQP 1902
Cdd:cd00009    92 --EIDSLSRGAQNALlrvletlndlRIDRENVR-VIGATNRP 130
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1316-1410 1.14e-03

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 40.67  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  1316 EERMQS--EQIRKLRRELESSQEKVATLTSQLS---ANANLVAAFEQSLVNMTSRLRHLAE-TAEEKD---TELLDLRET 1386
Cdd:pfam20492   12 EERLKQyeEETKKAQEELEESEETAEELEEERRqaeEEAERLEQKRQEAEEEKERLEESAEmEAEEKEqleAELAEAQEE 91
                           90       100
                   ....*....|....*....|....*...
gi 568908631  1387 IDFL----KKKNSEAQAVIQGALNASEA 1410
Cdd:pfam20492   92 IARLeeevERKEEEARRLQEELEEAREE 119
AAA_28 pfam13521
AAA domain;
1794-1826 5.32e-03

AAA domain;


Pssm-ID: 433278 [Multi-domain]  Cd Length: 164  Bit Score: 39.94  E-value: 5.32e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 568908631  1794 RLVLSGPSGTGKTYLTNRLAEYL----VERSGREVTD 1826
Cdd:pfam13521    1 RIVITGGPSTGKTTLAEALAARFgypvVPEAAREILE 37
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
52-237 9.39e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.83  E-value: 9.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631    52 ASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAYRGRGTPRAAGPARELAGSveSLPSSPWNSPRvtPKTALSS 131
Cdd:pfam03546  245 APAAATPAQAKPALKTPQTKASPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACA--SSPAVARGAQR--PEEDSSS 320
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   132 QAGSRRAGETQ----STQRKKTQEGIPVrhtRGRSPPQSSCYGETQIPGPPEGRMPPGCQGKDQRNIISKPPRcLEPDEG 207
Cdd:pfam03546  321 SEESESEEETApaaaVGQAKSVGKGLQG---KAASAPTKGPSGQGTAPVPPGKTGPAVAQVKAEAQEDSESSE-EESDSE 396
                          170       180       190
                   ....*....|....*....|....*....|
gi 568908631   208 EASGTSSPVCSPVQSMRSSATPGVISFSSA 237
Cdd:pfam03546  397 EAAATPAQVKASGKTPQAKANPAPTKASSA 426
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
27-277 1.64e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.64e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   27 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRRG--KAAVSSRNSSPKAYRGRGTPRAAGPAR 104
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRArrLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  105 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 179
Cdd:PHA03247 2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  180 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 259
Cdd:PHA03247 2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                         250
                  ....*....|....*....
gi 568908631  260 EPGPVP-QESWVLDGYTSP 277
Cdd:PHA03247 2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1320-1404 3.47e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  1320 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1399
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 568908631  1400 VIQGA 1404
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA NF038214
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1750-1903 3.51e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1750 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1822
Cdd:NF038214   41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1823 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1892
Cdd:NF038214  121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                         170
                  ....*....|.
gi 568908631 1893 PYIIgTTNQPV 1903
Cdd:NF038214  185 STII-TSNLPF 194
AAA smart00382
ATPases associated with a variety of cellular activities; AAA - ATPases associated with a ...
1791-1902 1.43e-04

ATPases associated with a variety of cellular activities; AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.


Pssm-ID: 214640 [Multi-domain]  Cd Length: 148  Bit Score: 44.29  E-value: 1.43e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   1791 KHRRLVLSGPSGTGKTYLTNRLAEYLVERSGREVTdgIVSTFNMHQQSCKDLQLYLSNLANQIDRETGIGDV-------- 1862
Cdd:smart00382    1 PGEVILIVGPPGSGKTTLARALARELGPPGGGVIY--IDGEDILEEVLDQLLLIIVGGKKASGSGELRLRLAlalarklk 78
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*....
gi 568908631   1863 PLVILLDDLSEAGS---------ISELVNGALTCKYHKCPyIIGTTNQP 1902
Cdd:smart00382   79 PDVLILDEITSLLDaeqeallllLEELRLLLLLKSEKNLT-VILTTNDE 126
PHA03247 PHA03247
large tegument protein UL36; Provisional
8-286 3.92e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 3.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631    8 PSAALPLLLTMSGMANVNSASRPHYASSIP--VPRASSQTRIHTPGASPQL-----RPRQQADLALSPQRGASPRRGKAA 80
Cdd:PHA03247 2712 PHALVSATPLPPGPAAARQASPALPAAPAPpaVPAGPATPGGPARPARPPTtagppAPAPPAAPAAGPPRRLTRPAVASL 2791
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   81 VSSRNSSPKAY-----------RGRGTPRAAGPARELAGSVESLPSSPWNSPRVTPKT-----ALSSQAGSRRAGETQST 144
Cdd:PHA03247 2792 SESRESLPSPWdpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSlplggSVAPGGDVRRRPPSRSP 2871
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  145 QRKKTQEGIPVRHTRGRSPPQSSCYGETQIPGPPEGRMPPGCQGKDQrniiskPPRCLEPDEGEASGTSSPVCSPVQSMR 224
Cdd:PHA03247 2872 AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQ------PQPQPPPPPQPQPPPPPPPRPQPPLAP 2945
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568908631  225 SSATPGVISFSSAHPqsQPITATVAPFQYRLQTDQEPGPVPQeswVLDGYTSPPSRTEDSFS 286
Cdd:PHA03247 2946 TTDPAGAGEPSGAVP--QPWLGALVPGRVAVPRFRVPQPAPS---REAPASSTPPLTGHSLS 3002
PHA03378 PHA03378
EBNA-3B; Provisional
22-253 4.12e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.83  E-value: 4.12e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   22 ANVNSASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQADLALSPQRGASPRRGK--AAVSSRNSSPKAYRGRGTPRA 99
Cdd:PHA03378  679 TGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARppAAAPGRARPPAAAPGRARPPA 758
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  100 AGPARELAGSVESLPSSPWNSPRVTPKTALSSQAGSrraGETQSTQRKKTQEGIPVRHTRGRSPPQSSCYGETQIPGPPE 179
Cdd:PHA03378  759 AAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAP---TPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR 835
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  180 GRMPPGCQGKDQRniisKPPRCLEPDEGeaSGTSS----------PVCSPVQSMRSSATPGVISFSsahpqsqpiTATVA 249
Cdd:PHA03378  836 GRPSLKKPAALER----QAAAGPTPSPG--SGTSDkivqapvfypPVLQPIQVMRQLGSVRAAAAS---------TVTQA 900

                  ....
gi 568908631  250 PFQY 253
Cdd:PHA03378  901 PTEY 904
McrB COG1401
5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB ...
1786-1816 4.73e-04

5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB [Defense mechanisms];


Pssm-ID: 441011 [Multi-domain]  Cd Length: 477  Bit Score: 45.15  E-value: 4.73e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 568908631 1786 ISLLLKHRRLV-LSGPSGTGKTYLTNRLAEYL 1816
Cdd:COG1401   214 FLAALKTKKNViLAGPPGTGKTYLARRLAEAL 245
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1322-1420 5.68e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.76  E-value: 5.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1322 EQIRKLRRELESSQEKVATLTSQLSANANL---------VAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKK 1392
Cdd:COG1579    59 KEIKRLELEIEEVEARIKKYEEQLGNVRNNkeyealqkeIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEA 138
                          90       100
                  ....*....|....*....|....*...
gi 568908631 1393 KNSEAQAVIQGALNASEATPKELRIKRQ 1420
Cdd:COG1579   139 ELEEKKAELDEELAELEAELEELEAERE 166
AAA cd00009
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily ...
1795-1902 6.01e-04

The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.


Pssm-ID: 99707 [Multi-domain]  Cd Length: 151  Bit Score: 42.52  E-value: 6.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1795 LVLSGPSGTGKTYLTNRLAEYLVERSGRevtdgiVSTFNMHQqscKDLQLYLSNLANQIDRETGIGDV----PLVILLDd 1870
Cdd:cd00009    22 LLLYGPPGTGKTTLARAIANELFRPGAP------FLYLNASD---LLEGLVVAELFGHFLVRLLFELAekakPGVLFID- 91
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 568908631 1871 lsEAGSISELVNGAL----------TCKYHKCPyIIGTTNQP 1902
Cdd:cd00009    92 --EIDSLSRGAQNALlrvletlndlRIDRENVR-VIGATNRP 130
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1317-1420 8.80e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.12  E-value: 8.80e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1317 ERMQsEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETidflKKKNSE 1396
Cdd:COG4372    62 EQLE-EELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQ----RKQLEA 136
                          90       100
                  ....*....|....*....|....
gi 568908631 1397 AQAVIQGALNASEATPKELRIKRQ 1420
Cdd:COG4372   137 QIAELQSEIAEREEELKELEEQLE 160
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1316-1410 1.14e-03

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 40.67  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  1316 EERMQS--EQIRKLRRELESSQEKVATLTSQLS---ANANLVAAFEQSLVNMTSRLRHLAE-TAEEKD---TELLDLRET 1386
Cdd:pfam20492   12 EERLKQyeEETKKAQEELEESEETAEELEEERRqaeEEAERLEQKRQEAEEEKERLEESAEmEAEEKEqleAELAEAQEE 91
                           90       100
                   ....*....|....*....|....*...
gi 568908631  1387 IDFL----KKKNSEAQAVIQGALNASEA 1410
Cdd:pfam20492   92 IARLeeevERKEEEARRLQEELEEAREE 119
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1321-1416 2.80e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631 1321 SEQIRKLRRELESSQEKVATLTSQLSANANL--VAAFEQSLVNMTSRLRHLaetaEEKDTELLDLRETIDFLKKKNSEAQ 1398
Cdd:COG4717   101 EEELEELEAELEELREELEKLEKLLQLLPLYqeLEALEAELAELPERLEEL----EERLEELRELEEELEELEAELAELQ 176
                          90
                  ....*....|....*....
gi 568908631 1399 AVIQGALN-ASEATPKELR 1416
Cdd:COG4717   177 EELEELLEqLSLATEEELQ 195
AAA_28 pfam13521
AAA domain;
1794-1826 5.32e-03

AAA domain;


Pssm-ID: 433278 [Multi-domain]  Cd Length: 164  Bit Score: 39.94  E-value: 5.32e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 568908631  1794 RLVLSGPSGTGKTYLTNRLAEYL----VERSGREVTD 1826
Cdd:pfam13521    1 RIVITGGPSTGKTTLAEALAARFgypvVPEAAREILE 37
PHA03247 PHA03247
large tegument protein UL36; Provisional
24-278 5.94e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 5.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   24 VNSASRPHYASSIP---VPRA--------SSQTRIHTPGASPQLRPR-----------QQADLALSPQRGASP------- 74
Cdd:PHA03247 2667 ARRLGRAAQASSPPqrpRRRAarptvgslTSLADPPPPPPTPEPAPHalvsatplppgPAAARQASPALPAAPappavpa 2746
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   75 ---------RRGKAAVSSRNSSPKAYRGR-GTPRAAGPARELAGSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQST 144
Cdd:PHA03247 2747 gpatpggpaRPARPPTTAGPPAPAPPAAPaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG 2826
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  145 QRKKTQEGIPVRHTRGRSPPQSSCygetqipgPPEGRMPPGcqGKDQRNIISKPPrclepdegeASGTSSPVCSPVQSM- 223
Cdd:PHA03247 2827 PLPPPTSAQPTAPPPPPGPPPPSL--------PLGGSVAPG--GDVRRRPPSRSP---------AAKPAAPARPPVRRLa 2887
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 568908631  224 RSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQEPGPVPQESWVLDGYTSPP 278
Cdd:PHA03247 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
PHA03247 PHA03247
large tegument protein UL36; Provisional
49-265 7.33e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 7.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   49 TPGASPQLRPrQQADLALSPQRGAsPRRGKAAVSSRNSSPKAYRGRGTPRAAG-PARELAGSVESLPSSPWNSPRVTPKT 127
Cdd:PHA03247 2552 PPPLPPAAPP-AAPDRSVPPPRPA-PRPSEPAVTSRARRPDAPPQSARPRAPVdDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  128 ALSSQAGSRRAGETQSTQRKKTQEGIPV-----RHTRGRSPPQSscygeTQIPGPPEGRMPPGCQGK--DQRNIISKPPR 200
Cdd:PHA03247 2630 SPSPAANEPDPHPPPTVPPPERPRDDPApgrvsRPRRARRLGRA-----AQASSPPQRPRRRAARPTvgSLTSLADPPPP 2704
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568908631  201 CLEPDEGEASGTSS-PVCSPVQSMRSSATPGVISFSSAHPQSQPIT-ATVAPFQYRLQTDQEPGPVP 265
Cdd:PHA03247 2705 PPTPEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVPAGPATpGGPARPARPPTTAGPPAPAP 2771
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
5-277 9.04e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 9.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631    5 TQDPSAALPLLLTMSGMANVNSASRPHYASSIPVPRASSQTRIHTPGASPQLRPR----QQADLALSPQRGASPRRGKAA 80
Cdd:PHA03307  155 AGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRsspiSASASSPAPAPGRSAADDAGA 234
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   81 VSSRNSSPKAYRGRGTPRAAGPaRELAGSVESLPS----SPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVR 156
Cdd:PHA03307  235 SSSDSSSSESSGCGWGPENECP-LPRPAPITLPTRiweaSGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPR 313
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631  157 HTRGRSPPQSSCYGETQIPGPPEGRMPPGcqgkdqrniiSKPPRCLEPDEGEASGtSSPVCSPVQSMRSSATPGVISFSS 236
Cdd:PHA03307  314 ASSSSSSSRESSSSSTSSSSESSRGAAVS----------PGPSPSRSPSPSRPPP-PADPSSPRKRPRPSRAPSSPAASA 382
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 568908631  237 AHPQSQPITATVAPFQYRLQ-TDQEPGPVPQESWVLDGYTSP 277
Cdd:PHA03307  383 GRPTRRRARAAVAGRARRRDaTGRFPAGRPRPSPLDAGAASG 424
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
52-237 9.39e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.83  E-value: 9.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631    52 ASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAYRGRGTPRAAGPARELAGSveSLPSSPWNSPRvtPKTALSS 131
Cdd:pfam03546  245 APAAATPAQAKPALKTPQTKASPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACA--SSPAVARGAQR--PEEDSSS 320
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568908631   132 QAGSRRAGETQ----STQRKKTQEGIPVrhtRGRSPPQSSCYGETQIPGPPEGRMPPGCQGKDQRNIISKPPRcLEPDEG 207
Cdd:pfam03546  321 SEESESEEETApaaaVGQAKSVGKGLQG---KAASAPTKGPSGQGTAPVPPGKTGPAVAQVKAEAQEDSESSE-EESDSE 396
                          170       180       190
                   ....*....|....*....|....*....|
gi 568908631   208 EASGTSSPVCSPVQSMRSSATPGVISFSSA 237
Cdd:pfam03546  397 EAAATPAQVKASGKTPQAKANPAPTKASSA 426
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH