NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720354674|ref|XP_030108822|]
View 

neuron navigator 1 isoform X11 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
10-260 1.10e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   10 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRR--GKAAVSSRNSSPKAYRGRGTPRAAGPAR 87
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRarRLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   88 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 162
Cdd:PHA03247  2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  163 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 242
Cdd:PHA03247  2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                          250
                   ....*....|....*....
gi 1720354674  243 EPGPVP-QESWVLDGYTSP 260
Cdd:PHA03247  2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1093-1177 3.06e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1093 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1172
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 1720354674 1173 VIQGA 1177
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA super family cl41901
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1523-1676 3.11e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


The actual alignment was detected with superfamily member NF038214:

Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1523 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1595
Cdd:NF038214    41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1596 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1665
Cdd:NF038214   121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                          170
                   ....*....|.
gi 1720354674 1666 PYIIgTTNQPV 1676
Cdd:NF038214   185 STII-TSNLPF 194
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-260 1.10e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   10 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRR--GKAAVSSRNSSPKAYRGRGTPRAAGPAR 87
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRarRLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   88 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 162
Cdd:PHA03247  2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  163 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 242
Cdd:PHA03247  2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                          250
                   ....*....|....*....
gi 1720354674  243 EPGPVP-QESWVLDGYTSP 260
Cdd:PHA03247  2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1093-1177 3.06e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1093 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1172
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 1720354674 1173 VIQGA 1177
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA NF038214
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1523-1676 3.11e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1523 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1595
Cdd:NF038214    41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1596 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1665
Cdd:NF038214   121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                          170
                   ....*....|.
gi 1720354674 1666 PYIIgTTNQPV 1676
Cdd:NF038214   185 STII-TSNLPF 194
AAA smart00382
ATPases associated with a variety of cellular activities; AAA - ATPases associated with a ...
1564-1675 1.27e-04

ATPases associated with a variety of cellular activities; AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.


Pssm-ID: 214640 [Multi-domain]  Cd Length: 148  Bit Score: 44.29  E-value: 1.27e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  1564 KHRRLVLSGPSGTGKTYLTNRLAEYLVERSGREVTdgIVSTFNMHQQSCKDLQLYLSNLANQIDRETGIGDV-------- 1635
Cdd:smart00382    1 PGEVILIVGPPGSGKTTLARALARELGPPGGGVIY--IDGEDILEEVLDQLLLIIVGGKKASGSGELRLRLAlalarklk 78
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*....
gi 1720354674  1636 PLVILLDDLSEAGS---------ISELVNGALTCKYHKCPyIIGTTNQP 1675
Cdd:smart00382   79 PDVLILDEITSLLDaeqeallllLEELRLLLLLKSEKNLT-VILTTNDE 126
McrB COG1401
5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB ...
1559-1589 4.18e-04

5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB [Defense mechanisms];


Pssm-ID: 441011 [Multi-domain]  Cd Length: 477  Bit Score: 45.15  E-value: 4.18e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1720354674 1559 ISLLLKHRRLV-LSGPSGTGKTYLTNRLAEYL 1589
Cdd:COG1401    214 FLAALKTKKNViLAGPPGTGKTYLARRLAEAL 245
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1089-1183 4.25e-04

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 41.83  E-value: 4.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1089 EERMQS--EQIRKLRRELESSQEKVATLTSQLS---ANANLVAAFEQSLVNMTSRLRHLAE-TAEEKD---TELLDLRET 1159
Cdd:pfam20492   12 EERLKQyeEETKKAQEELEESEETAEELEEERRqaeEEAERLEQKRQEAEEEKERLEESAEmEAEEKEqleAELAEAQEE 91
                           90       100
                   ....*....|....*....|....*...
gi 1720354674 1160 IDFL----KKKNSEAQAVIQGALNASEA 1183
Cdd:pfam20492   92 IARLeeevERKEEEARRLQEELEEAREE 119
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1095-1193 5.03e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.76  E-value: 5.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1095 EQIRKLRRELESSQEKVATLTSQLSANANL---------VAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKK 1165
Cdd:COG1579     59 KEIKRLELEIEEVEARIKKYEEQLGNVRNNkeyealqkeIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEA 138
                           90       100
                   ....*....|....*....|....*...
gi 1720354674 1166 KNSEAQAVIQGALNASEATPKELRIKRQ 1193
Cdd:COG1579    139 ELEEKKAELDEELAELEAELEELEAERE 166
AAA cd00009
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily ...
1568-1675 5.34e-04

The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.


Pssm-ID: 99707 [Multi-domain]  Cd Length: 151  Bit Score: 42.52  E-value: 5.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1568 LVLSGPSGTGKTYLTNRLAEYLVERSGRevtdgiVSTFNMHQqscKDLQLYLSNLANQIDRETGIGDV----PLVILLDd 1643
Cdd:cd00009     22 LLLYGPPGTGKTTLARAIANELFRPGAP------FLYLNASD---LLEGLVVAELFGHFLVRLLFELAekakPGVLFID- 91
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1720354674 1644 lsEAGSISELVNGAL----------TCKYHKCPyIIGTTNQP 1675
Cdd:cd00009     92 --EIDSLSRGAQNALlrvletlndlRIDRENVR-VIGATNRP 130
AAA_28 pfam13521
AAA domain;
1567-1599 4.73e-03

AAA domain;


Pssm-ID: 433278 [Multi-domain]  Cd Length: 164  Bit Score: 39.94  E-value: 4.73e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720354674 1567 RLVLSGPSGTGKTYLTNRLAEYL----VERSGREVTD 1599
Cdd:pfam13521    1 RIVITGGPSTGKTTLAEALAARFgypvVPEAAREILE 37
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
35-220 4.85e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 41.60  E-value: 4.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   35 ASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAYRGRGTPRAAGPARELAGSveSLPSSPWNSPRvtPKTALSS 114
Cdd:pfam03546  245 APAAATPAQAKPALKTPQTKASPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACA--SSPAVARGAQR--PEEDSSS 320
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  115 QAGSRRAGET----QSTQRKKTQEGIPVrhtRGRSPPQSSCYGETQIPGPPEGRMPPGCQGKDQRNIISKPPRcLEPDEG 190
Cdd:pfam03546  321 SEESESEEETapaaAVGQAKSVGKGLQG---KAASAPTKGPSGQGTAPVPPGKTGPAVAQVKAEAQEDSESSE-EESDSE 396
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720354674  191 EASGTSSPVCSPVQSMRSSATPGVISFSSA 220
Cdd:pfam03546  397 EAAATPAQVKASGKTPQAKANPAPTKASSA 426
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-260 1.10e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   10 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRR--GKAAVSSRNSSPKAYRGRGTPRAAGPAR 87
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRarRLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   88 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 162
Cdd:PHA03247  2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  163 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 242
Cdd:PHA03247  2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                          250
                   ....*....|....*....
gi 1720354674  243 EPGPVP-QESWVLDGYTSP 260
Cdd:PHA03247  2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1093-1177 3.06e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1093 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1172
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 1720354674 1173 VIQGA 1177
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA NF038214
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1523-1676 3.11e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1523 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1595
Cdd:NF038214    41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1596 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1665
Cdd:NF038214   121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                          170
                   ....*....|.
gi 1720354674 1666 PYIIgTTNQPV 1676
Cdd:NF038214   185 STII-TSNLPF 194
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
10-363 3.79e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.01  E-value: 3.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   10 ASRPHYASSIPVPRASSQTRiHTPGASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAyrGRGTPRAAGPArel 89
Cdd:PHA03307    51 AAVTVVAGAAACDRFEPPTG-PPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSS--PDPPPPTPPPA--- 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   90 agsvESLPSSPwnsPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSSCYGETQIPGPPEGRMPPGC 169
Cdd:PHA03307   125 ----SPPPSPA---PDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPST 197
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  170 QGkdqrniISKPPRCLEPDEGEASGTSSPVCSPVQSMRSSAtpGVISFSSAHPQSQ-----PITATVAPfqyRLQTDQEP 244
Cdd:PHA03307   198 PP------AAASPRPPRRSSPISASASSPAPAPGRSAADDA--GASSSDSSSSESSgcgwgPENECPLP---RPAPITLP 266
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  245 GPVPQES-WVLDGYTSPPSRTEDSFSCMDPESQRKRTvqnvldlrqnleetMSSLRGSQVTHSSLEMPCYDSDDANPRSV 323
Cdd:PHA03307   267 TRIWEASgWNGPSSRPGPASSSSSPRERSPSPSPSSP--------------GSGPAPSSPRASSSSSSSRESSSSSTSSS 332
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1720354674  324 SSLSNRSSPLSWRYGQSSPRLQAGDAPSVGGSCRSEGPPA 363
Cdd:PHA03307   333 SESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPS 372
AAA smart00382
ATPases associated with a variety of cellular activities; AAA - ATPases associated with a ...
1564-1675 1.27e-04

ATPases associated with a variety of cellular activities; AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.


Pssm-ID: 214640 [Multi-domain]  Cd Length: 148  Bit Score: 44.29  E-value: 1.27e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  1564 KHRRLVLSGPSGTGKTYLTNRLAEYLVERSGREVTdgIVSTFNMHQQSCKDLQLYLSNLANQIDRETGIGDV-------- 1635
Cdd:smart00382    1 PGEVILIVGPPGSGKTTLARALARELGPPGGGVIY--IDGEDILEEVLDQLLLIIVGGKKASGSGELRLRLAlalarklk 78
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*....
gi 1720354674  1636 PLVILLDDLSEAGS---------ISELVNGALTCKYHKCPyIIGTTNQP 1675
Cdd:smart00382   79 PDVLILDEITSLLDaeqeallllLEELRLLLLLKSEKNLT-VILTTNDE 126
PHA03378 PHA03378
EBNA-3B; Provisional
5-236 3.86e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.44  E-value: 3.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674    5 ANVNSASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQADLALSPQRGASPRRGK--AAVSSRNSSPKAYRGRGTPRA 82
Cdd:PHA03378   679 TGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARppAAAPGRARPPAAAPGRARPPA 758
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   83 AGPARELAGSVESLPSSPWNSPRVTPKTALSSQAGsrrAGETQSTQRKKTQEGIPVRHTRGRSPPQSSCYGETQIPGPPE 162
Cdd:PHA03378   759 AAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR 835
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  163 GRMPPGCQGKDQRniisKPPRCLEPDEGeaSGTSS----------PVCSPVQSMRSSATPGVISFSsahpqsqpiTATVA 232
Cdd:PHA03378   836 GRPSLKKPAALER----QAAAGPTPSPG--SGTSDkivqapvfypPVLQPIQVMRQLGSVRAAAAS---------TVTQA 900

                   ....
gi 1720354674  233 PFQY 236
Cdd:PHA03378   901 PTEY 904
McrB COG1401
5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB ...
1559-1589 4.18e-04

5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB [Defense mechanisms];


Pssm-ID: 441011 [Multi-domain]  Cd Length: 477  Bit Score: 45.15  E-value: 4.18e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1720354674 1559 ISLLLKHRRLV-LSGPSGTGKTYLTNRLAEYL 1589
Cdd:COG1401    214 FLAALKTKKNViLAGPPGTGKTYLARRLAEAL 245
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1089-1183 4.25e-04

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 41.83  E-value: 4.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1089 EERMQS--EQIRKLRRELESSQEKVATLTSQLS---ANANLVAAFEQSLVNMTSRLRHLAE-TAEEKD---TELLDLRET 1159
Cdd:pfam20492   12 EERLKQyeEETKKAQEELEESEETAEELEEERRqaeEEAERLEQKRQEAEEEKERLEESAEmEAEEKEqleAELAEAQEE 91
                           90       100
                   ....*....|....*....|....*...
gi 1720354674 1160 IDFL----KKKNSEAQAVIQGALNASEA 1183
Cdd:pfam20492   92 IARLeeevERKEEEARRLQEELEEAREE 119
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1095-1193 5.03e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.76  E-value: 5.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1095 EQIRKLRRELESSQEKVATLTSQLSANANL---------VAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKK 1165
Cdd:COG1579     59 KEIKRLELEIEEVEARIKKYEEQLGNVRNNkeyealqkeIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEA 138
                           90       100
                   ....*....|....*....|....*...
gi 1720354674 1166 KNSEAQAVIQGALNASEATPKELRIKRQ 1193
Cdd:COG1579    139 ELEEKKAELDEELAELEAELEELEAERE 166
AAA cd00009
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily ...
1568-1675 5.34e-04

The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.


Pssm-ID: 99707 [Multi-domain]  Cd Length: 151  Bit Score: 42.52  E-value: 5.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1568 LVLSGPSGTGKTYLTNRLAEYLVERSGRevtdgiVSTFNMHQqscKDLQLYLSNLANQIDRETGIGDV----PLVILLDd 1643
Cdd:cd00009     22 LLLYGPPGTGKTTLARAIANELFRPGAP------FLYLNASD---LLEGLVVAELFGHFLVRLLFELAekakPGVLFID- 91
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1720354674 1644 lsEAGSISELVNGAL----------TCKYHKCPyIIGTTNQP 1675
Cdd:cd00009     92 --EIDSLSRGAQNALlrvletlndlRIDRENVR-VIGATNRP 130
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1090-1193 7.78e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.12  E-value: 7.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1090 ERMQsEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETidflKKKNSE 1169
Cdd:COG4372     62 EQLE-EELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQ----RKQLEA 136
                           90       100
                   ....*....|....*....|....
gi 1720354674 1170 AQAVIQGALNASEATPKELRIKRQ 1193
Cdd:COG4372    137 QIAELQSEIAEREEELKELEEQLE 160
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1094-1189 2.48e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674 1094 SEQIRKLRRELESSQEKVATLTSQLSANANL--VAAFEQSLVNMTSRLRHLaetaEEKDTELLDLRETIDFLKKKNSEAQ 1171
Cdd:COG4717    101 EEELEELEAELEELREELEKLEKLLQLLPLYqeLEALEAELAELPERLEEL----EERLEELRELEEELEELEAELAELQ 176
                           90
                   ....*....|....*....
gi 1720354674 1172 AVIQGALN-ASEATPKELR 1189
Cdd:COG4717    177 EELEELLEqLSLATEEELQ 195
PHA03247 PHA03247
large tegument protein UL36; Provisional
13-269 2.55e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 2.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   13 PHYASSIPVPRASSQTRIHTPGASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAY-----------RGRGTPR 81
Cdd:PHA03247  2741 PPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWdpadppaavlaPAAALPP 2820
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   82 AAGPARELAGSVESLPSSPWNSPRVTPKT-----ALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSSCYGETQ 156
Cdd:PHA03247  2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSlplggSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL 2900
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  157 IPGPPEGRMPPGCQGKDQrniiskPPRCLEPDEGEASGTSSPVCSPVQSMRSSATPGVISFSSAHPqsQPITATVAPFQY 236
Cdd:PHA03247  2901 PPDQPERPPQPQAPPPPQ------PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP--QPWLGALVPGRV 2972
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1720354674  237 RLQTDQEPGPVPQeswVLDGYTSPPSRTEDSFS 269
Cdd:PHA03247  2973 AVPRFRVPQPAPS---REAPASSTPPLTGHSLS 3002
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-261 4.19e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 4.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674    7 VNSASRPHYASSIP---VPRA--------SSQTRIHTPGASPQLRPR-----------QQADLALSPQRGASP------- 57
Cdd:PHA03247  2667 ARRLGRAAQASSPPqrpRRRAarptvgslTSLADPPPPPPTPEPAPHalvsatplppgPAAARQASPALPAAPappavpa 2746
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   58 ---------RRGKAAVSSRNSSPKAYRGR-GTPRAAGPARELAGSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQST 127
Cdd:PHA03247  2747 gpatpggpaRPARPPTTAGPPAPAPPAAPaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG 2826
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  128 QRKKTQEGIPVRHTRGRSPPQSSCygetqipgPPEGRMPPGcqGKDQRNIISKPPrclepdegeASGTSSPVCSPVQSM- 206
Cdd:PHA03247  2827 PLPPPTSAQPTAPPPPPGPPPPSL--------PLGGSVAPG--GDVRRRPPSRSP---------AAKPAAPARPPVRRLa 2887
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720354674  207 RSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQEPGPVPQESWVLDGYTSPP 261
Cdd:PHA03247  2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
AAA_28 pfam13521
AAA domain;
1567-1599 4.73e-03

AAA domain;


Pssm-ID: 433278 [Multi-domain]  Cd Length: 164  Bit Score: 39.94  E-value: 4.73e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720354674 1567 RLVLSGPSGTGKTYLTNRLAEYL----VERSGREVTD 1599
Cdd:pfam13521    1 RIVITGGPSTGKTTLAEALAARFgypvVPEAAREILE 37
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
35-220 4.85e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 41.60  E-value: 4.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   35 ASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAYRGRGTPRAAGPARELAGSveSLPSSPWNSPRvtPKTALSS 114
Cdd:pfam03546  245 APAAATPAQAKPALKTPQTKASPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACA--SSPAVARGAQR--PEEDSSS 320
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  115 QAGSRRAGET----QSTQRKKTQEGIPVrhtRGRSPPQSSCYGETQIPGPPEGRMPPGCQGKDQRNIISKPPRcLEPDEG 190
Cdd:pfam03546  321 SEESESEEETapaaAVGQAKSVGKGLQG---KAASAPTKGPSGQGTAPVPPGKTGPAVAQVKAEAQEDSESSE-EESDSE 396
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720354674  191 EASGTSSPVCSPVQSMRSSATPGVISFSSA 220
Cdd:pfam03546  397 EAAATPAQVKASGKTPQAKANPAPTKASSA 426
PHA03247 PHA03247
large tegument protein UL36; Provisional
32-264 5.30e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 5.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674   32 TPGASPQLRPrQQADLALSPQRGAsPRRGKAAVSSRNSSPKAYRGRGTPRAAG-PARELAGSVESLPSSPWNSPRVTPKT 110
Cdd:PHA03247  2552 PPPLPPAAPP-AAPDRSVPPPRPA-PRPSEPAVTSRARRPDAPPQSARPRAPVdDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  111 ALSSQAGSRRAGETQSTQRKKTQEGIPV-----RHTRGRSPPQSscygeTQIPGPPEGRMPPGCQGK--DQRNIISKPPR 183
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPApgrvsRPRRARRLGRA-----AQASSPPQRPRRRAARPTvgSLTSLADPPPP 2704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720354674  184 CLEPDEGEASGTSS-PVCSPVQSMRSSATPGVISFSSAHPQSQPIT-ATVAPFQYRLQTDQEPGPVPQESWVldgyTSPP 261
Cdd:PHA03247  2705 PPTPEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVPAGPATpGGPARPARPPTTAGPPAPAPPAAPA----AGPP 2780

                   ...
gi 1720354674  262 SRT 264
Cdd:PHA03247  2781 RRL 2783
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH