NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039728316|ref|XP_017175298|]
View 

neuron navigator 1 isoform X5 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
10-260 1.81e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   10 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRRG--KAAVSSRNSSPKAYRGRGTPRAAGPAR 87
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRArrLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   88 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 162
Cdd:PHA03247  2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  163 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 242
Cdd:PHA03247  2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                          250
                   ....*....|....*....
gi 1039728316  243 EPGPVP-QESWVLDGYTSP 260
Cdd:PHA03247  2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1360-1444 3.54e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1360 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1439
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 1039728316 1440 VIQGA 1444
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA super family cl41901
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1790-1943 3.58e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


The actual alignment was detected with superfamily member NF038214:

Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1790 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1862
Cdd:NF038214    41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1863 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1932
Cdd:NF038214   121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                          170
                   ....*....|.
gi 1039728316 1933 PYIIgTTNQPV 1943
Cdd:NF038214   185 STII-TSNLPF 194
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-260 1.81e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   10 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRRG--KAAVSSRNSSPKAYRGRGTPRAAGPAR 87
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRArrLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   88 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 162
Cdd:PHA03247  2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  163 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 242
Cdd:PHA03247  2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                          250
                   ....*....|....*....
gi 1039728316  243 EPGPVP-QESWVLDGYTSP 260
Cdd:PHA03247  2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1360-1444 3.54e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1360 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1439
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 1039728316 1440 VIQGA 1444
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA NF038214
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1790-1943 3.58e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1790 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1862
Cdd:NF038214    41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1863 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1932
Cdd:NF038214   121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                          170
                   ....*....|.
gi 1039728316 1933 PYIIgTTNQPV 1943
Cdd:NF038214   185 STII-TSNLPF 194
AAA smart00382
ATPases associated with a variety of cellular activities; AAA - ATPases associated with a ...
1831-1942 1.46e-04

ATPases associated with a variety of cellular activities; AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.


Pssm-ID: 214640 [Multi-domain]  Cd Length: 148  Bit Score: 44.29  E-value: 1.46e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  1831 KHRRLVLSGPSGTGKTYLTNRLAEYLVERSGREVTdgIVSTFNMHQQSCKDLQLYLSNLANQIDRETGIGDV-------- 1902
Cdd:smart00382    1 PGEVILIVGPPGSGKTTLARALARELGPPGGGVIY--IDGEDILEEVLDQLLLIIVGGKKASGSGELRLRLAlalarklk 78
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*....
gi 1039728316  1903 PLVILLDDLSEAGS---------ISELVNGALTCKYHKCPyIIGTTNQP 1942
Cdd:smart00382   79 PDVLILDEITSLLDaeqeallllLEELRLLLLLKSEKNLT-VILTTNDE 126
McrB COG1401
5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB ...
1826-1856 4.82e-04

5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB [Defense mechanisms];


Pssm-ID: 441011 [Multi-domain]  Cd Length: 477  Bit Score: 45.15  E-value: 4.82e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1039728316 1826 ISLLLKHRRLV-LSGPSGTGKTYLTNRLAEYL 1856
Cdd:COG1401    214 FLAALKTKKNViLAGPPGTGKTYLARRLAEAL 245
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1362-1460 5.79e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.76  E-value: 5.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1362 EQIRKLRRELESSQEKVATLTSQLSANANL---------VAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKK 1432
Cdd:COG1579     59 KEIKRLELEIEEVEARIKKYEEQLGNVRNNkeyealqkeIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEA 138
                           90       100
                   ....*....|....*....|....*...
gi 1039728316 1433 KNSEAQAVIQGALNASEATPKELRIKRQ 1460
Cdd:COG1579    139 ELEEKKAELDEELAELEAELEELEAERE 166
AAA cd00009
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily ...
1835-1942 6.13e-04

The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.


Pssm-ID: 99707 [Multi-domain]  Cd Length: 151  Bit Score: 42.52  E-value: 6.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1835 LVLSGPSGTGKTYLTNRLAEYLVERSGRevtdgiVSTFNMHQqscKDLQLYLSNLANQIDRETGIGDV----PLVILLDd 1910
Cdd:cd00009     22 LLLYGPPGTGKTTLARAIANELFRPGAP------FLYLNASD---LLEGLVVAELFGHFLVRLLFELAekakPGVLFID- 91
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1039728316 1911 lsEAGSISELVNGAL----------TCKYHKCPyIIGTTNQP 1942
Cdd:cd00009     92 --EIDSLSRGAQNALlrvletlndlRIDRENVR-VIGATNRP 130
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1356-1450 9.67e-04

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 41.06  E-value: 9.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1356 EERMQS--EQIRKLRRELESSQEKVATLTSQLS---ANANLVAAFEQSLVNMTSRLRHLAE-TAEEKD---TELLDLRET 1426
Cdd:pfam20492   12 EERLKQyeEETKKAQEELEESEETAEELEEERRqaeEEAERLEQKRQEAEEEKERLEESAEmEAEEKEqleAELAEAQEE 91
                           90       100
                   ....*....|....*....|....*...
gi 1039728316 1427 IDFL----KKKNSEAQAVIQGALNASEA 1450
Cdd:pfam20492   92 IARLeeevERKEEEARRLQEELEEAREE 119
AAA_28 pfam13521
AAA domain;
1834-1866 5.43e-03

AAA domain;


Pssm-ID: 433278 [Multi-domain]  Cd Length: 164  Bit Score: 39.94  E-value: 5.43e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1039728316 1834 RLVLSGPSGTGKTYLTNRLAEYL----VERSGREVTD 1866
Cdd:pfam13521    1 RIVITGGPSTGKTTLAEALAARFgypvVPEAAREILE 37
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
35-220 9.99e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.83  E-value: 9.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   35 ASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAYRGRGTPRAAGPARELAGSveSLPSSPWNSPRvtPKTALSS 114
Cdd:pfam03546  245 APAAATPAQAKPALKTPQTKASPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACA--SSPAVARGAQR--PEEDSSS 320
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  115 QAGSRRAGETQ----STQRKKTQEGIPVrhtRGRSPPQSSCYGETQIPGPPEGRMPPGCQGKDQRNIISKPPRcLEPDEG 190
Cdd:pfam03546  321 SEESESEEETApaaaVGQAKSVGKGLQG---KAASAPTKGPSGQGTAPVPPGKTGPAVAQVKAEAQEDSESSE-EESDSE 396
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039728316  191 EASGTSSPVCSPVQSMRSSATPGVISFSSA 220
Cdd:pfam03546  397 EAAATPAQVKASGKTPQAKANPAPTKASSA 426
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-260 1.81e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   10 ASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQAdlalSPQRGASPRRG--KAAVSSRNSSPKAYRGRGTPRAAGPAR 87
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP----APGRVSRPRRArrLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   88 ELA-----GSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSScygetqipGPPE 162
Cdd:PHA03247  2697 SLAdppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA--------GPPA 2768
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  163 GRMPpgcqgkdqRNIISKPPRCLEPDEGeASGTSSPVCSPVQSMRSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQ 242
Cdd:PHA03247  2769 PAPP--------AAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                          250
                   ....*....|....*....
gi 1039728316  243 EPGPVP-QESWVLDGYTSP 260
Cdd:PHA03247  2840 PPPPGPpPPSLPLGGSVAP 2858
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1360-1444 3.54e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.28  E-value: 3.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1360 QSEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKKKNSEAQA 1439
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

                   ....*
gi 1039728316 1440 VIQGA 1444
Cdd:TIGR02168  930 RLEGL 934
IS21_help_AAA NF038214
IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was ...
1790-1943 3.58e-05

IS21-like element helper ATPase IstB; This protein family model resembles PF01695, but was built to hit full-length AAA+ ATPases of IS21 family IS (insertion sequence) elements.


Pssm-ID: 439516  Cd Length: 232  Bit Score: 47.47  E-value: 3.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1790 RRGVNNISVALK--GLK-EKCVDSLVFETL--IPKPMMQHYISL--LLKHRRLVLSGPSGTGKTYLTNRLAEYLVERSGR 1862
Cdd:NF038214    41 ERENRRIERRLKraRFPaAKTLEDFDFTAApgLDKAQIRELATLdfIERAENVLLLGPPGTGKTHLAIALGYAACRQGYR 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1863 evtdgiVSTFNMHqqsckDL--QLYLSNLANQIDRE-TGIGDVPLVIlLDDL-------SEAGSISELVNgaltCKYHKC 1932
Cdd:NF038214   121 ------VRFTTAA-----DLveQLAQARADGRLGRLlRRLARYDLLI-IDELgylpfsrEGANLLFELIA----DRYERG 184
                          170
                   ....*....|.
gi 1039728316 1933 PYIIgTTNQPV 1943
Cdd:NF038214   185 STII-TSNLPF 194
AAA smart00382
ATPases associated with a variety of cellular activities; AAA - ATPases associated with a ...
1831-1942 1.46e-04

ATPases associated with a variety of cellular activities; AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.


Pssm-ID: 214640 [Multi-domain]  Cd Length: 148  Bit Score: 44.29  E-value: 1.46e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  1831 KHRRLVLSGPSGTGKTYLTNRLAEYLVERSGREVTdgIVSTFNMHQQSCKDLQLYLSNLANQIDRETGIGDV-------- 1902
Cdd:smart00382    1 PGEVILIVGPPGSGKTTLARALARELGPPGGGVIY--IDGEDILEEVLDQLLLIIVGGKKASGSGELRLRLAlalarklk 78
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*....
gi 1039728316  1903 PLVILLDDLSEAGS---------ISELVNGALTCKYHKCPyIIGTTNQP 1942
Cdd:smart00382   79 PDVLILDEITSLLDaeqeallllLEELRLLLLLKSEKNLT-VILTTNDE 126
PHA03378 PHA03378
EBNA-3B; Provisional
5-236 4.20e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.83  E-value: 4.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316    5 ANVNSASRPHYASSIPVPRASSQTRIHTPGASPQLRPRQQADLALSPQRGASPRRGK--AAVSSRNSSPKAYRGRGTPRA 82
Cdd:PHA03378   679 TGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARppAAAPGRARPPAAAPGRARPPA 758
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   83 AGPARELAGSVESLPSSPWNSPRVTPKTALSSQAGSrraGETQSTQRKKTQEGIPVRHTRGRSPPQSSCYGETQIPGPPE 162
Cdd:PHA03378   759 AAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAP---TPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR 835
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  163 GRMPPGCQGKDQRniisKPPRCLEPDEGeaSGTSS----------PVCSPVQSMRSSATPGVISFSsahpqsqpiTATVA 232
Cdd:PHA03378   836 GRPSLKKPAALER----QAAAGPTPSPG--SGTSDkivqapvfypPVLQPIQVMRQLGSVRAAAAS---------TVTQA 900

                   ....
gi 1039728316  233 PFQY 236
Cdd:PHA03378   901 PTEY 904
McrB COG1401
5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB ...
1826-1856 4.82e-04

5-methylcytosine-specific restriction endonuclease McrBC, GTP-binding regulatory subunit McrB [Defense mechanisms];


Pssm-ID: 441011 [Multi-domain]  Cd Length: 477  Bit Score: 45.15  E-value: 4.82e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1039728316 1826 ISLLLKHRRLV-LSGPSGTGKTYLTNRLAEYL 1856
Cdd:COG1401    214 FLAALKTKKNViLAGPPGTGKTYLARRLAEAL 245
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1362-1460 5.79e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.76  E-value: 5.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1362 EQIRKLRRELESSQEKVATLTSQLSANANL---------VAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETIDFLKK 1432
Cdd:COG1579     59 KEIKRLELEIEEVEARIKKYEEQLGNVRNNkeyealqkeIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEA 138
                           90       100
                   ....*....|....*....|....*...
gi 1039728316 1433 KNSEAQAVIQGALNASEATPKELRIKRQ 1460
Cdd:COG1579    139 ELEEKKAELDEELAELEAELEELEAERE 166
AAA cd00009
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily ...
1835-1942 6.13e-04

The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.


Pssm-ID: 99707 [Multi-domain]  Cd Length: 151  Bit Score: 42.52  E-value: 6.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1835 LVLSGPSGTGKTYLTNRLAEYLVERSGRevtdgiVSTFNMHQqscKDLQLYLSNLANQIDRETGIGDV----PLVILLDd 1910
Cdd:cd00009     22 LLLYGPPGTGKTTLARAIANELFRPGAP------FLYLNASD---LLEGLVVAELFGHFLVRLLFELAekakPGVLFID- 91
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1039728316 1911 lsEAGSISELVNGAL----------TCKYHKCPyIIGTTNQP 1942
Cdd:cd00009     92 --EIDSLSRGAQNALlrvletlndlRIDRENVR-VIGATNRP 130
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1357-1460 8.98e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.12  E-value: 8.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1357 ERMQsEQIRKLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRETidflKKKNSE 1436
Cdd:COG4372     62 EQLE-EELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQ----RKQLEA 136
                           90       100
                   ....*....|....*....|....
gi 1039728316 1437 AQAVIQGALNASEATPKELRIKRQ 1460
Cdd:COG4372    137 QIAELQSEIAEREEELKELEEQLE 160
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1356-1450 9.67e-04

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 41.06  E-value: 9.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1356 EERMQS--EQIRKLRRELESSQEKVATLTSQLS---ANANLVAAFEQSLVNMTSRLRHLAE-TAEEKD---TELLDLRET 1426
Cdd:pfam20492   12 EERLKQyeEETKKAQEELEESEETAEELEEERRqaeEEAERLEQKRQEAEEEKERLEESAEmEAEEKEqleAELAEAQEE 91
                           90       100
                   ....*....|....*....|....*...
gi 1039728316 1427 IDFL----KKKNSEAQAVIQGALNASEA 1450
Cdd:pfam20492   92 IARLeeevERKEEEARRLQEELEEAREE 119
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1361-1456 2.86e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316 1361 SEQIRKLRRELESSQEKVATLTSQLSANANL--VAAFEQSLVNMTSRLRHLaetaEEKDTELLDLRETIDFLKKKNSEAQ 1438
Cdd:COG4717    101 EEELEELEAELEELREELEKLEKLLQLLPLYqeLEALEAELAELPERLEEL----EERLEELRELEEELEELEAELAELQ 176
                           90
                   ....*....|....*....
gi 1039728316 1439 AVIQGALN-ASEATPKELR 1456
Cdd:COG4717    177 EELEELLEqLSLATEEELQ 195
PHA03247 PHA03247
large tegument protein UL36; Provisional
13-269 3.49e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   13 PHYASSIPVPRASSQTRIHTPGASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAY-----------RGRGTPR 81
Cdd:PHA03247  2741 PPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWdpadppaavlaPAAALPP 2820
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   82 AAGPARELAGSVESLPSSPWNSPRVTPKT-----ALSSQAGSRRAGETQSTQRKKTQEGIPVRHTRGRSPPQSSCYGETQ 156
Cdd:PHA03247  2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSlplggSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL 2900
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  157 IPGPPEGRMPPGCQGKDQrniiskPPRCLEPDEGEASGTSSPVCSPVQSMRSSATPGVISFSSAHPqsQPITATVAPFQY 236
Cdd:PHA03247  2901 PPDQPERPPQPQAPPPPQ------PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP--QPWLGALVPGRV 2972
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1039728316  237 RLQTDQEPGPVPQeswVLDGYTSPPSRTEDSFS 269
Cdd:PHA03247  2973 AVPRFRVPQPAPS---REAPASSTPPLTGHSLS 3002
AAA_28 pfam13521
AAA domain;
1834-1866 5.43e-03

AAA domain;


Pssm-ID: 433278 [Multi-domain]  Cd Length: 164  Bit Score: 39.94  E-value: 5.43e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1039728316 1834 RLVLSGPSGTGKTYLTNRLAEYL----VERSGREVTD 1866
Cdd:pfam13521    1 RIVITGGPSTGKTTLAEALAARFgypvVPEAAREILE 37
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-261 6.17e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 6.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316    7 VNSASRPHYASSIP---VPRA--------SSQTRIHTPGASPQLRPR-----------QQADLALSPQRGASP------- 57
Cdd:PHA03247  2667 ARRLGRAAQASSPPqrpRRRAarptvgslTSLADPPPPPPTPEPAPHalvsatplppgPAAARQASPALPAAPappavpa 2746
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   58 ---------RRGKAAVSSRNSSPKAYRGR-GTPRAAGPARELAGSVESLPSSPWNSPRVTPKTALSSQAGSRRAGETQST 127
Cdd:PHA03247  2747 gpatpggpaRPARPPTTAGPPAPAPPAAPaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG 2826
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  128 QRKKTQEGIPVRHTRGRSPPQSSCygetqipgPPEGRMPPGcqGKDQRNIISKPPrclepdegeASGTSSPVCSPVQSM- 206
Cdd:PHA03247  2827 PLPPPTSAQPTAPPPPPGPPPPSL--------PLGGSVAPG--GDVRRRPPSRSP---------AAKPAAPARPPVRRLa 2887
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039728316  207 RSSATPGVISFSSAHPQSQPITATVAPFQYRLQTDQEPGPVPQESWVLDGYTSPP 261
Cdd:PHA03247  2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
PHA03247 PHA03247
large tegument protein UL36; Provisional
32-248 7.93e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 7.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   32 TPGASPQLRPrQQADLALSPQRGAsPRRGKAAVSSRNSSPKAYRGRGTPRAAG-PARELAGSVESLPSSPWNSPRVTPKT 110
Cdd:PHA03247  2552 PPPLPPAAPP-AAPDRSVPPPRPA-PRPSEPAVTSRARRPDAPPQSARPRAPVdDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  111 ALSSQAGSRRAGETQSTQRKKTQEGIPV-----RHTRGRSPPQSscygeTQIPGPPEGRMPPGCQGK--DQRNIISKPPR 183
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPApgrvsRPRRARRLGRA-----AQASSPPQRPRRRAARPTvgSLTSLADPPPP 2704
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039728316  184 CLEPDEGEASGTSS-PVCSPVQSMRSSATPGVISFSSAHPQSQPIT-ATVAPFQYRLQTDQEPGPVP 248
Cdd:PHA03247  2705 PPTPEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVPAGPATpGGPARPARPPTTAGPPAPAP 2771
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
35-220 9.99e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.83  E-value: 9.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316   35 ASPQLRPRQQADLALSPQRGASPRRGKAAVSSRNSSPKAYRGRGTPRAAGPARELAGSveSLPSSPWNSPRvtPKTALSS 114
Cdd:pfam03546  245 APAAATPAQAKPALKTPQTKASPRKGTPITPTSAKVPPVRVGTPAPWKAGTVTSPACA--SSPAVARGAQR--PEEDSSS 320
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039728316  115 QAGSRRAGETQ----STQRKKTQEGIPVrhtRGRSPPQSSCYGETQIPGPPEGRMPPGCQGKDQRNIISKPPRcLEPDEG 190
Cdd:pfam03546  321 SEESESEEETApaaaVGQAKSVGKGLQG---KAASAPTKGPSGQGTAPVPPGKTGPAVAQVKAEAQEDSESSE-EESDSE 396
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039728316  191 EASGTSSPVCSPVQSMRSSATPGVISFSSA 220
Cdd:pfam03546  397 EAAATPAQVKASGKTPQAKANPAPTKASSA 426
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH