NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|335929974|gb|AEH60515|]
View 

conserved hypothetical protein [Methanosalsum zhilinae DSM 4017]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
sarcinarray TIGR04209
sarcinarray family protein; Members of this protein family are exclusive to archaea, probably ...
16-159 1.57e-81

sarcinarray family protein; Members of this protein family are exclusive to archaea, probably all of which have S-layer surface protein arrays. All member proteins have an N-terminal signal sequence. The majority of known members belong to codirectional tandem arrays in the genus Methanosarcina (nine in M. barkeri str. Fusaro). Nearly all members have an additional 50 residues, (trimmed from the seed alignment for this model), consisting of low-complexity sequence rich in E,N,Q,T,S, and P, followed by a variant (PAF) form of the PGF-CTERM putative archaeal surface glycoprotein sorting signal. The coined name, sarcinarray family protein, evokes the predicted archaeal surface layer localization, the taxonomic bias of known members, and the tandem organization of most members.


:

Pssm-ID: 275055  Cd Length: 144  Bit Score: 238.82  E-value: 1.57e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 335929974   16 LIFLCTPISSSSNKPGDIQVYYNGVHLPGDNVAEPTLKIGEPFNVRINFTVYGDYKIYVKLDDYGDNYFVVQDGPSPVGI 95
Cdd:TIGR04209   1 FLLAFVSLVSASSDYGSIDVYYNGKLLPGAEVAKPVLKIGEPFTVKINMTVYQKSTVSVKLSELGGGSFEIISGPSPMNI 80
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 335929974   96 YSDVILRSDECHAFEWTVYPTDKWAGGFIPLDFHYSMVERGNHVPVAHGAFTVAYPYISHEHYD 159
Cdd:TIGR04209  81 YSSRILEKNETHTFEWTVAPTDNWAGGSLPVDFHYQINDFGTPEPLVNGSFTVAYPYISNEHYE 144
 
Name Accession Description Interval E-value
sarcinarray TIGR04209
sarcinarray family protein; Members of this protein family are exclusive to archaea, probably ...
16-159 1.57e-81

sarcinarray family protein; Members of this protein family are exclusive to archaea, probably all of which have S-layer surface protein arrays. All member proteins have an N-terminal signal sequence. The majority of known members belong to codirectional tandem arrays in the genus Methanosarcina (nine in M. barkeri str. Fusaro). Nearly all members have an additional 50 residues, (trimmed from the seed alignment for this model), consisting of low-complexity sequence rich in E,N,Q,T,S, and P, followed by a variant (PAF) form of the PGF-CTERM putative archaeal surface glycoprotein sorting signal. The coined name, sarcinarray family protein, evokes the predicted archaeal surface layer localization, the taxonomic bias of known members, and the tandem organization of most members.


Pssm-ID: 275055  Cd Length: 144  Bit Score: 238.82  E-value: 1.57e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 335929974   16 LIFLCTPISSSSNKPGDIQVYYNGVHLPGDNVAEPTLKIGEPFNVRINFTVYGDYKIYVKLDDYGDNYFVVQDGPSPVGI 95
Cdd:TIGR04209   1 FLLAFVSLVSASSDYGSIDVYYNGKLLPGAEVAKPVLKIGEPFTVKINMTVYQKSTVSVKLSELGGGSFEIISGPSPMNI 80
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 335929974   96 YSDVILRSDECHAFEWTVYPTDKWAGGFIPLDFHYSMVERGNHVPVAHGAFTVAYPYISHEHYD 159
Cdd:TIGR04209  81 YSSRILEKNETHTFEWTVAPTDNWAGGSLPVDFHYQINDFGTPEPLVNGSFTVAYPYISNEHYE 144
 
Name Accession Description Interval E-value
sarcinarray TIGR04209
sarcinarray family protein; Members of this protein family are exclusive to archaea, probably ...
16-159 1.57e-81

sarcinarray family protein; Members of this protein family are exclusive to archaea, probably all of which have S-layer surface protein arrays. All member proteins have an N-terminal signal sequence. The majority of known members belong to codirectional tandem arrays in the genus Methanosarcina (nine in M. barkeri str. Fusaro). Nearly all members have an additional 50 residues, (trimmed from the seed alignment for this model), consisting of low-complexity sequence rich in E,N,Q,T,S, and P, followed by a variant (PAF) form of the PGF-CTERM putative archaeal surface glycoprotein sorting signal. The coined name, sarcinarray family protein, evokes the predicted archaeal surface layer localization, the taxonomic bias of known members, and the tandem organization of most members.


Pssm-ID: 275055  Cd Length: 144  Bit Score: 238.82  E-value: 1.57e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 335929974   16 LIFLCTPISSSSNKPGDIQVYYNGVHLPGDNVAEPTLKIGEPFNVRINFTVYGDYKIYVKLDDYGDNYFVVQDGPSPVGI 95
Cdd:TIGR04209   1 FLLAFVSLVSASSDYGSIDVYYNGKLLPGAEVAKPVLKIGEPFTVKINMTVYQKSTVSVKLSELGGGSFEIISGPSPMNI 80
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 335929974   96 YSDVILRSDECHAFEWTVYPTDKWAGGFIPLDFHYSMVERGNHVPVAHGAFTVAYPYISHEHYD 159
Cdd:TIGR04209  81 YSSRILEKNETHTFEWTVAPTDNWAGGSLPVDFHYQINDFGTPEPLVNGSFTVAYPYISNEHYE 144
MAST_ArtA_sort TIGR04204
MAST domain; This model describes a domain (or in most cases the full length) of archaeal ...
33-161 1.81e-47

MAST domain; This model describes a domain (or in most cases the full length) of archaeal surface proteins that are putative targets for C-terminal processing by archaeosortase A (TIGR04125). Most members of this family belong to proteins encoded by tandem genes in the genus Methanosarcina. The putative processing signal, PGF-CTERM (TIGR04126), included within the domain definition, takes a variant form, with consensus motif PAF instead of PGF. We suggest the name MAST domain: Methanosarcina Archaeosortase-Sorted Tandem gene family domain.


Pssm-ID: 275051  Cd Length: 182  Bit Score: 153.78  E-value: 1.81e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 335929974   33 IQVYYNGVHLPGDNVAEPTLKIGEPFNVRINFTVYGDYKIYVKLDDYGDNYFVVQDG-PSPVGIYS-DVILRSDECHAFE 110
Cdd:TIGR04204   2 IDVYYNDKLLPGKEVAKPTLKIGEPFKVKINITVYQKSRVSVEVSSIGPGRFEIINGdTSKMNLYSpDRILDRNSGKVYE 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 335929974  111 WTVYPTDKWAGGFIPLDFHYSMVERGNHVPVAHGAFTVAYPYISHEHYDEE 161
Cdd:TIGR04204  82 WTVKPTELWAGGSLPLNFVYQINETGTDEPLVPGEFTIAYPIISNEHYEGE 132
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH