NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|81869787|sp|Q9WTS5|]
View 

RecName: Full=Teneurin-2; Short=Ten-2; AltName: Full=Protein Odd Oz/ten-m homolog 2; AltName: Full=Tenascin-M2; Short=Ten-m2; AltName: Full=Teneurin transmembrane protein 2; Contains: RecName: Full=Ten-2, soluble form; Contains: RecName: Full=Ten-2 intracellular domain; Short=Ten-2 ICD

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
10-374 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


:

Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 645.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787     10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLVHRESDEFSRQGTNFTLAELGICEP 89
Cdd:pfam06484    1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787     90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDDNG 168
Cdd:pfam06484   81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    169 RPIPPTSSSSLLPsaqlpSSHNPPP---VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQ 245
Cdd:pfam06484  161 PPIPPSSSSSSPV-----EQHSPPPpslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQNH 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    246 STLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKT 322
Cdd:pfam06484  236 SRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKT 315
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 81869787    323 SSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 374
Cdd:pfam06484  316 GTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1230-1560 8.20e-47

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 171.94  E-value: 8.20e-47
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1230 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPGHKYY----LAVDPvTGSLYVSDTNSRRIYRV 1303
Cdd:cd14953   25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1304 kslsgakDLAGNSEVVAGTGEqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1381
Cdd:cd14953  104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1382 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1459
Cdd:cd14953  170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1460 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSL 1539
Cdd:cd14953  237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                        330       340
                 ....*....|....*....|.
gi 81869787 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:cd14953  303 AVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2680-2757 7.72e-36

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 131.58  E-value: 7.72e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 81869787   2680 EEKARVLDQAGQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2757
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1521-2458 2.06e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 135.65  E-value: 2.06e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1521 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1597
Cdd:COG3209  105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1598 LVTGEYLYNFTYSADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLMTYDGNTG 1677
Cdd:COG3209  185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1678 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1757
Cdd:COG3209  265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1758 YQLCNNGTLRVMYANGMAVSFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1837
Cdd:COG3209  345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1838 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1917
Cdd:COG3209  425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1918 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLHAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1989
Cdd:COG3209  505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1990 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRF 2069
Cdd:COG3209  585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2070 SEEGMINARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 2149
Cdd:COG3209  665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2150 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2223
Cdd:COG3209  742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2224 LLNPGNSARLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2303
Cdd:COG3209  822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2304 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVYSINGLMIKQLQYTA 2383
Cdd:COG3209  891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
                        890       900       910       920       930       940       950
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 81869787 2384 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrNVGKEPAPfNLYMFKNNNPLSN 2458
Cdd:COG3209  951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVNY 1019
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
833-863 4.04e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


:

Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 50.98  E-value: 4.04e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 81869787   833 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 863
Cdd:NF033662    2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
C_rich_MXAN6577 super family cl49352
MXAN_6577-like cysteine-rich domain;
644-789 5.60e-07

MXAN_6577-like cysteine-rich domain;


The actual alignment was detected with superfamily member NF041328:

Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 51.30  E-value: 5.60e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   644 SCGGHGS-CIDGNCVCaagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 713
Cdd:NF041328   13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   714 gtylpdsgLCSCDPNWMGpdcsvVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcECR 787
Cdd:NF041328   75 --------DTASDPAHCG-----ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SCR 140

                  ..
gi 81869787   788 EG 789
Cdd:NF041328  141 GG 142
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
580-602 7.49e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


:

Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 7.49e-04
                           10        20
                   ....*....|....*....|....*
gi 81869787    580 CHGNGECVS--GLCHCFPGFLGADC 602
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
797-830 4.17e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 4.17e-03
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 81869787  797 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 830
Cdd:cd00054    2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
10-374 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 645.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787     10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLVHRESDEFSRQGTNFTLAELGICEP 89
Cdd:pfam06484    1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787     90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDDNG 168
Cdd:pfam06484   81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    169 RPIPPTSSSSLLPsaqlpSSHNPPP---VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQ 245
Cdd:pfam06484  161 PPIPPSSSSSSPV-----EQHSPPPpslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQNH 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    246 STLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKT 322
Cdd:pfam06484  236 SRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKT 315
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 81869787    323 SSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 374
Cdd:pfam06484  316 GTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1230-1560 8.20e-47

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 171.94  E-value: 8.20e-47
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1230 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPGHKYY----LAVDPvTGSLYVSDTNSRRIYRV 1303
Cdd:cd14953   25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1304 kslsgakDLAGNSEVVAGTGEqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1381
Cdd:cd14953  104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1382 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1459
Cdd:cd14953  170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1460 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSL 1539
Cdd:cd14953  237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                        330       340
                 ....*....|....*....|.
gi 81869787 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:cd14953  303 AVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2680-2757 7.72e-36

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 131.58  E-value: 7.72e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 81869787   2680 EEKARVLDQAGQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2757
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1521-2458 2.06e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 135.65  E-value: 2.06e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1521 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1597
Cdd:COG3209  105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1598 LVTGEYLYNFTYSADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLMTYDGNTG 1677
Cdd:COG3209  185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1678 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1757
Cdd:COG3209  265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1758 YQLCNNGTLRVMYANGMAVSFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1837
Cdd:COG3209  345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1838 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1917
Cdd:COG3209  425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1918 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLHAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1989
Cdd:COG3209  505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1990 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRF 2069
Cdd:COG3209  585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2070 SEEGMINARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 2149
Cdd:COG3209  665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2150 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2223
Cdd:COG3209  742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2224 LLNPGNSARLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2303
Cdd:COG3209  822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2304 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVYSINGLMIKQLQYTA 2383
Cdd:COG3209  891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
                        890       900       910       920       930       940       950
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 81869787 2384 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrNVGKEPAPfNLYMFKNNNPLSN 2458
Cdd:COG3209  951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVNY 1019
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1230-1560 2.95e-12

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 69.66  E-value: 2.95e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1230 PVALAVGIDGSLFVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNSPGHKYY-LAVDPvTGSLYVSDTNSRRIYRVks 1305
Cdd:COG4257   19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1306 lsGAKDlaGNSEVVAGTGEQCLPFdearcgdggkavdatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1382
Cdd:COG4257   86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1383 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1459
Cdd:COG4257  139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1460 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncicysgdDAYATDAILNSPSSL 1539
Cdd:COG4257  184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
                        330       340
                 ....*....|....*....|.
gi 81869787 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:COG4257  237 AVDGDGRVWFAESGANRIVRF 257
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2381-2458 7.60e-10

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 57.51  E-value: 7.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   2381 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrnvgkePA----PFNLYMFKNNNPL 2456
Cdd:TIGR03696    1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69

                   ..
gi 81869787   2457 SN 2458
Cdd:TIGR03696   70 NW 71
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
833-863 4.04e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 50.98  E-value: 4.04e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 81869787   833 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 863
Cdd:NF033662    2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
644-789 5.60e-07

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 51.30  E-value: 5.60e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   644 SCGGHGS-CIDGNCVCaagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 713
Cdd:NF041328   13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   714 gtylpdsgLCSCDPNWMGpdcsvVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcECR 787
Cdd:NF041328   75 --------DTASDPAHCG-----ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SCR 140

                  ..
gi 81869787   788 EG 789
Cdd:NF041328  141 GG 142
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
575-734 6.72e-07

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 53.47  E-value: 6.72e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    575 DCPRNCHGNGECVSGLCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 624
Cdd:pfam19232   11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    625 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCAAG--YKGEH-CEEV--------DCL 673
Cdd:pfam19232   90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    674 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDSGLCSC 725
Cdd:pfam19232  167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
                          250       260
                   ....*....|....*....|....
gi 81869787    726 ------------DPN---WMGPDC 734
Cdd:pfam19232  242 nidfsghnscgdDNNctsWTGPRC 265
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1672-1708 6.61e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 42.20  E-value: 6.61e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 81869787   1672 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1708
Cdd:pfam05593    1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
580-602 7.49e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 7.49e-04
                           10        20
                   ....*....|....*....|....*
gi 81869787    580 CHGNGECVS--GLCHCFPGFLGADC 602
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
671-700 2.60e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 2.60e-03
                         10        20        30
                 ....*....|....*....|....*....|....*
gi 81869787  671 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 700
Cdd:cd00054    4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
797-830 4.17e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 4.17e-03
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 81869787  797 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 830
Cdd:cd00054    2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
738-810 4.32e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 40.13  E-value: 4.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   738 CSVDCGTHGVCIGGACRCEEGWT--GAACDQRV--------CHPRCIEHGTCKDGKCE--CREGWngEHCTiDGCPDLCN 805
Cdd:NF041328   45 CGVACGAGQTCVAGACGCGPGTVacGGACVDTAsdpahcgaCGAACAPGQVCEGGACReaCSEGL--TRCG-GACVDLAT 121

                  ....*
gi 81869787   806 GNGRC 810
Cdd:NF041328  122 DPLHC 126
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1533-1624 8.60e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 40.74  E-value: 8.60e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1533 LNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYE----AASPG-----EQELYVFNADGiHQYTVSLVTGEY 1603
Cdd:cd14963   55 FKYPYGIAVDSDGNIYVADLYNGRIQVFDPDGKFLKYFPEKKdrvkLISPAglaidDGKLYVSDVKK-HKVIVFDLEGKL 133
                         90       100
                 ....*....|....*....|....*....
gi 81869787 1604 LYNF--------TYSADNDVTelIDNNGN 1624
Cdd:cd14963  134 LLEFgkpgsepgELSYPNGIA--VDEDGN 160
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
10-374 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 645.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787     10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLVHRESDEFSRQGTNFTLAELGICEP 89
Cdd:pfam06484    1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787     90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDDNG 168
Cdd:pfam06484   81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    169 RPIPPTSSSSLLPsaqlpSSHNPPP---VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQ 245
Cdd:pfam06484  161 PPIPPSSSSSSPV-----EQHSPPPpslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQNH 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    246 STLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKT 322
Cdd:pfam06484  236 SRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKT 315
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 81869787    323 SSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 374
Cdd:pfam06484  316 GTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1230-1560 8.20e-47

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 171.94  E-value: 8.20e-47
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1230 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPGHKYY----LAVDPvTGSLYVSDTNSRRIYRV 1303
Cdd:cd14953   25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1304 kslsgakDLAGNSEVVAGTGEqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1381
Cdd:cd14953  104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1382 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1459
Cdd:cd14953  170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1460 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSL 1539
Cdd:cd14953  237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
                        330       340
                 ....*....|....*....|.
gi 81869787 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:cd14953  303 AVDAAGNLYVADTGNNRIRKI 323
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1281-1561 1.84e-40

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 153.84  E-value: 1.84e-40
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1281 LAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGEqclpfdEARCGDGGKAvdATLMSPRGIAVDKNGLMY 1360
Cdd:cd14953   28 VAVDA-AGNLYVADRGNHRIRKI-------TPDGVVTTVAGTGT------AGFADGGGAA--AQFNTPSGVAVDAAGNLY 91
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1361 FVDAT--MIRKVDQNGIISTLLGsndlTAVRPLSCDSSMDVAQvrLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQV 1436
Cdd:cd14953   92 VADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGATAAQ--FNYPTGVAVDAAGN-LYVADtgNHRIRKITPDGVV 164
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1437 SIIAGRPmhcqVPGidYSLSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndv 1516
Cdd:cd14953  165 TTVAGTG----GAG--YAGDGPATAAQFNNPTGVAVDAAGNLYVADRGN---HRIRKITPDGVVTTVAGTGTA------- 228
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*
gi 81869787 1517 ncicYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVS 1561
Cdd:cd14953  229 ----GFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHRIRKIT 269
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2680-2757 7.72e-36

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 131.58  E-value: 7.72e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 81869787   2680 EEKARVLDQAGQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2757
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1318-1561 5.97e-32

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 128.80  E-value: 5.97e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1318 VVAGTGeqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL-----GSNDLTAvrp 1390
Cdd:cd14953    3 TVAGSG--------TAGFSGGGGTAARFNSPSGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAgtgtaGFADGGG--- 71
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1391 lscdssmdvAQVRLEWPTDLAVNPMDNsLYV--LENNVILRITENHQVSIIAGRPmhcqVPGidYSLSKLAIHSALESAS 1468
Cdd:cd14953   72 ---------AAAQFNTPSGVAVDAAGN-LYVadTGNHRIRKITPDGVVSTLAGTG----TAG--FSDDGGATAAQFNYPT 135
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1469 AIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSLAVAPDGTIY 1548
Cdd:cd14953  136 GVAVDAAGNLYVADTGN---HRIRKITPDGVVTTVAGTGGA-----------GYAGDGPATAAQFNNPTGVAVDAAGNLY 201
                        250
                 ....*....|...
gi 81869787 1549 IADLGNIRIRAVS 1561
Cdd:cd14953  202 VADRGNHRIRKIT 214
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1521-2458 2.06e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 135.65  E-value: 2.06e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1521 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1597
Cdd:COG3209  105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1598 LVTGEYLYNFTYSADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLMTYDGNTG 1677
Cdd:COG3209  185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1678 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1757
Cdd:COG3209  265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1758 YQLCNNGTLRVMYANGMAVSFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1837
Cdd:COG3209  345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1838 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1917
Cdd:COG3209  425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1918 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLHAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1989
Cdd:COG3209  505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1990 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRF 2069
Cdd:COG3209  585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2070 SEEGMINARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 2149
Cdd:COG3209  665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2150 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2223
Cdd:COG3209  742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2224 LLNPGNSARLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2303
Cdd:COG3209  822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 2304 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVYSINGLMIKQLQYTA 2383
Cdd:COG3209  891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
                        890       900       910       920       930       940       950
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 81869787 2384 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrNVGKEPAPfNLYMFKNNNPLSN 2458
Cdd:COG3209  951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVNY 1019
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1281-1579 1.26e-17

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 85.45  E-value: 1.26e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1281 LAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGeqclpfdearcGDGgkavDATLMSPRGIAVDKNGLMY 1360
Cdd:cd05819   13 IAVDS-SGNIYVADTGNNRIQVF-------DPDGNFITSFGSF-----------GSG----DGQFNEPAGVAVDSDGNLY 69
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1361 FVDAT--MIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL--ENNVILRITENHQV 1436
Cdd:cd05819   70 VADTGnhRIQKFDPDGNFLASFGGSGDG--------------DGEFNGPRGIAVDSSGN-IYVAdtGNHRIQKFDPDGEF 134
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1437 SIIAGrpmhcqvpgidyslSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndv 1516
Cdd:cd05819  135 LTTFG--------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGN---HRIQVFDPDGNFLTTFG----------- 186
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 81869787 1517 ncicysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPG 1579
Cdd:cd05819  187 --------STGTGPGQFNYPTGIAVDSDGNIYVADSGNNRVQVFDPDGAGFGGNGNFLGSDGQ 241
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1200-1370 2.08e-17

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 86.04  E-value: 2.08e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1200 IITSIMGNGRRRSiscpSCNGLAEGNKLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHS----- 1272
Cdd:cd14953  163 VVTTVAGTGGAGY----AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFSGDggata 238
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1273 ---NSPghkYYLAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGeQCLPfdearcGDGGKAVDATLMSPR 1349
Cdd:cd14953  239 aqlNNP---TGVAVDA-AGNLYVADSGNHRIRKI-------TPAGVVTTVAGGG-AGFS------GDGGPATSAQFNNPT 300
                        170       180
                 ....*....|....*....|...
gi 81869787 1350 GIAVDKNGLMYFVDAT--MIRKV 1370
Cdd:cd14953  301 GVAVDAAGNLYVADTGnnRIRKI 323
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1225-1558 2.56e-16

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 81.60  E-value: 2.56e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1225 NKLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPghkYYLAVDPvTGSLYVSDTNSRRIYR 1302
Cdd:cd05819    5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEP---AGVAVDS-DGNLYVADTGNHRIQK 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1303 VkslsgakDLAGNSEVVAGTGeqclpfdearcGDGgkavDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL 1380
Cdd:cd05819   81 F-------DPDGNFLASFGGS-----------GDG----DGEFNGPRGIAVDSSGNIYVADTGnhRIQKFDPDGEFLTTF 138
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1381 GSNdltavrplscdsSMDVAQvrLEWPTDLAVNPmDNSLYVLE--NNVILRITENHQVSIIAGRPmhCQVPGidyslskl 1458
Cdd:cd05819  139 GSG------------GSGPGQ--FNGPTGVAVDS-DGNIYVADtgNHRIQVFDPDGNFLTTFGST--GTGPG-------- 193
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1459 aihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndvncicysgdDAYATDAILNSPSS 1538
Cdd:cd05819  194 ----QFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNG-------------------NFLGSDGQFNRPSG 247
                        330       340
                 ....*....|....*....|
gi 81869787 1539 LAVAPDGTIYIADLGNIRIR 1558
Cdd:cd05819  248 LAVDSDGNLYVADTGNNRIQ 267
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1219-1430 3.30e-15

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 78.51  E-value: 3.30e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1219 NGLAEGNkLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSIL---ELRNKEFkhsNSPghkYYLAVDPvTGSLYVS 1293
Cdd:cd05819   94 SGDGDGE-FNGPRGIAVDSSGNIYVADTgnHRIQKFDPDGEFLTTFgsgGSGPGQF---NGP---TGVAVDS-DGNIYVA 165
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1294 DTNSRRIYRVKSlsgakdlagNSEVVAGTGEQClpfdearcgdggkAVDATLMSPRGIAVDKNGLMYFVDATM--IRKVD 1371
Cdd:cd05819  166 DTGNHRIQVFDP---------DGNFLTTFGSTG-------------TGPGQFNYPTGIAVDSDGNIYVADSGNnrVQVFD 223
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 81869787 1372 QNGIISTLLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPmDNSLYVLE--NNVILRI 1430
Cdd:cd05819  224 PDGAGFGGNGNF--------------LGSDGQFNRPSGLAVDS-DGNLYVADtgNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1342-1573 8.03e-15

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 77.36  E-value: 8.03e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1342 DATLMSPRGIAVDKNGLMYFVDATM--IRKVDQNGIISTLLGSNDltavrplscdssmdVAQVRLEWPTDLAVNPmDNSL 1419
Cdd:cd05819    4 PGELNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNL 68
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1420 YVL--ENNVILRITENHQVSIIAGRPmhcqvpGIDYSlsklaihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTN 1497
Cdd:cd05819   69 YVAdtGNHRIQKFDPDGNFLASFGGS------GDGDG--------EFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPD 131
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 81869787 1498 GEIcllagaasdcdckndVNCICYSGddayATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQY 1573
Cdd:cd05819  132 GEF---------------LTTFGSGG----SGPGQFNGPTGVAVDSDGNIYVADTGNHRIQVFDPDGNFLTTFGST 188
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1226-1491 1.12e-13

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 73.89  E-value: 1.12e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1226 KLLAPVALAVGIDGSLFVGDFNYIR-RIFPS----RNVTSILELRNKEFkhsNSPghkYYLAVDPvTGSLYVSDTNSRRI 1300
Cdd:cd05819   53 QFNEPAGVAVDSDGNLYVADTGNHRiQKFDPdgnfLASFGGSGDGDGEF---NGP---RGIAVDS-SGNIYVADTGNHRI 125
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1301 YRVkslsgakDLAGNSEVVAGtgeqclpfdearcgdGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIIST 1378
Cdd:cd05819  126 QKF-------DPDGEFLTTFG---------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1379 LLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGrpmhcqvpgidyslS 1456
Cdd:cd05819  184 TFGST--------------GTGPGQFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------N 234
                        250       260       270
                 ....*....|....*....|....*....|....*
gi 81869787 1457 KLAIHSALESASAIAISHTGVLYITETDEKKINRL 1491
Cdd:cd05819  235 FLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1230-1560 2.95e-12

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 69.66  E-value: 2.95e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1230 PVALAVGIDGSLFVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNSPGHKYY-LAVDPvTGSLYVSDTNSRRIYRVks 1305
Cdd:COG4257   19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1306 lsGAKDlaGNSEVVAGTGEQCLPFdearcgdggkavdatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1382
Cdd:COG4257   86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1383 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1459
Cdd:COG4257  139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1460 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncicysgdDAYATDAILNSPSSL 1539
Cdd:COG4257  184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
                        330       340
                 ....*....|....*....|.
gi 81869787 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:COG4257  237 AVDGDGRVWFAESGANRIVRF 257
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2381-2458 7.60e-10

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 57.51  E-value: 7.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   2381 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrnvgkePA----PFNLYMFKNNNPL 2456
Cdd:TIGR03696    1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69

                   ..
gi 81869787   2457 SN 2458
Cdd:TIGR03696   70 NW 71
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1267-1590 1.89e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 61.19  E-value: 1.89e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1267 KEFKHSNSPGHKYYLAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAgnsevvagTGEqclpFDEARCGDGGkavdatlm 1346
Cdd:COG4257    8 TEYPVPAPGSGPRDVAVDP-DGAVWFTDQGGGRIGRL-------DPA--------TGE----FTEYPLGGGS-------- 59
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1347 SPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTLLGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYV-- 1421
Cdd:COG4257   60 GPHGIAVDPDGNLWFTDngNNRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFtd 119
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1422 LENNVILRIT-ENHQVSIIAGRPMHCQvpgidyslsklaihsalesASAIAISHTGVLYITETdekKINRLRQVTT-NGE 1499
Cdd:COG4257  120 QGGNRIGRLDpATGEVTEFPLPTGGAG-------------------PYGIAVDPDGNLWVTDF---GANAIGRIDPdTGT 177
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1500 IcllagaasdcdckndvncicysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSknkPVLNAFNQYeAASPG 1579
Cdd:COG4257  178 L------------------------TEYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD---PKTGTVTEY-PLPGG 229
                        330
                 ....*....|...
gi 81869787 1580 EQELY--VFNADG 1590
Cdd:COG4257  230 GARPYgvAVDGDG 242
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1225-1500 1.95e-08

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 58.11  E-value: 1.95e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1225 NKLLAPVALAVGIDGSLFVGD--FNYIRRIFPSRNVTSILELRNKEfkhsNSPghkYYLAVDPvTGSLYVSDTNSRRIYR 1302
Cdd:COG4257   56 GGGSGPHGIAVDPDGNLWFTDngNNRIGRIDPKTGEITTFALPGGG----SNP---HGIAFDP-DGNLWFTDQGGNRIGR 127
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1303 VkslsgakDLAGNsEVVAGTgeqcLPFDEARcgdggkavdatlmsPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTL 1379
Cdd:COG4257  128 L-------DPATG-EVTEFP----LPTGGAG--------------PYGIAVDPDGNLWVTDfgANAIGRIDpDTGTLTEY 181
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1380 LGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYVLE--NNVILRITENhqvsiiagrpmhcqvpgiDYSLSK 1457
Cdd:COG4257  182 ALPTPGAG-------------------PRGLAVDP-DGNLWVADtgSGRIGRFDPK------------------TGTVTE 223
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|...
gi 81869787 1458 LAIHSALESASAIAISHTGVLYITETDekkINRLRQVTTNGEI 1500
Cdd:COG4257  224 YPLPGGGARPYGVAVDGDGRVWFAESG---ANRIVRFDPDTEL 263
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
833-863 4.04e-08

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 50.98  E-value: 4.04e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 81869787   833 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 863
Cdd:NF033662    2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1281-1557 6.85e-08

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 56.06  E-value: 6.85e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1281 LAVDPvTGSLYVSDTNSRRIYRvkslsgakdLAgnsevvAGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKNGLM 1359
Cdd:cd14952   15 VAVDA-AGNVYVADSGNNRVLK---------LA------AGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1360 YFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPMDNsLYVLE--NNVILRITenhqvs 1437
Cdd:cd14952   66 YVTDF------GNNRVLKLAAGSTTQTVL-PFT----------GLNDPTGVAVDAAGN-VYVADtgNNRVLKLA------ 121
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1438 iiAGRPMHCQVPGIDyslsklaihsaLESASAIAISHTGVLYITETDEKKINRLRQVTT-------NGeicLLAGAASDC 1510
Cdd:cd14952  122 --AGSNTQTVLPFTG-----------LSNPDGVAVDGAGNVYVTDTGNNRVLKLAAGSTtqtvlpfTG---LNSPSGVAV 185
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 81869787 1511 DCKNDVncicYSGD---------DAYATDAI------LNSPSSLAVAPDGTIYIADLGNIRI 1557
Cdd:cd14952  186 DTAGNV----YVTDhgnnrvlklAAGSTTPTvlpftgLNGPLGVAVDAAGNVYVADRGNDRV 243
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
644-789 5.60e-07

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 51.30  E-value: 5.60e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   644 SCGGHGS-CIDGNCVCaagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 713
Cdd:NF041328   13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   714 gtylpdsgLCSCDPNWMGpdcsvVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcECR 787
Cdd:NF041328   75 --------DTASDPAHCG-----ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SCR 140

                  ..
gi 81869787   788 EG 789
Cdd:NF041328  141 GG 142
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
575-734 6.72e-07

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 53.47  E-value: 6.72e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    575 DCPRNCHGNGECVSGLCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 624
Cdd:pfam19232   11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    625 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCAAG--YKGEH-CEEV--------DCL 673
Cdd:pfam19232   90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    674 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDSGLCSC 725
Cdd:pfam19232  167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
                          250       260
                   ....*....|....*....|....
gi 81869787    726 ------------DPN---WMGPDC 734
Cdd:pfam19232  242 nidfsghnscgdDNNctsWTGPRC 265
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1345-1624 6.90e-06

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 50.34  E-value: 6.90e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1345 LMSPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL 1422
Cdd:cd14957   17 FNTPRGIAVDSAGNIYVADTgnNRIQVFTSSGVYSYSIGSGGTG--------------SGQFNSPYGIAVDSNGN-IYVA 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1423 EnnvilriTENHQVSII--AGrpmhcqvpGIDYSL-SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGE 1499
Cdd:cd14957   82 D-------TDNNRIQVFnsSG--------VYQYSIgTGGSGDGQFNGPYGIAVDSNGNIYVADTGN---HRIQVFTSSGT 143
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1500 ICllagaasdcdckndvncicYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRavsknkpvlnafnqyeaaspg 1579
Cdd:cd14957  144 FS-------------------YSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ--------------------- 183
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*.
gi 81869787 1580 eqelyVFNADGIHQYTV-SLVTGEYLYNFTYsadnDVTelIDNNGN 1624
Cdd:cd14957  184 -----VFTSSGTFQYTFgSSGSGPGQFSDPY----GIA--VDSDGN 218
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1522-1563 2.02e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 49.45  E-value: 2.02e-05
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|..
gi 81869787 1522 SGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKN 1563
Cdd:cd14953   11 GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1230-1558 2.10e-05

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 48.80  E-value: 2.10e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1230 PVALAVGIDGSLFVGDFNYIR-RIF-PSRNVTSIL---ELRNKEFkhsNSPghkYYLAVDPvTGSLYVSDTNSRRIyRVK 1304
Cdd:cd14957   20 PRGIAVDSAGNIYVADTGNNRiQVFtSSGVYSYSIgsgGTGSGQF---NSP---YGIAVDS-NGNIYVADTDNNRI-QVF 91
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1305 SLSGAKDLAgnsevVAGTGEQCLPFDEarcgdggkavdatlmsPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGS 1382
Cdd:cd14957   92 NSSGVYQYS-----IGTGGSGDGQFNG----------------PYGIAVDSNGNIYVADTgnHRIQVFTSSGTFSYSIGS 150
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1383 ndltavrplscdSSMDVAQVRLewPTDLAVNPMDNsLYVLENNvilriteNHQVSII--AGRPmhcqvpgiDYSL-SKLA 1459
Cdd:cd14957  151 ------------GGTGPGQFNG--PQGIAVDSDGN-IYVADTG-------NHRIQVFtsSGTF--------QYTFgSSGS 200
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1460 IHSALESASAIAISHTGVLYITETDEKKInrlrQVTTNgeicllagaasdcdckndvncicySGDDAYA------TDAIL 1533
Cdd:cd14957  201 GPGQFSDPYGIAVDSDGNIYVADTGNHRI----QVFTS------------------------SGAYQYSigtsgsGNGQF 252
                        330       340
                 ....*....|....*....|....*
gi 81869787 1534 NSPSSLAVAPDGTIYIADLGNIRIR 1558
Cdd:cd14957  253 NYPYGIAVDNDGKIYVADSNNNRIQ 277
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1672-1708 6.61e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 42.20  E-value: 6.61e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 81869787   1672 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1708
Cdd:pfam05593    1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1227-1421 7.45e-05

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 46.82  E-value: 7.45e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1227 LLAPVALAVGIDGSLFVGDFNYIR--RIFPSRNVTSILELRNKEFKHSnspghkyyLAVDPVtGSLYVSDTNSRRIYRVK 1304
Cdd:cd14952   51 LYQPQGVAVDAAGTVYVTDFGNNRvlKLAAGSTTQTVLPFTGLNDPTG--------VAVDAA-GNVYVADTGNNRVLKLA 121
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1305 S------------LSGAKDLA------------GNSEVV---AGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKN 1356
Cdd:cd14952  122 AgsntqtvlpftgLSNPDGVAvdgagnvyvtdtGNNRVLklaAGSTTQTvLPFTG-------------LNSPSGVAVDTA 188
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 81869787 1357 GLMYFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPmDNSLYV 1421
Cdd:cd14952  189 GNVYVTDH------GNNRVLKLAAGSTTPTVL-PFT----------GLNGPLGVAVDA-AGNVYV 235
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
1233-1393 3.88e-04

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 44.88  E-value: 3.88e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1233 LAVGIDGSLFVGDFNY------IRRIFPSRNVTSILElrnkEFKHSNSpghkyyLAVDPVTGSLYVSDTNSRRIYRVksl 1306
Cdd:COG3386   98 GVVDPDGRLYFTDMGEylptgaLYRVDPDGSLRVLAD----GLTFPNG------IAFSPDGRTLYVADTGAGRIYRF--- 164
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1307 sgakDLAGNSEVVAGTgeqclPFDEARCGDGGkavdatlmsPRGIAVDKNGLMY--FVDATMIRKVDQNGiisTLLGSND 1384
Cdd:COG3386  165 ----DLDADGTLGNRR-----VFADLPDGPGG---------PDGLAVDADGNLWvaLWGGGGVVRFDPDG---ELLGRIE 223

                 ....*....
gi 81869787 1385 LTAVRPLSC 1393
Cdd:COG3386  224 LPERRPTNV 232
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
667-784 4.73e-04

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 43.24  E-value: 4.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    667 CEEVDCLDPTCSSHGVCvnGECLCSPGWGGLNCeLARVQCPDQCSGHGTYLPDSGLCSCDPNWMGPDCSVVCSVDCGTHG 746
Cdd:pfam01500    4 CGTSFCGFPTCSTGGTC--GSGCCQPCCCQSSC-CRPSCCQTSCCQPTTFQSSCCRPTCQPCCQTSCCQPTCCQTSSCQT 80
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 81869787    747 VCIGGACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKC 784
Cdd:pfam01500   81 GCGGIGYGQEGSSGAVSSRTRWCRPDCRVEGTCLPPCC 118
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
580-602 7.49e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 7.49e-04
                           10        20
                   ....*....|....*....|....*
gi 81869787    580 CHGNGECVS--GLCHCFPGFLGADC 602
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
755-797 1.83e-03

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 38.37  E-value: 1.83e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 81869787    755 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 797
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1220-1384 2.50e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 42.28  E-value: 2.50e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1220 GLAEGnKLLAPVALAVGIDGSLFVGDFnYIRRI------------FPSRnvtsilelrnKEFKHSNSPGHkyyLAVDpvT 1287
Cdd:cd14963   49 GTGPG-EFKYPYGIAVDSDGNIYVADL-YNGRIqvfdpdgkflkyFPEK----------KDRVKLISPAG---LAID--D 111
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1288 GSLYVSDTNSRRIYrvkslsgakdlagnseVVAGTGEQCLPFDEARCGDGgkavdaTLMSPRGIAVDKNGLMYFVDATMI 1367
Cdd:cd14963  112 GKLYVSDVKKHKVI----------------VFDLEGKLLLEFGKPGSEPG------ELSYPNGIAVDEDGNIYVADSGNG 169
                        170       180
                 ....*....|....*....|
gi 81869787 1368 R-KV-DQNG-IISTLLGSND 1384
Cdd:cd14963  170 RiQVfDKNGkFIKELNGSPD 189
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
671-700 2.60e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 2.60e-03
                         10        20        30
                 ....*....|....*....|....*....|....*
gi 81869787  671 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 700
Cdd:cd00054    4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
645-667 2.60e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 37.33  E-value: 2.60e-03
                           10        20
                   ....*....|....*....|....*
gi 81869787    645 CGGHGSCID--GNCVCAAGYKGEHC 667
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
725-766 2.85e-03

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 37.60  E-value: 2.85e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 81869787    725 CDPNWMGPDCSVVCSV--DCGTHGVC-IGGACRCEEGWTGAACDQ 766
Cdd:pfam01414    1 CDENYYGSTCSKFCRPrdDKFGHYTCdANGNKVCLPGWTGPYCDK 45
NHL_like_6 cd14962
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1279-1483 3.15e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271332 [Multi-domain]  Cd Length: 271  Bit Score: 42.19  E-value: 3.15e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1279 YYLAVDPvTGSLYVSDTNSRRIYRvkslsgakdlagnsevvagtgeqclpFDEARcgdG-----GKAVDATLMSPRGIAV 1353
Cdd:cd14962   15 YGVAADG-RGRIYVADTGRGAVFV--------------------------FDLPN---GkvfviGNAGPNRFVSPIGVAI 64
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1354 DKNGLMYFVDAT--MIRKVDQNGIISTLLGSNDLtavrplscdssmdvaQVRlewPTDLAVNPMDNSLYVLEnnvilriT 1431
Cdd:cd14962   65 DANGNLYVSDAElgKVFVFDRDGKFLRAIGAGAL---------------FKR---PTGIAVDPAGKRLYVVD-------T 119
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|..
gi 81869787 1432 ENHQVSIIAGRPMHCQVPGIDYSlsklaIHSALESASAIAISHTGVLYITET 1483
Cdd:cd14962  120 LAHKVKVFDLDGRLLFDIGKRGS-----GPGEFNLPTDLAVDRDGNLYVTDT 166
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
710-734 3.60e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.94  E-value: 3.60e-03
                           10        20
                   ....*....|....*....|....*
gi 81869787    710 CSGHGTYLPDSGLCSCDPNWMGPDC 734
Cdd:pfam07974    2 CSGRGTCVNQCGKCVCDSGYQGATC 26
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
638-668 3.71e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 3.71e-03
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 81869787  638 NQCIDPS-CGGHGSCIDG----NCVCAAGYKGEHCE 668
Cdd:cd00054    3 DECASGNpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1672-1714 3.77e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 37.18  E-value: 3.77e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 81869787   1672 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLH 1714
Cdd:TIGR01643    1 YDAA-GRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
797-830 4.17e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 4.17e-03
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 81869787  797 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 830
Cdd:cd00054    2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
738-810 4.32e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 40.13  E-value: 4.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787   738 CSVDCGTHGVCIGGACRCEEGWT--GAACDQRV--------CHPRCIEHGTCKDGKCE--CREGWngEHCTiDGCPDLCN 805
Cdd:NF041328   45 CGVACGAGQTCVAGACGCGPGTVacGGACVDTAsdpahcgaCGAACAPGQVCEGGACReaCSEGL--TRCG-GACVDLAT 121

                  ....*
gi 81869787   806 GNGRC 810
Cdd:NF041328  122 DPLHC 126
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
185-349 5.76e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 41.72  E-value: 5.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    185 LPSSHNPPPVScQMPLLDSNTSHQIMDTNPDEEF----SPNSYLLRACSGPQQ---ASSSGPPNHHSQSTLRPPLPPPhn 257
Cdd:pfam15279  127 APKPHEPPSLP-PPPLPPKKGRRHRPGLHPPLGRppgsPPMSMTPRGLLGKPQqhpPPSPLPAFMEPSSMPPPFLRPP-- 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787    258 htlsHHHSSANSlnrnSLTNRRSQIHAPAPAP-NDLATTPEsvqlqdswvlnsnvPLEtRHFLFKTSSGSTPLFSSSSPG 336
Cdd:pfam15279  204 ----PSIPQPNS----PLSNPMLPGIGPPPKPpRNLGPPSN--------------PMH-RPPFSPHHPPPPPTPPGPPPG 260
                          170
                   ....*....|...
gi 81869787    337 YPLTSGTVYTPPP 349
Cdd:pfam15279  261 LPPPPPRGFTPPF 273
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1230-1363 6.92e-03

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 41.04  E-value: 6.92e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1230 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILElrnkeFKHSNSPGHkyyLAVDPvTGSLYVSDTNSRRIYRvksls 1307
Cdd:cd14952  138 PDGVAVDGAGNVYVTDTgnNRVLKLAAGSTTQTVLP-----FTGLNSPSG---VAVDT-AGNVYVTDHGNNRVLK----- 203
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 81869787 1308 gakdLAgnsevvAGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKNGLMYFVD 1363
Cdd:cd14952  204 ----LA------AGSTTPTvLPFTG-------------LNGPLGVAVDAAGNVYVAD 237
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1475-1560 8.30e-03

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 41.02  E-value: 8.30e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1475 TGVLYITETDEKKINRL----RQVTTngeiclLAGaasdcdckndvncicySGDDAYA-TDAILNSPSSLAVAPDGTIYI 1549
Cdd:cd14951  206 DGSVYVADTYNHKIKRVdpatGEVST------LAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLYV 263
                         90
                 ....*....|.
gi 81869787 1550 ADLGNIRIRAV 1560
Cdd:cd14951  264 ADTNNHRIRRL 274
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1533-1624 8.60e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 40.74  E-value: 8.60e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81869787 1533 LNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYE----AASPG-----EQELYVFNADGiHQYTVSLVTGEY 1603
Cdd:cd14963   55 FKYPYGIAVDSDGNIYVADLYNGRIQVFDPDGKFLKYFPEKKdrvkLISPAglaidDGKLYVSDVKK-HKVIVFDLEGKL 133
                         90       100
                 ....*....|....*....|....*....
gi 81869787 1604 LYNF--------TYSADNDVTelIDNNGN 1624
Cdd:cd14963  134 LLEFgkpgsepgELSYPNGIA--VDEDGN 160
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH