NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462627980|ref|XP_054182321|]
View 

teneurin-1 isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ten_N super family cl24184
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
23-317 3.39e-79

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


The actual alignment was detected with superfamily member pfam06484:

Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 267.23  E-value: 3.39e-79
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980   23 YTSSSDESEDGRKPRQ-SYNSRETLHEYNQELRMNYNSQSRK----------RKEVEKSTQEMEFCETSHTLCSGYQTDM 91
Cdd:pfam06484   13 YTSSSADSEECRVPTQkSYSSSETLKAFDHDSRMLYGNRVKDmvhkeadefsRQGQNFSLRELGICEPSPRHGLAYCTEM 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980   92 hSVSRHGYQLEMGSDVDTETEGAASPDHALRMWIRGMKSEHSSCLSSRANSALSLTDTDHERKSDGENGFKFSPVCCDM- 170
Cdd:pfam06484   93 -GLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENGPPIPPSSSSSs 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  171 -------------EAQAG------STQDVQSSPHNQFTFRPLPPPPPPPHACTCARKPP--------------------- 210
Cdd:pfam06484  172 pveqhsppppslnENQRPllgnnaSHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPnfqnhsrlrtpppplppphkq 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  211 -----PAADSLQRRSMTTRSQPS----PAAPAPPTSTQDSVHLHNSWVLNSNIPLETRHFLFKHGSGSSAIFSAASQNYP 281
Cdd:pfam06484  252 nqhhhPSINSLNRSSLTNRRNPSpaptASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTGTGTTPLFCTASPGYP 331
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 2462627980  282 LTSNTVYSPPPRPLPRSTFSRPAFTFNKPYRCCNWK 317
Cdd:pfam06484  332 LTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2681-2758 2.53e-34

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 126.96  E-value: 2.53e-34
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462627980 2681 EEKNHVLEIARQRAVAQAWTKEQRRLQEGEEGIRAWTEGEKQQLLSTGRVQGYDGYFVLSVEQYLELSDSANNIHFMR 2758
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1168-1561 2.29e-32

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 129.96  E-value: 2.29e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1168 ISTIMGNGhqrsVACTNcNGPAHNNKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGNSVSILELRNRDTRHSTSPAHK 1245
Cdd:cd14953      1 VSTVAGSG----TAGFS-GGGGTAARFNSPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQ 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1246 YY----LAMDPvSESLYLSDTNTRKVYKLkslvetkDLSKNFEVVAGTGDQclpfdqsHCGDGGRASEASLNSPRDdssp 1321
Cdd:cd14953     76 FNtpsgVAVDA-AGNLYVADTGNHRIRKI-------TPDGVVSTLAGTGTA-------GFSDDGGATAAQFNYPTG---- 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1322 rrsvivlmvdngllppehvtgdllkgqyrITVDRHGFIYFVDGT--MIRKIDENAVITTVIGsnglTSTQPLSCD-SGmd 1398
Cdd:cd14953    137 -----------------------------VAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG----TGGAGYAGDgPA-- 181
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1399 iTQVRLEWPTDLAVNPMDNsLYVLD--NNIVLQISENRRVRIIAGRPIHCQVPGIDhflvskvAIHSTLESARAISVSHS 1476
Cdd:cd14953    182 -TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTGTAGFSGDGG-------ATAAQLNNPTGVAVDAA 252
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1477 GLLFIAETDErkvNRIQQVTTNGEIYIIAGAPTDcdckidpncdcFSGDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNV 1556
Cdd:cd14953    253 GNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGVAVDAAGNLYVADTGNN 318

                   ....*
gi 2462627980 1557 RIRTI 1561
Cdd:cd14953    319 RIRKI 323
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1524-2457 1.12e-29

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 129.88  E-value: 1.12e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1524 GDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISRNQAHLNDMNIYEIASPADQELYQFTVNGTHLHTLNLITRD 1603
Cdd:COG3209    119 VSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGS 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1604 YVYNFTYNSEGDLGAITSSNGNSVHIRRDAGGMPLWLVVPGGQVYWLTISSNGVLKRVSAQGYNLALMTYPGNTGLLATK 1683
Cdd:COG3209    199 ALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDAST 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1684 SNENGWTTVYEYDPEGHLTNATFPTGEVSSFHSDLEKLTKVELDTSNRENVLMSTNLTATSTIYILKQENTQSTYRVNPD 1763
Cdd:COG3209    279 GTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLG 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1764 GSLRVTFASGMEIGLSSEPHILAGAVNPTLGKCNISLPGEHNANLIEWRQRKEQNKGNVSAFERRLRAHNRNLLSIDF-- 1841
Cdd:COG3209    359 GYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGtg 438
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1842 DHITRTGKIYDDHRKFTLRILYDQTGRPILWSPVSRYNEVNITYSPSGLVTFIQRGTWNEkMEYDQSGKIISRTWADGKI 1921
Cdd:COG3209    439 GGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGT-DTTLDDTLGGTTTTTAGAR 517
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1922 WSYTYLEKSVMLLLHSQRRYIFEYDQPDCLLSVTMPSMVRHSLQTMLSVGYYRNIYTPPDSSTSFIQDYSRDGRLLQTLH 2001
Cdd:COG3209    518 GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTT 597
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2002 LGTGRRVLYKYTKQARLSEVLYDTTQVTLTYEESSGVIKTIHLMHDGFICTIRYRQTGPLIGRQIFRFSEEGLVNARFDY 2081
Cdd:COG3209    598 TTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLA 677
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2082 SYNNFRVTSMQAVINETPLPIDLYRYVDVSGRTEQFGKFSVINYDLNQVITTTVMKHTKIFSANGQVIEVQYEILKAIAY 2161
Cdd:COG3209    678 TGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGA 757
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2162 WmTIQYDNVGRMVICDIRVGVDANITRYFYEYDADGQLQTVSVNDKTQWRYSYDLNGNINLLSHGKSARLTPL-----RY 2236
Cdd:COG3209    758 L-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDLqdrtyTY 836
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2237 DLRDRITRLgeiqykmdEDGFLRQRGNDIFEYNSNGLLQKAynKASGWTVQYYYDGLGRRVASKSSLGQHLQFFYADLtn 2316
Cdd:COG3209    837 DAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTTTYTYDALGR-- 904
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2317 PIRVTHlynhTSSEITSLYYDLQGHliamelssgeeyyvaCDNTGTPLAVFSSRGQVIKEILYTPYGDIYHDTYPDFQVI 2396
Cdd:COG3209    905 LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANP 965
                          890       900       910       920       930       940
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462627980 2397 IGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkP------FNLYSFENNYPVGKI 2457
Cdd:COG3209    966 LRFTGQEYDAETGLYYNGARYYDPALGRFLSPD------------PiglaggLNLYAYVGNNPVNYV 1020
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
803-828 2.46e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


:

Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.46e-09
                           10        20
                   ....*....|....*....|....*.
gi 2462627980  803 CGDNLDNDGDGLTDCVDPDCCQQSNC 828
Cdd:NF033662     7 CSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 super family cl44670
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
530-721 1.03e-06

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


The actual alignment was detected with superfamily member pfam19232:

Pssm-ID: 437064  Cd Length: 265  Bit Score: 52.70  E-value: 1.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  530 DDCSTNCNGNGECISGHCH-----------------CFPGFLGPdcaRDSCpvlCGG----NGE----------YEKGHC 578
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTP---KASC---CGGvtcgAGQtcdaktntcvYVKGYC 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  579 VCRHGW-KGPECDVPEEQCIDPTCFGHGT---CIMGV-----------------CICV------------PGYKGEICEE 625
Cdd:pfam19232   84 SADHPCpSGSACDTAKNACIAQPPYGPDSgkgCVRGFgawiweldpatnsgvwrCRCAngslynsahecsPLADQTLCAA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  626 EDcLDPMC---------------SNHGICVK-------------GECHCSTGWGGVNCETplpvcQEQCSGHGTFLLDAG 677
Cdd:pfam19232  164 EN-LDPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTE-----DRTCNGRGTWNETTG 237
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 2462627980  678 VCSCDPKWTGSDcstelctmECGSHGVCSRgicqceegWVGPTC 721
Cdd:pfam19232  238 QCACNIDFSGHN--------SCGDDNNCTS--------WTGPRC 265
DSL super family cl19567
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
712-755 1.88e-04

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


The actual alignment was detected with superfamily member pfam01414:

Pssm-ID: 473190  Cd Length: 46  Bit Score: 41.07  E-value: 1.88e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462627980  712 CEEGWVGPTCEeRSC--------HSHCTEHGQCkdgkcECSPGWEGDHCTIA 755
Cdd:pfam01414    1 CDENYYGSTCS-KFCrprddkfgHYTCDANGNK-----VCLPGWTGPYCDKP 46
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
23-317 3.39e-79

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 267.23  E-value: 3.39e-79
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980   23 YTSSSDESEDGRKPRQ-SYNSRETLHEYNQELRMNYNSQSRK----------RKEVEKSTQEMEFCETSHTLCSGYQTDM 91
Cdd:pfam06484   13 YTSSSADSEECRVPTQkSYSSSETLKAFDHDSRMLYGNRVKDmvhkeadefsRQGQNFSLRELGICEPSPRHGLAYCTEM 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980   92 hSVSRHGYQLEMGSDVDTETEGAASPDHALRMWIRGMKSEHSSCLSSRANSALSLTDTDHERKSDGENGFKFSPVCCDM- 170
Cdd:pfam06484   93 -GLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENGPPIPPSSSSSs 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  171 -------------EAQAG------STQDVQSSPHNQFTFRPLPPPPPPPHACTCARKPP--------------------- 210
Cdd:pfam06484  172 pveqhsppppslnENQRPllgnnaSHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPnfqnhsrlrtpppplppphkq 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  211 -----PAADSLQRRSMTTRSQPS----PAAPAPPTSTQDSVHLHNSWVLNSNIPLETRHFLFKHGSGSSAIFSAASQNYP 281
Cdd:pfam06484  252 nqhhhPSINSLNRSSLTNRRNPSpaptASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTGTGTTPLFCTASPGYP 331
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 2462627980  282 LTSNTVYSPPPRPLPRSTFSRPAFTFNKPYRCCNWK 317
Cdd:pfam06484  332 LTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2681-2758 2.53e-34

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 126.96  E-value: 2.53e-34
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462627980 2681 EEKNHVLEIARQRAVAQAWTKEQRRLQEGEEGIRAWTEGEKQQLLSTGRVQGYDGYFVLSVEQYLELSDSANNIHFMR 2758
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1168-1561 2.29e-32

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 129.96  E-value: 2.29e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1168 ISTIMGNGhqrsVACTNcNGPAHNNKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGNSVSILELRNRDTRHSTSPAHK 1245
Cdd:cd14953      1 VSTVAGSG----TAGFS-GGGGTAARFNSPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQ 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1246 YY----LAMDPvSESLYLSDTNTRKVYKLkslvetkDLSKNFEVVAGTGDQclpfdqsHCGDGGRASEASLNSPRDdssp 1321
Cdd:cd14953     76 FNtpsgVAVDA-AGNLYVADTGNHRIRKI-------TPDGVVSTLAGTGTA-------GFSDDGGATAAQFNYPTG---- 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1322 rrsvivlmvdngllppehvtgdllkgqyrITVDRHGFIYFVDGT--MIRKIDENAVITTVIGsnglTSTQPLSCD-SGmd 1398
Cdd:cd14953    137 -----------------------------VAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG----TGGAGYAGDgPA-- 181
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1399 iTQVRLEWPTDLAVNPMDNsLYVLD--NNIVLQISENRRVRIIAGRPIHCQVPGIDhflvskvAIHSTLESARAISVSHS 1476
Cdd:cd14953    182 -TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTGTAGFSGDGG-------ATAAQLNNPTGVAVDAA 252
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1477 GLLFIAETDErkvNRIQQVTTNGEIYIIAGAPTDcdckidpncdcFSGDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNV 1556
Cdd:cd14953    253 GNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGVAVDAAGNLYVADTGNN 318

                   ....*
gi 2462627980 1557 RIRTI 1561
Cdd:cd14953    319 RIRKI 323
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1524-2457 1.12e-29

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 129.88  E-value: 1.12e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1524 GDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISRNQAHLNDMNIYEIASPADQELYQFTVNGTHLHTLNLITRD 1603
Cdd:COG3209    119 VSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGS 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1604 YVYNFTYNSEGDLGAITSSNGNSVHIRRDAGGMPLWLVVPGGQVYWLTISSNGVLKRVSAQGYNLALMTYPGNTGLLATK 1683
Cdd:COG3209    199 ALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDAST 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1684 SNENGWTTVYEYDPEGHLTNATFPTGEVSSFHSDLEKLTKVELDTSNRENVLMSTNLTATSTIYILKQENTQSTYRVNPD 1763
Cdd:COG3209    279 GTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLG 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1764 GSLRVTFASGMEIGLSSEPHILAGAVNPTLGKCNISLPGEHNANLIEWRQRKEQNKGNVSAFERRLRAHNRNLLSIDF-- 1841
Cdd:COG3209    359 GYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGtg 438
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1842 DHITRTGKIYDDHRKFTLRILYDQTGRPILWSPVSRYNEVNITYSPSGLVTFIQRGTWNEkMEYDQSGKIISRTWADGKI 1921
Cdd:COG3209    439 GGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGT-DTTLDDTLGGTTTTTAGAR 517
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1922 WSYTYLEKSVMLLLHSQRRYIFEYDQPDCLLSVTMPSMVRHSLQTMLSVGYYRNIYTPPDSSTSFIQDYSRDGRLLQTLH 2001
Cdd:COG3209    518 GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTT 597
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2002 LGTGRRVLYKYTKQARLSEVLYDTTQVTLTYEESSGVIKTIHLMHDGFICTIRYRQTGPLIGRQIFRFSEEGLVNARFDY 2081
Cdd:COG3209    598 TTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLA 677
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2082 SYNNFRVTSMQAVINETPLPIDLYRYVDVSGRTEQFGKFSVINYDLNQVITTTVMKHTKIFSANGQVIEVQYEILKAIAY 2161
Cdd:COG3209    678 TGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGA 757
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2162 WmTIQYDNVGRMVICDIRVGVDANITRYFYEYDADGQLQTVSVNDKTQWRYSYDLNGNINLLSHGKSARLTPL-----RY 2236
Cdd:COG3209    758 L-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDLqdrtyTY 836
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2237 DLRDRITRLgeiqykmdEDGFLRQRGNDIFEYNSNGLLQKAynKASGWTVQYYYDGLGRRVASKSSLGQHLQFFYADLtn 2316
Cdd:COG3209    837 DAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTTTYTYDALGR-- 904
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2317 PIRVTHlynhTSSEITSLYYDLQGHliamelssgeeyyvaCDNTGTPLAVFSSRGQVIKEILYTPYGDIYHDTYPDFQVI 2396
Cdd:COG3209    905 LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANP 965
                          890       900       910       920       930       940
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462627980 2397 IGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkP------FNLYSFENNYPVGKI 2457
Cdd:COG3209    966 LRFTGQEYDAETGLYYNGARYYDPALGRFLSPD------------PiglaggLNLYAYVGNNPVNYV 1020
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
803-828 2.46e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.46e-09
                           10        20
                   ....*....|....*....|....*.
gi 2462627980  803 CGDNLDNDGDGLTDCVDPDCCQQSNC 828
Cdd:NF033662     7 CSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
530-721 1.03e-06

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 52.70  E-value: 1.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  530 DDCSTNCNGNGECISGHCH-----------------CFPGFLGPdcaRDSCpvlCGG----NGE----------YEKGHC 578
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTP---KASC---CGGvtcgAGQtcdaktntcvYVKGYC 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  579 VCRHGW-KGPECDVPEEQCIDPTCFGHGT---CIMGV-----------------CICV------------PGYKGEICEE 625
Cdd:pfam19232   84 SADHPCpSGSACDTAKNACIAQPPYGPDSgkgCVRGFgawiweldpatnsgvwrCRCAngslynsahecsPLADQTLCAA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  626 EDcLDPMC---------------SNHGICVK-------------GECHCSTGWGGVNCETplpvcQEQCSGHGTFLLDAG 677
Cdd:pfam19232  164 EN-LDPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTE-----DRTCNGRGTWNETTG 237
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 2462627980  678 VCSCDPKWTGSDcstelctmECGSHGVCSRgicqceegWVGPTC 721
Cdd:pfam19232  238 QCACNIDFSGHN--------SCGDDNNCTS--------WTGPRC 265
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2379-2457 1.70e-05

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 45.18  E-value: 1.70e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2379 YTPYGDIYHDTYPDFQVIiGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkPF------NLYSFENNY 2452
Cdd:TIGR03696    1 YDPYGEVLSESGAAPNPL-RFTGQYYDAETGLYYNGARYYDPELGRFLSPD------------PIglggglNLYAYVGNN 67

                   ....*
gi 2462627980 2453 PVGKI 2457
Cdd:TIGR03696   68 PVNWV 72
RHS_core NF041261
RHS element core protein;
1608-1725 3.63e-05

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 49.62  E-value: 3.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1608 FTYNSEGDLGAITSSNGNSVHIRRDAGGMPLwlvvpggqvywltISSNGVLKRvsAQGYNLAlmtypgntGLLATKSNEN 1687
Cdd:NF041261   602 YEYNAAGDLTAVITPDGNRSETQYDAWGKAV-------------STTQGGLTR--SMEYDAA--------GRITTLTNEN 658
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 2462627980 1688 GWTTVYEYDPEGHLTNATFPTGEVSSFHSDLE-KLTKVE 1725
Cdd:NF041261   659 GSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTgKLTQSE 697
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
712-755 1.88e-04

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 41.07  E-value: 1.88e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462627980  712 CEEGWVGPTCEeRSC--------HSHCTEHGQCkdgkcECSPGWEGDHCTIA 755
Cdd:pfam01414    1 CDENYYGSTCS-KFCrprddkfgHYTCDANGNK-----VCLPGWTGPYCDKP 46
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
652-795 2.26e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 40.90  E-value: 2.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  652 GVNCETPLPVCQEQCsghgtFLLDAgvcscDPkwtgSDCSTelCTMECGSHGVCSRGICQCEEGWV--GPTC-EERSCHS 728
Cdd:NF041328    18 GAVCPEGLSVCGGAC-----VDLRS-----DP----SNCGA--CGVACGAGQTCVAGACGCGPGTVacGGACvDTASDPA 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462627980  729 HCtehGQCkdGKCeCSPGW--EGDHCtiahyldavRDGCP-GLCFGNGRCT-LDQNGWHCvcqvGWSGTGC 795
Cdd:NF041328    82 HC---GAC--GAA-CAPGQvcEGGAC---------REACSeGLTRCGGACVdLATDPLHC----GACGVAC 133
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1351-1561 3.95e-03

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 41.93  E-value: 3.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1351 ITVDRHGFIYFVDGT--MIRKID-ENAVITTVIGSNGLTStqplscdsgmditqvrlewPTDLAVNPmDNSLYVLDNNiv 1427
Cdd:COG4257     64 IAVDPDGNLWFTDNGnnRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFTDQG-- 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1428 lqiseNRRVRII---AGRpihcqvpgidhflVSKVAIHSTLESARAISVSHSGLLFIAEtdeRKVNRIQqvttngeiyii 1504
Cdd:COG4257    122 -----GNRIGRLdpaTGE-------------VTEFPLPTGGAGPYGIAVDPDGNLWVTD---FGANAIG----------- 169
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462627980 1505 agaptdcdcKIDPncdcfsgDGG----YAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTI 1561
Cdd:COG4257    170 ---------RIDP-------DTGtlteYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRF 214
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
23-317 3.39e-79

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 267.23  E-value: 3.39e-79
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980   23 YTSSSDESEDGRKPRQ-SYNSRETLHEYNQELRMNYNSQSRK----------RKEVEKSTQEMEFCETSHTLCSGYQTDM 91
Cdd:pfam06484   13 YTSSSADSEECRVPTQkSYSSSETLKAFDHDSRMLYGNRVKDmvhkeadefsRQGQNFSLRELGICEPSPRHGLAYCTEM 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980   92 hSVSRHGYQLEMGSDVDTETEGAASPDHALRMWIRGMKSEHSSCLSSRANSALSLTDTDHERKSDGENGFKFSPVCCDM- 170
Cdd:pfam06484   93 -GLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENGPPIPPSSSSSs 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  171 -------------EAQAG------STQDVQSSPHNQFTFRPLPPPPPPPHACTCARKPP--------------------- 210
Cdd:pfam06484  172 pveqhsppppslnENQRPllgnnaSHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPnfqnhsrlrtpppplppphkq 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  211 -----PAADSLQRRSMTTRSQPS----PAAPAPPTSTQDSVHLHNSWVLNSNIPLETRHFLFKHGSGSSAIFSAASQNYP 281
Cdd:pfam06484  252 nqhhhPSINSLNRSSLTNRRNPSpaptASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTGTGTTPLFCTASPGYP 331
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 2462627980  282 LTSNTVYSPPPRPLPRSTFSRPAFTFNKPYRCCNWK 317
Cdd:pfam06484  332 LTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2681-2758 2.53e-34

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 126.96  E-value: 2.53e-34
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462627980 2681 EEKNHVLEIARQRAVAQAWTKEQRRLQEGEEGIRAWTEGEKQQLLSTGRVQGYDGYFVLSVEQYLELSDSANNIHFMR 2758
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1168-1561 2.29e-32

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 129.96  E-value: 2.29e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1168 ISTIMGNGhqrsVACTNcNGPAHNNKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGNSVSILELRNRDTRHSTSPAHK 1245
Cdd:cd14953      1 VSTVAGSG----TAGFS-GGGGTAARFNSPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQ 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1246 YY----LAMDPvSESLYLSDTNTRKVYKLkslvetkDLSKNFEVVAGTGDQclpfdqsHCGDGGRASEASLNSPRDdssp 1321
Cdd:cd14953     76 FNtpsgVAVDA-AGNLYVADTGNHRIRKI-------TPDGVVSTLAGTGTA-------GFSDDGGATAAQFNYPTG---- 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1322 rrsvivlmvdngllppehvtgdllkgqyrITVDRHGFIYFVDGT--MIRKIDENAVITTVIGsnglTSTQPLSCD-SGmd 1398
Cdd:cd14953    137 -----------------------------VAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG----TGGAGYAGDgPA-- 181
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1399 iTQVRLEWPTDLAVNPMDNsLYVLD--NNIVLQISENRRVRIIAGRPIHCQVPGIDhflvskvAIHSTLESARAISVSHS 1476
Cdd:cd14953    182 -TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTGTAGFSGDGG-------ATAAQLNNPTGVAVDAA 252
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1477 GLLFIAETDErkvNRIQQVTTNGEIYIIAGAPTDcdckidpncdcFSGDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNV 1556
Cdd:cd14953    253 GNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGVAVDAAGNLYVADTGNN 318

                   ....*
gi 2462627980 1557 RIRTI 1561
Cdd:cd14953    319 RIRKI 323
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1524-2457 1.12e-29

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 129.88  E-value: 1.12e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1524 GDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISRNQAHLNDMNIYEIASPADQELYQFTVNGTHLHTLNLITRD 1603
Cdd:COG3209    119 VSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGS 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1604 YVYNFTYNSEGDLGAITSSNGNSVHIRRDAGGMPLWLVVPGGQVYWLTISSNGVLKRVSAQGYNLALMTYPGNTGLLATK 1683
Cdd:COG3209    199 ALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDAST 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1684 SNENGWTTVYEYDPEGHLTNATFPTGEVSSFHSDLEKLTKVELDTSNRENVLMSTNLTATSTIYILKQENTQSTYRVNPD 1763
Cdd:COG3209    279 GTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLG 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1764 GSLRVTFASGMEIGLSSEPHILAGAVNPTLGKCNISLPGEHNANLIEWRQRKEQNKGNVSAFERRLRAHNRNLLSIDF-- 1841
Cdd:COG3209    359 GYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGtg 438
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1842 DHITRTGKIYDDHRKFTLRILYDQTGRPILWSPVSRYNEVNITYSPSGLVTFIQRGTWNEkMEYDQSGKIISRTWADGKI 1921
Cdd:COG3209    439 GGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGT-DTTLDDTLGGTTTTTAGAR 517
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1922 WSYTYLEKSVMLLLHSQRRYIFEYDQPDCLLSVTMPSMVRHSLQTMLSVGYYRNIYTPPDSSTSFIQDYSRDGRLLQTLH 2001
Cdd:COG3209    518 GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTT 597
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2002 LGTGRRVLYKYTKQARLSEVLYDTTQVTLTYEESSGVIKTIHLMHDGFICTIRYRQTGPLIGRQIFRFSEEGLVNARFDY 2081
Cdd:COG3209    598 TTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLA 677
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2082 SYNNFRVTSMQAVINETPLPIDLYRYVDVSGRTEQFGKFSVINYDLNQVITTTVMKHTKIFSANGQVIEVQYEILKAIAY 2161
Cdd:COG3209    678 TGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGA 757
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2162 WmTIQYDNVGRMVICDIRVGVDANITRYFYEYDADGQLQTVSVNDKTQWRYSYDLNGNINLLSHGKSARLTPL-----RY 2236
Cdd:COG3209    758 L-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDLqdrtyTY 836
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2237 DLRDRITRLgeiqykmdEDGFLRQRGNDIFEYNSNGLLQKAynKASGWTVQYYYDGLGRRVASKSSLGQHLQFFYADLtn 2316
Cdd:COG3209    837 DAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTTTYTYDALGR-- 904
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2317 PIRVTHlynhTSSEITSLYYDLQGHliamelssgeeyyvaCDNTGTPLAVFSSRGQVIKEILYTPYGDIYHDTYPDFQVI 2396
Cdd:COG3209    905 LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANP 965
                          890       900       910       920       930       940
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462627980 2397 IGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkP------FNLYSFENNYPVGKI 2457
Cdd:COG3209    966 LRFTGQEYDAETGLYYNGARYYDPALGRFLSPD------------PiglaggLNLYAYVGNNPVNYV 1020
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1248-1564 1.48e-26

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 113.01  E-value: 1.48e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1248 LAMDPvSESLYLSDTNTRKVYKL--KSLVETkdlsknfevVAGTGDQclpfdqshcG-DGGRASEASLNSPRDdssprrs 1324
Cdd:cd14953     28 VAVDA-AGNLYVADRGNHRIRKItpDGVVTT---------VAGTGTA---------GfADGGGAAAQFNTPSG------- 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1325 vivlmvdngllppehvtgdllkgqyrITVDRHGFIYFVDGT--MIRKIDENAVITTVIGsnglTSTQPLSCDSGMdiTQV 1402
Cdd:cd14953     82 --------------------------VAVDAAGNLYVADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGA--TAA 129
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1403 RLEWPTDLAVNPMDNsLYVLD--NNIVLQISENRRVRIIAGRPihcqVPGidhFLVSKVAIHSTLESARAISVSHSGLLF 1480
Cdd:cd14953    130 QFNYPTGVAVDAAGN-LYVADtgNHRIRKITPDGVVTTVAGTG----GAG---YAGDGPATAAQFNNPTGVAVDAAGNLY 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1481 IAETDErkvNRIQQVTTNGEIYIIAGAPTDCdckidpncdcFSGDGGyAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRT 1560
Cdd:cd14953    202 VADRGN---HRIRKITPDGVVTTVAGTGTAG----------FSGDGG-ATAAQLNNPTGVAVDAAGNLYVADSGNHRIRK 267

                   ....
gi 2462627980 1561 ISRN 1564
Cdd:cd14953    268 ITPA 271
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1349-1562 5.75e-23

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 102.61  E-value: 5.75e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1349 YRITVDRHGFIYFVDGT--MIRKIDENAVITTVIGsnglTSTQPLSCDSGmdiTQVRLEWPTDLAVNPMDNsLYVLD--N 1424
Cdd:cd14953     26 SGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAG----TGTAGFADGGG---AAAQFNTPSGVAVDAAGN-LYVADtgN 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1425 NIVLQISENRRVRIIAGRpihcqvpGIDHFLVSKVAIHSTLESARAISVSHSGLLFIAETderKVNRIQQVTTNGEIYII 1504
Cdd:cd14953     98 HRIRKITPDGVVSTLAGT-------GTAGFSDDGGATAAQFNYPTGVAVDAAGNLYVADT---GNHRIRKITPDGVVTTV 167
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462627980 1505 AGAPTDcdckidpncdcFSGDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTIS 1562
Cdd:cd14953    168 AGTGGA-----------GYAGDGPATAAQFNNPTGVAVDAAGNLYVADRGNHRIRKIT 214
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1191-1561 4.24e-16

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 80.83  E-value: 4.24e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1191 NNKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGNSVSILELRNRDTRHSTSPAHkyyLAMDPvSESLYLSDTNTRKVY 1268
Cdd:cd05819      4 PGELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQ 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1269 KLkslvetkDLSKNFEVVAGTGDQclpfdqshcGDGGraseasLNSPRDdssprrsvivlmvdngllppehvtgdllkgq 1348
Cdd:cd05819     80 KF-------DPDGNFLASFGGSGD---------GDGE------FNGPRG------------------------------- 106
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1349 yrITVDRHGFIYFVDgTM---IRKIDENAVITTVIGSNGLTSTQplscdsgmditqvrLEWPTDLAVNPmDNSLYVLDnn 1425
Cdd:cd05819    107 --IAVDSSGNIYVAD-TGnhrIQKFDPDGEFLTTFGSGGSGPGQ--------------FNGPTGVAVDS-DGNIYVAD-- 166
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1426 ivlqiSENRRVRIIA--GRPIhcqvpgidhFLV-SKVAIHSTLESARAISVSHSGLLFIAETDErkvNRIQqvttngeiy 1502
Cdd:cd05819    167 -----TGNHRIQVFDpdGNFL---------TTFgSTGTGPGQFNYPTGIAVDSDGNIYVADSGN---NRVQ--------- 220
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1503 iiagaptdcdcKIDPNCDCFSGDGGYA-KDAKMKAPSSLAVSPDGTLYVADLGNVRIRTI 1561
Cdd:cd05819    221 -----------VFDPDGAGFGGNGNFLgSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1351-1569 5.62e-16

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 80.83  E-value: 5.62e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1351 ITVDRHGFIYFVDGTM--IRKIDENAVITTVIGSNGLTSTQplscdsgmditqvrLEWPTDLAVNPmDNSLYVLDnnivl 1428
Cdd:cd05819     13 IAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFGSGDGQ--------------FNEPAGVAVDS-DGNLYVAD----- 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1429 qiSENRRVRII--AGRPI-HCQVPGIDhflvskvaiHSTLESARAISVSHSGLLFIAETDErkvNRIQQVTTNGEIYIIA 1505
Cdd:cd05819     73 --TGNHRIQKFdpDGNFLaSFGGSGDG---------DGEFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPDGEFLTTF 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1506 GAPTDCDCK--------IDPN-----CDC-------FSGDGGY--------AKDAKMKAPSSLAVSPDGTLYVADLGNVR 1557
Cdd:cd05819    139 GSGGSGPGQfngptgvaVDSDgniyvADTgnhriqvFDPDGNFlttfgstgTGPGQFNYPTGIAVDSDGNIYVADSGNNR 218
                          250
                   ....*....|..
gi 2462627980 1558 IRTISRNQAHLN 1569
Cdd:cd05819    219 VQVFDPDGAGFG 230
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1167-1430 2.44e-11

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 67.56  E-value: 2.44e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1167 VISTIMGNGhqrSVACTNCNGPAhNNKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGnSVSIL---ELRNRDTRHSTS 1241
Cdd:cd14953    108 VVSTLAGTG---TAGFSDDGGAT-AAQFNYPTGVAVDAAGNLYVADTgnHRIRKITPDG-VVTTVagtGGAGYAGDGPAT 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1242 PAHKYY---LAMDPvSESLYLSDTNTRKVYKLKS--LVETkdlsknfevVAGTGDQclPFdqshcGDGGRASEASLNSPR 1316
Cdd:cd14953    183 AAQFNNptgVAVDA-AGNLYVADRGNHRIRKITPdgVVTT---------VAGTGTA--GF-----SGDGGATAAQLNNPT 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1317 DdssprrsvivlmvdngllppehvtgdllkgqyrITVDRHGFIYFVD---GTmIRKIDENAVITTVIGSnglTSTQPLSC 1393
Cdd:cd14953    246 G---------------------------------VAVDAAGNLYVADsgnHR-IRKITPAGVVTTVAGG---GAGFSGDG 288
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 2462627980 1394 DSGmdiTQVRLEWPTDLAVNPmDNSLYVLD--NNIVLQI 1430
Cdd:cd14953    289 GPA---TSAQFNNPTGVAVDA-AGNLYVADtgNNRIRKI 323
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
803-828 2.46e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.46e-09
                           10        20
                   ....*....|....*....|....*.
gi 2462627980  803 CGDNLDNDGDGLTDCVDPDCCQQSNC 828
Cdd:NF033662     7 CSDGIDNDGDGLTDCADPDCAGNPVC 32
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1351-1558 2.54e-07

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 54.60  E-value: 2.54e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1351 ITVDRHGFIYFVD--GTMIRKIDENaviTTVIGSNGLTSTQPLScdsgmditqvrLEWPTDLAVNPmDNSLYVLDnnivl 1428
Cdd:cd14956    112 VAVDADGNLYVADfgNQRIQKFDPD---GSFLRQWGGTGIEPGS-----------FNYPRGVAVDP-DGTLYVAD----- 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1429 qiSENRRVriiagrpihcQVPGIDHFLVSKVAIHST----LESARAISVSHSGLLFIAETDErkvNRIQQVTTNGEIYII 1504
Cdd:cd14956    172 --TYNDRI----------QVFDNDGAFLRKWGGRGTgpgqFNYPYGIAIDPDGNVFVADFGN---NRIQKFTADGTFLTS 236
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462627980 1505 AGAPTdcdckidpncdcfSGDGgyakdaKMKAPSSLAVSPDGTLYVADLGNVRI 1558
Cdd:cd14956    237 WGSPG-------------TGPG------QFKNPWGVVVDADGTVYVADSNNNRV 271
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1350-1558 4.88e-07

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 53.83  E-value: 4.88e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1350 RITVDRHGFIYFVDGT--MIRKIDENAVITTVIGSNGLTSTQplscdsgmditqvrLEWPTDLAVNPmDNSLYVLD--NN 1425
Cdd:cd14956     64 GLAVDKDGWLYVADYWgdRIQVFTLTGELQTIGGSSGSGPGQ--------------FNAPRGVAVDA-DGNLYVADfgNQ 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1426 IVLQISENRR-VRIIAGRPIHcqvPGidHFLvskvaihstleSARAISVSHSGLLFIAETderKVNRIQQVTTNGEIYII 1504
Cdd:cd14956    129 RIQKFDPDGSfLRQWGGTGIE---PG--SFN-----------YPRGVAVDPDGTLYVADT---YNDRIQVFDNDGAFLRK 189
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462627980 1505 AGAPtdcdckidpncdcFSGDGgyakdaKMKAPSSLAVSPDGTLYVADLGNVRI 1558
Cdd:cd14956    190 WGGR-------------GTGPG------QFNYPYGIAIDPDGNVFVADFGNNRI 224
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
530-721 1.03e-06

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 52.70  E-value: 1.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  530 DDCSTNCNGNGECISGHCH-----------------CFPGFLGPdcaRDSCpvlCGG----NGE----------YEKGHC 578
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTP---KASC---CGGvtcgAGQtcdaktntcvYVKGYC 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  579 VCRHGW-KGPECDVPEEQCIDPTCFGHGT---CIMGV-----------------CICV------------PGYKGEICEE 625
Cdd:pfam19232   84 SADHPCpSGSACDTAKNACIAQPPYGPDSgkgCVRGFgawiweldpatnsgvwrCRCAngslynsahecsPLADQTLCAA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  626 EDcLDPMC---------------SNHGICVK-------------GECHCSTGWGGVNCETplpvcQEQCSGHGTFLLDAG 677
Cdd:pfam19232  164 EN-LDPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTE-----DRTCNGRGTWNETTG 237
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 2462627980  678 VCSCDPKWTGSDcstelctmECGSHGVCSRgicqceegWVGPTC 721
Cdd:pfam19232  238 QCACNIDFSGHN--------SCGDDNNCTS--------WTGPRC 265
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1469-1564 1.96e-06

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 51.94  E-value: 1.96e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1469 RAISVSHSGLLFIAETDErkvNRIQQVTTNGEIYIIAGAptdcdckidpncdcfSGDGgyakDAKMKAPSSLAVSPDGTL 1548
Cdd:cd05819     11 QGIAVDSSGNIYVADTGN---NRIQVFDPDGNFITSFGS---------------FGSG----DGQFNEPAGVAVDSDGNL 68
                           90
                   ....*....|....*.
gi 2462627980 1549 YVADLGNVRIRTISRN 1564
Cdd:cd05819     69 YVADTGNHRIQKFDPD 84
NHL_TRIM71_like cd14954
NHL repeat domain of the tripartite motif-containing protein 71 (TRIM71) and related proteins; ...
1403-1558 9.87e-06

NHL repeat domain of the tripartite motif-containing protein 71 (TRIM71) and related proteins; The E3 ubiquitin-protein ligase TRIM71 (LIN-41) is a RING-finger domain containing protein that has been associated with a variety of activities. The NHL repeat domain appears responsible for targeting TRIM71 to mRNAs, and TRIM71 appears responsible for translational repression and mRNA decay. Together with BRAT, TRIM71 may be part of a family of mRNA repressors that regulate proliferation and differentiation. TRIM has been shown to negatively regulate stability of Lin28B, which inhibits the pre-let-7 miRNA precursor from maturing by recruiting the terminal uriyltransferase TUT4. This family also contains the Caenorhabditis elegans NHL repeat containing 1 (NHL-1), a RING-finger-containing protein that was shown to interact with E2 ubiquitin conjugating enzymes in two-hybrid screens. Its domain architecture resembles that of the E3 ubiquitin protein ligases TRIM2, TRIM32, and TRIM71. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271324 [Multi-domain]  Cd Length: 285  Bit Score: 49.85  E-value: 9.87e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1403 RLEWPTDLAVNPMDNsLYVLDnnivlqiSENRRVRIIA--GRPIH---CQVPGIDHFlvskvaihstlESARAISVSHSG 1477
Cdd:cd14954    116 QFNYPWGVAVDSEGR-IYVSD-------TRNHRVQVFDsdGQFIRkfgFEGAGPGQL-----------DSPRGVAVNPDG 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1478 LLFIAETDERKV-----------------------NRIQQVTTNGEIYIIAgaptdCDCKiDPNCDCFSGDGGYAK---- 1530
Cdd:cd14954    177 NIVVSDFNNHRLqvfdpdgqflrffgsegsgngqfKRPRGVAVDDEGNIIV-----ADSG-NHRVQVFSPDGEFLCsfgt 250
                          170       180       190
                   ....*....|....*....|....*....|..
gi 2462627980 1531 ----DAKMKAPSSLAVSPDGTLYVADLGNVRI 1558
Cdd:cd14954    251 egngEGQFDRPSGVAVTPDGRIVVVDRGNHRI 282
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2379-2457 1.70e-05

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 45.18  E-value: 1.70e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 2379 YTPYGDIYHDTYPDFQVIiGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkPF------NLYSFENNY 2452
Cdd:TIGR03696    1 YDPYGEVLSESGAAPNPL-RFTGQYYDAETGLYYNGARYYDPELGRFLSPD------------PIglggglNLYAYVGNN 67

                   ....*
gi 2462627980 2453 PVGKI 2457
Cdd:TIGR03696   68 PVNWV 72
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1885-1925 2.36e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 43.35  E-value: 2.36e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 2462627980 1885 YSPSG-LVTFIQRGTWNEKMEYDQSGKIISRTWADGKIWSYT 1925
Cdd:TIGR01643    1 YDAAGrLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
RHS_core NF041261
RHS element core protein;
1608-1725 3.63e-05

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 49.62  E-value: 3.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1608 FTYNSEGDLGAITSSNGNSVHIRRDAGGMPLwlvvpggqvywltISSNGVLKRvsAQGYNLAlmtypgntGLLATKSNEN 1687
Cdd:NF041261   602 YEYNAAGDLTAVITPDGNRSETQYDAWGKAV-------------STTQGGLTR--SMEYDAA--------GRITTLTNEN 658
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 2462627980 1688 GWTTVYEYDPEGHLTNATFPTGEVSSFHSDLE-KLTKVE 1725
Cdd:NF041261   659 GSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTgKLTQSE 697
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1522-1564 4.70e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 48.30  E-value: 4.70e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 2462627980 1522 FSGDGGyaKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISRN 1564
Cdd:cd14953     12 FSGGGG--TAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
712-755 1.88e-04

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 41.07  E-value: 1.88e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462627980  712 CEEGWVGPTCEeRSC--------HSHCTEHGQCkdgkcECSPGWEGDHCTIA 755
Cdd:pfam01414    1 CDENYYGSTCS-KFCrprddkfgHYTCDANGNK-----VCLPGWTGPYCDKP 46
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1407-1560 6.33e-04

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 44.20  E-value: 6.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1407 PTDLAVNPMDNSLYVLDNNIVLQI--SENRRVRIIAGRPihcqvPGIDHFlvskvaihstlESARAISVSHSGLLFIAET 1484
Cdd:cd14956     15 PRGIAVDADDNVYVADARNGRIQVfdKDGTFLRRFGTTG-----DGPGQF-----------GRPRGLAVDKDGWLYVADY 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1485 DErkvNRIQQVTTNGEIYIIAG----------APTDCDCKIDPN---CDC-------FSGDGGYAKD--------AKMKA 1536
Cdd:cd14956     79 WG---DRIQVFTLTGELQTIGGssgsgpgqfnAPRGVAVDADGNlyvADFgnqriqkFDPDGSFLRQwggtgiepGSFNY 155
                          170       180
                   ....*....|....*....|....
gi 2462627980 1537 PSSLAVSPDGTLYVADLGNVRIRT 1560
Cdd:cd14956    156 PRGVAVDPDGTLYVADTYNDRIQV 179
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
730-752 1.63e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.10  E-value: 1.63e-03
                           10        20
                   ....*....|....*....|....*
gi 2462627980  730 CTEHGQCKD--GKCECSPGWEGDHC 752
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
698-721 1.88e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 37.71  E-value: 1.88e-03
                           10        20
                   ....*....|....*....|....*.
gi 2462627980  698 ECGSHGVCSR--GICQCEEGWVGPTC 721
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
652-795 2.26e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 40.90  E-value: 2.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  652 GVNCETPLPVCQEQCsghgtFLLDAgvcscDPkwtgSDCSTelCTMECGSHGVCSRGICQCEEGWV--GPTC-EERSCHS 728
Cdd:NF041328    18 GAVCPEGLSVCGGAC-----VDLRS-----DP----SNCGA--CGVACGAGQTCVAGACGCGPGTVacGGACvDTASDPA 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462627980  729 HCtehGQCkdGKCeCSPGW--EGDHCtiahyldavRDGCP-GLCFGNGRCT-LDQNGWHCvcqvGWSGTGC 795
Cdd:NF041328    82 HC---GAC--GAA-CAPGQvcEGGAC---------REACSeGLTRCGGACVdLATDPLHC----GACGVAC 133
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
536-558 3.43e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.94  E-value: 3.43e-03
                           10        20
                   ....*....|....*....|....*
gi 2462627980  536 CNGNGECIS--GHCHCFPGFLGPDC 558
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1530-1564 3.49e-03

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 41.92  E-value: 3.49e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 2462627980 1530 KDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISRN 1564
Cdd:cd05819      3 GPGELNNPQGIAVDSSGNIYVADTGNNRIQVFDPD 37
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1351-1561 3.95e-03

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 41.93  E-value: 3.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1351 ITVDRHGFIYFVDGT--MIRKID-ENAVITTVIGSNGLTStqplscdsgmditqvrlewPTDLAVNPmDNSLYVLDNNiv 1427
Cdd:COG4257     64 IAVDPDGNLWFTDNGnnRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFTDQG-- 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1428 lqiseNRRVRII---AGRpihcqvpgidhflVSKVAIHSTLESARAISVSHSGLLFIAEtdeRKVNRIQqvttngeiyii 1504
Cdd:COG4257    122 -----GNRIGRLdpaTGE-------------VTEFPLPTGGAGPYGIAVDPDGNLWVTD---FGANAIG----------- 169
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462627980 1505 agaptdcdcKIDPncdcfsgDGG----YAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTI 1561
Cdd:COG4257    170 ---------RIDP-------DTGtlteYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRF 214
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
596-732 4.26e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 40.54  E-value: 4.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980  596 CIDPTCFGHGTCIMGVCicvpgykGEICEEEDCLDPMCSNHGIC--VKGECHCSTGWGGVNCETPL--PVC------QEQ 665
Cdd:pfam01500    9 CGFPTCSTGGTCGSGCC-------QPCCCQSSCCRPSCCQTSCCqpTTFQSSCCRPTCQPCCQTSCcqPTCcqtsscQTG 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462627980  666 CSGHGTFLL-DAGVCSCDPKWTGSDCSTE-LCTMECGSHGVCSRGICQ--------CEEGWVGPTCEERSCHSHCTE 732
Cdd:pfam01500   82 CGGIGYGQEgSSGAVSSRTRWCRPDCRVEgTCLPPCCVVSCTPPTCCQlhhaqascCRPSYCGQSCCRPACCCQCSE 158
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1192-1270 5.16e-03

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 41.54  E-value: 5.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1192 NKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGNSVSILELrnrdtrhSTSPAHKYYLAMDPvSESLYLSDTNTRKVYK 1269
Cdd:COG4257    185 TPGAGPRGLAVDPDGNLWVADTgsGRIGRFDPKTGTVTEYPL-------PGGGARPYGVAVDG-DGRVWFAESGANRIVR 256

                   .
gi 2462627980 1270 L 1270
Cdd:COG4257    257 F 257
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1466-1615 5.26e-03

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 41.48  E-value: 5.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1466 ESARAISVSHSGLLFIAETDErkvNRIQQVTTNGE-IYIIAGaptdcdckidpncdcfSGDGgyakDAKMKAPSSLAVSP 1544
Cdd:cd14957     65 NSPYGIAVDSNGNIYVADTDN---NRIQVFNSSGVyQYSIGT----------------GGSG----DGQFNGPYGIAVDS 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462627980 1545 DGTLYVADLGNVRIR----------TISRNQAHLNDMNI-YEIASPADQELYqftVNGTHLHTLNLITRDYVYNFTYNSE 1613
Cdd:cd14957    122 NGNIYVADTGNHRIQvftssgtfsySIGSGGTGPGQFNGpQGIAVDSDGNIY---VADTGNHRIQVFTSSGTFQYTFGSS 198

                   ..
gi 2462627980 1614 GD 1615
Cdd:cd14957    199 GS 200
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
633-655 7.91e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.17  E-value: 7.91e-03
                           10        20
                   ....*....|....*....|....*
gi 2462627980  633 CSNHGICVK--GECHCSTGWGGVNC 655
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH