NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767987301|ref|XP_011520830|]
View 

polycystin-1 isoform X3 [Homo sapiens]

Protein Classification

CLECT and PLAT_polycystin domain-containing protein( domain architecture ID 13202464)

protein containing domains CLECT, GPS, PLAT_polycystin, and PKD_channel

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
115-2746 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4864.69  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   115 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 194
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   195 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 270
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   271 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 346
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   347 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 426
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   427 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 504
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   505 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 584
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   585 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 660
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   661 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSVTLHGQDVLMLPGDLVGLQ 736
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   737 HDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH------------------------LPAQLEGTWA----CPACALR 788
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppdnlagdgadplpdpeldLKALLEGTRAswleCAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   789 LLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSCSFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVD 868
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   869 SGANATATARWPGGSVSARFENVCPALVA-------TFVPGCPWETNDTLFSVVALPWLSEGEHV----VDVVVENSASR 937
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   938 ANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSDMVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSLTAS 1017
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSLTAM 960
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1018 NHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPNATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPD 1097
Cdd:TIGR00864  961 NHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPD 1040
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1098 PSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENLTQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLY 1177
Cdd:TIGR00864 1041 PSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHY 1120
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1178 TWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGD 1257
Cdd:TIGR00864 1121 EWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGD 1200
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1258 NITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTG 1337
Cdd:TIGR00864 1201 NITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSG 1280
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1338 NPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAW 1417
Cdd:TIGR00864 1281 NAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQ 1360
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1418 LVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG- 1496
Cdd:TIGR00864 1361 FQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGn 1440
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1497 -LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASR 1575
Cdd:TIGR00864 1441 nLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASL 1520
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1576 TVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQV 1655
Cdd:TIGR00864 1521 TNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQI 1600
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1656 VGG-------------GRYFPTNHTVQLQAVVRDGTNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATN 1719
Cdd:TIGR00864 1601 LGEtaegggggvqeldGCYFETNHTVQFHAGFKDGTNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAAN 1680
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1720 MLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTM 1799
Cdd:TIGR00864 1681 LLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTM 1760
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1800 TAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS-FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAG 1878
Cdd:TIGR00864 1761 KAFNELGSANASEEVDVQEPISGLKIRAADAGEQnFFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAG 1840
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1879 TFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWASSKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHS 1957
Cdd:TIGR00864 1841 TFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHS 1920
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1958 FPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGLQVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLV 2037
Cdd:TIGR00864 1921 FPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLV 2000
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2038 ILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLVLEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFG 2115
Cdd:TIGR00864 2001 IHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFG 2080
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2116 DGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRD 2195
Cdd:TIGR00864 2081 DGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKD 2160
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2196 CVTYQTEYRWEVYRTASCQRPGRPARVALPG-------------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQS 2262
Cdd:TIGR00864 2161 CLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKA 2240
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2263 IQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTV 2341
Cdd:TIGR00864 2241 ACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTL 2320
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2342 TIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRG 2421
Cdd:TIGR00864 2321 GIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRG 2400
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2422 RWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG- 2500
Cdd:TIGR00864 2401 RWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGe 2480
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2501 -------------AVHALTTKVHFECTGWHDAEDAGAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFE 2566
Cdd:TIGR00864 2481 tgqehgdkedevwAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFE 2560
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2567 VGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATGLTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDV 2646
Cdd:TIGR00864 2561 VGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDS 2640
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2647 AAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQIAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVT 2726
Cdd:TIGR00864 2641 AAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVT 2720
                         2730      2740
                   ....*....|....*....|
gi 767987301  2727 PTAIGDSILNITGDLIHLAS 2746
Cdd:TIGR00864 2721 PTAIADNILNIMGDLIHLAS 2740
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3136-3255 1.00e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


:

Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 1.00e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3136 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3213
Cdd:cd01752     1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 767987301 3214 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3255
Cdd:cd01752    81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom super family cl48672
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3731-3909 5.37e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


The actual alignment was detected with superfamily member pfam20519:

Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.37e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3731 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3796
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3797 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3870
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 767987301  3871 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3909
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel super family cl37568
Polycystin cation channel; This family contains the cation channel region from group II of ...
3910-4131 6.03e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


The actual alignment was detected with superfamily member pfam08016:

Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.91  E-value: 6.03e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3910 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3982
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3983 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4062
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767987301  4063 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4131
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
GPS super family cl02559
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
3029-3078 3.02e-10

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


The actual alignment was detected with superfamily member smart00303:

Pssm-ID: 470616  Cd Length: 49  Bit Score: 58.17  E-value: 3.02e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 767987301   3029 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3078
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
115-2746 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4864.69  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   115 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 194
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   195 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 270
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   271 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 346
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   347 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 426
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   427 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 504
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   505 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 584
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   585 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 660
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   661 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSVTLHGQDVLMLPGDLVGLQ 736
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   737 HDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH------------------------LPAQLEGTWA----CPACALR 788
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppdnlagdgadplpdpeldLKALLEGTRAswleCAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   789 LLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSCSFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVD 868
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   869 SGANATATARWPGGSVSARFENVCPALVA-------TFVPGCPWETNDTLFSVVALPWLSEGEHV----VDVVVENSASR 937
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   938 ANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSDMVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSLTAS 1017
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSLTAM 960
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1018 NHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPNATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPD 1097
Cdd:TIGR00864  961 NHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPD 1040
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1098 PSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENLTQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLY 1177
Cdd:TIGR00864 1041 PSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHY 1120
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1178 TWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGD 1257
Cdd:TIGR00864 1121 EWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGD 1200
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1258 NITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTG 1337
Cdd:TIGR00864 1201 NITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSG 1280
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1338 NPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAW 1417
Cdd:TIGR00864 1281 NAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQ 1360
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1418 LVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG- 1496
Cdd:TIGR00864 1361 FQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGn 1440
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1497 -LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASR 1575
Cdd:TIGR00864 1441 nLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASL 1520
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1576 TVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQV 1655
Cdd:TIGR00864 1521 TNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQI 1600
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1656 VGG-------------GRYFPTNHTVQLQAVVRDGTNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATN 1719
Cdd:TIGR00864 1601 LGEtaegggggvqeldGCYFETNHTVQFHAGFKDGTNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAAN 1680
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1720 MLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTM 1799
Cdd:TIGR00864 1681 LLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTM 1760
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1800 TAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS-FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAG 1878
Cdd:TIGR00864 1761 KAFNELGSANASEEVDVQEPISGLKIRAADAGEQnFFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAG 1840
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1879 TFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWASSKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHS 1957
Cdd:TIGR00864 1841 TFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHS 1920
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1958 FPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGLQVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLV 2037
Cdd:TIGR00864 1921 FPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLV 2000
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2038 ILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLVLEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFG 2115
Cdd:TIGR00864 2001 IHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFG 2080
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2116 DGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRD 2195
Cdd:TIGR00864 2081 DGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKD 2160
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2196 CVTYQTEYRWEVYRTASCQRPGRPARVALPG-------------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQS 2262
Cdd:TIGR00864 2161 CLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKA 2240
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2263 IQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTV 2341
Cdd:TIGR00864 2241 ACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTL 2320
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2342 TIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRG 2421
Cdd:TIGR00864 2321 GIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRG 2400
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2422 RWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG- 2500
Cdd:TIGR00864 2401 RWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGe 2480
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2501 -------------AVHALTTKVHFECTGWHDAEDAGAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFE 2566
Cdd:TIGR00864 2481 tgqehgdkedevwAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFE 2560
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2567 VGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATGLTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDV 2646
Cdd:TIGR00864 2561 VGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDS 2640
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2647 AAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQIAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVT 2726
Cdd:TIGR00864 2641 AAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVT 2720
                         2730      2740
                   ....*....|....*....|
gi 767987301  2727 PTAIGDSILNITGDLIHLAS 2746
Cdd:TIGR00864 2721 PTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2189-2632 5.11e-133

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 426.15  E-value: 5.11e-133
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2189 AHVDLRDCV-TYQTEYRWEVYRTASC---QRPGRPARVALPGVDvsrprlvLPRLALPVGHYCFVFVVSFGDTP-LTQSI 2263
Cdd:pfam02010    1 ASVELNGCFsAYTIDYLWSVFTVSSNlnlQTISSPKDLVLPQLT-------IPSGTLPYGTYVFTLTVSLSSTPsLAGTD 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2264 QANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVAST------QREAGGCA-----LN 2332
Cdd:pfam02010   74 IITVTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGT 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2333 FGPRGSSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGRCL 2412
Cdd:pfam02010  154 LLQSTSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCL 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2413 NCSSGSKRG--RWAARTFSNKTLVLD--ETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRP 2488
Cdd:pfam02010  232 NCSSDLSDVtyRWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAP 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2489 PLGGSCRLFPLGAvHALTTKVHFECTGWHDAEDagaPLVYALLLRRCRQGHCEEFCVYKGSLS-SYGAVLPPGFRPH-FE 2566
Cdd:pfam02010  312 PTGGTCSVTPTEG-TALETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQiSISTFLPPGLPANdYQ 387
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301  2567 VGLAVVVQDQLGAAvVALNRSLAITLPEPNGSatglTVWLHGLTASVLPGLLRQADPQHVIEYSLA 2632
Cdd:pfam02010  388 VTVVVVVYDSLGAA-TSVSLTITVTPPSSSDE----LLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3136-3255 1.00e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 1.00e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3136 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3213
Cdd:cd01752     1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 767987301 3214 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3255
Cdd:cd01752    81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3731-3909 5.37e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.37e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3731 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3796
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3797 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3870
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 767987301  3871 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3909
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3910-4131 6.03e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.91  E-value: 6.03e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3910 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3982
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3983 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4062
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767987301  4063 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4131
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
195-289 2.09e-24

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 100.24  E-value: 2.09e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301    195 GEEYVACLPDNSSGTVAAVSFSAAHegLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSA-----SFACLSLCSGp 269
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSvsasdSSQCSTTCSG- 77
                            90       100
                    ....*....|....*....|
gi 767987301    270 ppPPAPTCRGPTLLQHVFPA 289
Cdd:smart00321   78 --YPAEVCGGPNRLSVYVLA 95
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3138-3241 2.85e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 97.50  E-value: 2.85e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3138 YEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGDR-AFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3216
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 767987301  3217 HVIV-RDLQTARSAFFLVNDWLSVET 3241
Cdd:pfam01477   81 SITVeVPGETGGKYTFPCNSWVYGSK 106
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3136-3238 1.96e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 86.16  E-value: 1.96e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   3136 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD--GDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKglSPAW 3213
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 767987301   3214 FLQHVIVRDLQTARSAFFLVNDWLS 3238
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
436-549 1.01e-16

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 78.82  E-value: 1.01e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  436 HCYRLVVEKAAWLQAQEQCQAWaGAALAMVDSPAVQRFLVSRVTRSL--DVWIGFSTVQGVEVGPAPQGEAFSleSCQNW 513
Cdd:cd00037     1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGLNDLSSEGTWKWSDGSPLV--DYTNW 77
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 767987301  514 LPGEPHPATAEHCVRL--GPTGWCNTDLCSAPHSYVCE 549
Cdd:cd00037    78 APGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1487-1793 1.05e-12

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 72.78  E-value: 1.05e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1487 TSIKVNGSLGLELQqpylFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEV-SRSEAWLNVTVKRR 1565
Cdd:COG3291     2 TATPTSGCAPLTVQ----FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAP 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1566 VRGLVVNASRTVVPLNGSVSFSTSLEAGSDVrYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVY 1645
Cdd:COG3291    78 NPGVTTVTTSTTVTTLANTANGGATTVVAGS-TVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVT 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1646 VLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAW 1725
Cdd:COG3291   157 TSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLT 236
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767987301 1726 ADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPG 1793
Cdd:COG3291   237 GISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTAD 304
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
3029-3078 3.02e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 58.17  E-value: 3.02e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 767987301   3029 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3078
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
PHA03247 PHA03247
large tegument protein UL36; Provisional
514-809 3.68e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 3.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  514 LPGEPHPATAEHCVrlgPTGWCnTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLA-----------QQD 582
Cdd:PHA03247 2555 LPPAAPPAAPDRSV---PPPRP-APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPlppdthapdppPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  583 GLSAPHEPVEVMVFPGL------------------RLSREAFLTTAEFGTQELRRPAqLRLQVYRLLSTAGTPENGSEPE 644
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPpperprddpapgrvsrprRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPE 2709
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  645 SRsPDNRTQLAPAcMPGGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAPYALWREFLFSVPAGPP----------- 713
Cdd:PHA03247 2710 PA-PHALVSATPL-PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAagpprrltrpa 2787
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  714 -AQYSVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPHLPAQLEGTWACPACALRLLAA 792
Cdd:PHA03247 2788 vASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP 2867
                         330
                  ....*....|....*..
gi 767987301  793 TEQLTVLLGLRPNPGLR 809
Cdd:PHA03247 2868 SRSPAAKPAAPARPPVR 2884
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
115-2746 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4864.69  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   115 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 194
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   195 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 270
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   271 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 346
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   347 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 426
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   427 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 504
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   505 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 584
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   585 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 660
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   661 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSVTLHGQDVLMLPGDLVGLQ 736
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   737 HDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH------------------------LPAQLEGTWA----CPACALR 788
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppdnlagdgadplpdpeldLKALLEGTRAswleCAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   789 LLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSCSFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVD 868
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   869 SGANATATARWPGGSVSARFENVCPALVA-------TFVPGCPWETNDTLFSVVALPWLSEGEHV----VDVVVENSASR 937
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   938 ANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSDMVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSLTAS 1017
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSLTAM 960
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1018 NHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPNATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPD 1097
Cdd:TIGR00864  961 NHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPD 1040
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1098 PSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENLTQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLY 1177
Cdd:TIGR00864 1041 PSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHY 1120
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1178 TWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGD 1257
Cdd:TIGR00864 1121 EWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGD 1200
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1258 NITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTG 1337
Cdd:TIGR00864 1201 NITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSG 1280
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1338 NPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAW 1417
Cdd:TIGR00864 1281 NAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQ 1360
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1418 LVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG- 1496
Cdd:TIGR00864 1361 FQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGn 1440
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1497 -LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASR 1575
Cdd:TIGR00864 1441 nLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASL 1520
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1576 TVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQV 1655
Cdd:TIGR00864 1521 TNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQI 1600
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1656 VGG-------------GRYFPTNHTVQLQAVVRDGTNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATN 1719
Cdd:TIGR00864 1601 LGEtaegggggvqeldGCYFETNHTVQFHAGFKDGTNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAAN 1680
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1720 MLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTM 1799
Cdd:TIGR00864 1681 LLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTM 1760
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1800 TAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS-FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAG 1878
Cdd:TIGR00864 1761 KAFNELGSANASEEVDVQEPISGLKIRAADAGEQnFFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAG 1840
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1879 TFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWASSKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHS 1957
Cdd:TIGR00864 1841 TFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHS 1920
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1958 FPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGLQVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLV 2037
Cdd:TIGR00864 1921 FPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLV 2000
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2038 ILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLVLEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFG 2115
Cdd:TIGR00864 2001 IHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFG 2080
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2116 DGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRD 2195
Cdd:TIGR00864 2081 DGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKD 2160
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2196 CVTYQTEYRWEVYRTASCQRPGRPARVALPG-------------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQS 2262
Cdd:TIGR00864 2161 CLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKA 2240
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2263 IQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTV 2341
Cdd:TIGR00864 2241 ACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTL 2320
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2342 TIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRG 2421
Cdd:TIGR00864 2321 GIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRG 2400
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2422 RWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG- 2500
Cdd:TIGR00864 2401 RWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGe 2480
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2501 -------------AVHALTTKVHFECTGWHDAEDAGAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFE 2566
Cdd:TIGR00864 2481 tgqehgdkedevwAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFE 2560
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2567 VGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATGLTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDV 2646
Cdd:TIGR00864 2561 VGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDS 2640
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2647 AAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQIAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVT 2726
Cdd:TIGR00864 2641 AAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVT 2720
                         2730      2740
                   ....*....|....*....|
gi 767987301  2727 PTAIGDSILNITGDLIHLAS 2746
Cdd:TIGR00864 2721 PTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2189-2632 5.11e-133

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 426.15  E-value: 5.11e-133
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2189 AHVDLRDCV-TYQTEYRWEVYRTASC---QRPGRPARVALPGVDvsrprlvLPRLALPVGHYCFVFVVSFGDTP-LTQSI 2263
Cdd:pfam02010    1 ASVELNGCFsAYTIDYLWSVFTVSSNlnlQTISSPKDLVLPQLT-------IPSGTLPYGTYVFTLTVSLSSTPsLAGTD 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2264 QANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVAST------QREAGGCA-----LN 2332
Cdd:pfam02010   74 IITVTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGT 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2333 FGPRGSSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGRCL 2412
Cdd:pfam02010  154 LLQSTSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCL 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2413 NCSSGSKRG--RWAARTFSNKTLVLD--ETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRP 2488
Cdd:pfam02010  232 NCSSDLSDVtyRWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAP 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  2489 PLGGSCRLFPLGAvHALTTKVHFECTGWHDAEDagaPLVYALLLRRCRQGHCEEFCVYKGSLS-SYGAVLPPGFRPH-FE 2566
Cdd:pfam02010  312 PTGGTCSVTPTEG-TALETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQiSISTFLPPGLPANdYQ 387
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301  2567 VGLAVVVQDQLGAAvVALNRSLAITLPEPNGSatglTVWLHGLTASVLPGLLRQADPQHVIEYSLA 2632
Cdd:pfam02010  388 VTVVVVVYDSLGAA-TSVSLTITVTPPSSSDE----LLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3136-3255 1.00e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 1.00e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3136 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3213
Cdd:cd01752     1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 767987301 3214 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3255
Cdd:cd01752    81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3731-3909 5.37e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.37e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3731 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3796
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3797 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3870
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 767987301  3871 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3909
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3910-4131 6.03e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.91  E-value: 6.03e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3910 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3982
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3983 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4062
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767987301  4063 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4131
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_repeat cd01756
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 ...
3136-3255 6.28e-28

PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238854  Cd Length: 120  Bit Score: 111.11  E-value: 6.28e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3136 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD---GDRAFHRNSLDIFRIATPhSLGSVWKIRVWHDNKGLSPA 3212
Cdd:cd01756     1 VTYEVTVKTGDVKGAGTDANVFITLYGENGDTGKRKLKksnNKNKFERGQTDKFTVEAV-DLGKLKKIRIGHDNSGLGAG 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 767987301 3213 WFLQHVIVRDLQTARSAFFLVNDWLSveTEANGGLVEKEVLAA 3255
Cdd:cd01756    80 WFLDKVEIREPGTGDEYTFPCNRWLD--KDEDDGQIVRELYPS 120
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
195-289 2.09e-24

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 100.24  E-value: 2.09e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301    195 GEEYVACLPDNSSGTVAAVSFSAAHegLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSA-----SFACLSLCSGp 269
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSvsasdSSQCSTTCSG- 77
                            90       100
                    ....*....|....*....|
gi 767987301    270 ppPPAPTCRGPTLLQHVFPA 289
Cdd:smart00321   78 --YPAEVCGGPNRLSVYVLA 95
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3138-3241 2.85e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 97.50  E-value: 2.85e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  3138 YEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGDR-AFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3216
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 767987301  3217 HVIV-RDLQTARSAFFLVNDWLSVET 3241
Cdd:pfam01477   81 SITVeVPGETGGKYTFPCNSWVYGSK 106
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3136-3238 1.96e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 86.16  E-value: 1.96e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   3136 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD--GDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKglSPAW 3213
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 767987301   3214 FLQHVIVRDLQTARSAFFLVNDWLS 3238
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1741-1810 1.31e-17

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 79.74  E-value: 1.31e-17
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1741 VAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANA 1810
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
CLECT smart00034
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function ...
424-549 1.51e-17

C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function as calcium-dependent carbohydrate binding modules.


Pssm-ID: 214480 [Multi-domain]  Cd Length: 124  Bit Score: 81.49  E-value: 1.51e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301    424 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFL---VSRVTRSLDVWIGFSTVQGVEVGPAP 500
Cdd:smart00034    1 CPSGWISY--GGKCYKFSTEKKTWEDAQAFCQS-LGGHLASIHSEAENDFVaslLKNSGSSDYYWIGLSDPDSNGSWQWS 77
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 767987301    501 QGeaFSLESCQNWLPGEPhPATAEHCVRLGPTGWC-NTDLCSAPHSYVCE 549
Cdd:smart00034   78 DG--SGPVSYSNWAPGEP-NNSSGDCVVLSTSGGKwNDVSCTSKLPFVCE 124
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1146-1227 6.40e-17

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 78.26  E-value: 6.40e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   1146 SVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVR 1225
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTATSSDDGSIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVV 77

                    ..
gi 767987301   1226 VF 1227
Cdd:smart00089   78 VQ 79
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
436-549 1.01e-16

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 78.82  E-value: 1.01e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  436 HCYRLVVEKAAWLQAQEQCQAWaGAALAMVDSPAVQRFLVSRVTRSL--DVWIGFSTVQGVEVGPAPQGEAFSleSCQNW 513
Cdd:cd00037     1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGLNDLSSEGTWKWSDGSPLV--DYTNW 77
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 767987301  514 LPGEPHPATAEHCVRL--GPTGWCNTDLCSAPHSYVCE 549
Cdd:cd00037    78 APGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1910-1979 1.44e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.66  E-value: 1.44e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1910 LWASSKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQA 1979
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1655-1726 1.93e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.27  E-value: 1.93e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767987301  1655 VVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRdrGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWA 1726
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGD--SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PLAT cd00113
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. ...
3137-3238 2.23e-16

PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. It consists of an eight stranded beta-barrel. The domain can be found in various domain architectures, in case of lipoxygenases, alpha toxin, lipases and polycystin, but also as a single domain or as repeats.The putative function of this domain is to facilitate access to sequestered membrane or micelle bound substrates.


Pssm-ID: 238061  Cd Length: 116  Bit Score: 77.76  E-value: 2.23e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3137 KYEILVKTGWGRGSGTTAHVGIMLYGVD-SRSGHRHLDGDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFL 3215
Cdd:cd00113     2 RYTVTIKTGDKKGAGTDSNISLALYGENgNSSDIPILDGPGSFERGSTDTFQIDLKLDIGDITKVYLRRDGSGLSDGWYC 81
                          90       100
                  ....*....|....*....|...
gi 767987301 3216 QHVIVRDLQTARSAFFLVNDWLS 3238
Cdd:cd00113    82 ESITVQALGTKKVYTFPVNRWVL 104
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1571-1640 2.87e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.89  E-value: 2.87e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1571 VNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQD 1640
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
2086-2153 3.85e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.50  E-value: 3.85e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767987301  2086 SGPCFTNRSAQFEAaTSPSPRRVAYHWDFGDgSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVA 2153
Cdd:pfam00801    5 GTVVAAGQPVTFTA-TLADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1149-1220 4.68e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.50  E-value: 4.68e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767987301  1149 VGVSDGVLVAGRPVTFYPHpLPSPGGVLYTWDFGDgSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAA 1220
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTAT-LADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1403-1474 9.53e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 74.35  E-value: 9.53e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767987301  1403 LQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGteEAAPTRARGPEVTFIYRDPGSYLVTVTASNNISAAND 1474
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFG--DSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
2078-2160 2.95e-15

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 73.25  E-value: 2.95e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   2078 AVQYVALQSGPcfTNRSAQFEAATSPSPRRVAYHWDFGDGSpgqDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVAQATV 2157
Cdd:smart00089    2 ADVSASPTVGV--AGESVTFTATSSDDGSIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76

                    ...
gi 767987301   2158 TVQ 2160
Cdd:smart00089   77 VVQ 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1238-1303 7.46e-15

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 72.03  E-value: 7.46e-15
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301  1238 SLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLAR 1303
Cdd:pfam00801    5 GTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1830-1895 9.43e-15

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 71.65  E-value: 9.43e-15
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767987301  1830 PGGSFVAAGSSVPFWGQLATGTNVSWCWAVPG--GSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSA 1895
Cdd:pfam00801    3 ASGTVVAAGQPVTFTATLADGSNVTYTWDFGDspGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1147-1221 3.39e-14

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 70.60  E-value: 3.39e-14
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301 1147 VAVGVSDGVLV-AGRPVTFYPHPLPSPGGVLYTWDFGDGSpVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQ 1221
Cdd:cd00146     1 PTASVSAPPVAeLGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1568-1646 1.08e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 69.02  E-value: 1.08e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   1568 GLVVNASRTVVPLNGSVSFS-TSLEAGSDVRYSWVLCDRctPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYV 1646
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTaTSSDDGSIVSYTWDFGDG--TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1486-1563 2.64e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 67.86  E-value: 2.64e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   1486 VTSIKVNGSLGLeLQQPYLFSAV--GRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVK 1563
Cdd:smart00089    1 VADVSASPTVGV-AGESVTFTATssDDGSIVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1487-1793 1.05e-12

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 72.78  E-value: 1.05e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1487 TSIKVNGSLGLELQqpylFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEV-SRSEAWLNVTVKRR 1565
Cdd:COG3291     2 TATPTSGCAPLTVQ----FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAP 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1566 VRGLVVNASRTVVPLNGSVSFSTSLEAGSDVrYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVY 1645
Cdd:COG3291    78 NPGVTTVTTSTTVTTLANTANGGATTVVAGS-TVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVT 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1646 VLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAW 1725
Cdd:COG3291   157 TSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLT 236
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767987301 1726 ADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPG 1793
Cdd:COG3291   237 GISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTAD 304
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1491-1556 2.25e-12

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 64.72  E-value: 2.25e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767987301  1491 VNGSLGLELQQPYLFSA-VGRGRPASYLWDLGD--GGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEA 1556
Cdd:pfam00801    2 SASGTVVAAGQPVTFTAtLADGSNVTYTWDFGDspGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1743-1817 2.32e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 65.17  E-value: 2.32e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301   1743 ASPNPAAVNTSVTLSAELAG-GSGVVYTWSLEEGLSWetSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQ 1817
Cdd:smart00089    6 ASPTVGVAGESVTFTATSSDdGSIVSYTWDFGDGTSS--TGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1837-1896 3.64e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 64.78  E-value: 3.64e-12
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767987301   1837 AGSSVPFWGQLAT-GTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSAT 1896
Cdd:smart00089   13 AGESVTFTATSSDdGSIVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASAT 73
WSC pfam01822
WSC domain; This domain is involved in carbohydrate binding.
198-268 6.05e-12

WSC domain; This domain is involved in carbohydrate binding.


Pssm-ID: 460348  Cd Length: 82  Bit Score: 64.02  E-value: 6.05e-12
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 767987301   198 YVACLPDNSsGTVAAVSFSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFA----CLSLCSG 268
Cdd:pfam01822    1 YLGCYSDGT-GGRRLLLGSSGDYDDMTPEKCIAFCSAAGYTYAGLEYGGECYCGNSLPSGSALAdssdCNTPCPG 74
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1998-2069 1.80e-11

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 62.40  E-value: 1.80e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767987301  1998 CEPGIATGTERNFTARVQRGSRVAYAWYFslqkvqGDSLV-ILSGRDVTYTPVAAGLLEIQVRAFNALGSENR 2069
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLADGSNVTYTWDF------GDSPGtSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
2091-2161 1.87e-11

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 62.51  E-value: 1.87e-11
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767987301 2091 TNRSAQFEAATSPSPRRVAYHWDFGDGSpGQDTDEPRAEHSYLRPGDYRVQVNASNLVSfFVAQATVTVQV 2161
Cdd:cd00146    13 LGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVTNAVG-SSSTKTTTVVV 81
LRRCT smart00082
Leucine rich repeat C-terminal domain;
143-195 2.14e-11

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 61.68  E-value: 2.14e-11
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 767987301    143 NPFECDCGLAWLPRWAeEQQVRVVQPEAATCAGPGSLAGqPLLGIPLLDSGCG 195
Cdd:smart00082    1 NPFICDCELRWLLRWL-QANEHLQDPVDLRCASPSSLRG-PLLELLHSEFKCP 51
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1570-1646 5.00e-11

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 61.36  E-value: 5.00e-11
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767987301 1570 VVNASRTVVPLNGSVSFS-TSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQ-DSIFVYV 1646
Cdd:cd00146     3 ASVSAPPVAELGASVTFSaSDSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVVV 81
CLECT_DC-SIGN_like cd03590
C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific ...
424-549 5.40e-11

C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR); CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.


Pssm-ID: 153060 [Multi-domain]  Cd Length: 126  Bit Score: 62.71  E-value: 5.40e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  424 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFLVSRVTRSLDVWIGFStVQGVE-----V-G 497
Cdd:cd03590     1 CPTNWKSF--QSSCYFFSTEKKSWEESRQFCED-MGAHLVIINSQEEQEFISKILSGNRSYWIGLS-DEETEgewkwVdG 76
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301  498 PAPqgeafsLESCQNWLPGEP--HPATAEHCVRLGPT--GWcNTDLCSAPHSYVCE 549
Cdd:cd03590    77 TPL------NSSKTFWHPGEPnnWGGGGEDCAELVYDsgGW-NDVPCNLEYRWICE 125
LRR_8 pfam13855
Leucine rich repeat;
91-145 8.95e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 59.85  E-value: 8.95e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 767987301    91 DVSHNLLRALDVGLLANLSALAELDISNNKISTLEEGIFANLFNLSEINLSGNPF 145
Cdd:pfam13855    7 DLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
295-362 1.01e-10

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 60.09  E-value: 1.01e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767987301   295 LVGPHGPLASGQLAAFHIA-APLPVTATRWDFGDgSAEVDAAGPAASHRYVLPGRYHVTAVLALGAGSA 362
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATlADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSA 68
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1314-1395 1.86e-10

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 59.77  E-value: 1.86e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   1314 VLRVEPAACIPTQP-DARLTAYVTGNPAHYLFDWTFGDGssnTTVRGcPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSI 1392
Cdd:smart00089    1 VADVSASPTVGVAGeSVTFTATSSDDGSIVSYTWDFGDG---TSSTG-PTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76

                    ...
gi 767987301   1393 CVE 1395
Cdd:smart00089   77 VVQ 79
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
3029-3078 3.02e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 58.17  E-value: 3.02e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 767987301   3029 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3078
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1425-1481 4.48e-10

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 58.62  E-value: 4.48e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 767987301   1425 PFPYRYTWDFGTEeaapTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEVQ 1481
Cdd:smart00089   27 GSIVSYTWDFGDG----TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1663-1730 3.52e-09

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 56.31  E-value: 3.52e-09
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767987301   1663 PTNHTVQLQAVVR-DGTNVSYSWtawrDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTM 1730
Cdd:smart00089   12 VAGESVTFTATSSdDGSIVSYTW----DFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1240-1299 4.97e-09

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 55.97  E-value: 4.97e-09
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767987301 1240 AVEQGAPVVVSAAVQ-TGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAG 1299
Cdd:cd00146    10 VAELGASVTFSASDSsGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVG 70
CLECT_REG-1_like cd03594
C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and ...
424-549 7.39e-09

C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2); CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.


Pssm-ID: 153064 [Multi-domain]  Cd Length: 129  Bit Score: 57.00  E-value: 7.39e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  424 CPSDTeiFPGNGHCYRLVVEKAAWLQAQEQCQAW-AGAALAMVDSPAVQRFLVSRV----TRSLDVWIGFSTVQGVEVGP 498
Cdd:cd03594     1 CPKGW--LPYKGNCYGYFRQPLSWSDAELFCQKYgPGAHLASIHSPAEAAAIASLIssyqKAYQPVWIGLHDPQQSRGWE 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 767987301  499 APQGEAFSLEScqnWLPGEPHPaTAEHCVRL-GPTG---WcNTDLCSAPHSYVCE 549
Cdd:cd03594    79 WSDGSKLDYRS---WDRNPPYA-RGGYCAELsRSTGflkW-NDANCEERNPFICK 128
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1244-1310 8.31e-09

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 55.15  E-value: 8.31e-09
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767987301   1244 GAPVVVSAAVQT-GDNITWTFDMGDGTVLSGPeaTVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVF 1310
Cdd:smart00089   14 GESVTFTATSSDdGSIVSYTWDFGDGTSSTGP--TVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1158-1368 2.16e-08

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 59.30  E-value: 2.16e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1158 AGRPVTFYPHplpSPGGVL-YTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTV-SGAAAQADVRVFEELRGLSV 1235
Cdd:COG3291    10 APLTVQFTDT---SSGNATsYEWDFGDGT---TSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTT 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1236 DMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVL 1315
Cdd:COG3291    84 VTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTD 163
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 767987301 1316 RVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFT 1368
Cdd:COG3291   164 VTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATS 216
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1498-1562 2.83e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 53.65  E-value: 2.83e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1498 ELQQPYLFSAV--GRGRPASYLWDLGDGGWL--EGPEVTHAYNSTGDFTVRVAGWNEVSRSEA-WLNVTV 1562
Cdd:cd00146    12 ELGASVTFSASdsSGGSIVSYKWDFGDGEVSssGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTkTTTVVV 81
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1825-1896 3.38e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 53.27  E-value: 3.38e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 767987301 1825 IRASEPGGSFVAAGSSVPFWGQLATG---TNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSAT 1896
Cdd:cd00146     1 PTASVSAPPVAELGASVTFSASDSSGgsiVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1042-1134 4.19e-08

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 52.77  E-value: 4.19e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1042 VSTVPAVLSPNATLALTAGVLvdSAVEVAFLWTFGDgeqalhqfqppynesfpvpdpSVAQVLVEHNVMHTYAAPGEYLL 1121
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLA--DGSNVTYTWDFGD---------------------SPGTSGSGPTVTHTYLSPGTYTV 57
                           90
                   ....*....|...
gi 767987301  1122 TVLASNAFENLTQ 1134
Cdd:pfam00801   58 TLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1741-1814 7.52e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 52.50  E-value: 7.52e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767987301 1741 VAASPNPAA-VNTSVTLSAE-LAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSAN---ATVEV 1814
Cdd:cd00146     3 ASVSAPPVAeLGASVTFSASdSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStktTTVVV 81
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1039-1140 8.60e-08

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 52.07  E-value: 8.60e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   1039 GLQVSTVPAVLSPNATLALTAGVLVDSAVeVAFLWTFGDGeqalhqfqppynesfpvpdpsvaQVLVEHNVMHTYAAPGE 1118
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTATSSDDGSI-VSYTWDFGDG-----------------------TSSTGPTVTHTYTKPGT 56
                            90       100
                    ....*....|....*....|..
gi 767987301   1119 YLLTVLASNAFENLTQQVPVSV 1140
Cdd:smart00089   57 YTVTLTVTNAVGSASATVTVVV 78
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
91-146 1.31e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 57.25  E-value: 1.31e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301   91 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEgiFANLFNLSEINLSGNPFE 146
Cdd:COG4886   211 DLSGNQLTDLPEPL-ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLT 263
PLAT_plant_stress cd01754
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of ...
3138-3245 1.54e-07

PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of its members are stress induced. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238852  Cd Length: 129  Bit Score: 52.93  E-value: 1.54e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3138 YEILVKTGWGRGSGTTAHVGIMLYGVDSRS---------GHRHLDGDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKG 3208
Cdd:cd01754     3 YTIYVQTGSIWKAGTDSRISLQIYDADGPGlrianleawGGLMGAGHDYFERGNLDRFSGRGPCLPSPPCWMNLTSDGTG 82
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 767987301 3209 LSPAWFLQHVIVRDL-QTARSA--FFLVNDWLSVETEANG 3245
Cdd:cd01754    83 NHPGWYVNYVEVTQAgQHAPCMqhLFAVEQWLATDESPYM 122
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1170-1226 1.82e-07

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 51.50  E-value: 1.82e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 767987301  1170 PSPGGVL-YTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVRV 1226
Cdd:pfam18911   29 DPDGDILsYRWDFGDGT---TATGANVSHTYAAPGTYTVTLTVTDDSGASNSTATDTV 83
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1325-1398 2.27e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 50.96  E-value: 2.27e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767987301 1325 TQPDARLTAYVTGNPAHYLFDWTFGDGssNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAhyfTSICVEPEV 1398
Cdd:cd00146    13 LGASVTFSASDSSGGSIVSYKWDFGDG--EVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSS---STKTTTVVV 81
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1322-1387 2.64e-07

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 50.46  E-value: 2.64e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767987301  1322 CIPTQPDARLTAYV-TGNPAHYLfdWTFGDgsSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAH 1387
Cdd:pfam00801    7 VVAAGQPVTFTATLaDGSNVTYT--WDFGD--SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSAN 69
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1906-1985 3.35e-07

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 50.53  E-value: 3.35e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   1906 VGLVLWASSKVVApGQLVHFQI-LLAAGSAVTFRLQVGgaNPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIV 1984
Cdd:smart00089    1 VADVSASPTVGVA-GESVTFTAtSSDDGSIVSYTWDFG--DGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVV 77

                    .
gi 767987301   1985 V 1985
Cdd:smart00089   78 V 78
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
91-146 7.40e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 54.94  E-value: 7.40e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301   91 DVSHNLLRALDVglLANLSALAELDISNNKISTLEEgiFANLFNLSEINLSGNPFE 146
Cdd:COG4886   234 DLSNNQLTDLPE--LGNLTNLEELDLSNNQLTDLPP--LANLTNLKTLDLSNNQLT 285
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
91-146 8.35e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 54.94  E-value: 8.35e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301   91 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 146
Cdd:COG4886   188 DLSNNQITDLPEPL-GNLTNLEELDLSGNQLTDLPEPL-ANLTNLETLDLSNNQLT 241
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1427-1480 8.84e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 49.42  E-value: 8.84e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 767987301 1427 PYRYTWDFGTEEAAPTRarGPEVTFIYRDPGSYLVTVTASNNISAAN-DSALVEV 1480
Cdd:cd00146    29 IVSYKWDFGDGEVSSSG--EPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVVV 81
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
91-145 1.22e-06

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 52.48  E-value: 1.22e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 767987301   91 DVSHNLLRALDVglLANLSALAELDISNNKISTLEE--GIFANLFNLSEINLSGNPF 145
Cdd:cd21340   126 NISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1743-1920 1.83e-06

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 53.52  E-value: 1.83e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1743 ASPNPAAVNTSVTLSAeLAGGSGVVYTWSLEEGLSweTSEPFTTHSFPTPGLHLVTMTAGNPLGSANA-----TVEVDVQ 1817
Cdd:COG3291     3 ATPTSGCAPLTVQFTD-TSSGNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGCSDTttktiTVGAPNP 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1818 VPVSGLSIRASEPGGSFVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATY 1897
Cdd:COG3291    80 GVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSV 159
                         170       180
                  ....*....|....*....|...
gi 767987301 1898 NLTAEEPIVGLVLWASSKVVAPG 1920
Cdd:COG3291   160 STTDVTSDGTTSASTNPSVTTDT 182
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1505-1545 2.35e-06

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 48.42  E-value: 2.35e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 767987301  1505 FSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRV 1545
Cdd:pfam18911   26 ASDDPDGDILSYRWDFGDGTTATGANVSHTYAAPGTYTVTL 66
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
91-146 2.64e-06

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 53.40  E-value: 2.64e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301   91 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 146
Cdd:COG4886   165 DLSNNQLTDLPEEL-GNLTNLKELDLSNNQITDLPEPL-GNLTNLEELDLSGNQLT 218
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
91-146 2.76e-06

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 53.01  E-value: 2.76e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301   91 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 146
Cdd:COG4886   142 DLSNNQLTDLPEPL-GNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1426-1727 4.04e-06

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 52.36  E-value: 4.04e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1426 FPYRYTWDFGTeeaaPTRARGPEVTFIYRDPGSYLVTVTASNNI-SAANDSALVEVQEPVLVTSIKVNGSlglelqqpYL 1504
Cdd:COG3291    23 NATSYEWDFGD----GTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTST--------TV 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1505 FSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSV 1584
Cdd:COG3291    91 TTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTT 170
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1585 SFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGGGRYFPT 1664
Cdd:COG3291   171 SASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNT 250
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767987301 1665 NHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWAD 1727
Cdd:COG3291   251 VTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTADVTGGTATLA 313
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
294-366 5.35e-06

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 47.11  E-value: 5.35e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301  294 TLVGPHGPLAS-GQLAAFHIAAPLP--VTATRWDFGDGSAEVdAAGPAASHRYVLPGRYHVTAVLALGAGSALLGT 366
Cdd:cd00146     2 TASVSAPPVAElGASVTFSASDSSGgsIVSYKWDFGDGEVSS-SGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKT 76
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
91-146 1.12e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 51.09  E-value: 1.12e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767987301   91 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 146
Cdd:COG4886   119 DLSGNQLTDLPEEL-ANLTNLKELDLSNNQLTDLPEPL-GNLTNLKSLDLSNNQLT 172
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1043-1402 1.65e-05

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 50.44  E-value: 1.65e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1043 STVPAVLSPNATLALTAgvlVDSAVEVAFLWTFGDGEQAlhqfqppynesfpvpdpsvaqvlVEHNVMHTYAAPGEYLLT 1122
Cdd:COG3291     2 TATPTSGCAPLTVQFTD---TSSGNATSYEWDFGDGTTS-----------------------TEANPSHTYTTPGTYTVT 55
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1123 VLASNAF-ENLTQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYAS 1201
Cdd:COG3291    56 LTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTT 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1202 RGTYHVRLEVNNTVSGAAAQADVRVFEELRGLSVDmslaveqGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHV 1281
Cdd:COG3291   136 TGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTT-------SASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTA 208
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1282 YLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCP 1361
Cdd:COG3291   209 GVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGL 288
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|.
gi 767987301 1362 TVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEPEVGNVT 1402
Cdd:COG3291   289 GTTTAITPGNVSTTADVTGGTATLAVSSTLTTNDTTGSSST 329
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
318-362 2.18e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 45.72  E-value: 2.18e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 767987301   318 VTATRWDFGDGSAevdAAGPAASHRYVLPGRYHVTAVLALGAGSA 362
Cdd:pfam18911   34 ILSYRWDFGDGTT---ATGANVSHTYAAPGTYTVTLTVTDDSGAS 75
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1429-1480 2.34e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 45.34  E-value: 2.34e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 767987301  1429 RYTWDFGTEeaapTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEV 1480
Cdd:pfam18911   36 SYRWDFGDG----TTATGANVSHTYAAPGTYTVTLTVTDDSGASNSTATDTV 83
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1990-2076 2.58e-05

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 45.13  E-value: 2.58e-05
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301   1990 SGLQVPNCCEPgiATGTERNFTARVQR-GSRVAYAWYFslqkvqGDSLViLSGRDVTYTPVAAGLLEIQVRAFNALGSEN 2068
Cdd:smart00089    1 VADVSASPTVG--VAGESVTFTATSSDdGSIVSYTWDF------GDGTS-STGPTVTHTYTKPGTYTVTLTVTNAVGSAS 71

                    ....*...
gi 767987301   2069 RTLVLEVQ 2076
Cdd:smart00089   72 ATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
319-371 2.63e-05

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 45.13  E-value: 2.63e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 767987301    319 TATRWDFGDGSaevDAAGPAASHRYVLPGRYHVTAVLALGAGSALLGTDVQVE 371
Cdd:smart00089   30 VSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1909-1985 3.71e-05

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 44.79  E-value: 3.71e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767987301 1909 VLWASSKVVAPGQLVHFQI-LLAAGSAVTFRLQVGGANPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVV 1985
Cdd:cd00146     3 ASVSAPPVAELGASVTFSAsDSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKTTTVV 80
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
2094-2160 4.04e-05

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 49.28  E-value: 4.04e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767987301 2094 SAQFEAATSPSPrrVAYHWDFGDGSpgqDTDEPRAEHSYLRPGDYRVQVNASNLV-SFFVAQATVTVQ 2160
Cdd:COG3291    13 TVQFTDTSSGNA--TSYEWDFGDGT---TSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVG 75
CLECT_NK_receptors_like cd03593
C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); ...
424-549 5.09e-05

C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.


Pssm-ID: 153063  Cd Length: 116  Bit Score: 45.40  E-value: 5.09e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  424 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFLvSRVTRSLDVWIGFSTVQGVEVGPAPQGE 503
Cdd:cd03593     1 CPKDWICY--GNKCYYFSMEKKTWNESKEACSS-KNSSLLKIDDEEELEFL-QSQIGSSSYWIGLSREKSEKPWKWIDGS 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 767987301  504 AFSlescqNWLpgEPHPATAE-HCVRLGPTGwCNTDLCSAPHSYVCE 549
Cdd:cd03593    77 PLN-----NLF--NIRGSTKSgNCAYLSSTG-IYSEDCSTKKRWICE 115
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
97-146 5.34e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 49.16  E-value: 5.34e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 767987301   97 LRALDVGLLANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 146
Cdd:COG4886   101 LDLSGNEELSNLTNLESLDLSGNQLTDLPEEL-ANLTNLKELDLSNNQLT 149
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
2094-2146 8.17e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 43.80  E-value: 8.17e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 767987301  2094 SAQFEAATS--PSPRRVAYHWDFGDGSPGqdtDEPRAEHSYLRPGDYRVQVNASN 2146
Cdd:pfam18911   19 TVTFDASASddPDGDILSYRWDFGDGTTA---TGANVSHTYAAPGTYTVTLTVTD 70
CLECT_CEL-1_like cd03589
C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and ...
424-550 1.43e-04

C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina; CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc. This protein may play a role as a toxin defending against predation. Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc. Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD. Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.


Pssm-ID: 153059  Cd Length: 137  Bit Score: 44.66  E-value: 1.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  424 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAWAG----AALAMVDSPAVQRFL------VSRVTRSLDVWIGFStvQG 493
Cdd:cd03589     1 CPTFWTAF--GGYCYRFFGDRLTWEEAELRCRSFSIpgliAHLVSIHSQEENDFVydlfesSRGPDTPYGLWIGLH--DR 76
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 767987301  494 VEVGPA--PQGEAFSLEscqNWLPGEPHPA-TAEHCVRLGPTG-----WcNTDLCSAPHSYVCEL 550
Cdd:cd03589    77 TSEGPFewTDGSPVDFT---KWAGGQPDNYgGNEDCVQMWRRGdagqsW-NDMPCDAVFPYICKM 137
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
3774-4299 1.46e-04

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 48.33  E-value: 1.46e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3774 EALYPDPPGPRV--------HTCSAAGGFSTSDYDVGWESPHNGSGTWAYSAPDLLGAWSWGSCAVYDSGGYVQELGLSL 3845
Cdd:COG3321   850 SALYPGRGRRRVplptypfqREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAAL 929
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3846 EESRDRLRFLQLHNWLDNRSRAVFLELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALRRLSAGLSLPLLTSVCL 3925
Cdd:COG3321   930 LALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAA 1009
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 3926 LLFAVHFAVAEARTWHREGRWRVLRLGAWARWLLVALTAATALVRLAQLGAADRQWTRFVRGRPRRFTSFDQVAQLSSAA 4005
Cdd:COG3321  1010 LLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAA 1089
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 4006 RGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLAILLVSSCVDSLWSVAQALLVLCP 4085
Cdd:COG3321  1090 LAAAALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLA 1169
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 4086 GTGLSTLCPAESWHLSPLLCVGLWALRLWGALRLGAVILRWRYHALRGELYRPAWEPQDYEMVELFLRRLRLWMGLSKVK 4165
Cdd:COG3321  1170 AAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALA 1249
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 4166 EFRHKVRFEGMEPLPSRSSRGSKVSPDVPPPSAGSDASHPSTSSSQLDGLSVSLGRLGTRCEPEPSRLQAVFEALLTQFD 4245
Cdd:COG3321  1250 AAAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAA 1329
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 767987301 4246 RLNQA-TEDVYQLEQQLHSLQGRRSSRAPAGSSRGPSPGLRPALPSRLARASRGV 4299
Cdd:COG3321  1330 LAALAaAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALALAALAAA 1384
PHA03247 PHA03247
large tegument protein UL36; Provisional
514-809 3.68e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 3.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  514 LPGEPHPATAEHCVrlgPTGWCnTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLA-----------QQD 582
Cdd:PHA03247 2555 LPPAAPPAAPDRSV---PPPRP-APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPlppdthapdppPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  583 GLSAPHEPVEVMVFPGL------------------RLSREAFLTTAEFGTQELRRPAqLRLQVYRLLSTAGTPENGSEPE 644
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPpperprddpapgrvsrprRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPE 2709
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  645 SRsPDNRTQLAPAcMPGGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAPYALWREFLFSVPAGPP----------- 713
Cdd:PHA03247 2710 PA-PHALVSATPL-PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAagpprrltrpa 2787
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  714 -AQYSVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPHLPAQLEGTWACPACALRLLAA 792
Cdd:PHA03247 2788 vASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP 2867
                         330
                  ....*....|....*..
gi 767987301  793 TEQLTVLLGLRPNPGLR 809
Cdd:PHA03247 2868 SRSPAAKPAAPARPPVR 2884
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
952-1032 5.59e-04

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 41.28  E-value: 5.59e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301    952 GLRATPSPEARVLQGVLVRYSPVVEAGSDMVFRWTINDKQSLTFQNVVFnvIYQSAAVFKLSLTASNHVSNVTVNYNVTV 1031
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTATSSDDGSIVSYTWDFGDGTSSTGPTVTH--TYTKPGTYTVTLTVTNAVGSASATVTVVV 78

                    .
gi 767987301   1032 E 1032
Cdd:smart00089   79 Q 79
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1661-1733 6.41e-04

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 41.33  E-value: 6.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301 1661 YFPTNHTVQLQAVVR-------DGTNVSYSWTaWRDrGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFV 1733
Cdd:cd00146     4 SVSAPPVAELGASVTfsasdssGGSIVSYKWD-FGD-GEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKTTTVVV 81
LRR_8 pfam13855
Leucine rich repeat;
111-143 1.24e-03

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 39.82  E-value: 1.24e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 767987301   111 LAELDISNNKISTLEEGIFANLFNLSEINLSGN 143
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNN 35
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1741-1818 2.21e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 39.95  E-value: 2.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767987301  1741 VAASPNPAAVNTSVTLSAE---LAGGSGVVYTWSLEEGLSWETSEPftTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQ 1817
Cdd:pfam18911    7 DAGGDRIVAEGETVTFDASasdDPDGDILSYRWDFGDGTTATGANV--SHTYAAPGTYTVTLTVTDDSGASNSTATDTVT 84

                   .
gi 767987301  1818 V 1818
Cdd:pfam18911   85 V 85
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH