NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767929956|ref|XP_011512211|]
View 

slit homolog 2 protein isoform X3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1153-1286 7.82e-37

Laminin G domain;


:

Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.82e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   1153 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1229
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 767929956   1230 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1286
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
485-831 1.13e-22

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.13e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  485 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 564
Cdd:COG4886     3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  565 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 642
Cdd:COG4886    83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  643 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 722
Cdd:COG4886   148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  723 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 800
Cdd:COG4886   179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                         330       340       350
                  ....*....|....*....|....*....|.
gi 767929956  801 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 831
Cdd:COG4886   256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
27-177 5.31e-21

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 97.70  E-value: 5.31e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   27 DLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLSENQIQ 106
Cdd:COG4886   119 DLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767929956  107 AIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 177
Cdd:COG4886   196 DLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
289-637 3.90e-17

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 85.76  E-value: 3.90e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  289 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 368
Cdd:COG4886   108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  369 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlankrigqikskkfrc 448
Cdd:COG4886   162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-------------------------- 196
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  449 sakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKK 526
Cdd:COG4886   197 -------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGN 248
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  527 LPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLS 606
Cdd:COG4886   249 LTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLL 302
                         330       340       350
                  ....*....|....*....|....*....|.
gi 767929956  607 LYDNQITTVAPGAFDTLHSLSTLNLLANPFN 637
Cdd:COG4886   303 LLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
802-983 2.33e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.33e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   802 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 878
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   879 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 958
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 767929956   959 GFEGENCEVNVDDCEDNDCENNSTC 983
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1046-1082 2.38e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.38e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 767929956 1046 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1082
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
968-1004 4.11e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.11e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 767929956  968 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1004
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
148-226 9.54e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 9.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   148 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGHNVAEVQKREF 221
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQPLLGIPLLDS 78

                   ....*
gi 767929956   222 VCSDE 226
Cdd:TIGR00864   79 GCDEE 83
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1008-1043 1.13e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.13e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 767929956 1008 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1043
Cdd:cd00054     3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
LRRNT smart00013
Leucine rich repeat N-terminal domain;
242-274 2.71e-05

Leucine rich repeat N-terminal domain;


:

Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 42.30  E-value: 2.71e-05
                            10        20        30
                    ....*....|....*....|....*....|...
gi 767929956    242 HCPAACTCSNNIVDCRGKGLTEIPTNLPETITE 274
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1100-1127 5.38e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.38e-03
                          10        20
                  ....*....|....*....|....*...
gi 767929956 1100 CQNGAQCIVRINEPICQCLPGYQGEKCE 1127
Cdd:cd00054    11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1153-1286 7.82e-37

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.82e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   1153 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1229
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 767929956   1230 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1286
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1131-1284 8.48e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.84  E-value: 8.48e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956 1131 SVNFiNKESYLQIP-SAKVRPQTNITLQIATDEDSGILLYKGDK---DHIAVELYRGRVRASYDTGSHPASaIYSVETIN 1206
Cdd:cd00110     1 GVSF-SGSSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSQnggDFLALELEDGRLVLRYDLGSGSLV-LSSKTPLN 78
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767929956 1207 DGNFHIVELLALDQSLSLSVDGGNPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYIN 1284
Cdd:cd00110    79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLP-----EDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1160-1286 8.44e-32

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.99  E-value: 8.44e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  1160 TDEDSGILLYKGD--KDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGNPKIITNL 1237
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 767929956  1238 SKQSTLNFDSPLYVGGMPGKSNVASLRQAPGqngtsFHGCIRNLYINSE 1286
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPLLLLPALPVRAG-----FVGCIRDVRVNGE 126
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
485-831 1.13e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.13e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  485 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 564
Cdd:COG4886     3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  565 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 642
Cdd:COG4886    83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  643 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 722
Cdd:COG4886   148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  723 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 800
Cdd:COG4886   179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                         330       340       350
                  ....*....|....*....|....*....|.
gi 767929956  801 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 831
Cdd:COG4886   256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
27-177 5.31e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 97.70  E-value: 5.31e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   27 DLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLSENQIQ 106
Cdd:COG4886   119 DLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767929956  107 AIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 177
Cdd:COG4886   196 DLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
289-637 3.90e-17

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 85.76  E-value: 3.90e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  289 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 368
Cdd:COG4886   108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  369 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlankrigqikskkfrc 448
Cdd:COG4886   162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-------------------------- 196
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  449 sakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKK 526
Cdd:COG4886   197 -------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGN 248
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  527 LPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLS 606
Cdd:COG4886   249 LTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLL 302
                         330       340       350
                  ....*....|....*....|....*....|.
gi 767929956  607 LYDNQITTVAPGAFDTLHSLSTLNLLANPFN 637
Cdd:COG4886   303 LLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
LRR_8 pfam13855
Leucine rich repeat;
748-807 1.61e-16

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 74.87  E-value: 1.61e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   748 HLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDI 807
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
27-81 3.08e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 3.08e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 767929956    27 DLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHL 81
Cdd:pfam13855    7 DLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
273-330 3.53e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 3.53e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 767929956   273 TEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 330
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
802-983 2.33e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.33e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   802 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 878
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   879 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 958
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 767929956   959 GFEGENCEVNVDDCEDNDCENNSTC 983
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
607-683 1.06e-10

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 67.03  E-value: 1.06e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   607 LYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLRKKRIVTGNPR---CQKPYFLKEIPIQDVAIQDFTCD 683
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
724-824 2.46e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.11  E-value: 2.46e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  724 RDVTELYLDGNQFTLVPkELSNYKHLTLIDLSNNRISTLSNqsFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLH 803
Cdd:cd21340     2 KRITHLYLNDKNITKID-NLSLCKNLKVLYLYDNKITKIEN--LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLG 76
                          90       100
                  ....*....|....*....|.
gi 767929956  804 GNDISVVpEGaFNDLSALSHL 824
Cdd:cd21340    77 GNRISVV-EG-LENLTNLEEL 95
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
28-177 5.63e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 60.96  E-value: 5.63e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   28 LNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQlfpELLFLGT-AKLYRLDLSENQIQ 106
Cdd:cd21340     9 LNDKNITKID--NLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIE---KIENLENlVNLKKLYLGGNRIS 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767929956  107 AIprKAFRGAVDIKNLQLDYNQIS-----CIEDGAFRALRD-LEVLTLNNNNITrlSVASFNHMPKLRTFRLHSNNL 177
Cdd:cd21340    82 VV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNID--SLEPLAPLRNLEQLDASNNQI 154
PLN03150 PLN03150
hypothetical protein; Provisional
739-808 1.90e-08

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 59.06  E-value: 1.90e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  739 VPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDIS 808
Cdd:PLN03150  434 IPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1046-1082 2.38e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.38e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 767929956 1046 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1082
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
968-1004 4.11e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.11e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 767929956  968 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1004
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
28-177 8.68e-07

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 53.70  E-value: 8.68e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   28 LNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlQLFPELL-FLGTAKLYRLDLSENQIQ 106
Cdd:PLN00113  411 LQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARN--KFFGGLPdSFGSKRLENLDLSRNQFS 488
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767929956  107 -AIPRKaFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 177
Cdd:PLN00113  489 gAVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
257-337 2.66e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.66e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  257 RGKGLTEIPTNLPETITEIRLEQNTIKVIP---PGAfspykkLRRIDLSNNQISELAPDAFQGLRSLNslvLYGNKITEL 333
Cdd:PRK15370  228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                  ....
gi 767929956  334 PKSL 337
Cdd:PRK15370  299 PAHL 302
LRRCT smart00082
Leucine rich repeat C-terminal domain;
829-878 4.07e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 45.11  E-value: 4.07e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 767929956    829 NPLYCDCNMQWLSDWVKSE--YKEPGIARCAGPGEMADKLLLTTPSkKFTCQ 878
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
EGF_CA smart00179
Calcium-binding EGF-like domain;
1046-1082 7.32e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 44.16  E-value: 7.32e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 767929956   1046 DFDDCQ-DNKCKNGAHCTDAVNGYTCICPEGYS-GLFCE 1082
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
148-226 9.54e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 9.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   148 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGHNVAEVQKREF 221
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQPLLGIPLLDS 78

                   ....*
gi 767929956   222 VCSDE 226
Cdd:TIGR00864   79 GCDEE 83
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1050-1078 9.65e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 9.65e-06
                           10        20
                   ....*....|....*....|....*....
gi 767929956  1050 CQDNKCKNGAHCTDAVNGYTCICPEGYSG 1078
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
LRRNT smart00013
Leucine rich repeat N-terminal domain;
697-728 1.13e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.13e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 767929956    697 CPTECTCLDTVVRCSNKGLKVLPKGIPRDVTE 728
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1008-1043 1.13e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.13e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 767929956 1008 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1043
Cdd:cd00054     3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA smart00179
Calcium-binding EGF-like domain;
968-1004 1.59e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.00  E-value: 1.59e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 767929956    968 NVDDCE-DNDCENNSTCVDGINNYTCLCPPEYT-GELCE 1004
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRCT smart00082
Leucine rich repeat C-terminal domain;
175-224 2.23e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 43.19  E-value: 2.23e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 767929956    175 NNLYCDCHLAWLSDWLRQRPRV--GLYTQCMGPSHLRGhNVAEVQKREFVCS 224
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
242-274 2.71e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 42.30  E-value: 2.71e-05
                            10        20        30
                    ....*....|....*....|....*....|...
gi 767929956    242 HCPAACTCSNNIVDCRGKGLTEIPTNLPETITE 274
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
972-1001 1.07e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.83  E-value: 1.07e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767929956   972 CEDNDCENNSTCVDGINNYTCLCPPEYTGE 1001
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
373-436 2.24e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 46.23  E-value: 2.24e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767929956   373 LYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTNPIET---SGARCTSPRRLANK 436
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
243-269 2.38e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.53  E-value: 2.38e-04
                           10        20
                   ....*....|....*....|....*..
gi 767929956   243 CPAACTCSNNIVDCRGKGLTEIPTNLP 269
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1013-1042 2.53e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 39.67  E-value: 2.53e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767929956  1013 DLNPCQHDSKCILTPKGFKCDCTPGYVGEH 1042
Cdd:pfam00008    2 APNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
LRRCT smart00082
Leucine rich repeat C-terminal domain;
400-430 9.43e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.57  E-value: 9.43e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 767929956    400 NPFICDCHLKWLADYLHTNPI--ETSGARCTSP 430
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
934-966 1.44e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 1.44e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 767929956  934 NPCKHGGTCHLKEGeedGFWCICADGFEGENCE 966
Cdd:cd00054     9 NPCQNGGTCVNTVG---SYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
931-964 2.65e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 2.65e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 767929956   931 CISNPCKHGGTCHLKEGeedGFWCICADGFEGEN 964
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
1008-1044 3.41e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.46  E-value: 3.41e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 767929956   1008 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYV-GEHCD 1044
Cdd:smart00179    3 DECASG-NPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1100-1127 5.38e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.38e-03
                          10        20
                  ....*....|....*....|....*...
gi 767929956 1100 CQNGAQCIVRINEPICQCLPGYQGEKCE 1127
Cdd:cd00054    11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1095-1124 6.18e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 6.18e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 767929956  1095 CDNFDCQNGAQCIVRINEPICQCLPGYQGE 1124
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1153-1286 7.82e-37

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.82e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   1153 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1229
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 767929956   1230 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1286
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1131-1284 8.48e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.84  E-value: 8.48e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956 1131 SVNFiNKESYLQIP-SAKVRPQTNITLQIATDEDSGILLYKGDK---DHIAVELYRGRVRASYDTGSHPASaIYSVETIN 1206
Cdd:cd00110     1 GVSF-SGSSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSQnggDFLALELEDGRLVLRYDLGSGSLV-LSSKTPLN 78
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767929956 1207 DGNFHIVELLALDQSLSLSVDGGNPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYIN 1284
Cdd:cd00110    79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLP-----EDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1160-1286 8.44e-32

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.99  E-value: 8.44e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  1160 TDEDSGILLYKGD--KDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGNPKIITNL 1237
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 767929956  1238 SKQSTLNFDSPLYVGGMPGKSNVASLRQAPGqngtsFHGCIRNLYINSE 1286
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPLLLLPALPVRAG-----FVGCIRDVRVNGE 126
Laminin_G_1 pfam00054
Laminin G domain;
1158-1289 1.86e-28

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 111.64  E-value: 1.86e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  1158 IATDEDSGILLYKGDKDH---IAVELYRGRVRASYDTGSHPASaIYSVETINDGNFHIVELLALDQSLSLSVDGG-NPKI 1233
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTErdfLALELRDGRLEVSYDLGSGAAV-VRSGDKLNDGKWHSVELERNGRSGTLSVDGEaRPTG 79
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 767929956  1234 ITNLSKQSTLNFDSPLYVGGMPgkSNVASLRQAPgqNGTSFHGCIRNLYINSELQD 1289
Cdd:pfam00054   80 ESPLGATTDLDVDGPLYVGGLP--SLGVKKRRLA--ISPSFDGCIRDVIVNGKPLD 131
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
485-831 1.13e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.13e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  485 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 564
Cdd:COG4886     3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  565 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 642
Cdd:COG4886    83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  643 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 722
Cdd:COG4886   148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  723 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 800
Cdd:COG4886   179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                         330       340       350
                  ....*....|....*....|....*....|.
gi 767929956  801 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 831
Cdd:COG4886   256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
490-808 4.02e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 100.78  E-value: 4.02e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  490 SNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGafegasgvneilltsnrlenvqh 569
Cdd:COG4886    75 LLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPEE----------------------- 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  570 kmFKGLESLKTLMLRSNRITCVGnDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLlanpfncncylawlgewl 649
Cdd:COG4886   132 --LANLTNLKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDL------------------ 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  650 rkkrivTGNPrcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcldtvvrcsnkglkvlpkgiprdVTEL 729
Cdd:COG4886   190 ------SNNQ-------ITDLP--------------------EPLGNLTN--------------------------LEEL 210
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767929956  730 YLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSnqSFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLHGNDIS 808
Cdd:COG4886   211 DLSGNQLTDLPEPLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPP--LANLTNLKTLDLSNNQLT 285
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
27-177 5.31e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 97.70  E-value: 5.31e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   27 DLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLSENQIQ 106
Cdd:COG4886   119 DLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767929956  107 AIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 177
Cdd:COG4886   196 DLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
13-413 1.62e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 89.99  E-value: 1.62e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   13 ESINGVTLRLPASRDLNGNNITRITKTDFAGLRHLRVLQLMENKistiergAFQDLKELERLRLNRNHLQLFPELLFLGT 92
Cdd:COG4886    64 SLLLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDLPEELANLT 136
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   93 aKLYRLDLSENQIQAIPrKAFRGAVDIKNLQLDYNQISCIeDGAFRALRDLEVLTLNNNNITRLSvASFNHMPKLRTFRL 172
Cdd:COG4886   137 -NLKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQITDLP-EPLGNLTNLEELDL 212
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  173 HSNNLYCdchlawLSDWLRQrprvglytqcmgpshlrghnvaevqkrefvcsdeeeghqsfmapscsvlhcpaactCSN- 251
Cdd:COG4886   213 SGNQLTD------LPEPLAN--------------------------------------------------------LTNl 230
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  252 NIVDCRGKGLTEIP--TNLPEtITEIRLEQNTIKVIPPGAFSPykKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNK 329
Cdd:COG4886   231 ETLDLSNNQLTDLPelGNLTN-LEELDLSNNQLTDLPPLANLT--NLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLL 307
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  330 ITELPksLFEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLK 409
Cdd:COG4886   308 LNLLE--LLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALL 385

                  ....
gi 767929956  410 WLAD 413
Cdd:COG4886   386 LLTL 389
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
13-418 1.84e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 89.61  E-value: 1.84e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   13 ESINGVTLRLPASRDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlqlfPELLFLgt 92
Cdd:COG4886    40 LSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGN-----EELSNL-- 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   93 AKLYRLDLSENQIQAIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvASFNHMPKLRTFRL 172
Cdd:COG4886   113 TNLESLDLSGNQLTDLP-EELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLP-EELGNLTNLKELDL 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  173 HSNNlycdchlawLSDWlrqrprvglytqcmgPSHLRGhnvaevqkrefvcsdeeeghqsfmapscsvlhcpaactCSN- 251
Cdd:COG4886   190 SNNQ---------ITDL---------------PEPLGN--------------------------------------LTNl 207
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  252 NIVDCRGKGLTEIPTNLPE--TITEIRLEQNTIKVIPpgAFSPYKKLRRIDLSNNQISELAPDAfqGLRSLNSLVLYGNK 329
Cdd:COG4886   208 EELDLSGNQLTDLPEPLANltNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQ 283
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  330 ITELP-KSLFEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHL 408
Cdd:COG4886   284 LTDLKlKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLL 363
                         410
                  ....*....|
gi 767929956  409 KWLADYLHTN 418
Cdd:COG4886   364 TLLLTLGLLG 373
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
289-637 3.90e-17

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 85.76  E-value: 3.90e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  289 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 368
Cdd:COG4886   108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  369 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlankrigqikskkfrc 448
Cdd:COG4886   162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-------------------------- 196
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  449 sakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKK 526
Cdd:COG4886   197 -------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGN 248
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  527 LPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLS 606
Cdd:COG4886   249 LTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLL 302
                         330       340       350
                  ....*....|....*....|....*....|.
gi 767929956  607 LYDNQITTVAPGAFDTLHSLSTLNLLANPFN 637
Cdd:COG4886   303 LLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
LRR_8 pfam13855
Leucine rich repeat;
748-807 1.61e-16

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 74.87  E-value: 1.61e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   748 HLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDI 807
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
253-422 4.24e-16

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 82.67  E-value: 4.24e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  253 IVDCRGKGLTEIPTNLPE--TITEIRLEQNTIKVIPPgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKI 330
Cdd:COG4886   140 ELDLSNNQLTDLPEPLGNltNLKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQITDL-PEPLGNLTNLEELDLSGNQL 217
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  331 TELPKSLfeglfslqllllnankinclrvdafQDLHNLNLLSLYDNKLQTIAKgtFSPLRAIQTMHLAQN-----PFICD 405
Cdd:COG4886   218 TDLPEPL-------------------------ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNqltdlPPLAN 270
                         170
                  ....*....|....*...
gi 767929956  406 CH-LKWLadYLHTNPIET 422
Cdd:COG4886   271 LTnLKTL--DLSNNQLTD 286
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
272-650 1.62e-14

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 77.67  E-value: 1.62e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  272 ITEIRLEQNTIKVIPPgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLfeglfslqllllna 351
Cdd:COG4886   115 LESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDL-PEPLGNLTNLKSLDLSNNQLTDLPEEL-------------- 178
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  352 nkinclrvdafQDLHNLNLLSLYDNKLQTIAKgTFSPLRAIQTMHLAQNPFicdchlkwladylhtNPIETSGARCTspr 431
Cdd:COG4886   179 -----------GNLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQL---------------TDLPEPLANLT--- 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  432 rlankrigqikskkfrcsakeqyfipgtedyrsKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPE--HIPQYTaELR 509
Cdd:COG4886   229 ---------------------------------NL------------------ETLDLSNNQLTDLPElgNLTNLE-ELD 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  510 LNNNEFTVLEATGifkKLPQLRKINFSNNKITDIEEGAFEGASGVNeiLLTSNRLENVQHKMFKGLESLKTLMLRSNRIT 589
Cdd:COG4886   257 LSNNQLTDLPPLA---NLTNLKTLDLSNNQLTDLKLKELELLLGLN--SLLLLLLLLNLLELLILLLLLTTLLLLLLLLK 331
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767929956  590 CVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLR 650
Cdd:COG4886   332 GLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALLLLTLLLL 392
LRR_8 pfam13855
Leucine rich repeat;
552-612 2.21e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 63.31  E-value: 2.21e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767929956   552 SGVNEILLTSNRLENVQHKMFKGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQI 612
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
577-636 2.63e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 2.63e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   577 SLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPF 636
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
27-81 3.08e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 3.08e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 767929956    27 DLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHL 81
Cdd:pfam13855    7 DLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
273-330 3.53e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 3.53e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 767929956   273 TEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 330
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
120-177 1.10e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 61.39  E-value: 1.10e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 767929956   120 KNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 177
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
802-983 2.33e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.33e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   802 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 878
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   879 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 958
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 767929956   959 GFEGENCEVNVDDCEDNDCENNSTC 983
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
607-683 1.06e-10

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 67.03  E-value: 1.06e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   607 LYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLRKKRIVTGNPR---CQKPYFLKEIPIQDVAIQDFTCD 683
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81
LRR_8 pfam13855
Leucine rich repeat;
94-153 1.20e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 58.30  E-value: 1.20e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956    94 KLYRLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNI 153
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
724-824 2.46e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.11  E-value: 2.46e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  724 RDVTELYLDGNQFTLVPkELSNYKHLTLIDLSNNRISTLSNqsFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLH 803
Cdd:cd21340     2 KRITHLYLNDKNITKID-NLSLCKNLKVLYLYDNKITKIEN--LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLG 76
                          90       100
                  ....*....|....*....|.
gi 767929956  804 GNDISVVpEGaFNDLSALSHL 824
Cdd:cd21340    77 GNRISVV-EG-LENLTNLEEL 95
LRR_8 pfam13855
Leucine rich repeat;
726-783 3.35e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 57.15  E-value: 3.35e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767929956   726 VTELYLDGNQFTLVPKE-LSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRL 783
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGaFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
28-177 5.63e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 60.96  E-value: 5.63e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   28 LNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQlfpELLFLGT-AKLYRLDLSENQIQ 106
Cdd:cd21340     9 LNDKNITKID--NLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIE---KIENLENlVNLKKLYLGGNRIS 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767929956  107 AIprKAFRGAVDIKNLQLDYNQIS-----CIEDGAFRALRD-LEVLTLNNNNITrlSVASFNHMPKLRTFRLHSNNL 177
Cdd:cd21340    82 VV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNID--SLEPLAPLRNLEQLDASNNQI 154
LRR_8 pfam13855
Leucine rich repeat;
771-831 7.34e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 55.99  E-value: 7.34e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767929956   771 TQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDISVVPEGAFNDLSALSHLAIGANPL 831
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
28-175 1.77e-09

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 59.41  E-value: 1.77e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   28 LNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQLFPELLF-----LGTAK-LYRLDLS 101
Cdd:cd21340    53 LQNNQIEKIE--NLENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKLTFdprslAALSNsLRVLNIS 128
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767929956  102 ENQIQaiprkafrgavDIKNLQldynqisciedgafrALRDLEVLTLNNNNITRLSVAS--FNHMPKLRTFRLHSN 175
Cdd:cd21340   129 GNNID-----------SLEPLA---------------PLRNLEQLDASNNQISDLEELLdlLSSWPSLRELDLTGN 178
LRR_8 pfam13855
Leucine rich repeat;
507-564 5.19e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 5.19e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 767929956   507 ELRLNNNEFTVLEAtGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 564
Cdd:pfam13855    5 SLDLSNNRLTSLDD-GAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
295-378 5.45e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 5.45e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   295 KLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKITELPKslfeglfslqllllnankinclrvDAFQDLHNLNLLSLY 374
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSP------------------------GAFSGLPSLRYLDLS 57

                   ....
gi 767929956   375 DNKL 378
Cdd:pfam13855   58 GNRL 61
LRR_8 pfam13855
Leucine rich repeat;
45-105 5.95e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 5.95e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767929956    45 RHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHLQLFPELLFLGTAKLYRLDLSENQI 105
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PLN03150 PLN03150
hypothetical protein; Provisional
739-808 1.90e-08

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 59.06  E-value: 1.90e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  739 VPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDIS 808
Cdd:PLN03150  434 IPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
703-831 3.18e-08

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 58.55  E-value: 3.18e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  703 CL---DTVVRCSNKGLKVLPKGIPRDVTELYLDGNQFTLVPKEL-SNYKHLT------------------LIDLSNNRIS 760
Cdd:PRK15370  175 CLknnKTELRLKILGLTTIPACIPEQITTLILDNNELKSLPENLqGNIKTLYansnqltsipatlpdtiqEMELSINRIT 254
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767929956  761 TLSNQSFSnmtQLLTLILSYNRLRCIPPRTFDGlksLRLLSLHGNDISVVPEgafNDLSALSHLAIGANPL 831
Cdd:PRK15370  255 ELPERLPS---ALQSLDLFHNKISCLPENLPEE---LRYLSVYDNSIRTLPA---HLPSGITHLNVQSNSL 316
LRR_8 pfam13855
Leucine rich repeat;
350-402 5.89e-08

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 50.60  E-value: 5.89e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 767929956   350 NANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPF 402
Cdd:pfam13855    9 SNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
292-833 1.06e-07

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 56.78  E-value: 1.06e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  292 PYkkLRRIDLSNNQISELAPDA-FQGLRSLNSLVLYGNKITelpKSLFEGLFSLQLLLLNANkiNCLRVDAFQDL---HN 367
Cdd:PLN00113   93 PY--IQTINLSNNQLSGPIPDDiFTTSSSLRYLNLSNNNFT---GSIPRGSIPNLETLDLSN--NMLSGEIPNDIgsfSS 165
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  368 LNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDC--------HLKWLadYLHTN------PIETSGarCTSPRRL 433
Cdd:PLN00113  166 LKVLDLGGNVLVGKIPNSLTNLTSLEFLTLASNQLVGQIprelgqmkSLKWI--YLGYNnlsgeiPYEIGG--LTSLNHL 241
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  434 A---NKRIGQIKSKKFRCSAKEQYFIpgtedYRSKLSGDCFADLACPEKCrcegTTVDCSNQKLN-KIPEHIPQY-TAE- 507
Cdd:PLN00113  242 DlvyNNLTGPIPSSLGNLKNLQYLFL-----YQNKLSGPIPPSIFSLQKL----ISLDLSDNSLSgEIPELVIQLqNLEi 312
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  508 LRLNNNEFTVLEATGIfKKLPQLRKINFSNNKIT----------------DIEEGAFEG-------ASG-VNEILLTSNR 563
Cdd:PLN00113  313 LHLFSNNFTGKIPVAL-TSLPRLQVLQLWSNKFSgeipknlgkhnnltvlDLSTNNLTGeipeglcSSGnLFKLILFSNS 391
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  564 LENVQHKMFKGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITtvapGAFDT----LHSLSTLNLLANPFNCN 639
Cdd:PLN00113  392 LEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQ----GRINSrkwdMPSLQMLSLARNKFFGG 467
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  640 cylawLGEWLRKKRIvtgnprcqkpyflkeipiqdvaiqdftcddGNDDnscspLSRcptectcldtvvrcsNKGLKVLP 719
Cdd:PLN00113  468 -----LPDSFGSKRL------------------------------ENLD-----LSR---------------NQFSGAVP 492
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  720 KGIPR--DVTELYLDGNQFT-LVPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKS 796
Cdd:PLN00113  493 RKLGSlsELMQLKLSENKLSgEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKNLGNVES 572
                         570       580       590
                  ....*....|....*....|....*....|....*....
gi 767929956  797 LRLLSLHGNDI--SVVPEGAFndlSALSHLAIGANPLYC 833
Cdd:PLN00113  573 LVQVNISHNHLhgSLPSTGAF---LAINASAVAGNIDLC 608
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1046-1082 2.38e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.38e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 767929956 1046 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1082
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
968-1004 4.11e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.11e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 767929956  968 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1004
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
28-177 8.68e-07

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 53.70  E-value: 8.68e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   28 LNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlQLFPELL-FLGTAKLYRLDLSENQIQ 106
Cdd:PLN00113  411 LQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARN--KFFGGLPdSFGSKRLENLDLSRNQFS 488
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767929956  107 -AIPRKaFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 177
Cdd:PLN00113  489 gAVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
257-337 2.66e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.66e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  257 RGKGLTEIPTNLPETITEIRLEQNTIKVIP---PGAfspykkLRRIDLSNNQISELAPDAFQGLRSLNslvLYGNKITEL 333
Cdd:PRK15370  228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                  ....
gi 767929956  334 PKSL 337
Cdd:PRK15370  299 PAHL 302
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
260-381 3.29e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 3.29e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  260 GLTEIPTNLPETITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRslnslvLYGNKITELPKSLfe 339
Cdd:PRK15370  189 GLTTIPACIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIPATLPDTIQEME------LSINRITELPERL-- 260
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 767929956  340 gLFSLQLLLLNANKINCLRvDAFQDlhNLNLLSLYDNKLQTI 381
Cdd:PRK15370  261 -PSALQSLDLFHNKISCLP-ENLPE--ELRYLSVYDNSIRTL 298
LRRCT smart00082
Leucine rich repeat C-terminal domain;
829-878 4.07e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 45.11  E-value: 4.07e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 767929956    829 NPLYCDCNMQWLSDWVKSE--YKEPGIARCAGPGEMADKLLLTTPSkKFTCQ 878
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
EGF_CA smart00179
Calcium-binding EGF-like domain;
1046-1082 7.32e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 44.16  E-value: 7.32e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 767929956   1046 DFDDCQ-DNKCKNGAHCTDAVNGYTCICPEGYS-GLFCE 1082
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
148-226 9.54e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 9.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   148 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGHNVAEVQKREF 221
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQPLLGIPLLDS 78

                   ....*
gi 767929956   222 VCSDE 226
Cdd:TIGR00864   79 GCDEE 83
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1050-1078 9.65e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 9.65e-06
                           10        20
                   ....*....|....*....|....*....
gi 767929956  1050 CQDNKCKNGAHCTDAVNGYTCICPEGYSG 1078
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
LRRNT smart00013
Leucine rich repeat N-terminal domain;
697-728 1.13e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.13e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 767929956    697 CPTECTCLDTVVRCSNKGLKVLPKGIPRDVTE 728
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1008-1043 1.13e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.13e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 767929956 1008 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1043
Cdd:cd00054     3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA smart00179
Calcium-binding EGF-like domain;
968-1004 1.59e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.00  E-value: 1.59e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 767929956    968 NVDDCE-DNDCENNSTCVDGINNYTCLCPPEYT-GELCE 1004
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRCT smart00082
Leucine rich repeat C-terminal domain;
175-224 2.23e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 43.19  E-value: 2.23e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 767929956    175 NNLYCDCHLAWLSDWLRQRPRV--GLYTQCMGPSHLRGhNVAEVQKREFVCS 224
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
242-274 2.71e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 42.30  E-value: 2.71e-05
                            10        20        30
                    ....*....|....*....|....*....|...
gi 767929956    242 HCPAACTCSNNIVDCRGKGLTEIPTNLPETITE 274
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
490-635 7.84e-05

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 47.09  E-value: 7.84e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  490 SNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEATGIFKKL---PQLRKINFSNNKITDieegafEGASGVNEILLTSNRL 564
Cdd:COG5238   193 GDEGIEELAEALTQNTTvtTLWLKRNPIGDEGAEILAEALkgnKSLTTLDLSNNQIGD------EGVIALAEALKNNTTV 266
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  565 E-------NVQH-------KMFKGLESLKTLMLRSNRItcvGNDSFIGL-------SSVRLLSLYDNQITTV-APGAFDT 622
Cdd:COG5238   267 EtlylsgnQIGAegaialaKALQGNTTLTSLDLSVNRI---GDEGAIALaeglqgnKTLHTLNLAYNGIGAQgAIALAKA 343
                         170
                  ....*....|....*.
gi 767929956  623 LH---SLSTLNLLANP 635
Cdd:COG5238   344 LQentTLHSLDLSDNQ 359
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
972-1001 1.07e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.83  E-value: 1.07e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767929956   972 CEDNDCENNSTCVDGINNYTCLCPPEYTGE 1001
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
47-180 1.07e-04

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 46.19  E-value: 1.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   47 LRVLQLMENKIS-TIER---GAFQDLKE-LERLRLNRNHL------QLFPELLFLGtaKLYRLDLSENQI--QAIPR--K 111
Cdd:cd00116   110 LQELKLNNNGLGdRGLRllaKGLKDLPPaLEKLVLGRNRLegasceALAKALRANR--DLKELNLANNGIgdAGIRAlaE 187
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767929956  112 AFRGAVDIKNLQLDYNQISCIED----GAFRALRDLEVLTLNNNNIT-----RLSVASFNHMPKLRTFRLHSNNLYCD 180
Cdd:cd00116   188 GLKANCNLEVLDLNNNGLTDEGAsalaETLASLKSLEVLNLGDNNLTdagaaALASALLSPNISLLTLSLSCNDITDD 265
LRR_8 pfam13855
Leucine rich repeat;
143-177 1.60e-04

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 40.97  E-value: 1.60e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 767929956   143 LEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 177
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLL 37
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
697-723 1.61e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 1.61e-04
                           10        20
                   ....*....|....*....|....*..
gi 767929956   697 CPTECTCLDTVVRCSNKGLKVLPKGIP 723
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
373-436 2.24e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 46.23  E-value: 2.24e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767929956   373 LYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTNPIET---SGARCTSPRRLANK 436
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
243-269 2.38e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.53  E-value: 2.38e-04
                           10        20
                   ....*....|....*....|....*..
gi 767929956   243 CPAACTCSNNIVDCRGKGLTEIPTNLP 269
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1013-1042 2.53e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 39.67  E-value: 2.53e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767929956  1013 DLNPCQHDSKCILTPKGFKCDCTPGYVGEH 1042
Cdd:pfam00008    2 APNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
727-808 2.60e-04

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 44.01  E-value: 2.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  727 TELYLD------GNQFTLVP---KELSNykHLTLIDLSNNRISTLSnqSFSNMTQLLTLILSYNRLRCIPP--RTFDGLK 795
Cdd:cd21340    93 EELHIEnqrlppGEKLTFDPrslAALSN--SLRVLNISGNNIDSLE--PLAPLRNLEQLDASNNQISDLEEllDLLSSWP 168
                          90
                  ....*....|...
gi 767929956  796 SLRLLSLHGNDIS 808
Cdd:cd21340   169 SLRELDLTGNPVC 181
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1055-1076 2.72e-04

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 39.24  E-value: 2.72e-04
                           10        20
                   ....*....|....*....|..
gi 767929956  1055 CKNGAHCTDAVNGYTCICPEGY 1076
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
LRRCT smart00082
Leucine rich repeat C-terminal domain;
634-683 5.44e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 39.34  E-value: 5.44e-04
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 767929956    634 NPFNCNCYLAWLGEWLRKKRIV--TGNPRCQKPYFLKEiPIQDVAIQDFTCD 683
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
475-505 5.86e-04

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 38.45  E-value: 5.86e-04
                            10        20        30
                    ....*....|....*....|....*....|.
gi 767929956    475 ACPEKCRCEGTTVDCSNQKLNKIPEHIPQYT 505
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDT 31
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
143-177 7.12e-04

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 38.77  E-value: 7.12e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 767929956   143 LEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 177
Cdd:pfam12799    3 LEVLDLSNNQITDIP--PLAKLPNLETLDLSGNNK 35
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1051-1082 9.31e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 38.23  E-value: 9.31e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 767929956 1051 QDNKCKNGAHCTDAVNGYTCICPEGYSGLF-CE 1082
Cdd:cd00053     4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDRsCE 36
LRRCT smart00082
Leucine rich repeat C-terminal domain;
400-430 9.43e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.57  E-value: 9.43e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 767929956    400 NPFICDCHLKWLADYLHTNPI--ETSGARCTSP 430
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
934-966 1.44e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 1.44e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 767929956  934 NPCKHGGTCHLKEGeedGFWCICADGFEGENCE 966
Cdd:cd00054     9 NPCQNGGTCVNTVG---SYRCSCPPGYTGRNCE 38
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
973-1004 2.19e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.07  E-value: 2.19e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 767929956  973 EDNDCENNSTCVDGINNYTCLCPPEYTGEL-CE 1004
Cdd:cd00053     4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDRsCE 36
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
294-334 2.21e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 37.22  E-value: 2.21e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 767929956   294 KKLRRIDLSNNQISELapDAFQGLRSLNSLVLYGN-KITELP 334
Cdd:pfam12799    1 PNLEVLDLSNNQITDI--PPLAKLPNLETLDLSGNnKITDLS 40
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
493-636 2.28e-03

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 41.31  E-value: 2.28e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  493 KLNKIP--EHIPQYTaELRLNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEegAFEGASGVNEILLTSNRLENVQHK 570
Cdd:cd21340    35 KITKIEnlEFLTNLT-HLYLQNNQIEKIEN---LENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKL 108
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767929956  571 MF-----KGL-ESLKTLMLRSNRITCVgnDSFIGLSSVRLLSLYDNQITTVAP--GAFDTLHSLSTLNLLANPF 636
Cdd:cd21340   109 TFdprslAALsNSLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
931-964 2.65e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 2.65e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 767929956   931 CISNPCKHGGTCHLKEGeedGFWCICADGFEGEN 964
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
1008-1044 3.41e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.46  E-value: 3.41e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 767929956   1008 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYV-GEHCD 1044
Cdd:smart00179    3 DECASG-NPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
977-998 3.45e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 36.16  E-value: 3.45e-03
                           10        20
                   ....*....|....*....|..
gi 767929956   977 CENNSTCVDGINNYTCLCPPEY 998
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
250-419 3.55e-03

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 41.70  E-value: 3.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  250 SNNIVDCRGKGLTEIPTnLPETITEIRLEQNTIKviPPGA------FSPYKKLRRIDLSNNQIS-----ELApDAFQGLR 318
Cdd:COG5238   189 CNQIGDEGIEELAEALT-QNTTVTTLWLKRNPIG--DEGAeilaeaLKGNKSLTTLDLSNNQIGdegviALA-EALKNNT 264
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956  319 SLNSLVLYGNKITE-----LPKSLfEGLFSLQLLLLNANKINCLRVDAFQDL----HNLNLLSLYDNKLQT-----IAKg 384
Cdd:COG5238   265 TVETLYLSGNQIGAegaiaLAKAL-QGNTTLTSLDLSVNRIGDEGAIALAEGlqgnKTLHTLNLAYNGIGAqgaiaLAK- 342
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 767929956  385 TFSPLRAIQTMHLAQNPfICDCHLKWLADYLHTNP 419
Cdd:COG5238   343 ALQENTTLHSLDLSDNQ-IGDEGAIALAKYLEGNT 376
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1015-1042 3.58e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 36.69  E-value: 3.58e-03
                          10        20
                  ....*....|....*....|....*...
gi 767929956 1015 NPCQHDSKCILTPKGFKCDCTPGYVGEH 1042
Cdd:cd00053     6 NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
LRR smart00370
Leucine-rich repeats, outliers;
295-316 4.15e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 36.18  E-value: 4.15e-03
                            10        20
                    ....*....|....*....|..
gi 767929956    295 KLRRIDLSNNQISELAPDAFQG 316
Cdd:smart00370    3 NLRELDLSNNQLSSLPPGAFQG 24
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
295-316 4.15e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 36.18  E-value: 4.15e-03
                            10        20
                    ....*....|....*....|..
gi 767929956    295 KLRRIDLSNNQISELAPDAFQG 316
Cdd:smart00369    3 NLRELDLSNNQLSSLPPGAFQG 24
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1100-1127 5.38e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.38e-03
                          10        20
                  ....*....|....*....|....*...
gi 767929956 1100 CQNGAQCIVRINEPICQCLPGYQGEKCE 1127
Cdd:cd00054    11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
892-921 5.71e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 5.71e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 767929956   892 CLSNPCKNDGTCNSDPVDfYRCTCPYGFKG 921
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGG-YTCICPEGYTG 29
LRR_9 pfam14580
Leucine-rich repeat;
510-589 6.02e-03

Leucine-rich repeat;


Pssm-ID: 405295 [Multi-domain]  Cd Length: 175  Bit Score: 39.36  E-value: 6.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   510 LNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL-ENVQHKMFKGLESLKTLMLRSNRI 588
Cdd:pfam14580   49 FSDNEIRKLDG---FPLLRRLKTLLLNNNRICRIGEGLGEALPNLTELILTNNNLqELGDLDPLASLKKLTFLSLLRNPV 125

                   .
gi 767929956   589 T 589
Cdd:pfam14580  126 T 126
LRR_5 pfam13306
BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich ...
259-340 6.07e-03

BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.


Pssm-ID: 463839 [Multi-domain]  Cd Length: 127  Bit Score: 38.29  E-value: 6.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767929956   259 KGLTEIptNLPETITEIR--------LE----QNTIKVIPPGAFSPYKKLRRIDLSNNqISELAPDAFQGLrSLNSLVLy 326
Cdd:pfam13306   34 TSLKSI--TLPSSLTSIGsyafyncsLTsitiPSSLTSIGEYAFSNCSNLKSITLPSN-LTSIGSYAFSNC-SLKSITI- 108
                           90
                   ....*....|....
gi 767929956   327 GNKITELPKSLFEG 340
Cdd:pfam13306  109 PSSVTTIGSYAFSN 122
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1095-1124 6.18e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 6.18e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 767929956  1095 CDNFDCQNGAQCIVRINEPICQCLPGYQGE 1124
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
267-337 7.44e-03

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 40.99  E-value: 7.44e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767929956  267 NLPEtITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-ELPKSL 337
Cdd:PLN00113  497 SLSE-LMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSgEIPKNL 567
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1100-1121 8.17e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 35.39  E-value: 8.17e-03
                           10        20
                   ....*....|....*....|..
gi 767929956  1100 CQNGAQCIVRINEPICQCLPGY 1121
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
890-924 8.39e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 35.31  E-value: 8.39e-03
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 767929956  890 NPCLS-NPCKNDGTCNSDPVDfYRCTCPYGFKGQDC 924
Cdd:cd00054     3 DECASgNPCQNGGTCVNTVGS-YRCSCPPGYTGRNC 37
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH