NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|440215735|gb|AGB94492|]
View 

proteome of centrioles 1, isoform B [Drosophila melanogaster]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 11455410)

WD40 repeat domain-containing protein similar to proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
PubMed:  10322433|8090199
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
7-298 3.13e-89

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 275.64  E-value: 3.13e-89
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735   7 DPALERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVKI 86
Cdd:COG2319  109 TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRL 188
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  87 WEPKLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDDK 166
Cdd:COG2319  189 WDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADG 268
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 167 SVRIYDVDSGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNDVAFHPSGHFLLSG 246
Cdd:COG2319  269 TVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASG 348
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|..
gi 440215735 247 SDDRTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVWQ 298
Cdd:COG2319  349 SDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWD 400
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
7-298 3.13e-89

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 275.64  E-value: 3.13e-89
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735   7 DPALERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVKI 86
Cdd:COG2319  109 TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRL 188
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  87 WEPKLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDDK 166
Cdd:COG2319  189 WDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADG 268
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 167 SVRIYDVDSGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNDVAFHPSGHFLLSG 246
Cdd:COG2319  269 TVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASG 348
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|..
gi 440215735 247 SDDRTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVWQ 298
Cdd:COG2319  349 SDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWD 400
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
10-298 1.05e-87

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 267.66  E-value: 1.05e-87
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  10 LERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVKIWEP 89
Cdd:cd00200    1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  90 KLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDDKSVR 169
Cdd:cd00200   81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 170 IYDVDSGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNDVAFHPSGHFLLSGSDD 249
Cdd:cd00200  161 LWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED 240
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*....
gi 440215735 250 RTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVWQ 298
Cdd:cd00200  241 GTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
63-309 9.36e-13

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 70.12  E-value: 9.36e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  63 VNGVAWSPKGNLVASAGHDRTVKIWEPKLRGVSG--------EFVAHSKaVRSVDFDSTGHLMLTASD-DKSAKIWRVAR 133
Cdd:PLN00181 486 VCAIGFDRDGEFFATAGVNKKIKIFECESIIKDGrdihypvvELASRSK-LSGICWNSYIKSQVASSNfEGVVQVWDVAR 564
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 134 RQFVSSFAQQNNWVRSAKFSP-NGKLVATASDDKSVRIYDVDSGECVRTF-TEERAAPRQLAWHPwGNMLAVALGCNRIK 211
Cdd:PLN00181 565 SQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIkTKANICCVQFPSES-GRSLAFGSADHKVY 643
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 212 IFDVSGSQL-LQLYVVHSAPVNDVAFHPSGHfLLSGSDDRTIRILDL------LEGRPIYTLTGHTDAVNAVAFSRDGDK 284
Cdd:PLN00181 644 YYDLRNPKLpLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLsmsisgINETPLHSFMGHTNVKNFVGLSVSDGY 722
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|...
gi 440215735 285 FATGGSDRQLLVWQ------------------SNLHTYDASQF 309
Cdd:PLN00181 723 IATGSETNEVFVYHkafpmpvlsykfktidpvSGLEVDDASQF 765
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
260-297 9.25e-09

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 50.77  E-value: 9.25e-09
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 440215735   260 GRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVW 297
Cdd:smart00320   2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
WD40 pfam00400
WD domain, G-beta repeat;
260-297 1.12e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 47.73  E-value: 1.12e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 440215735  260 GRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVW 297
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
7-298 3.13e-89

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 275.64  E-value: 3.13e-89
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735   7 DPALERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVKI 86
Cdd:COG2319  109 TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRL 188
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  87 WEPKLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDDK 166
Cdd:COG2319  189 WDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADG 268
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 167 SVRIYDVDSGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNDVAFHPSGHFLLSG 246
Cdd:COG2319  269 TVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASG 348
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|..
gi 440215735 247 SDDRTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVWQ 298
Cdd:COG2319  349 SDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWD 400
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
10-298 1.05e-87

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 267.66  E-value: 1.05e-87
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  10 LERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVKIWEP 89
Cdd:cd00200    1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  90 KLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDDKSVR 169
Cdd:cd00200   81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 170 IYDVDSGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNDVAFHPSGHFLLSGSDD 249
Cdd:cd00200  161 LWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED 240
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*....
gi 440215735 250 RTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVWQ 298
Cdd:cd00200  241 GTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
6-297 1.16e-87

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 271.40  E-value: 1.16e-87
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735   6 RDPALERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVK 85
Cdd:COG2319   66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVR 145
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  86 IWEPKLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDD 165
Cdd:COG2319  146 LWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSAD 225
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 166 KSVRIYDVDSGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNDVAFHPSGHFLLS 245
Cdd:COG2319  226 GTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|..
gi 440215735 246 GSDDRTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVW 297
Cdd:COG2319  306 GSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLW 357
WD40 COG2319
WD40 repeat [General function prediction only];
6-297 8.40e-77

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 243.66  E-value: 8.40e-77
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735   6 RDPALERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVK 85
Cdd:COG2319   24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVR 103
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  86 IWEPKLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDD 165
Cdd:COG2319  104 LWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDD 183
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 166 KSVRIYDVDSGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNDVAFHPSGHFLLS 245
Cdd:COG2319  184 GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLAS 263
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|..
gi 440215735 246 GSDDRTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVW 297
Cdd:COG2319  264 GSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLW 315
WD40 COG2319
WD40 repeat [General function prediction only];
25-298 1.65e-65

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 214.39  E-value: 1.65e-65
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  25 RFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVKIWEPKLRGVSGEFVAHSKA 104
Cdd:COG2319    1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 105 VRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDDKSVRIYDVDSGECVRTFTE 184
Cdd:COG2319   81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 185 ERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNDVAFHPSGHFLLSGSDDRTIRILDLLEGRPIY 264
Cdd:COG2319  161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLR 240
                        250       260       270
                 ....*....|....*....|....*....|....
gi 440215735 265 TLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVWQ 298
Cdd:COG2319  241 TLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWD 274
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
6-254 8.87e-51

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 172.13  E-value: 8.87e-51
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735   6 RDPALERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVK 85
Cdd:cd00200   81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  86 IWEPKLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDD 165
Cdd:cd00200  161 LWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED 240
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 166 KSVRIYDVDSGECVRTFTEeraaprqlawhpwgnmlavalgcnrikifdvsgsqllqlyvvHSAPVNDVAFHPSGHFLLS 245
Cdd:cd00200  241 GTIRVWDLRTGECVQTLSG------------------------------------------HTNSVTSLAWSPDGKRLAS 278

                 ....*....
gi 440215735 246 GSDDRTIRI 254
Cdd:cd00200  279 GSADGTIRI 287
WD40 COG2319
WD40 repeat [General function prediction only];
14-175 5.86e-44

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 157.38  E-value: 5.86e-44
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  14 FTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVKIWEPKLRG 93
Cdd:COG2319  242 LTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGK 321
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  94 VSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSFAQQNNWVRSAKFSPNGKLVATASDDKSVRIYDV 173
Cdd:COG2319  322 LLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401

                 ..
gi 440215735 174 DS 175
Cdd:COG2319  402 AT 403
WD40 COG2319
WD40 repeat [General function prediction only];
6-132 5.49e-30

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 119.63  E-value: 5.49e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735   6 RDPALERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVK 85
Cdd:COG2319  276 ATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVR 355
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*..
gi 440215735  86 IWEPKLRGVSGEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIWRVA 132
Cdd:COG2319  356 LWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
227-300 1.02e-18

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 85.46  E-value: 1.02e-18
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 440215735 227 HSAPVNDVAFHPSGHFLLSGSDDRTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVWQSN 300
Cdd:cd00200    8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLE 81
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
63-309 9.36e-13

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 70.12  E-value: 9.36e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  63 VNGVAWSPKGNLVASAGHDRTVKIWEPKLRGVSG--------EFVAHSKaVRSVDFDSTGHLMLTASD-DKSAKIWRVAR 133
Cdd:PLN00181 486 VCAIGFDRDGEFFATAGVNKKIKIFECESIIKDGrdihypvvELASRSK-LSGICWNSYIKSQVASSNfEGVVQVWDVAR 564
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 134 RQFVSSFAQQNNWVRSAKFSP-NGKLVATASDDKSVRIYDVDSGECVRTF-TEERAAPRQLAWHPwGNMLAVALGCNRIK 211
Cdd:PLN00181 565 SQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIkTKANICCVQFPSES-GRSLAFGSADHKVY 643
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 212 IFDVSGSQL-LQLYVVHSAPVNDVAFHPSGHfLLSGSDDRTIRILDL------LEGRPIYTLTGHTDAVNAVAFSRDGDK 284
Cdd:PLN00181 644 YYDLRNPKLpLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLsmsisgINETPLHSFMGHTNVKNFVGLSVSDGY 722
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|...
gi 440215735 285 FATGGSDRQLLVWQ------------------SNLHTYDASQF 309
Cdd:PLN00181 723 IATGSETNEVFVYHkafpmpvlsykfktidpvSGLEVDDASQF 765
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
260-297 9.25e-09

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 50.77  E-value: 9.25e-09
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 440215735   260 GRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVW 297
Cdd:smart00320   2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
155-284 1.17e-08

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 55.47  E-value: 1.17e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 155 NGKLVATASDDKSVRIYDVDSGECVRTFTEERaAPRQLAWHPWGNMLAVA-LGCNRIKIFDVSGSQLLQLYVVHSAPVnD 233
Cdd:COG3391   79 GRRLYVANSGSGRVSVIDLATGKVVATIPVGG-GPRGLAVDPDGGRLYVAdSGNGRVSVIDTATGKVVATIPVGAGPH-G 156
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 440215735 234 VAFHPSGHFLL-SGSDDRTI----RILDLLEGRPIYTLTGHtDAVNAVAFSRDGDK 284
Cdd:COG3391  157 IAVDPDGKRLYvANSGSNTVsvivSVIDTATGKVVATIPVG-GGPVGVAVSPDGRR 211
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
56-88 5.04e-08

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 48.85  E-value: 5.04e-08
                           10        20        30
                   ....*....|....*....|....*....|...
gi 440215735    56 FASHSAPVNGVAWSPKGNLVASAGHDRTVKIWE 88
Cdd:smart00320   8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
260-297 1.12e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 47.73  E-value: 1.12e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 440215735  260 GRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVW 297
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
219-256 1.62e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 47.31  E-value: 1.62e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 440215735   219 QLLQLYVVHSAPVNDVAFHPSGHFLLSGSDDRTIRILD 256
Cdd:smart00320   3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
148-282 3.76e-07

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 52.35  E-value: 3.76e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  148 RSAKFSPNGKLVATASD-DKSVRIY--DVDSGECVRTFTEERAA-PRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQl 223
Cdd:COG4946   346 RLPAWSPDGKSIAYFSDaSGEYELYiaPADGSGEPKQLTLGDLGrVFNPVWSPDGKKIAFTDNRGRLWVVDLASGKVRK- 424
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 440215735  224 yVVHS---APVNDVAFHPSGHFLL----SGSDDRTIRILDlLEGRPIYTLTGHTDAVNAVAFSRDG 282
Cdd:COG4946   425 -VDTDgygDGISDLAWSPDSKWLAyskpGPNQLSQIFLYD-VETGKTVQLTDGRYDDGSPAFSPDG 488
WD40 pfam00400
WD domain, G-beta repeat;
56-88 5.16e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 45.80  E-value: 5.16e-07
                          10        20        30
                  ....*....|....*....|....*....|...
gi 440215735   56 FASHSAPVNGVAWSPKGNLVASAGHDRTVKIWE 88
Cdd:pfam00400   7 LEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 pfam00400
WD domain, G-beta repeat;
218-256 5.92e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 45.80  E-value: 5.92e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 440215735  218 SQLLQLYVVHSAPVNDVAFHPSGHFLLSGSDDRTIRILD 256
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
147-194 1.31e-06

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 46.12  E-value: 1.31e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 440215735  147 VRSAKFSPNGKLVATASDDKSVRIYDVDSGECVRTFTEERAAPRQLAW 194
Cdd:pfam12894  41 VTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGW 88
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
150-232 1.79e-06

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 45.73  E-value: 1.79e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  150 AKFSPNGKLVATASDDKSVRIYDVdSGECVRTF--TEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVH 227
Cdd:pfam12894   1 MSWCPTMDLIALATEDGELLLHRL-NWQRVWTLspDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAG 79

                  ....*
gi 440215735  228 SAPVN 232
Cdd:pfam12894  80 SDLIT 84
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
134-172 1.97e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 44.23  E-value: 1.97e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 440215735   134 RQFVSSFAQQNNWVRSAKFSPNGKLVATASDDKSVRIYD 172
Cdd:smart00320   2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
192-279 2.24e-06

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 45.35  E-value: 2.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  192 LAWHPWGNMLAVALGCNRIKIFDVSGSQLLQLYV-VHSAPVNDVAFHPSGHFLLSGSDDRTIRILDLLEGRPIYTLTGHT 270
Cdd:pfam12894   1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPdKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80

                  ....*....
gi 440215735  271 DAVNAVAFS 279
Cdd:pfam12894  81 DLITCLGWG 89
WD40 pfam00400
WD domain, G-beta repeat;
10-46 3.93e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 43.49  E-value: 3.93e-06
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 440215735   10 LERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWN 46
Cdd:pfam00400   3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
10-46 4.07e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.46  E-value: 4.07e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 440215735    10 LERHFTGHSGGITQLRFGPDGAQIATSSTDSTVILWN 46
Cdd:smart00320   4 LLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
134-172 7.82e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 42.33  E-value: 7.82e-06
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 440215735  134 RQFVSSFAQQNNWVRSAKFSPNGKLVATASDDKSVRIYD 172
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
PTZ00421 PTZ00421
coronin; Provisional
16-215 8.14e-06

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 47.97  E-value: 8.14e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  16 GHSGGITQLRFGP-DGAQIATSSTDSTVILWNL-------NQAARCIRFASHSAPVNGVAWSPKG-NLVASAGHDRTVKI 86
Cdd:PTZ00421  73 GQEGPIIDVAFNPfDPQKLFTASEDGTIMGWGIpeegltqNISDPIVHLQGHTKKVGIVSFHPSAmNVLASAGADMVVNV 152
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  87 WEPKlRGVSGEFVA-HSKAVRSVDfdstghlmltasddksakiWRVarrqfvssfaqqnnwvrsakfspNGKLVATASDD 165
Cdd:PTZ00421 153 WDVE-RGKAVEVIKcHSDQITSLE-------------------WNL-----------------------DGSLLCTTSKD 189
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 440215735 166 KSVRIYDVDSGECVRTFTEERAAPRQLA-WHPWGNmLAVALGCNR-----IKIFDV 215
Cdd:PTZ00421 190 KKLNIIDPRDGTIVSSVEAHASAKSQRClWAKRKD-LIITLGCSKsqqrqIMLWDT 244
TolB COG0823
Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, ...
119-244 9.43e-06

Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 440585 [Multi-domain]  Cd Length: 158  Bit Score: 45.43  E-value: 9.43e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 119 TASDDKSAKIWRV-----ARRQFVSSFAqqnnWVRSAKFSPNGKLVATASDDKS---VRIYDVDSGEcVRTFTEERAAPR 190
Cdd:COG0823    4 TLSRDGNSDIYVVdldggEPRRLTNSPG----IDTSPAWSPDGRRIAFTSDRGGgpqIYVVDADGGE-PRRLTFGGGYNA 78
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 440215735 191 QLAWHPWGNMLAVAL---GCNRIKIFDVSGSQLLQLYVVHSAPvndvAFHPSGHFLL 244
Cdd:COG0823   79 SPSWSPDGKRLAFVSrsdGRFDIYVLDLDGGAPRRLTDGPGSP----SWSPDGRRIV 131
COG5354 COG5354
Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only];
32-287 1.26e-05

Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only];


Pssm-ID: 227657 [Multi-domain]  Cd Length: 561  Bit Score: 47.18  E-value: 1.26e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  32 QIATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAgHDRTVKIWEPKLRGVSGEFVAHSkaVRSVDFD 111
Cdd:COG5354    4 QFPLDYSAVISVFWNSQSEVIHTRFESENWPVAYVSESPLGTYLFSE-HAAGVECWGGPSKAKLVRFRHPD--VKYLDFS 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 112 STGHLMLTASD---------------DKSAKIWRVARRQFVSSFAQQNN----WVRSaKFSPNGKLVATASDDkSVRIYD 172
Cdd:COG5354   81 PNEKYLVTWSRepiiepeieispftsKNNVFVWDIASGMIVFSFNGISQpylgWPVL-KFSIDDKYVARVVGS-SLYIHE 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 173 VD-----------SGECVRTF--TEERAAPRQLAWHPWGN----MLAV-ALGCNRI----KIFDVSGSQL--------LQ 222
Cdd:COG5354  159 ITdnieehpfknlRPVGILDFsiSPEGNHDELAYWTPEKLnkpaMVRIlSIPKNSVlvtkNLFKVSGVQLkwqvlgkyLL 238
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 440215735 223 LYVVHSAPVNDVAFHPSGHFLLSgsddrtirildlLEGRPIYTLTGHTDAVNAVAFSRDGDKFAT 287
Cdd:COG5354  239 VLVMTHTKSNKSYFGESNLYLLR------------ITERSIPVEKDLKDPVHDFTWEPLSSRFAV 291
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
65-292 1.28e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 46.55  E-value: 1.28e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  65 GVAWSPKGNLVASAGHDRTVKIWEPKlrgvSGEFVAHSKAVRS----VDFDSTGHLMLTasDDKSAKIWRVARRQ-FVSS 139
Cdd:COG4257   21 DVAVDPDGAVWFTDQGGGRIGRLDPA----TGEFTEYPLGGGSgphgIAVDPDGNLWFT--DNGNNRIGRIDPKTgEITT 94
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 140 FAQQNNWVR--SAKFSPNGKLVATASDDKSVRIYDVDSGEcVRTFT--EERAAPRQLAWHPWGNMLAVALGCNRIKIFDV 215
Cdd:COG4257   95 FALPGGGSNphGIAFDPDGNLWFTDQGGNRIGRLDPATGE-VTEFPlpTGGAGPYGIAVDPDGNLWVTDFGANAIGRIDP 173
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 216 SGSQllqlyvVHSAPVNDVAFHPSGhfLLSGSDDR---------TIRILDLLEGRPI-YTLTGHTDAVNAVAFSRDGDK- 284
Cdd:COG4257  174 DTGT------LTEYALPTPGAGPRG--LAVDPDGNlwvadtgsgRIGRFDPKTGTVTeYPLPGGGARPYGVAVDGDGRVw 245

                 ....*...
gi 440215735 285 FATGGSDR 292
Cdd:COG4257  246 FAESGANR 253
WD40 pfam00400
WD domain, G-beta repeat;
96-129 1.91e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 41.56  E-value: 1.91e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 440215735   96 GEFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIW 129
Cdd:pfam00400   5 KTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
146-244 1.92e-05

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 46.96  E-value: 1.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  146 WVRSAKFSPNGKLVATASDDKSVRIYDVDSGEcVRTFTEER--AAPRQLAWHPWGNMLAVAL----GCNRIKIFDVSGSQ 219
Cdd:COG4946   390 RVFNPVWSPDGKKIAFTDNRGRLWVVDLASGK-VRKVDTDGygDGISDLAWSPDSKWLAYSKpgpnQLSQIFLYDVETGK 468
                          90       100
                  ....*....|....*....|....*..
gi 440215735  220 LLQlyvVHSAPVND--VAFHPSGHFLL 244
Cdd:COG4946   469 TVQ---LTDGRYDDgsPAFSPDGKYLY 492
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
97-129 3.01e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 40.76  E-value: 3.01e-05
                           10        20        30
                   ....*....|....*....|....*....|...
gi 440215735    97 EFVAHSKAVRSVDFDSTGHLMLTASDDKSAKIW 129
Cdd:smart00320   7 TLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
65-295 3.32e-04

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 42.19  E-value: 3.32e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  65 GVAWSPKGNLVASAGHDRTVKIWEPKLRGVSgEFVAHSKAVRSVDFDSTGHLMLTasdDKSAKIWRV-----ARRQFVSS 139
Cdd:COG3386   12 GPVWDPDGRLYWVDIPGGRIHRYDPDGGAVE-VFAEPSGRPNGLAFDPDGRLLVA---DHGRGLVRFdpadgEVTVLADE 87
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 140 FAQQNNWVRSAKFSPNGKLVATASDDKSV--RIYDVDSGECVRTFTEERAAPRQLAWHPWGNMLAVA-LGCNRIKIFDVS 216
Cdd:COG3386   88 YGKPLNRPNDGVVDPDGRLYFTDMGEYLPtgALYRVDPDGSLRVLADGLTFPNGIAFSPDGRTLYVAdTGAGRIYRFDLD 167
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 217 GS-QLLQLYVVHSAPVND-----VAFHPSGHFLLSGSDDRTIRILDlLEGRPIYTLTGHTDAVNAVAFsrdgdkfatGGS 290
Cdd:COG3386  168 ADgTLGNRRVFADLPDGPggpdgLAVDADGNLWVALWGGGGVVRFD-PDGELLGRIELPERRPTNVAF---------GGP 237

                 ....*
gi 440215735 291 DRQLL 295
Cdd:COG3386  238 DLRTL 242
NBCH_WD40 pfam20426
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ...
238-298 4.54e-04

Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.


Pssm-ID: 466575 [Multi-domain]  Cd Length: 350  Bit Score: 41.98  E-value: 4.54e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 440215735  238 PSGHFLLS-GSDDRTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQLLVWQ 298
Cdd:pfam20426  91 PSENFLIScGNWENSFQVISLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTVMVWE 152
PTZ00421 PTZ00421
coronin; Provisional
13-108 4.56e-04

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 42.19  E-value: 4.56e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  13 HFTGHSGGITQLRFGPDGAQI-ATSSTDSTVILWNLNQAARCIRFASHSAPVNGVAWSPKGNLVASAGHDRTVKIWEPKL 91
Cdd:PTZ00421 120 HLQGHTKKVGIVSFHPSAMNVlASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRD 199
                         90
                 ....*....|....*....
gi 440215735  92 RGV--SGEFVAHSKAVRSV 108
Cdd:PTZ00421 200 GTIvsSVEAHASAKSQRCL 218
NHL_TRIM32_like cd14961
NHL repeat domain of the tripartite motif-containing protein 32 (TRIM32) and related proteins; ...
98-296 7.82e-04

NHL repeat domain of the tripartite motif-containing protein 32 (TRIM32) and related proteins; The E3 ubiquitin-protein ligase TRIM32 (HT2A) is widely expressed and is responsible for ubiquinating a large variety of targets, including dysbindin (DTNBP1), NPHP7/Glis2, TAp73, and others. TRIM32 promotes disassociation of the plakoglobin-PI3K complex and reduces PI3K-Akt-FoxO signaling. Mutations in TRIM32 have been implemented in the two diverse diseases limb-girdle muscular dystrophy type 2H (LGMD2H) or sarcotubular myopathy (STM) and Bardet-Biedl syndrome type 11 (BBS11). The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271331 [Multi-domain]  Cd Length: 273  Bit Score: 41.11  E-value: 7.82e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  98 FVAHSKAVRSVDFDSTGHLMltasddksakiwrvarrQFVSSFAQQNNWVRSA---KFSPNGKLVATASDDKSVRIYDVD 174
Cdd:cd14961   25 VVADDGNKRIQVFDSDGNCL-----------------QQFGPKGDAGQDIRYPldvAVTPDGHIVVTDAGDRSVKVFSFD 87
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 175 sGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFDVSG------------SQLLQLYVVHSAPVNDVAF------ 236
Cdd:cd14961   88 -GRLKLFVRKSFSLPWGVAVNPSGEILVTDSEAGKLFVLTVDFklgilkkgqklcSQLCRPRFVAVSRLGAVAVtehlfa 166
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 440215735 237 ----HPSGHFLLSGSDDRTIRILDLLeGRPIYTLTGHTdaVNAVAFSRDGDKFATGGSDRQLLV 296
Cdd:cd14961  167 ngtrSSSTRVKVFSSGGQLLGQIDSF-GLNLVFPSLIC--ASGVAFDSEGNVIVADTGSGAILC 227
PTZ00420 PTZ00420
coronin; Provisional
16-140 8.09e-04

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 41.47  E-value: 8.09e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  16 GHSGGITQLRFGPDGAQI-ATSSTDSTVILW-------NLNQAA--RCIrFASHSAPVNGVAWSPKGN-LVASAGHDRTV 84
Cdd:PTZ00420  72 GHTSSILDLQFNPCFSEIlASGSEDLTIRVWeiphndeSVKEIKdpQCI-LKGHKKKISIIDWNPMNYyIMCSSGFDSFV 150
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 440215735  85 KIWEPKLRGVSGEFVAhSKAVRSVDFDSTGHLMLTASDDKSAKIWRVARRQFVSSF 140
Cdd:PTZ00420 151 NIWDIENEKRAFQINM-PKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSF 205
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
65-255 8.17e-04

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 40.76  E-value: 8.17e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  65 GVAWSPKGNL-VASAGHDRtVKIWEPKLR-----GVSGEFVAHSKAVRSVDFDSTGHLmltasddksakiwrvarrqFVS 138
Cdd:cd05819   59 GVAVDSDGNLyVADTGNHR-IQKFDPDGNflasfGGSGDGDGEFNGPRGIAVDSSGNI-------------------YVA 118
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 139 SFaqQNNWVRsaKFSPNGKLVAT-----------------ASDDK-----------SVRIYDVDsGECVRTFTEERAAPR 190
Cdd:cd05819  119 DT--GNHRIQ--KFDPDGEFLTTfgsggsgpgqfngptgvAVDSDgniyvadtgnhRIQVFDPD-GNFLTTFGSTGTGPG 193
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 440215735 191 QLAwHPW-------GNMLAVALGCNRIKIFDVSGSQLLQL--YVVHSAPVN---DVAFHPSGHFLLSGSDDRTIRIL 255
Cdd:cd05819  194 QFN-YPTgiavdsdGNIYVADSGNNRVQVFDPDGAGFGGNgnFLGSDGQFNrpsGLAVDSDGNLYVADTGNNRIQVF 269
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
64-252 1.60e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 39.96  E-value: 1.60e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  64 NGVAWSPKGNLVASAGHDRTVKIWEP-----KLRGVSGEFVAHSKAVRSVDFDSTGHLmltasddksakiwrvarrqFVS 138
Cdd:cd14956   63 RGLAVDKDGWLYVADYWGDRIQVFTLtgelqTIGGSSGSGPGQFNAPRGVAVDADGNL-------------------YVA 123
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 139 SFAQQnnwvRSAKFSPNGKLVATASDdksvriYDVDSGEcvrtFTeeraAPRQLAWHPWGNmLAVALGCN-RIKIFDVSG 217
Cdd:cd14956  124 DFGNQ----RIQKFDPDGSFLRQWGG------TGIEPGS----FN----YPRGVAVDPDGT-LYVADTYNdRIQVFDNDG 184
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|
gi 440215735 218 SQLLQLYVVHSAP-----VNDVAFHPSGHFLLSGSDDRTI 252
Cdd:cd14956  185 AFLRKWGGRGTGPgqfnyPYGIAIDPDGNVFVADFGNNRI 224
TolB COG0823
Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, ...
115-217 4.06e-03

Periplasmic component TolB of the Tol biopolymer transport system [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 440585 [Multi-domain]  Cd Length: 158  Bit Score: 37.73  E-value: 4.06e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 115 HLMLTASDDKSAKIWRVARRqfvSSFAQQ----NNWVRSAKFSPNGKLVATASDDKS---VRIYDVDSGEcVRTFTEERA 187
Cdd:COG0823   44 RIAFTSDRGGGPQIYVVDAD---GGEPRRltfgGGYNASPSWSPDGKRLAFVSRSDGrfdIYVLDLDGGA-PRRLTDGPG 119
                         90       100       110
                 ....*....|....*....|....*....|...
gi 440215735 188 APrqlAWHPWGNMLAVA---LGCNRIKIFDVSG 217
Cdd:COG0823  120 SP---SWSPDGRRIVFSsdrGGRPDLYVVDLDG 149
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
55-236 6.31e-03

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 37.95  E-value: 6.31e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735  55 RFASHSAPVNGVAWSPKGNLVAsAGHDRTVKIWEPKlrgvSGEFV-----AHSKAVRSVD--FDSTGHLMLTASDDK--S 125
Cdd:COG3386   43 VFAEPSGRPNGLAFDPDGRLLV-ADHGRGLVRFDPA----DGEVTvladeYGKPLNRPNDgvVDPDGRLYFTDMGEYlpT 117
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 440215735 126 AKIWRVAR----RQFVSSFAQQNnwvrSAKFSPNGK-LVATASDDKSVRIYDVDS------GECVRTFTEERAAPRQLAW 194
Cdd:COG3386  118 GALYRVDPdgslRVLADGLTFPN----GIAFSPDGRtLYVADTGAGRIYRFDLDAdgtlgnRRVFADLPDGPGGPDGLAV 193
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|..
gi 440215735 195 HPWGNMLAVALGCNRIKIFDVSGSQLLQLYVVHSAPVNdVAF 236
Cdd:COG3386  194 DADGNLWVALWGGGGVVRFDPDGELLGRIELPERRPTN-VAF 234
PTZ00421 PTZ00421
coronin; Provisional
227-294 8.71e-03

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 38.34  E-value: 8.71e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 440215735 227 HSAPVNDVAFHPSG-HFLLSGSDDRTIRILDLLEGRPIYTLTGHTDAVNAVAFSRDGDKFATGGSDRQL 294
Cdd:PTZ00421 124 HTKKVGIVSFHPSAmNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKL 192
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
175-214 9.54e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 33.83  E-value: 9.54e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 440215735   175 SGECVRTFTEERAAPRQLAWHPWGNMLAVALGCNRIKIFD 214
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
eIF2A pfam08662
Eukaryotic translation initiation factor eIF2A; This is a family of eukaryotic translation ...
60-129 9.59e-03

Eukaryotic translation initiation factor eIF2A; This is a family of eukaryotic translation initiation factors.


Pssm-ID: 462552 [Multi-domain]  Cd Length: 194  Bit Score: 36.87  E-value: 9.59e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 440215735   60 SAPVNGVAWSPKGNLVASAGHDRT---VKIWEPKLRGVSGEFVaHSKAVrSVDFDSTGHLMLTASD------DKSAKIW 129
Cdd:pfam08662 100 EQPRNTIFWSPFGRLVLLAGFGNLagdIEFWDVVNKKKIATAE-ASNAT-LCEWSPDGRYFLTATTaprlrvDNGFKIW 176
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH