NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034656143|ref|XP_016867870|]
View 

cytoplasmic dynein 2 intermediate chain 1 isoform X1 [Homo sapiens]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 1000017)

WD40 repeat domain-containing protein folds into a beta-propeller structure and functions as a scaffold, providing a platform for the interaction and assembly of several proteins into a signalosome; similar to a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
Gene Ontology:  GO:0005515
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PTZ00121 super family cl31754
MAEBL; Provisional
5-406 4.66e-08

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 57.46  E-value: 4.66e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143    5 EHKEPRCRDPDQDARsrdRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLK-EKHREAEKSHSRGKDREKEKDRRA 83
Cdd:PTZ00121  1114 ARKAEEAKKKAEDAR---KAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKaEEARKAEDAKKAEAARKAEEVRKA 1190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   84 ----RKEELRQTVAHHNllGQETRDRQLLERAE--RKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLK 157
Cdd:PTZ00121  1191 eelrKAEDARKAEAARK--AEEERKAEEARKAEdaKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  158 YWLYKEEgerrhrKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEERHK-EKRHKEGFHFDDERHQSNVDRKEKS 236
Cdd:PTZ00121  1269 QAAIKAE------EARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKaDEAKKKAEEAKKKADAAKKKAEEAK 1342
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  237 AKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSvwwKLDQRPGGEETVVRREIEKEE 316
Cdd:PTZ00121  1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK---KAEEDKKKADELKKAAAAKKK 1419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  317 TDLENARADAYTASceddfedyeddfevcdgddDESSNEPESREKLEELplaqKKEIQEIQRAINA-----ENERIGELS 391
Cdd:PTZ00121  1420 ADEAKKKAEEKKKA-------------------DEAKKKAEEAKKADEA----KKKAEEAKKAEEAkkkaeEAKKADEAK 1476
                          410
                   ....*....|....*
gi 1034656143  392 LKLFQKRGRTEFEKE 406
Cdd:PTZ00121  1477 KKAEEAKKADEAKKK 1491
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
869-941 1.06e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 42.32  E-value: 1.06e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034656143  869 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 941
Cdd:cd00200    180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
 
Name Accession Description Interval E-value
PTZ00121 PTZ00121
MAEBL; Provisional
5-406 4.66e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 57.46  E-value: 4.66e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143    5 EHKEPRCRDPDQDARsrdRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLK-EKHREAEKSHSRGKDREKEKDRRA 83
Cdd:PTZ00121  1114 ARKAEEAKKKAEDAR---KAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKaEEARKAEDAKKAEAARKAEEVRKA 1190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   84 ----RKEELRQTVAHHNllGQETRDRQLLERAE--RKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLK 157
Cdd:PTZ00121  1191 eelrKAEDARKAEAARK--AEEERKAEEARKAEdaKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  158 YWLYKEEgerrhrKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEERHK-EKRHKEGFHFDDERHQSNVDRKEKS 236
Cdd:PTZ00121  1269 QAAIKAE------EARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKaDEAKKKAEEAKKKADAAKKKAEEAK 1342
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  237 AKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSvwwKLDQRPGGEETVVRREIEKEE 316
Cdd:PTZ00121  1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK---KAEEDKKKADELKKAAAAKKK 1419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  317 TDLENARADAYTASceddfedyeddfevcdgddDESSNEPESREKLEELplaqKKEIQEIQRAINA-----ENERIGELS 391
Cdd:PTZ00121  1420 ADEAKKKAEEKKKA-------------------DEAKKKAEEAKKADEA----KKKAEEAKKAEEAkkkaeEAKKADEAK 1476
                          410
                   ....*....|....*
gi 1034656143  392 LKLFQKRGRTEFEKE 406
Cdd:PTZ00121  1477 KKAEEAKKADEAKKK 1491
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
51-202 8.23e-05

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.42  E-value: 8.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   51 KDREKEKLKEKHREaekshsRGKDREKEKDRRARKEELRQTVAHHnllgqETRDRQLLERAERKGRSVSKVRSEEKDEDS 130
Cdd:TIGR01642    1 RDEEPDREREKSRG------RDRDRSSERPRRRSRDRSRFRDRHR-----RSRERSYREDSRPRDRRRYDSRSPRSLRYS 69
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034656143  131 ERGDedrerryrerklqygdSKDNPlkywlykeegeRRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSF 202
Cdd:TIGR01642   70 SVRR----------------SRDRP-----------RRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLW 114
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
54-254 4.91e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 43.96  E-value: 4.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   54 EKEKLK-EKHREAEKSHSRGKDREKEkdrRARKEELRQTVAHHNLLGQETRDRQLL-ERAERKGRSVSKVRSEEkdEDSE 131
Cdd:pfam17380  338 EQERMAmERERELERIRQEERKRELE---RIRQEEIAMEISRMRELERLQMERQQKnERVRQELEAARKVKILE--EERQ 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  132 RGDEDRERRYRERKLQYGDSKDNPLKYW----------LYKEEGERRHRKPR----EPDRDNKHREKSSTREKREKYSKE 197
Cdd:pfam17380  413 RKIQQQKVEMEQIRAEQEEARQREVRRLeeeraremerVRLEEQERQQQVERlrqqEEERKRKKLELEKEKRDRKRAEEQ 492
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034656143  198 KSNSFSDKGEERHK----EKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNR 254
Cdd:pfam17380  493 RRKILEKELEERKQamieEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRR 553
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
869-941 1.06e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 42.32  E-value: 1.06e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034656143  869 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 941
Cdd:cd00200    180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
 
Name Accession Description Interval E-value
PTZ00121 PTZ00121
MAEBL; Provisional
5-406 4.66e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 57.46  E-value: 4.66e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143    5 EHKEPRCRDPDQDARsrdRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLK-EKHREAEKSHSRGKDREKEKDRRA 83
Cdd:PTZ00121  1114 ARKAEEAKKKAEDAR---KAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKaEEARKAEDAKKAEAARKAEEVRKA 1190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   84 ----RKEELRQTVAHHNllGQETRDRQLLERAE--RKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLK 157
Cdd:PTZ00121  1191 eelrKAEDARKAEAARK--AEEERKAEEARKAEdaKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  158 YWLYKEEgerrhrKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEERHK-EKRHKEGFHFDDERHQSNVDRKEKS 236
Cdd:PTZ00121  1269 QAAIKAE------EARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKaDEAKKKAEEAKKKADAAKKKAEEAK 1342
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  237 AKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSvwwKLDQRPGGEETVVRREIEKEE 316
Cdd:PTZ00121  1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK---KAEEDKKKADELKKAAAAKKK 1419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  317 TDLENARADAYTASceddfedyeddfevcdgddDESSNEPESREKLEELplaqKKEIQEIQRAINA-----ENERIGELS 391
Cdd:PTZ00121  1420 ADEAKKKAEEKKKA-------------------DEAKKKAEEAKKADEA----KKKAEEAKKAEEAkkkaeEAKKADEAK 1476
                          410
                   ....*....|....*
gi 1034656143  392 LKLFQKRGRTEFEKE 406
Cdd:PTZ00121  1477 KKAEEAKKADEAKKK 1491
PTZ00121 PTZ00121
MAEBL; Provisional
5-316 6.97e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 53.61  E-value: 6.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143    5 EHKEPRCRDPDQDARSRDRVAEVHTAKESPRGerDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRAR 84
Cdd:PTZ00121  1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKA--DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   85 KEELRQTVAhhnllgqeTRDRQLLERAERK-GRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKE 163
Cdd:PTZ00121  1571 KAEEDKNMA--------LRKAEEAKKAEEArIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKE 1642
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  164 EGERrhrkpREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEERHKEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRK 243
Cdd:PTZ00121  1643 AEEK-----KKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034656143  244 RESQNGEHRNRGASS----KRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGssvwwKLDQRPGGEETVVRREIEKEE 316
Cdd:PTZ00121  1718 AEELKKAEEENKIKAeeakKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK-----KAEEIRKEKEAVIEEELDEED 1789
PTZ00121 PTZ00121
MAEBL; Provisional
21-376 6.87e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 50.52  E-value: 6.87e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   21 RDRVAEVHTAKESPRGERDRDRQRERRRDAKDREK-EKLKEKHREAEKS--HSRGKDREKEKDRRARK-EELRQtvAHHN 96
Cdd:PTZ00121  1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKaDEAKKKAEEAKKKadEAKKAAEAKKKADEAKKaEEAKK--ADEA 1527
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   97 LLGQETRDRQLLERAE--RKGRSVSKV----RSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKEEGERRHR 170
Cdd:PTZ00121  1528 KKAEEAKKADEAKKAEekKKADELKKAeelkKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKM 1607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  171 KPREPDRDNKHREKSSTREKREKySKEKSNSFSDKGEErhkEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRES---Q 247
Cdd:PTZ00121  1608 KAEEAKKAEEAKIKAEELKKAEE-EKKKVEQLKKKEAE---EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEakkA 1683
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  248 NGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEgssvwwklDQRPGGEETvVRREIEKEETDLENARADAY 327
Cdd:PTZ00121  1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAE--------EENKIKAEE-AKKEAEEDKKKAEEAKKDEE 1754
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1034656143  328 TASCEDDFEDYEDDFEVCDGDDDESSNEPESREKLEELPLAQKKEIQEI 376
Cdd:PTZ00121  1755 EKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDI 1803
PTZ00121 PTZ00121
MAEBL; Provisional
2-410 1.55e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 49.37  E-value: 1.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143    2 DLPEHKEPRCRDPDQDARSRDRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKShsrgKDREKEKDR 81
Cdd:PTZ00121  1282 ELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAA----KAEAEAAAD 1357
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   82 RARKEELRQTVAHHnllgQETRDRQLLERAERKGRSVSKVRSEEK--DEDSERGDEDRERRYRERKLQYGDSKDNPLKyw 159
Cdd:PTZ00121  1358 EAEAAEEKAEAAEK----KKEEAKKKADAAKKKAEEKKKADEAKKkaEEDKKKADELKKAAAAKKKADEAKKKAEEKK-- 1431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  160 lYKEEGERRHRKPREPDRDNKHREKSSTREKREKYSKE--KSNSFSDKGEERHK-----------EKRHKEGFHFDDERH 226
Cdd:PTZ00121  1432 -KADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakKADEAKKKAEEAKKadeakkkaeeaKKKADEAKKAAEAKK 1510
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  227 QSNVDRK--EKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGE 304
Cdd:PTZ00121  1511 KADEAKKaeEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE 1590
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  305 EtvvRREIEKEETDLENARADAYTASCEDDFEDYEddfevcdgddDESSNEPESREKLEELPLAQKKEIQEIQRAINAEN 384
Cdd:PTZ00121  1591 E---ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA----------EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE 1657
                          410       420
                   ....*....|....*....|....*....
gi 1034656143  385 E---RIGELSLKLFQKRGRTEFEKEPRTD 410
Cdd:PTZ00121  1658 EnkiKAAEEAKKAEEDKKKAEEAKKAEED 1686
PTZ00121 PTZ00121
MAEBL; Provisional
2-289 4.92e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 47.83  E-value: 4.92e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143    2 DLPEHKEPRCRDPDQDARSRDRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDR 81
Cdd:PTZ00121  1526 EAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEK 1605
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   82 RARKEELRQTvahhnllgqetrdrqllERAERKGRSVSKvrSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLY 161
Cdd:PTZ00121  1606 KMKAEEAKKA-----------------EEAKIKAEELKK--AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  162 KEEGERRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEErhkEKRHKEGFHFDDERHQSNVDRKEKSAKDEP 241
Cdd:PTZ00121  1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE---EKKKAEELKKAEEENKIKAEEAKKEAEEDK 1743
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1034656143  242 RKRES--QNGEHRNRGASSKRDGTSS-----QHAENLVRNHGKDKDSRRKHGHEE 289
Cdd:PTZ00121  1744 KKAEEakKDEEEKKKIAHLKKEEEKKaeeirKEKEAVIEEELDEEDEKRRMEVDK 1798
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
51-202 8.23e-05

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.42  E-value: 8.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   51 KDREKEKLKEKHREaekshsRGKDREKEKDRRARKEELRQTVAHHnllgqETRDRQLLERAERKGRSVSKVRSEEKDEDS 130
Cdd:TIGR01642    1 RDEEPDREREKSRG------RDRDRSSERPRRRSRDRSRFRDRHR-----RSRERSYREDSRPRDRRRYDSRSPRSLRYS 69
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034656143  131 ERGDedrerryrerklqygdSKDNPlkywlykeegeRRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSF 202
Cdd:TIGR01642   70 SVRR----------------SRDRP-----------RRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLW 114
PTZ00121 PTZ00121
MAEBL; Provisional
53-406 1.28e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 46.29  E-value: 1.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   53 REKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQLLERAE--RKGRSVSKVRSEEKDEDS 130
Cdd:PTZ00121  1084 KEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEeaRKAEDAKRVEIARKAEDA 1163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  131 ERGDEDRERRYRERKLQYGDSKDNPLKYWLYKEEGERRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEE-- 208
Cdd:PTZ00121  1164 RKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEak 1243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  209 RHKEKRHKEGFHFDDE-------RHQSNVDRKEKSAKDEPRK-------RESQNGEHRNRGASSKRDGTSSQHAENLVRN 274
Cdd:PTZ00121  1244 KAEEERNNEEIRKFEEarmahfaRRQAAIKAEEARKADELKKaeekkkaDEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  275 --HGKDKDSRRKHGHEEGSsvwwKLDQRPGGEETVVRREIEKEEtdlENARADAYTASCEDDFEDYEDDFEvcdgddDES 352
Cdd:PTZ00121  1324 aeEAKKKADAAKKKAEEAK----KAAEAAKAEAEAAADEAEAAE---EKAEAAEKKKEEAKKKADAAKKKA------EEK 1390
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034656143  353 SNEPESREKLEElplaQKKEIQEIQRAiNAENERIGELSLKLFQKRGRTEFEKE 406
Cdd:PTZ00121  1391 KKADEAKKKAEE----DKKKADELKKA-AAAKKKADEAKKKAEEKKKADEAKKK 1439
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
171-260 4.00e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 44.11  E-value: 4.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  171 KPREPDRDN-----KHREKSSTREKREKYSKEKSNSFSDKGEER--HKEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRK 243
Cdd:TIGR01642    1 RDEEPDREReksrgRDRDRSSERPRRRSRDRSRFRDRHRRSRERsyREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRR 80
                           90       100
                   ....*....|....*....|
gi 1034656143  244 RE---SQNGEHRNRGASSKR 260
Cdd:TIGR01642   81 RSrsvRSIEQHRRRLRDRSP 100
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
54-254 4.91e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 43.96  E-value: 4.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   54 EKEKLK-EKHREAEKSHSRGKDREKEkdrRARKEELRQTVAHHNLLGQETRDRQLL-ERAERKGRSVSKVRSEEkdEDSE 131
Cdd:pfam17380  338 EQERMAmERERELERIRQEERKRELE---RIRQEEIAMEISRMRELERLQMERQQKnERVRQELEAARKVKILE--EERQ 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  132 RGDEDRERRYRERKLQYGDSKDNPLKYW----------LYKEEGERRHRKPR----EPDRDNKHREKSSTREKREKYSKE 197
Cdd:pfam17380  413 RKIQQQKVEMEQIRAEQEEARQREVRRLeeeraremerVRLEEQERQQQVERlrqqEEERKRKKLELEKEKRDRKRAEEQ 492
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034656143  198 KSNSFSDKGEERHK----EKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNR 254
Cdd:pfam17380  493 RRKILEKELEERKQamieEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRR 553
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
59-216 9.26e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 42.96  E-value: 9.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   59 KEKHREAEKSHSRGKDREKEKDRRARKEELRQtvahhnllgqETRDRQLLERAERKG-RSVSKVRSEEKDEDSERgdedr 137
Cdd:TIGR01642    3 EEPDREREKSRGRDRDRSSERPRRRSRDRSRF----------RDRHRRSRERSYREDsRPRDRRRYDSRSPRSLR----- 67
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034656143  138 erryrerklqygdskdnplkywlykeegERRHRKPREPDRdnkHREKSSTREKREkysKEKSNSFSDKGEERHKEKRHK 216
Cdd:TIGR01642   68 ----------------------------YSSVRRSRDRPR---RRSRSVRSIEQH---RRRLRDRSPSNQWRKDDKKRS 112
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
869-941 1.06e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 42.32  E-value: 1.06e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034656143  869 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 941
Cdd:cd00200    180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
Caldesmon pfam02029
Caldesmon;
13-326 2.92e-03

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 41.39  E-value: 2.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   13 DPDQDARSRDRVAevhtAKESPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTV 92
Cdd:pfam02029    3 DEEEAARERRRRA----REERRRQKEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQKR 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   93 AHHNLLGQETRDRQLLER----AERKGR----SVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYwlyKEE 164
Cdd:pfam02029   79 LQEALERQKEFDPTIADEkesvAERKENneeeENSSWEKEEKRDSRLGRYKEEETEIREKEYQENKWSTEVRQA---EEE 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  165 GERRHRKPRE----PDRDNKHREKSSTREKREKYSKEKSNSFSDKGEERHKEK------RHKEGFHFDDERHQSNVDRKE 234
Cdd:pfam02029  156 GEEEEDKSEEaeevPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEVKsqngeeEVTKLKVTTKRRQGGLSQSQE 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  235 KSAKDEPRKRESQNGEHRNRgassKRDGTSSQHAENL----------VRNHGKDKDSRRKHGHEEgssvwwklDQRPGGE 304
Cdd:pfam02029  236 REEEAEVFLEAEQKLEELRR----RRQEKESEEFEKLrqkqqeaeleLEELKKKREERRKLLEEE--------EQRRKQE 303
                          330       340
                   ....*....|....*....|....*...
gi 1034656143  305 ETvvRREIEKEE------TDLENARADA 326
Cdd:pfam02029  304 EA--ERKLREEEekrrmkEEIERRRAEA 329
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
51-219 9.34e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 39.90  E-value: 9.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143   51 KDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRqllERAERKGRSVSKVRSEEKDEDs 130
Cdd:TIGR01622    3 RDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGR---ERRSRSRRPNRRYRPREKRRR- 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034656143  131 ergdedrerryrerklqygdskdnplkywlyKEEGERRHRKPREPdRDNKHREKSSTREKREKYSKEKSNSFSDKGEERH 210
Cdd:TIGR01622   79 -------------------------------RGDSYRRRRDDRRS-RREKPRARDGTPEPLTEDERDRRTVFVQQLAARA 126

                   ....*....
gi 1034656143  211 KEKRHKEGF 219
Cdd:TIGR01622  127 RERDLYEFF 135
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH