NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1435761108|ref|NP_001352058|]
View 

nuclear pore complex protein Nup98-Nup96 isoform 9 [Homo sapiens]

Protein Classification

nucleoporin family protein( domain architecture ID 10614352)

nucleoporin family protein functions as a component of the nuclear pore complex (NPC), and may play the role of both NPC structural component and of docking or interaction partner for transiently associated nuclear transport factors

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Nup96 pfam12110
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of ...
1268-1559 1.05e-133

Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of the Nup84 heptameric complex in the nuclear pore complex. Nup96 complexes with Sec13 in the middle of the heptamer. The function of the heptamer is to coat the curvature of the nuclear pore complex between the inner and outer nuclear membranes. Nup96 is predicted to be an alpha helical solenoid. The interaction between Nup96 and Sec13 is the point of curvature in the heptameric complex.


:

Pssm-ID: 463462  Cd Length: 287  Bit Score: 417.38  E-value: 1.05e-133
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1268 EAVFSYLTGKRISEACSLAQQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLS 1347
Cdd:pfam12110    1 EKAFALLTGHRVEEACELAIDSGDFRLATLLSQAGGDDSFREDMAEQLDDWRESGVDSEIDEPRRKLYELLAGNVLVSEG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1348 EKKQINVCSQLDWKRSLAIHLWYLLPPTASISRALSMYEEAFQntsdSDRYACSPLPSYLEGSGCVIAEEQNSQTPLrDV 1427
Cdd:pfam12110   81 KKSTINISEGLDWKRAFGLRLWYGIPPDTSIEDAVEAYEEALS----QGREPAPPLPWYLEEGDSESWEDPRLKKRE-DL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1428 CFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWEVLRA--LNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVL 1505
Cdd:pfam12110  156 LYHLLKLYADPTAPLEAVLDPESSSPDPLDYRLSWHLYQVLSAvrLGYGHLSSAKADQLTLSFASQLESLGLWQWAVFVL 235
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1435761108 1506 LHIDNSGIREKAVRELLTRHCQLLETPESwaKETFLTQKLRVPAKWIHEAKAVR 1559
Cdd:pfam12110  236 LHLEDPARRERAVRELLARHAELISEDDA--KERFLTEKLKIPEAWIHEAKALY 287
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
674-816 2.53e-63

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


:

Pssm-ID: 461171  Cd Length: 143  Bit Score: 211.97  E-value: 2.53e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  674 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVVVYLDDNQKPPVGE 749
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761108  750 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 816
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
40-149 9.80e-15

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


:

Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 71.11  E-value: 9.80e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   40 SAFGSSNNT-GGLFGNSQTKP---GGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTslfssqnnafaqnkpt 115
Cdd:pfam13634    1 GLFGAATSTsGGLFGNTSTTAasgGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNTTT---------------- 64
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1435761108  116 gfgnfgTSTSSGGLFGTTNTTSNPfgSTSGSLFG 149
Cdd:pfam13634   65 ------QTATGGGLFGNNAATTTS--TTGGGLFG 90
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
239-332 2.81e-14

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


:

Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 69.95  E-value: 2.81e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  239 GLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNpgglfgqQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTiGQPSTNTMGL 318
Cdd:pfam13634    1 GLFGAATSTSGGLFGNTSTTAASGGGLFGAA-------STATATTSGGGLFGNSSSNAPSGGLFGATNT-TTQTATGGGL 72
                           90
                   ....*....|....*...
gi 1435761108  319 FGVTQASQP----GGLFG 332
Cdd:pfam13634   73 FGNNAATTTsttgGGLFG 90
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
24-433 1.64e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


:

Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 1.64e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   24 GQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFS 103
Cdd:COG3210    825 TATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTN 904
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  104 SQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS 183
Cdd:COG3210    905 AASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSA 984
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  184 TKHQCITA------MKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKT 257
Cdd:COG3210    985 GSTGGVIAatgilvAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTAS 1064
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  258 AFGTSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNT 337
Cdd:COG3210   1065 GTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGT 1144
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  338 STGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLG 417
Cdd:COG3210   1145 LTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTA 1224
                          410
                   ....*....|....*.
gi 1435761108  418 AGFGTALTDPNASAAQ 433
Cdd:COG3210   1225 SDTGNTTATGGSSAGQ 1240
 
Name Accession Description Interval E-value
Nup96 pfam12110
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of ...
1268-1559 1.05e-133

Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of the Nup84 heptameric complex in the nuclear pore complex. Nup96 complexes with Sec13 in the middle of the heptamer. The function of the heptamer is to coat the curvature of the nuclear pore complex between the inner and outer nuclear membranes. Nup96 is predicted to be an alpha helical solenoid. The interaction between Nup96 and Sec13 is the point of curvature in the heptameric complex.


Pssm-ID: 463462  Cd Length: 287  Bit Score: 417.38  E-value: 1.05e-133
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1268 EAVFSYLTGKRISEACSLAQQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLS 1347
Cdd:pfam12110    1 EKAFALLTGHRVEEACELAIDSGDFRLATLLSQAGGDDSFREDMAEQLDDWRESGVDSEIDEPRRKLYELLAGNVLVSEG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1348 EKKQINVCSQLDWKRSLAIHLWYLLPPTASISRALSMYEEAFQntsdSDRYACSPLPSYLEGSGCVIAEEQNSQTPLrDV 1427
Cdd:pfam12110   81 KKSTINISEGLDWKRAFGLRLWYGIPPDTSIEDAVEAYEEALS----QGREPAPPLPWYLEEGDSESWEDPRLKKRE-DL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1428 CFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWEVLRA--LNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVL 1505
Cdd:pfam12110  156 LYHLLKLYADPTAPLEAVLDPESSSPDPLDYRLSWHLYQVLSAvrLGYGHLSSAKADQLTLSFASQLESLGLWQWAVFVL 235
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1435761108 1506 LHIDNSGIREKAVRELLTRHCQLLETPESwaKETFLTQKLRVPAKWIHEAKAVR 1559
Cdd:pfam12110  236 LHLEDPARRERAVRELLARHAELISEDDA--KERFLTEKLKIPEAWIHEAKALY 287
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
674-816 2.53e-63

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


Pssm-ID: 461171  Cd Length: 143  Bit Score: 211.97  E-value: 2.53e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  674 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVVVYLDDNQKPPVGE 749
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761108  750 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 816
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
40-149 9.80e-15

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 71.11  E-value: 9.80e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   40 SAFGSSNNT-GGLFGNSQTKP---GGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTslfssqnnafaqnkpt 115
Cdd:pfam13634    1 GLFGAATSTsGGLFGNTSTTAasgGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNTTT---------------- 64
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1435761108  116 gfgnfgTSTSSGGLFGTTNTTSNPfgSTSGSLFG 149
Cdd:pfam13634   65 ------QTATGGGLFGNNAATTTS--TTGGGLFG 90
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
239-332 2.81e-14

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 69.95  E-value: 2.81e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  239 GLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNpgglfgqQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTiGQPSTNTMGL 318
Cdd:pfam13634    1 GLFGAATSTSGGLFGNTSTTAASGGGLFGAA-------STATATTSGGGLFGNSSSNAPSGGLFGATNT-TTQTATGGGL 72
                           90
                   ....*....|....*...
gi 1435761108  319 FGVTQASQP----GGLFG 332
Cdd:pfam13634   73 FGNNAATTTsttgGGLFG 90
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
25-402 1.47e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.73  E-value: 1.47e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG3210    368 NGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIG 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG3210    448 GLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNA 527
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  185 KHQCITAmkeyesksleelrledYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTT 264
Cdd:COG3210    528 TSGGTGG----------------DGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGG 591
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  265 GFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFG 344
Cdd:COG3210    592 TGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTG 671
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1435761108  345 TGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGT-TSGGLFGFGTNTSGNSI 402
Cdd:COG3210    672 GGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSiTVTGQIGALANANGDTV 730
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
24-433 1.64e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 1.64e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   24 GQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFS 103
Cdd:COG3210    825 TATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTN 904
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  104 SQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS 183
Cdd:COG3210    905 AASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSA 984
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  184 TKHQCITA------MKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKT 257
Cdd:COG3210    985 GSTGGVIAatgilvAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTAS 1064
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  258 AFGTSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNT 337
Cdd:COG3210   1065 GTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGT 1144
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  338 STGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLG 417
Cdd:COG3210   1145 LTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTA 1224
                          410
                   ....*....|....*.
gi 1435761108  418 AGFGTALTDPNASAAQ 433
Cdd:COG3210   1225 SDTGNTTATGGSSAGQ 1240
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
44-273 2.61e-12

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 71.62  E-value: 2.61e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   44 SSNNTGGLFGNSQTKPGGL-FGTSSFSQPatSTSTGFGFGTSTGTANtlfGTASTGTSLFSSQNNAFAQNKPTGFGnfgt 122
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFsFGAAAASNP--GSTGGFSFGTLGAAPA---ATATTTTATLGLGGGLFGQKPATGFT---- 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  123 stssgglFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMkeyeskslee 202
Cdd:pfam15967   73 -------FGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVL---------- 135
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761108  203 lrledyqanrKGPQNQVGAGTTTGLFGSSPATSSA--TGLFSSSTTNS-GFAYGQNktafgTSTTGFGTNPGGL 273
Cdd:pfam15967  136 ----------TSTAAQQGATGFTLNLGGTPATTTAvsTGLSLGSTLTSlGGSLFQN-----TNSTGLGQTTLGL 194
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
25-334 4.81e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 65.03  E-value: 4.81e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTantlfgtaSTGTS 100
Cdd:NF033849   255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTD--------SSSHS 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  101 LFSSQNNAFAQnkptgfgNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST 180
Cdd:NF033849   327 QSSSYNVSSGT-------GVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  181 NISTKHQCITAMKEYESKSLEelrlEDYQANRKGPQNQVGAGTTTglfGSSPATSSATGLFSSSTTNSGFAYGQNKTAfg 260
Cdd:NF033849   400 GGVTSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSS---GHSDSSSHSTSSGQADSVSQGTSWSEGTGT-- 470
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1435761108  261 TSTTGFGTNPGglFGQQNQQTTSlFSKPFGQATTT-QNTGFSFGNTSTIGQPSTNTMGlfgvtQASQPGGLFGTA 334
Cdd:NF033849   471 SQGQSVGTSES--WSTSQSETDS-VGDSTGTSESVsQGDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
PPE COG5651
PPE-repeat protein [Function unknown];
31-183 1.07e-06

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 52.97  E-value: 1.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   31 TTSGGAFGTSAFGSSNN-------TGGLFGNSQTKPGGL-FGTSSFSQPATSTSTGFgFGTSTGTANTLFGTASTGTSLF 102
Cdd:COG5651    175 TNPGGLLGAQNAGSGNTssnpgfaNLGLTGLNQVGIGGLnSGSGPIGLNSGPGNTGF-AGTGAAAGAAAAAAAAAAAAGA 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  103 SSQNN-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAG 177
Cdd:COG5651    254 GASAAlaslaATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333

                   ....*.
gi 1435761108  178 VSTNIS 183
Cdd:COG5651    334 AAAAGA 339
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
33-403 5.67e-06

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 51.58  E-value: 5.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   33 SGGAFGTSAFGSSNNTGGLFGNSQTK-PGGLFGTSSFSQPATS--TSTGFGFGT---STGTANTLFGTASTGTSLFSSQN 106
Cdd:NF033176   139 SGGAQNIYNLGHASNTVIFNGGNQTIfSGGISDDTNISSGGQQrvSSGGVASNTtinSSGTQNILSGGSTVSTHISSGGN 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  107 N-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAaptGTTIKFNPPTGTDTMVKAGVSTN 181
Cdd:NF033176   219 QyisagGNASATVVSSGGFQRVSSGGTATGTVLSGGTQNVSSGGSAISTSVYSS---GVQTVYAGATVTDTTVNSGGKQN 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  182 ISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSS--ATGLFSSSTTNSGfayGQNKTAF 259
Cdd:NF033176   296 ISSGGIVSGTIVNSSGTQNIYSGGSALSANIKGSQIVNSDGTAINTLVNDGGYQHirNGGVASGTIINQS---GRVNISS 372
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  260 GTSTTGFGTNPGGlfgqqnqqTTSLFSKPFGQATTTQNTGFSfgNTSTiGQPSTNTMGLFGVTQASQPGGlfgTATNTST 339
Cdd:NF033176   373 GGYAESTIINSGG--------TQSVLSGGYASGTLINNSGRE--NVSN-GGSAYNTIINAGGNQYIYSNG---EASGTTV 438
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761108  340 GTAFgtgtglFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGfGTNTSGNSIF 403
Cdd:NF033176   439 NTSG------FQRVNSGGTATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYA-GGEASGTQIF 495
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
25-152 2.04e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 42.68  E-value: 2.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFG------NSQ--------------------TKPGGLFGTSSFSQPATSTSTG 78
Cdd:cd21118    145 GGTGGPWASGGNYGTNSLGGSVGQGGNGGplnygtNSQgavaqpgygtvrgnnqnsgcTNPPPSGSHESFSNSGGSSSSG 224
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761108   79 FGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNK-PTGFGNFGTSTSSGGLFG-TTNTTSNPFGSTSGSLFGPSS 152
Cdd:cd21118    225 SSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSgNSGGSNGGSSGNSGSGSGgSSSGGSNGWGGSSSSGGSGGS 300
PRK12688 PRK12688
flagellin; Reviewed
72-444 2.16e-03

flagellin; Reviewed


Pssm-ID: 171664 [Multi-domain]  Cd Length: 751  Bit Score: 42.94  E-value: 2.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   72 ATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQNNAFaqnkptgFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPS 151
Cdd:PRK12688   276 ATIAVSASGGAVSAAAAGAVTLKSSTGADLSVTGKADL-------LKALGLTTATGAGNATVNANRTTSAGSLGALIQDG 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  152 SfTAAPTGTTIKFN---PPTGTDTMVKAGVSTNISTkhqcitamkeyESKSLEELRLEDYQANRKGPQNQVGAGTTTGLF 228
Cdd:PRK12688   349 S-TLNVDGKTITFKnapIPGAASVPSGYGASGNVLT-----------DGNGNSTVYLQGGTINDVLKAIDLATGVQTATI 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  229 GSSPATSSATGLFSSSTTNSGfayGQNKTAFGT----STTGFGtNPGGLFGQQNQQTTSlfskpfgqatttqnTGFSFGN 304
Cdd:PRK12688   417 ANGTATLATAAGQTASSVNAS---GQLKLSTGLnadlSITGTG-NALSALGLAGNTGTA--------------TAFTAAR 478
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  305 TSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTafgtgtglFGQTNTGFGA--VGSTLFGNNKLTTFGSSTTSAPS 382
Cdd:PRK12688   479 TAGAGGISGKTLTFTSFNGGTAVNVTFGDGTNGTVKT--------LAQLNTALQAnnLTATIDATGKLTISASNDYASST 550
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1435761108  383 FG-TTSGGLFGfGTNTSGNSiFGSKPAPgtlgtglgagfgtaLTDPNASAAQQAVLQQHINSL 444
Cdd:PRK12688   551 LGsTLAGGAIG-GTLTSTLT-FSTASAP--------------VADTVAQTTRANLVKQYNNIL 597
34 PHA02584
long tail fiber, proximal subunit; Provisional
25-175 2.18e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 43.21  E-value: 2.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTSAFGSSNNTGG---------------------------LFGNSQTKPGGlfGTSSFSQPATSTST 77
Cdd:PHA02584   944 QNTSNGTVVVVDETSIAFYSQNNTTGnivfnidgtvdpinvnangtlnatgvaTNGRAVYAEGG--GIARTNNAARAITG 1021
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   78 GFGFGTSTGTANTLFGTASTGTSLFSSQ-----NNAFAQNK--PTGFGNFGTSTSSGGLfgttnTTSNPFGSTSGSlfgp 150
Cdd:PHA02584  1022 GFTIRNDGSTTVFLLTAAGDQTGGFNGLksliiNNANGQVTinDNYIINAGGTIMSGGL-----TVNSRIRSQGTK---- 1092
                          170       180
                   ....*....|....*....|....*
gi 1435761108  151 SSFTAAPTGTTIKFNPPTGTDTMVK 175
Cdd:PHA02584  1093 ASYTRAPTADTVGFWSVDINDSATY 1117
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
214-389 3.65e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 3.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  214 GPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNPGGLFGQQNQQ-TTSLFSKPFGQA 292
Cdd:COG3469     29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATlVATSTASGANTG 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  293 TTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTT 372
Cdd:COG3469    109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
                          170
                   ....*....|....*....
gi 1435761108  373 --FGSSTTSAPSFGTTSGG 389
Cdd:COG3469    189 taSGATTPSATTTATTTGP 207
 
Name Accession Description Interval E-value
Nup96 pfam12110
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of ...
1268-1559 1.05e-133

Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of the Nup84 heptameric complex in the nuclear pore complex. Nup96 complexes with Sec13 in the middle of the heptamer. The function of the heptamer is to coat the curvature of the nuclear pore complex between the inner and outer nuclear membranes. Nup96 is predicted to be an alpha helical solenoid. The interaction between Nup96 and Sec13 is the point of curvature in the heptameric complex.


Pssm-ID: 463462  Cd Length: 287  Bit Score: 417.38  E-value: 1.05e-133
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1268 EAVFSYLTGKRISEACSLAQQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLS 1347
Cdd:pfam12110    1 EKAFALLTGHRVEEACELAIDSGDFRLATLLSQAGGDDSFREDMAEQLDDWRESGVDSEIDEPRRKLYELLAGNVLVSEG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1348 EKKQINVCSQLDWKRSLAIHLWYLLPPTASISRALSMYEEAFQntsdSDRYACSPLPSYLEGSGCVIAEEQNSQTPLrDV 1427
Cdd:pfam12110   81 KKSTINISEGLDWKRAFGLRLWYGIPPDTSIEDAVEAYEEALS----QGREPAPPLPWYLEEGDSESWEDPRLKKRE-DL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108 1428 CFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWEVLRA--LNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVL 1505
Cdd:pfam12110  156 LYHLLKLYADPTAPLEAVLDPESSSPDPLDYRLSWHLYQVLSAvrLGYGHLSSAKADQLTLSFASQLESLGLWQWAVFVL 235
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1435761108 1506 LHIDNSGIREKAVRELLTRHCQLLETPESwaKETFLTQKLRVPAKWIHEAKAVR 1559
Cdd:pfam12110  236 LHLEDPARRERAVRELLARHAELISEDDA--KERFLTEKLKIPEAWIHEAKALY 287
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
674-816 2.53e-63

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


Pssm-ID: 461171  Cd Length: 143  Bit Score: 211.97  E-value: 2.53e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  674 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVVVYLDDNQKPPVGE 749
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761108  750 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 816
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
40-149 9.80e-15

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 71.11  E-value: 9.80e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   40 SAFGSSNNT-GGLFGNSQTKP---GGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTslfssqnnafaqnkpt 115
Cdd:pfam13634    1 GLFGAATSTsGGLFGNTSTTAasgGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNTTT---------------- 64
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1435761108  116 gfgnfgTSTSSGGLFGTTNTTSNPfgSTSGSLFG 149
Cdd:pfam13634   65 ------QTATGGGLFGNNAATTTS--TTGGGLFG 90
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
239-332 2.81e-14

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 69.95  E-value: 2.81e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  239 GLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNpgglfgqQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTiGQPSTNTMGL 318
Cdd:pfam13634    1 GLFGAATSTSGGLFGNTSTTAASGGGLFGAA-------STATATTSGGGLFGNSSSNAPSGGLFGATNT-TTQTATGGGL 72
                           90
                   ....*....|....*...
gi 1435761108  319 FGVTQASQP----GGLFG 332
Cdd:pfam13634   73 FGNNAATTTsttgGGLFG 90
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
25-402 1.47e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.73  E-value: 1.47e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG3210    368 NGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIG 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG3210    448 GLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNA 527
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  185 KHQCITAmkeyesksleelrledYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTT 264
Cdd:COG3210    528 TSGGTGG----------------DGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGG 591
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  265 GFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFG 344
Cdd:COG3210    592 TGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTG 671
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1435761108  345 TGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGT-TSGGLFGFGTNTSGNSI 402
Cdd:COG3210    672 GGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSiTVTGQIGALANANGDTV 730
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
24-433 1.64e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 1.64e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   24 GQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFS 103
Cdd:COG3210    825 TATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTN 904
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  104 SQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS 183
Cdd:COG3210    905 AASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSA 984
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  184 TKHQCITA------MKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKT 257
Cdd:COG3210    985 GSTGGVIAatgilvAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTAS 1064
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  258 AFGTSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNT 337
Cdd:COG3210   1065 GTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGT 1144
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  338 STGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLG 417
Cdd:COG3210   1145 LTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTA 1224
                          410
                   ....*....|....*.
gi 1435761108  418 AGFGTALTDPNASAAQ 433
Cdd:COG3210   1225 SDTGNTTATGGSSAGQ 1240
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
272-392 1.24e-12

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 65.33  E-value: 1.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  272 GLFGQQNQQTTSLFSkpfGQATTTQNTGFSFGNTSTiGQPSTNTMGLFGVTQASQP-GGLFGTatntstgtafgtgtglf 350
Cdd:pfam13634    1 GLFGAATSTSGGLFG---NTSTTAASGGGLFGAAST-ATATTSGGGLFGNSSSNAPsGGLFGA----------------- 59
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1435761108  351 gQTNTGFGAVGSTLFGNNklttfgssttSAPSFGTTSGGLFG 392
Cdd:pfam13634   60 -TNTTTQTATGGGLFGNN----------AATTTSTTGGGLFG 90
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
44-273 2.61e-12

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 71.62  E-value: 2.61e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   44 SSNNTGGLFGNSQTKPGGL-FGTSSFSQPatSTSTGFGFGTSTGTANtlfGTASTGTSLFSSQNNAFAQNKPTGFGnfgt 122
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFsFGAAAASNP--GSTGGFSFGTLGAAPA---ATATTTTATLGLGGGLFGQKPATGFT---- 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  123 stssgglFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMkeyeskslee 202
Cdd:pfam15967   73 -------FGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVL---------- 135
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761108  203 lrledyqanrKGPQNQVGAGTTTGLFGSSPATSSA--TGLFSSSTTNS-GFAYGQNktafgTSTTGFGTNPGGL 273
Cdd:pfam15967  136 ----------TSTAAQQGATGFTLNLGGTPATTTAvsTGLSLGSTLTSlGGSLFQN-----TNSTGLGQTTLGL 194
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
26-402 2.43e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 65.94  E-value: 2.43e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3210    625 ANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGN 704
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  106 NNAFAQNKPTGFGNFGTSTSSGG--------LFGTTNTTSNPFGSTSGSLFGPSSFTAAP---TGTTIKFNPPTGTDTmV 174
Cdd:COG3210    705 TLTISTGSITVTGQIGALANANGdtvtfgnlGTGATLTLNAGVTITSGNAGTLSIGLTANttaSGTTLTLANANGNTS-A 783
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  175 KAGVSTNISTKHQCITAmkeyesksleelrleDYQANRKGPqNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQ 254
Cdd:COG3210    784 GATLDNAGAEISIDITA---------------DGTITAAGT-TAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTD 847
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  255 NKTAFGTSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTA 334
Cdd:COG3210    848 TTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLT 927
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1435761108  335 TNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSI 402
Cdd:COG3210    928 GGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATG 995
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
25-334 4.81e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 65.03  E-value: 4.81e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTantlfgtaSTGTS 100
Cdd:NF033849   255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTD--------SSSHS 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  101 LFSSQNNAFAQnkptgfgNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST 180
Cdd:NF033849   327 QSSSYNVSSGT-------GVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  181 NISTKHQCITAMKEYESKSLEelrlEDYQANRKGPQNQVGAGTTTglfGSSPATSSATGLFSSSTTNSGFAYGQNKTAfg 260
Cdd:NF033849   400 GGVTSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSS---GHSDSSSHSTSSGQADSVSQGTSWSEGTGT-- 470
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1435761108  261 TSTTGFGTNPGglFGQQNQQTTSlFSKPFGQATTT-QNTGFSFGNTSTIGQPSTNTMGlfgvtQASQPGGLFGTA 334
Cdd:NF033849   471 SQGQSVGTSES--WSTSQSETDS-VGDSTGTSESVsQGDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
26-462 7.54e-10

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 64.03  E-value: 7.54e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG4625    173 GGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGG 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  106 NNAFAQNkpTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTk 185
Cdd:COG4625    253 GGGGGNG--GGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG- 329
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  186 hqcitamkeyesksleelrledYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTG 265
Cdd:COG4625    330 ----------------------GGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGG 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  266 FGTNPGGLFGQQNQQTTSLFSKPFGQATTT-----QNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTA-TNTST 339
Cdd:COG4625    388 SGGGGGGGAGGGGGGGGAGGTGGGGAGGGGgaaggGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGgAGAGG 467
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  340 GTAFGTGTGLFGQTNTGFGAVGSTLFGNnkLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTG---- 415
Cdd:COG4625    468 GSGSGAGTLTLTGNNTYTGTTTVNGGGN--YTQSAGSTLAVEVDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTTytil 545
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1435761108  416 -------------LGAGFGTALTDPNASAAQQAVLQ---QHINSLTYSPFGDSPLFRNPMSDP 462
Cdd:COG4625    546 avaaaldalagngDLSALYNALAALDAAAARAALDQlsgEIHASAAAALLQASRALRDALSNR 608
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
25-93 5.40e-09

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 54.93  E-value: 5.40e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1435761108   25 QNTGFGTTSGGAFGTSAFGSSNNTGG-LFGNSQTKP--GGLFGTSSFSQPATSTSTGFGFGTSTGTANT---LFG 93
Cdd:pfam13634   16 NTSTTAASGGGLFGAASTATATTSGGgLFGNSSSNApsGGLFGATNTTTQTATGGGLFGNNAATTTSTTgggLFG 90
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
90-161 1.52e-08

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 53.39  E-value: 1.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   90 TLFGTA-STGTSLFSSQNNAF------------AQNKPTGFGNFG---TSTSSGGLFGTTNTTSNPfgSTSGSLFGPSSF 153
Cdd:pfam13634    1 GLFGAAtSTSGGLFGNTSTTAasggglfgaastATATTSGGGLFGnssSNAPSGGLFGATNTTTQT--ATGGGLFGNNAA 78

                   ....*...
gi 1435761108  154 TAAPTGTT 161
Cdd:pfam13634   79 TTTSTTGG 86
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
309-404 1.46e-07

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 50.69  E-value: 1.46e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  309 GQPSTNTMGLFG--VTQASQPGGLFGTATNTstgtafgtgtglfGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPsfGTT 386
Cdd:pfam13634    4 GAATSTSGGLFGntSTTAASGGGLFGAASTA-------------TATTSGGGLFGNSSSNAPSGGLFGATNTTTQ--TAT 68
                           90       100
                   ....*....|....*....|..
gi 1435761108  387 SGGLFG----FGTNTSGNSIFG 404
Cdd:pfam13634   69 GGGLFGnnaaTTTSTTGGGLFG 90
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
26-405 2.45e-07

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 55.93  E-value: 2.45e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTkpGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3210    550 GGASGTTAASGSNTANTLGVLAATGGTSNATT--AGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANA 627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIkfnppTGTDTMVKAGVSTNISTK 185
Cdd:COG3210    628 TGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATG-----GTTGTTLNAATGGTLNNA 702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  186 HQCITAmkeyeskSLEELRLEDYQANRKGPQ------NQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQN---K 256
Cdd:COG3210    703 GNTLTI-------STGSITVTGQIGALANANgdtvtfGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTltlA 775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  257 TAFGTSTTGFGTNPGGLFGQQNQQTTSLFSKP-FGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTAT 335
Cdd:COG3210    776 NANGNTSAGATLDNAGAEISIDITADGTITAAgTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSD 855
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  336 NTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGS 405
Cdd:COG3210    856 GASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGG 925
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
26-399 2.81e-07

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 55.55  E-value: 2.81e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG5295    219 ASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSA 298
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVK---AGVSTNI 182
Cdd:COG5295    299 SSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGggaAATSSSG 378
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  183 STKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLF--SSSTTNSGFAYGQNKTAFG 260
Cdd:COG5295    379 GSATAAGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNgaSGTSSSASSAGAAGGGTAG 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  261 TSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTG 340
Cdd:COG5295    459 AGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGA 538
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1435761108  341 TAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTsAPSFGTTSGGLFGFGTNTSG 399
Cdd:COG5295    539 AAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSV-ASGANSVSVGAAGAENVAAG 596
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
26-436 4.83e-07

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 54.78  E-value: 4.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG5295    239 ASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGG 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTk 185
Cdd:COG5295    319 GAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAG- 397
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  186 hqcitamkeyesksleelrledYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTG 265
Cdd:COG5295    398 ----------------------SGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGG 455
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  266 FGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQAS---QPGGLFGTATNTSTGTA 342
Cdd:COG5295    456 TAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAggaANAAAASGATATAGSAG 535
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  343 FGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSfGTTSGGLFGFGTNTSGNSifgskpAPgtlgtglgagfGT 422
Cdd:COG5295    536 GGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGS-VASGANSVSVGAAGAENV------AA-----------GA 597
                          410
                   ....*....|....
gi 1435761108  423 ALTDPNASAAQQAV 436
Cdd:COG5295    598 TDTDAVNGGGAVAT 611
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
216-320 6.02e-07

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 49.15  E-value: 6.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  216 QNQVGAGTTTGLFGSSPATSSATGLFSSsttnsgfaygqnktaFGTSTTgfGTNPGGLFGQQNQQttslfskpfgqaTTT 295
Cdd:pfam13634   16 NTSTTAASGGGLFGAASTATATTSGGGL---------------FGNSSS--NAPSGGLFGATNTT------------TQT 66
                           90       100
                   ....*....|....*....|....*
gi 1435761108  296 QNTGFSFGNTSTIGQPSTNTmGLFG 320
Cdd:pfam13634   67 ATGGGLFGNNAATTTSTTGG-GLFG 90
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
28-182 9.80e-07

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 53.52  E-value: 9.80e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   28 GFGTTSGG--AFGTSAFGSSNNTGGL-FGNSQTKP--------------GGLFGtssfSQPATststGFGFGT------S 84
Cdd:pfam15967   11 GSTATAGGgfSFGAAAASNPGSTGGFsFGTLGAAPaatattttatlglgGGLFG----QKPAT----GFTFGTpasstaA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   85 TGTANTLFGTASTGTSlfSSQNNAFAQNKPTG----FGNFGTSTSSGGL-FGTTNTTSNPFGSTSGSLFG----PSSFTA 155
Cdd:pfam15967   83 TGPTGLTLGTPAATTA--ASTGFSLGFNKPAAsatpFSLPASSTSGGGLsLGSVLTSTAAQQGATGFTLNlggtPATTTA 160
                          170       180
                   ....*....|....*....|....*..
gi 1435761108  156 APTGTTIKFNPPTGTDTMVKAGVSTNI 182
Cdd:pfam15967  161 VSTGLSLGSTLTSLGGSLFQNTNSTGL 187
PPE COG5651
PPE-repeat protein [Function unknown];
31-183 1.07e-06

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 52.97  E-value: 1.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   31 TTSGGAFGTSAFGSSNN-------TGGLFGNSQTKPGGL-FGTSSFSQPATSTSTGFgFGTSTGTANTLFGTASTGTSLF 102
Cdd:COG5651    175 TNPGGLLGAQNAGSGNTssnpgfaNLGLTGLNQVGIGGLnSGSGPIGLNSGPGNTGF-AGTGAAAGAAAAAAAAAAAAGA 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  103 SSQNN-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAG 177
Cdd:COG5651    254 GASAAlaslaATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333

                   ....*.
gi 1435761108  178 VSTNIS 183
Cdd:COG5651    334 AAAAGA 339
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
329-409 1.51e-06

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 48.00  E-value: 1.51e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  329 GLFGTATNTSTGtafgtgtgLFGQTNTGfGAVGSTLFGNNklTTFGSSTTSAPSFG-----TTSGGLFGFGTNTS----G 399
Cdd:pfam13634    1 GLFGAATSTSGG--------LFGNTSTT-AASGGGLFGAA--STATATTSGGGLFGnsssnAPSGGLFGATNTTTqtatG 69
                           90
                   ....*....|
gi 1435761108  400 NSIFGSKPAP 409
Cdd:pfam13634   70 GGLFGNNAAT 79
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
115-275 1.63e-06

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 47.61  E-value: 1.63e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  115 TGFGNfgTSTSSGGLFGTTNTTsnpfGSTSGSLFGPSSFTAAPTgttikfnpptgtdtmvkagvstnistkhqcitamke 194
Cdd:pfam13634    1 GLFGA--ATSTSGGLFGNTSTT----AASGGGLFGAASTATATT------------------------------------ 38
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  195 yesksleelrledyqanrkgpqnqvgagTTTGLFGSSPATSSATGLFSSSTTNSGFAygQNKTAFG-TSTTGFGTNPGGL 273
Cdd:pfam13634   39 ----------------------------SGGGLFGNSSSNAPSGGLFGATNTTTQTA--TGGGLFGnNAATTTSTTGGGL 88

                   ..
gi 1435761108  274 FG 275
Cdd:pfam13634   89 FG 90
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
26-335 1.96e-06

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 53.03  E-value: 1.96e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLF--- 102
Cdd:COG3468    100 GTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGTGVGGTGAAAAGGGTGSGGGGSGGGGGAGGggg 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  103 -----SSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAG 177
Cdd:COG3468    180 ggaggSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGGGGGSAGGTGGGGLTGGGAA 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  178 VSTNISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPA---TSSATGLFSSSTTNSGFAYGQ 254
Cdd:COG3468    260 GTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGgggGSNAGGGSGGGGGGGGGGGGG 339
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  255 NKTA--FGTSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFG 332
Cdd:COG3468    340 GTTLngAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGTGNNGGGGVGGGGGGGLTLT 419

                   ...
gi 1435761108  333 TAT 335
Cdd:COG3468    420 GGT 422
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
26-309 2.35e-06

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 52.64  E-value: 2.35e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3468    162 GSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGGG 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  106 NNAFAQNK------PTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVS 179
Cdd:COG3468    242 GGSAGGTGgggltgGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGGSN 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  180 TNISTkhqcitamkeyesksleelrledyqANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAF 259
Cdd:COG3468    322 AGGGS-------------------------GGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGG 376
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1435761108  260 GTSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIG 309
Cdd:COG3468    377 GGAGGGGANTGSDGVGTGLTTGGTGNNGGGGVGGGGGGGLTLTGGTLTVN 426
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
33-403 5.67e-06

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 51.58  E-value: 5.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   33 SGGAFGTSAFGSSNNTGGLFGNSQTK-PGGLFGTSSFSQPATS--TSTGFGFGT---STGTANTLFGTASTGTSLFSSQN 106
Cdd:NF033176   139 SGGAQNIYNLGHASNTVIFNGGNQTIfSGGISDDTNISSGGQQrvSSGGVASNTtinSSGTQNILSGGSTVSTHISSGGN 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  107 N-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAaptGTTIKFNPPTGTDTMVKAGVSTN 181
Cdd:NF033176   219 QyisagGNASATVVSSGGFQRVSSGGTATGTVLSGGTQNVSSGGSAISTSVYSS---GVQTVYAGATVTDTTVNSGGKQN 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  182 ISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSS--ATGLFSSSTTNSGfayGQNKTAF 259
Cdd:NF033176   296 ISSGGIVSGTIVNSSGTQNIYSGGSALSANIKGSQIVNSDGTAINTLVNDGGYQHirNGGVASGTIINQS---GRVNISS 372
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  260 GTSTTGFGTNPGGlfgqqnqqTTSLFSKPFGQATTTQNTGFSfgNTSTiGQPSTNTMGLFGVTQASQPGGlfgTATNTST 339
Cdd:NF033176   373 GGYAESTIINSGG--------TQSVLSGGYASGTLINNSGRE--NVSN-GGSAYNTIINAGGNQYIYSNG---EASGTTV 438
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761108  340 GTAFgtgtglFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGfGTNTSGNSIF 403
Cdd:NF033176   439 NTSG------FQRVNSGGTATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYA-GGEASGTQIF 495
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
26-172 2.42e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 48.98  E-value: 2.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTST----------GTANTLFGTA 95
Cdd:COG3469     52 AASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTvtttstgagsVTSTTSSTAG 131
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761108   96 STGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGlfgTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDT 172
Cdd:COG3469    132 STTTSGASATSSAGSTTTTTTVSGTETATGGTT---TTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT 205
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
26-167 1.57e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 46.28  E-value: 1.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3469     75 TTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1435761108  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNpfgSTSGSLFGPSSFTAAPTGTTIKFNPP 167
Cdd:COG3469    155 GTETATGGTTTTSTTTTTTSASTTPSATTTATA---TTASGATTPSATTTATTTGPPTPGLP 213
NupH_GANP pfam16768
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the ...
25-311 1.82e-04

Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the nucleoporin-homology domain at the N-terminus of human GANP or germinal-centre associated nuclear proteins. GANP is part of the TREX-2 complex that links transcription with nuclear messenger RNA export, and it associates with the mRNP particle through the interaction of the NupH_GANP with NXF1, the export factor. This attachment mediates efficient delivery of mRNPs to nuclear pore complexes.


Pssm-ID: 435572 [Multi-domain]  Cd Length: 292  Bit Score: 45.67  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTsafgssnntgglfgnSQTKPGGLFGTSS-FSQPATSTSTGFGFGTSTGtantlFGTASTGTSLFS 103
Cdd:pfam16768   10 QPSAFSTSSSPSTGT---------------FQAKPPFRFGQPSlFGQNNTLSGKNSGFSQVSS-----FPTTSGVSHSSS 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  104 SQNNAFAQnkptgfgnfgtsTSSGGLFGTTNTTSnPFGSTSgslfGPSSfTAAPTGTTIKFNPPTGTdtmvkaGVSTNIS 183
Cdd:pfam16768   70 GQTLGFTQ------------TSGVGLFSGLEHTP-SFVATS----GPSS-SSVPSNPGFSFKSPTNL------GAFPSTS 125
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  184 T-------KHQCITAMKEYESKSLEELRLEDYQANRKGP---QNQVGAGTTTglFgSSPATSSATGLF--------SSST 245
Cdd:pfam16768  126 TfgpesgeVASSGFGKTEFSFKPPENAVFRPIFGAESEPektQSQITSGFFT--F-SHPVSSGPGGLApfsfsqvtSSSA 202
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761108  246 TNSGFAYGQNKTAFGTSTTGFGTNPGglfgqQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQP 311
Cdd:pfam16768  203 TSSNFTFSKPVSSNNSSSAFAPALSS-----QNVEEEKRGPKSFFGSSNSSFTSFPNSSSGSLGEP 263
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
72-337 2.94e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.51  E-value: 2.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   72 ATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPS 151
Cdd:COG3469      3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  152 SFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTkhqcitamkeyesksleelrledyqanrkgPQNQVGAGTTTGLFGSS 231
Cdd:COG3469     83 TAAAAAATSTSATLVATSTASGANTGTSTVTTT------------------------------STGAGSVTSTTSSTAGS 132
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  232 PATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNPgglfgqqnqqttslfskpfgqaTTTQNTGFSFGNTSTIGQP 311
Cdd:COG3469    133 TTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTT----------------------TTSASTTPSATTTATATTA 190
                          250       260
                   ....*....|....*....|....*.
gi 1435761108  312 StntmglfGVTQASQPGglfgTATNT 337
Cdd:COG3469    191 S-------GATTPSATT----TATTT 205
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
226-407 5.32e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 44.66  E-value: 5.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  226 GLFGSSPATSSATGLFSSSTTNSGFA--YGQNKTAFGTSTTGFGtnpgglFGqqnqqttslFSKPFGQAT-------TTQ 296
Cdd:pfam15967   61 GLFGQKPATGFTFGTPASSTAATGPTglTLGTPAATTAASTGFS------LG---------FNKPAASATpfslpasSTS 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  297 NTGFSFGNTSTIGQP----STNTMGLFGVTQASQPG--GLFGTATntstgtAFGTGTGLFGQTN-TGFGAvgSTLFGNNK 369
Cdd:pfam15967  126 GGGLSLGSVLTSTAAqqgaTGFTLNLGGTPATTTAVstGLSLGST------LTSLGGSLFQNTNsTGLGQ--TTLGLTLL 197
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1435761108  370 LTTFGSSTTSAPSFGTtsGGL-FGFGTNTSGNSIFGSKP 407
Cdd:pfam15967  198 ATSTAPVSAPAASEGL--GGLdFSTSSEKKSDKASGTRP 234
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
26-269 1.03e-03

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 43.99  E-value: 1.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGF-GTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG5295    398 SGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGgGTAGAGGAANVGAATTAASAAAT 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTN-IS 183
Cdd:COG5295    478 AAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNsVA 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  184 TKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSpATSSATGLFSSSTTNSGFAYGQNKTAFGTST 263
Cdd:COG5295    558 VGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAGATDTDAVNG-GGAVATGDNSVAVGNNAQASGANSVALGAGA 636

                   ....*.
gi 1435761108  264 TGFGTN 269
Cdd:COG5295    637 TATANN 642
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
25-152 2.04e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 42.68  E-value: 2.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFG------NSQ--------------------TKPGGLFGTSSFSQPATSTSTG 78
Cdd:cd21118    145 GGTGGPWASGGNYGTNSLGGSVGQGGNGGplnygtNSQgavaqpgygtvrgnnqnsgcTNPPPSGSHESFSNSGGSSSSG 224
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761108   79 FGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNK-PTGFGNFGTSTSSGGLFG-TTNTTSNPFGSTSGSLFGPSS 152
Cdd:cd21118    225 SSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSgNSGGSNGGSSGNSGSGSGgSSSGGSNGWGGSSSSGGSGGS 300
PRK12688 PRK12688
flagellin; Reviewed
72-444 2.16e-03

flagellin; Reviewed


Pssm-ID: 171664 [Multi-domain]  Cd Length: 751  Bit Score: 42.94  E-value: 2.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   72 ATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQNNAFaqnkptgFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPS 151
Cdd:PRK12688   276 ATIAVSASGGAVSAAAAGAVTLKSSTGADLSVTGKADL-------LKALGLTTATGAGNATVNANRTTSAGSLGALIQDG 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  152 SfTAAPTGTTIKFN---PPTGTDTMVKAGVSTNISTkhqcitamkeyESKSLEELRLEDYQANRKGPQNQVGAGTTTGLF 228
Cdd:PRK12688   349 S-TLNVDGKTITFKnapIPGAASVPSGYGASGNVLT-----------DGNGNSTVYLQGGTINDVLKAIDLATGVQTATI 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  229 GSSPATSSATGLFSSSTTNSGfayGQNKTAFGT----STTGFGtNPGGLFGQQNQQTTSlfskpfgqatttqnTGFSFGN 304
Cdd:PRK12688   417 ANGTATLATAAGQTASSVNAS---GQLKLSTGLnadlSITGTG-NALSALGLAGNTGTA--------------TAFTAAR 478
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  305 TSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTafgtgtglFGQTNTGFGA--VGSTLFGNNKLTTFGSSTTSAPS 382
Cdd:PRK12688   479 TAGAGGISGKTLTFTSFNGGTAVNVTFGDGTNGTVKT--------LAQLNTALQAnnLTATIDATGKLTISASNDYASST 550
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1435761108  383 FG-TTSGGLFGfGTNTSGNSiFGSKPAPgtlgtglgagfgtaLTDPNASAAQQAVLQQHINSL 444
Cdd:PRK12688   551 LGsTLAGGAIG-GTLTSTLT-FSTASAP--------------VADTVAQTTRANLVKQYNNIL 597
34 PHA02584
long tail fiber, proximal subunit; Provisional
25-175 2.18e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 43.21  E-value: 2.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   25 QNTGFGTTSGGAFGTSAFGSSNNTGG---------------------------LFGNSQTKPGGlfGTSSFSQPATSTST 77
Cdd:PHA02584   944 QNTSNGTVVVVDETSIAFYSQNNTTGnivfnidgtvdpinvnangtlnatgvaTNGRAVYAEGG--GIARTNNAARAITG 1021
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   78 GFGFGTSTGTANTLFGTASTGTSLFSSQ-----NNAFAQNK--PTGFGNFGTSTSSGGLfgttnTTSNPFGSTSGSlfgp 150
Cdd:PHA02584  1022 GFTIRNDGSTTVFLLTAAGDQTGGFNGLksliiNNANGQVTinDNYIINAGGTIMSGGL-----TVNSRIRSQGTK---- 1092
                          170       180
                   ....*....|....*....|....*
gi 1435761108  151 SSFTAAPTGTTIKFNPPTGTDTMVK 175
Cdd:PHA02584  1093 ASYTRAPTADTVGFWSVDINDSATY 1117
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
214-389 3.65e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 3.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  214 GPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNPGGLFGQQNQQ-TTSLFSKPFGQA 292
Cdd:COG3469     29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATlVATSTASGANTG 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  293 TTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTT 372
Cdd:COG3469    109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
                          170
                   ....*....|....*....
gi 1435761108  373 --FGSSTTSAPSFGTTSGG 389
Cdd:COG3469    189 taSGATTPSATTTATTTGP 207
PPE COG5651
PPE-repeat protein [Function unknown];
26-159 3.91e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.80  E-value: 3.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAFGTSAFGSSNN-TGGLFGNSQTKPGGLFGTS---SFSQPATSTSTGFGFGTST---------GTANTLF 92
Cdd:COG5651    194 NPGFANLGLTGLNQVGIGGLNSgSGPIGLNSGPGNTGFAGTGaaaGAAAAAAAAAAAAGAGASAalaslaatlLNASSLG 273
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761108   93 GTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTG 159
Cdd:COG5651    274 LAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAA 340
PTZ00473 PTZ00473
Plasmodium Vir superfamily; Provisional
26-100 4.37e-03

Plasmodium Vir superfamily; Provisional


Pssm-ID: 240430 [Multi-domain]  Cd Length: 420  Bit Score: 41.37  E-value: 4.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108   26 NTGFGTTSGGAF--------------GTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTl 91
Cdd:PTZ00473   315 RGPYNANYGGQFnsrsgrtgssesirGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGGSTYGGSST- 393

                   ....*....
gi 1435761108   92 FGTASTGTS 100
Cdd:PTZ00473   394 FDGSSRGSS 402
PPE COG5651
PPE-repeat protein [Function unknown];
214-401 4.93e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.42  E-value: 4.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  214 GPQNQVGAGTTTGLFGSSPAtssATGLFSSSTTNSGFAYGqnktafgTSTTGF---GTNPGGLFGQQNQQTTSLF---SK 287
Cdd:COG5651    189 GNTSSNPGFANLGLTGLNQV---GIGGLNSGSGPIGLNSG-------PGNTGFagtGAAAGAAAAAAAAAAAAGAgasAA 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761108  288 PFGQATTTQNTGFSFGNTSTIGQPSTNTmgLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGN 367
Cdd:COG5651    259 LASLAATLLNASSLGLAATAASSAATNL--GLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAA 336
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1435761108  368 NKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNS 401
Cdd:COG5651    337 AGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGS 370
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH