NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1435761100|ref|NP_001352054|]
View 

nuclear pore complex protein Nup98-Nup96 isoform 5 [Homo sapiens]

Protein Classification

Nucleoporin2 and Nup96 domain-containing protein( domain architecture ID 13837623)

protein containing domains Herpes_BLLF1, Nucleoporin_FG, Nucleoporin2, and Nup96

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Nup96 pfam12110
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of ...
1346-1637 1.53e-133

Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of the Nup84 heptameric complex in the nuclear pore complex. Nup96 complexes with Sec13 in the middle of the heptamer. The function of the heptamer is to coat the curvature of the nuclear pore complex between the inner and outer nuclear membranes. Nup96 is predicted to be an alpha helical solenoid. The interaction between Nup96 and Sec13 is the point of curvature in the heptameric complex.


:

Pssm-ID: 463462  Cd Length: 287  Bit Score: 417.77  E-value: 1.53e-133
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1346 EAVFSYLTGKRISEACSLAQQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLS 1425
Cdd:pfam12110    1 EKAFALLTGHRVEEACELAIDSGDFRLATLLSQAGGDDSFREDMAEQLDDWRESGVDSEIDEPRRKLYELLAGNVLVSEG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1426 EKKQINVCSQLDWKRSLAIHLWYLLPPTASISRALSMYEEAFQntsdSDRYACSPLPSYLEGSGCVIAEEQNSQTPLrDV 1505
Cdd:pfam12110   81 KKSTINISEGLDWKRAFGLRLWYGIPPDTSIEDAVEAYEEALS----QGREPAPPLPWYLEEGDSESWEDPRLKKRE-DL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1506 CFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWEVLRA--LNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVL 1583
Cdd:pfam12110  156 LYHLLKLYADPTAPLEAVLDPESSSPDPLDYRLSWHLYQVLSAvrLGYGHLSSAKADQLTLSFASQLESLGLWQWAVFVL 235
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1435761100 1584 LHIDNSGIREKAVRELLTRHCQLLETPESwaKETFLTQKLRVPAKWIHEAKAVR 1637
Cdd:pfam12110  236 LHLEDPARRERAVRELLARHAELISEDDA--KERFLTEKLKIPEAWIHEAKALY 287
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
721-863 2.64e-63

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


:

Pssm-ID: 461171  Cd Length: 143  Bit Score: 211.97  E-value: 2.64e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  721 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVVVYLDDNQKPPVGE 796
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761100  797 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 863
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
25-435 5.05e-16

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


:

Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 84.44  E-value: 5.05e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG4625     74 AGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGG 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG4625    154 GGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGG 233
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  185 KHQCITAMkeYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTT 264
Cdd:COG4625    234 GGGGGGGG--GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGG 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  265 GFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFG 344
Cdd:COG4625    312 GGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGG 391
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  345 TGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTAL 424
Cdd:COG4625    392 GGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGS 471
                          410
                   ....*....|.
gi 1435761100  425 GAGQASLFGNN 435
Cdd:COG4625    472 GAGTLTLTGNN 482
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
243-666 1.76e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  243 SSTTNSGFAYGQNKTaFGTSTTGFGTNPGGLFGQQNQQTTSlfskpfgqaTTTQNTGFSFGNTSTIGQPSTNTMGlfgvt 322
Cdd:pfam05109  374 SGCENISGAFASNRT-FDITVSGLGTAPKTLIITRTATNAT---------TTTHKVIFSKAPESTTTSPTLNTTG----- 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  323 qasqpgglfgtatntstgtafgtgtglFGQTNTGFGAVGSTLFGNNkLTTFGSSTTSAPSFGTTSGGLFGfgtNTSGNSI 402
Cdd:pfam05109  439 ---------------------------FAAPNTTTGLPSSTHVPTN-LTAPASTGPTVSTADVTSPTPAG---TTSGASP 487
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  403 FGSKPAPGTLGTGLGAGFGTAlgagQASLFGNNQPKIGGPlgTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQA 482
Cdd:pfam05109  488 VTPSPSPRDNGTESKAPDMTS----PTSAVTTPTPNATSP--TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPA 561
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  483 VLQQHINSLTYSPFGDSPLFRNPMSDPKKKEERLKPTNPAAQKalTTPTHYKLTPRPATRVRPKALQTTGTAKSHlfdgl 562
Cdd:pfam05109  562 VTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANT--TNHTLGGTSSTPVVTSPPKNATSAVTTGQH----- 634
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  563 dddepslangaFMPKKSIKKLVLKNLNNSNLFSPVNRDSENLASP---SEYPENGERFSFLSKPVDENHqqdgdedslvs 639
Cdd:pfam05109  635 -----------NITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltSAHPTGGENITQVTPASTSTH----------- 692
                          410       420
                   ....*....|....*....|....*..
gi 1435761100  640 HFYTNPIAkPIPQTPESAGNKHSNSNS 666
Cdd:pfam05109  693 HVSTSSPA-PRPGTTSQASGPGNSSTS 718
 
Name Accession Description Interval E-value
Nup96 pfam12110
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of ...
1346-1637 1.53e-133

Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of the Nup84 heptameric complex in the nuclear pore complex. Nup96 complexes with Sec13 in the middle of the heptamer. The function of the heptamer is to coat the curvature of the nuclear pore complex between the inner and outer nuclear membranes. Nup96 is predicted to be an alpha helical solenoid. The interaction between Nup96 and Sec13 is the point of curvature in the heptameric complex.


Pssm-ID: 463462  Cd Length: 287  Bit Score: 417.77  E-value: 1.53e-133
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1346 EAVFSYLTGKRISEACSLAQQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLS 1425
Cdd:pfam12110    1 EKAFALLTGHRVEEACELAIDSGDFRLATLLSQAGGDDSFREDMAEQLDDWRESGVDSEIDEPRRKLYELLAGNVLVSEG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1426 EKKQINVCSQLDWKRSLAIHLWYLLPPTASISRALSMYEEAFQntsdSDRYACSPLPSYLEGSGCVIAEEQNSQTPLrDV 1505
Cdd:pfam12110   81 KKSTINISEGLDWKRAFGLRLWYGIPPDTSIEDAVEAYEEALS----QGREPAPPLPWYLEEGDSESWEDPRLKKRE-DL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1506 CFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWEVLRA--LNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVL 1583
Cdd:pfam12110  156 LYHLLKLYADPTAPLEAVLDPESSSPDPLDYRLSWHLYQVLSAvrLGYGHLSSAKADQLTLSFASQLESLGLWQWAVFVL 235
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1435761100 1584 LHIDNSGIREKAVRELLTRHCQLLETPESwaKETFLTQKLRVPAKWIHEAKAVR 1637
Cdd:pfam12110  236 LHLEDPARRERAVRELLARHAELISEDDA--KERFLTEKLKIPEAWIHEAKALY 287
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
721-863 2.64e-63

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


Pssm-ID: 461171  Cd Length: 143  Bit Score: 211.97  E-value: 2.64e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  721 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVVVYLDDNQKPPVGE 796
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761100  797 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 863
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
25-435 5.05e-16

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 84.44  E-value: 5.05e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG4625     74 AGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGG 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG4625    154 GGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGG 233
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  185 KHQCITAMkeYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTT 264
Cdd:COG4625    234 GGGGGGGG--GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGG 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  265 GFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFG 344
Cdd:COG4625    312 GGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGG 391
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  345 TGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTAL 424
Cdd:COG4625    392 GGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGS 471
                          410
                   ....*....|.
gi 1435761100  425 GAGQASLFGNN 435
Cdd:COG4625    472 GAGTLTLTGNN 482
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
40-149 5.63e-14

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 69.18  E-value: 5.63e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   40 SAFGSSNNT-GGLFGNSQTKP---GGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTslfssqnnafaqnkpt 115
Cdd:pfam13634    1 GLFGAATSTsGGLFGNTSTTAasgGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNTTT---------------- 64
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1435761100  116 gfgnfgTSTSSGGLFGTTNTTSNPfgSTSGSLFG 149
Cdd:pfam13634   65 ------QTATGGGLFGNNAATTTS--TTGGGLFG 90
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
25-334 6.75e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 64.26  E-value: 6.75e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTantlfgtaSTGTS 100
Cdd:NF033849   255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTD--------SSSHS 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  101 LFSSQNNAFAQnkptgfgNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST 180
Cdd:NF033849   327 QSSSYNVSSGT-------GVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  181 NISTKHQCITAMKEYESKSLEelrlEDYQANRKGPQNQVGAGTTTglfGSSPATSSATGLFSSSTTNSGFAYGQNKTAfg 260
Cdd:NF033849   400 GGVTSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSS---GHSDSSSHSTSSGQADSVSQGTSWSEGTGT-- 470
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1435761100  261 TSTTGFGTNPGglFGQQNQQTTSlFSKPFGQATTT-QNTGFSFGNTSTIGQPSTNTMGlfgvtQASQPGGLFGTA 334
Cdd:NF033849   471 SQGQSVGTSES--WSTSQSETDS-VGDSTGTSESVsQGDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
33-403 9.55e-06

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 50.81  E-value: 9.55e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   33 SGGAFGTSAFGSSNNTGGLFGNSQTK-PGGLFGTSSFSQPATS--TSTGFGFGT---STGTANTLFGTASTGTSLFSSQN 106
Cdd:NF033176   139 SGGAQNIYNLGHASNTVIFNGGNQTIfSGGISDDTNISSGGQQrvSSGGVASNTtinSSGTQNILSGGSTVSTHISSGGN 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  107 N-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAaptGTTIKFNPPTGTDTMVKAGVSTN 181
Cdd:NF033176   219 QyisagGNASATVVSSGGFQRVSSGGTATGTVLSGGTQNVSSGGSAISTSVYSS---GVQTVYAGATVTDTTVNSGGKQN 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  182 ISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSS--ATGLFSSSTTNSGfayGQNKTAF 259
Cdd:NF033176   296 ISSGGIVSGTIVNSSGTQNIYSGGSALSANIKGSQIVNSDGTAINTLVNDGGYQHirNGGVASGTIINQS---GRVNISS 372
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  260 GTSTTGFGTNPGGlfgqqnqqTTSLFSKPFGQATTTQNTGFSfgNTSTiGQPSTNTMGLFGVTQASQPGGlfgTATNTST 339
Cdd:NF033176   373 GGYAESTIINSGG--------TQSVLSGGYASGTLINNSGRE--NVSN-GGSAYNTIINAGGNQYIYSNG---EASGTTV 438
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761100  340 GTAFgtgtglFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGfGTNTSGNSIF 403
Cdd:NF033176   439 NTSG------FQRVNSGGTATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYA-GGEASGTQIF 495
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
243-666 1.76e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  243 SSTTNSGFAYGQNKTaFGTSTTGFGTNPGGLFGQQNQQTTSlfskpfgqaTTTQNTGFSFGNTSTIGQPSTNTMGlfgvt 322
Cdd:pfam05109  374 SGCENISGAFASNRT-FDITVSGLGTAPKTLIITRTATNAT---------TTTHKVIFSKAPESTTTSPTLNTTG----- 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  323 qasqpgglfgtatntstgtafgtgtglFGQTNTGFGAVGSTLFGNNkLTTFGSSTTSAPSFGTTSGGLFGfgtNTSGNSI 402
Cdd:pfam05109  439 ---------------------------FAAPNTTTGLPSSTHVPTN-LTAPASTGPTVSTADVTSPTPAG---TTSGASP 487
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  403 FGSKPAPGTLGTGLGAGFGTAlgagQASLFGNNQPKIGGPlgTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQA 482
Cdd:pfam05109  488 VTPSPSPRDNGTESKAPDMTS----PTSAVTTPTPNATSP--TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPA 561
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  483 VLQQHINSLTYSPFGDSPLFRNPMSDPKKKEERLKPTNPAAQKalTTPTHYKLTPRPATRVRPKALQTTGTAKSHlfdgl 562
Cdd:pfam05109  562 VTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANT--TNHTLGGTSSTPVVTSPPKNATSAVTTGQH----- 634
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  563 dddepslangaFMPKKSIKKLVLKNLNNSNLFSPVNRDSENLASP---SEYPENGERFSFLSKPVDENHqqdgdedslvs 639
Cdd:pfam05109  635 -----------NITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltSAHPTGGENITQVTPASTSTH----------- 692
                          410       420
                   ....*....|....*....|....*..
gi 1435761100  640 HFYTNPIAkPIPQTPESAGNKHSNSNS 666
Cdd:pfam05109  693 HVSTSSPA-PRPGTTSQASGPGNSSTS 718
34 PHA02584
long tail fiber, proximal subunit; Provisional
25-175 2.43e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 42.82  E-value: 2.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGG---------------------------LFGNSQTKPGGlfGTSSFSQPATSTST 77
Cdd:PHA02584   944 QNTSNGTVVVVDETSIAFYSQNNTTGnivfnidgtvdpinvnangtlnatgvaTNGRAVYAEGG--GIARTNNAARAITG 1021
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   78 GFGFGTSTGTANTLFGTASTGTSLFSSQ-----NNAFAQNK--PTGFGNFGTSTSSGGLfgttnTTSNPFGSTSGSlfgp 150
Cdd:PHA02584  1022 GFTIRNDGSTTVFLLTAAGDQTGGFNGLksliiNNANGQVTinDNYIINAGGTIMSGGL-----TVNSRIRSQGTK---- 1092
                          170       180
                   ....*....|....*....|....*
gi 1435761100  151 SSFTAAPTGTTIKFNPPTGTDTMVK 175
Cdd:PHA02584  1093 ASYTRAPTADTVGFWSVDINDSATY 1117
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
25-152 3.48e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 41.91  E-value: 3.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFG------NSQ--------------------TKPGGLFGTSSFSQPATSTSTG 78
Cdd:cd21118    145 GGTGGPWASGGNYGTNSLGGSVGQGGNGGplnygtNSQgavaqpgygtvrgnnqnsgcTNPPPSGSHESFSNSGGSSSSG 224
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761100   79 FGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNK-PTGFGNFGTSTSSGGLFG-TTNTTSNPFGSTSGSLFGPSS 152
Cdd:cd21118    225 SSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSgNSGGSNGGSSGNSGSGSGgSSSGGSNGWGGSSSSGGSGGS 300
 
Name Accession Description Interval E-value
Nup96 pfam12110
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of ...
1346-1637 1.53e-133

Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of the Nup84 heptameric complex in the nuclear pore complex. Nup96 complexes with Sec13 in the middle of the heptamer. The function of the heptamer is to coat the curvature of the nuclear pore complex between the inner and outer nuclear membranes. Nup96 is predicted to be an alpha helical solenoid. The interaction between Nup96 and Sec13 is the point of curvature in the heptameric complex.


Pssm-ID: 463462  Cd Length: 287  Bit Score: 417.77  E-value: 1.53e-133
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1346 EAVFSYLTGKRISEACSLAQQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLS 1425
Cdd:pfam12110    1 EKAFALLTGHRVEEACELAIDSGDFRLATLLSQAGGDDSFREDMAEQLDDWRESGVDSEIDEPRRKLYELLAGNVLVSEG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1426 EKKQINVCSQLDWKRSLAIHLWYLLPPTASISRALSMYEEAFQntsdSDRYACSPLPSYLEGSGCVIAEEQNSQTPLrDV 1505
Cdd:pfam12110   81 KKSTINISEGLDWKRAFGLRLWYGIPPDTSIEDAVEAYEEALS----QGREPAPPLPWYLEEGDSESWEDPRLKKRE-DL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100 1506 CFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWEVLRA--LNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVL 1583
Cdd:pfam12110  156 LYHLLKLYADPTAPLEAVLDPESSSPDPLDYRLSWHLYQVLSAvrLGYGHLSSAKADQLTLSFASQLESLGLWQWAVFVL 235
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1435761100 1584 LHIDNSGIREKAVRELLTRHCQLLETPESwaKETFLTQKLRVPAKWIHEAKAVR 1637
Cdd:pfam12110  236 LHLEDPARRERAVRELLARHAELISEDDA--KERFLTEKLKIPEAWIHEAKALY 287
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
721-863 2.64e-63

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


Pssm-ID: 461171  Cd Length: 143  Bit Score: 211.97  E-value: 2.64e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  721 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVVVYLDDNQKPPVGE 796
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761100  797 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 863
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
25-435 5.05e-16

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 84.44  E-value: 5.05e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG4625     74 AGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGG 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG4625    154 GGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGG 233
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  185 KHQCITAMkeYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTT 264
Cdd:COG4625    234 GGGGGGGG--GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGG 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  265 GFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFG 344
Cdd:COG4625    312 GGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGG 391
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  345 TGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTAL 424
Cdd:COG4625    392 GGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGS 471
                          410
                   ....*....|.
gi 1435761100  425 GAGQASLFGNN 435
Cdd:COG4625    472 GAGTLTLTGNN 482
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
40-149 5.63e-14

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 69.18  E-value: 5.63e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   40 SAFGSSNNT-GGLFGNSQTKP---GGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTslfssqnnafaqnkpt 115
Cdd:pfam13634    1 GLFGAATSTsGGLFGNTSTTAasgGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNTTT---------------- 64
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1435761100  116 gfgnfgTSTSSGGLFGTTNTTSNPfgSTSGSLFG 149
Cdd:pfam13634   65 ------QTATGGGLFGNNAATTTS--TTGGGLFG 90
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
24-492 7.28e-14

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 77.50  E-value: 7.28e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   24 GQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFS 103
Cdd:COG3210    825 TATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTN 904
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  104 SQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS 183
Cdd:COG3210    905 AASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSA 984
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  184 TKHQCIT-AMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTS 262
Cdd:COG3210    985 GSTGGVIaATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTAS 1064
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  263 TTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTA 342
Cdd:COG3210   1065 GTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGT 1144
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  343 FGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGT 422
Cdd:COG3210   1145 LTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTA 1224
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  423 ALGAGQASLFGNNQPKIGGPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQAVLQQHINSLT 492
Cdd:COG3210   1225 SDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSAT 1294
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
239-332 1.28e-13

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 68.03  E-value: 1.28e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  239 GLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNpgglfgqQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTiGQPSTNTMGL 318
Cdd:pfam13634    1 GLFGAATSTSGGLFGNTSTTAASGGGLFGAA-------STATATTSGGGLFGNSSSNAPSGGLFGATNT-TTQTATGGGL 72
                           90
                   ....*....|....*...
gi 1435761100  319 FGVTQASQP----GGLFG 332
Cdd:pfam13634   73 FGNNAATTTsttgGGLFG 90
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
25-464 1.09e-12

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 73.65  E-value: 1.09e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG3210    297 TNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAG 376
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG3210    377 AGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTN 456
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  185 KHQCITAmkeyesksleelrledyqANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTA----FG 260
Cdd:COG3210    457 GAGLSGN------------------TDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATgltgIT 518
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  261 TSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTG 340
Cdd:COG3210    519 AGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGG 598
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  341 TAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLGAGF 420
Cdd:COG3210    599 TVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTV 678
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 1435761100  421 GTALGAGQASLFGNNQpKIGGPLGTGAFGAPGFNTTTATLGFGA 464
Cdd:COG3210    679 TSGATGGTTGTTLNAA-TGGTLNNAGNTLTISTGSITVTGQIGA 721
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
25-402 2.96e-12

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 72.49  E-value: 2.96e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG3210    368 NGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIG 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG3210    448 GLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNA 527
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  185 KHQCITAmkeyesksleelrledYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTT 264
Cdd:COG3210    528 TSGGTGG----------------DGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGG 591
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  265 GFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFG 344
Cdd:COG3210    592 TGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTG 671
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1435761100  345 TGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGT-TSGGLFGFGTNTSGNSI 402
Cdd:COG3210    672 GGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSiTVTGQIGALANANGDTV 730
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
44-273 6.33e-12

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 70.47  E-value: 6.33e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   44 SSNNTGGLFGNSQTKPGGL-FGTSSFSQPatSTSTGFGFGTSTGTANtlfGTASTGTSLFSSQNNAFAQNKPTGFGnfgt 122
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFsFGAAAASNP--GSTGGFSFGTLGAAPA---ATATTTTATLGLGGGLFGQKPATGFT---- 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  123 stssgglFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMkeyeskslee 202
Cdd:pfam15967   73 -------FGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVL---------- 135
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761100  203 lrledyqanrKGPQNQVGAGTTTGLFGSSPATSSA--TGLFSSSTTNS-GFAYGQNktafgTSTTGFGTNPGGL 273
Cdd:pfam15967  136 ----------TSTAAQQGATGFTLNLGGTPATTTAvsTGLSLGSTLTSlGGSLFQN-----TNSTGLGQTTLGL 194
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
272-392 8.12e-12

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 63.02  E-value: 8.12e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  272 GLFGQQNQQTTSLFSkpfGQATTTQNTGFSFGNTSTiGQPSTNTMGLFGVTQASQP-GGLFGTatntstgtafgtgtglf 350
Cdd:pfam13634    1 GLFGAATSTSGGLFG---NTSTTAASGGGLFGAAST-ATATTSGGGLFGNSSSNAPsGGLFGA----------------- 59
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1435761100  351 gQTNTGFGAVGSTLFGNNklttfgssttSAPSFGTTSGGLFG 392
Cdd:pfam13634   60 -TNTTTQTATGGGLFGNN----------AATTTSTTGGGLFG 90
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
26-464 1.85e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 66.33  E-value: 1.85e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3210    625 ANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGN 704
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  106 NNAFAQNKPTGFGNFGTSTSSGG--------LFGTTNTTSNPFGSTSGSLFGPSSFTAAP---TGTTIKFNPPTGTDTmV 174
Cdd:COG3210    705 TLTISTGSITVTGQIGALANANGdtvtfgnlGTGATLTLNAGVTITSGNAGTLSIGLTANttaSGTTLTLANANGNTS-A 783
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  175 KAGVSTNISTKHQCITAmkeyesksleelrleDYQANRKGPqNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQ 254
Cdd:COG3210    784 GATLDNAGAEISIDITA---------------DGTITAAGT-TAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTD 847
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  255 NKTAFGTSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTA 334
Cdd:COG3210    848 TTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLT 927
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  335 TNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGFGTNTSGNSIFGSKPAPGTLGT 414
Cdd:COG3210    928 GGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTAS 1007
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|
gi 1435761100  415 GLGAGFGTALGAGQASLFGNNQPKIGGPLGTGAFGAPGFNTTTATLGFGA 464
Cdd:COG3210   1008 TTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGN 1057
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
25-334 6.75e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 64.26  E-value: 6.75e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTantlfgtaSTGTS 100
Cdd:NF033849   255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTD--------SSSHS 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  101 LFSSQNNAFAQnkptgfgNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST 180
Cdd:NF033849   327 QSSSYNVSSGT-------GVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  181 NISTKHQCITAMKEYESKSLEelrlEDYQANRKGPQNQVGAGTTTglfGSSPATSSATGLFSSSTTNSGFAYGQNKTAfg 260
Cdd:NF033849   400 GGVTSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSS---GHSDSSSHSTSSGQADSVSQGTSWSEGTGT-- 470
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1435761100  261 TSTTGFGTNPGglFGQQNQQTTSlFSKPFGQATTT-QNTGFSFGNTSTIGQPSTNTMGlfgvtQASQPGGLFGTA 334
Cdd:NF033849   471 SQGQSVGTSES--WSTSQSETDS-VGDSTGTSESVsQGDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
26-492 1.26e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 63.26  E-value: 1.26e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG4625    173 GGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGG 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  106 NNAFAQNkpTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTk 185
Cdd:COG4625    253 GGGGGNG--GGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG- 329
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  186 hqcitamkeyesksleelrledYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTG 265
Cdd:COG4625    330 ----------------------GGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGG 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  266 FGTNPGGLFGQQNQQTTSLFSKPFGQATTT-----QNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTA-TNTST 339
Cdd:COG4625    388 SGGGGGGGAGGGGGGGGAGGTGGGGAGGGGgaaggGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGgAGAGG 467
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  340 GTAFGTGTGLFGQTNTGFGAVGSTLFGNNkltTFGSSTTSAPSFGTTSGGLfgfgTNTSGN-SIFGSKPAPGTLGTGLGA 418
Cdd:COG4625    468 GSGSGAGTLTLTGNNTYTGTTTVNGGGNY---TQSAGSTLAVEVDAANSDR----LVVTGTaTLNGGTVVVLAGGYAPGT 540
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761100  419 GFgTALGAGQAslfgnnqpkiggpLGTGAFGAPGFNTTTATLGFGAPQAPVALTD--PNASAAQQAVLQQHINSLT 492
Cdd:COG4625    541 TY-TILAVAAA-------------LDALAGNGDLSALYNALAALDAAAARAALDQlsGEIHASAAAALLQASRALR 602
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
25-93 1.80e-08

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 53.39  E-value: 1.80e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGG-LFGNSQTKP--GGLFGTSSFSQPATSTSTGFGFGTSTGTANT---LFG 93
Cdd:pfam13634   16 NTSTTAASGGGLFGAASTATATTSGGgLFGNSSSNApsGGLFGATNTTTQTATGGGLFGNNAATTTSTTgggLFG 90
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
90-161 5.98e-08

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 51.85  E-value: 5.98e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   90 TLFGTA-STGTSLFSSQNNAF------------AQNKPTGFGNFG---TSTSSGGLFGTTNTTSNPfgSTSGSLFGPSSF 153
Cdd:pfam13634    1 GLFGAAtSTSGGLFGNTSTTAasggglfgaastATATTSGGGLFGnssSNAPSGGLFGATNTTTQT--ATGGGLFGNNAA 78

                   ....*...
gi 1435761100  154 TAAPTGTT 161
Cdd:pfam13634   79 TTTSTTGG 86
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
363-463 3.05e-07

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 49.92  E-value: 3.05e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  363 TLFGNNKLTT---FGSSTTSApsfgTTSGGLFGFG----TNTSGNSIFGSKPAPgtlgtglgagfgtalgAGQASLFGNN 435
Cdd:pfam13634    1 GLFGAATSTSgglFGNTSTTA----ASGGGLFGAAstatATTSGGGLFGNSSSN----------------APSGGLFGAT 60
                           90       100       110
                   ....*....|....*....|....*....|
gi 1435761100  436 QPKIGGPLGTGAFGAPGFNTTTATLG--FG 463
Cdd:pfam13634   61 NTTTQTATGGGLFGNNAATTTSTTGGglFG 90
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
309-404 5.60e-07

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 49.15  E-value: 5.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  309 GQPSTNTMGLFG--VTQASQPGGLFGTATNTstgtafgtgtglfGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPsfGTT 386
Cdd:pfam13634    4 GAATSTSGGLFGntSTTAASGGGLFGAASTA-------------TATTSGGGLFGNSSSNAPSGGLFGATNTTTQ--TAT 68
                           90       100
                   ....*....|....*....|..
gi 1435761100  387 SGGLFG----FGTNTSGNSIFG 404
Cdd:pfam13634   69 GGGLFGnnaaTTTSTTGGGLFG 90
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
26-402 8.45e-07

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 54.01  E-value: 8.45e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG5295    219 ASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSA 298
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG5295    299 SSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSG 378
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  186 hqcitamkeyesksleelrLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTG 265
Cdd:COG5295    379 -------------------GSATAAGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGA 439
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  266 FGTNpGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGT 345
Cdd:COG5295    440 SGTS-SSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGA 518
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761100  346 GTGLFGQTNTGFGAVGSTLFGNnklTTFGSSTTSAPSFGTTSgglFGFGTNTSGNSI 402
Cdd:COG5295    519 ANAAAASGATATAGSAGGGAAA---AAGGGSTTAATGTNSVA---VGNNTATGANSV 569
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
28-182 2.16e-06

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 52.75  E-value: 2.16e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   28 GFGTTSGG--AFGTSAFGSSNNTGGL-FGNSQTKP--------------GGLFGtssfSQPATststGFGFGT------S 84
Cdd:pfam15967   11 GSTATAGGgfSFGAAAASNPGSTGGFsFGTLGAAPaatattttatlglgGGLFG----QKPAT----GFTFGTpasstaA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   85 TGTANTLFGTASTGTSlfSSQNNAFAQNKPTG----FGNFGTSTSSGGL-FGTTNTTSNPFGSTSGSLFG----PSSFTA 155
Cdd:pfam15967   83 TGPTGLTLGTPAATTA--ASTGFSLGFNKPAAsatpFSLPASSTSGGGLsLGSVLTSTAAQQGATGFTLNlggtPATTTA 160
                          170       180
                   ....*....|....*....|....*..
gi 1435761100  156 APTGTTIKFNPPTGTDTMVKAGVSTNI 182
Cdd:pfam15967  161 VSTGLSLGSTLTSLGGSLFQNTNSTGL 187
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
216-320 2.31e-06

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 47.23  E-value: 2.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  216 QNQVGAGTTTGLFGSSPATSSATGLFSSsttnsgfaygqnktaFGTSTTgfGTNPGGLFGQQNQQttslfskpfgqaTTT 295
Cdd:pfam13634   16 NTSTTAASGGGLFGAASTATATTSGGGL---------------FGNSSS--NAPSGGLFGATNTT------------TQT 66
                           90       100
                   ....*....|....*....|....*
gi 1435761100  296 QNTGFSFGNTSTIGQPSTNTmGLFG 320
Cdd:pfam13634   67 ATGGGLFGNNAATTTSTTGG-GLFG 90
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
26-335 2.97e-06

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 52.26  E-value: 2.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLF--- 102
Cdd:COG3468    100 GTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGTGVGGTGAAAAGGGTGSGGGGSGGGGGAGGggg 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  103 -----SSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAG 177
Cdd:COG3468    180 ggaggSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGGGGGSAGGTGGGGLTGGGAA 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  178 VSTNISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPA---TSSATGLFSSSTTNSGFAYGQ 254
Cdd:COG3468    260 GTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGgggGSNAGGGSGGGGGGGGGGGGG 339
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  255 NKTA--FGTSTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFG 332
Cdd:COG3468    340 GTTLngAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGTGNNGGGGVGGGGGGGLTLT 419

                   ...
gi 1435761100  333 TAT 335
Cdd:COG3468    420 GGT 422
PPE COG5651
PPE-repeat protein [Function unknown];
31-183 3.33e-06

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 51.43  E-value: 3.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   31 TTSGGAFGTSAFGSSNN-------TGGLFGNSQTKPGGL-FGTSSFSQPATSTSTGFgFGTSTGTANTLFGTASTGTSLF 102
Cdd:COG5651    175 TNPGGLLGAQNAGSGNTssnpgfaNLGLTGLNQVGIGGLnSGSGPIGLNSGPGNTGF-AGTGAAAGAAAAAAAAAAAAGA 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  103 SSQNN-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAG 177
Cdd:COG5651    254 GASAAlaslaATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333

                   ....*.
gi 1435761100  178 VSTNIS 183
Cdd:COG5651    334 AAAAGA 339
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
26-309 3.34e-06

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 52.26  E-value: 3.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3468    142 GGGGGTGVGGTGAAAAGGGTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGA 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIkfNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG3468    222 GGATGGAGSGGNTGGGVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGG--GANGGGSGGGGGASGTGGGGT 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  186 HQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTG 265
Cdd:COG3468    300 ASTGGGGGGGGGNGGGGGGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGA 379
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1435761100  266 FGTNPGGLFGQQNQQTTSLFSKPFGQATTTQN---TGFSFGNTSTIG 309
Cdd:COG3468    380 GGGGANTGSDGVGTGLTTGGTGNNGGGGVGGGgggGLTLTGGTLTVN 426
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
115-275 8.96e-06

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 45.68  E-value: 8.96e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  115 TGFGNfgTSTSSGGLFGTTNTTsnpfGSTSGSLFGPSSFTAAPTgttikfnpptgtdtmvkagvstnistkhqcitamke 194
Cdd:pfam13634    1 GLFGA--ATSTSGGLFGNTSTT----AASGGGLFGAASTATATT------------------------------------ 38
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  195 yesksleelrledyqanrkgpqnqvgagTTTGLFGSSPATSSATGLFSSSTTNSGFAygQNKTAFG-TSTTGFGTNPGGL 273
Cdd:pfam13634   39 ----------------------------SGGGLFGNSSSNAPSGGLFGATNTTTQTA--TGGGLFGnNAATTTSTTGGGL 88

                   ..
gi 1435761100  274 FG 275
Cdd:pfam13634   89 FG 90
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
33-403 9.55e-06

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 50.81  E-value: 9.55e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   33 SGGAFGTSAFGSSNNTGGLFGNSQTK-PGGLFGTSSFSQPATS--TSTGFGFGT---STGTANTLFGTASTGTSLFSSQN 106
Cdd:NF033176   139 SGGAQNIYNLGHASNTVIFNGGNQTIfSGGISDDTNISSGGQQrvSSGGVASNTtinSSGTQNILSGGSTVSTHISSGGN 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  107 N-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAaptGTTIKFNPPTGTDTMVKAGVSTN 181
Cdd:NF033176   219 QyisagGNASATVVSSGGFQRVSSGGTATGTVLSGGTQNVSSGGSAISTSVYSS---GVQTVYAGATVTDTTVNSGGKQN 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  182 ISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSS--ATGLFSSSTTNSGfayGQNKTAF 259
Cdd:NF033176   296 ISSGGIVSGTIVNSSGTQNIYSGGSALSANIKGSQIVNSDGTAINTLVNDGGYQHirNGGVASGTIINQS---GRVNISS 372
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  260 GTSTTGFGTNPGGlfgqqnqqTTSLFSKPFGQATTTQNTGFSfgNTSTiGQPSTNTMGLFGVTQASQPGGlfgTATNTST 339
Cdd:NF033176   373 GGYAESTIINSGG--------TQSVLSGGYASGTLINNSGRE--NVSN-GGSAYNTIINAGGNQYIYSNG---EASGTTV 438
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761100  340 GTAFgtgtglFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFGfGTNTSGNSIF 403
Cdd:NF033176   439 NTSG------FQRVNSGGTATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYA-GGEASGTQIF 495
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
26-172 3.73e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 48.60  E-value: 3.73e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTST----------GTANTLFGTA 95
Cdd:COG3469     52 AASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTvtttstgagsVTSTTSSTAG 131
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761100   96 STGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGlfgTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDT 172
Cdd:COG3469    132 STTTSGASATSSAGSTTTTTTVSGTETATGGTT---TTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT 205
PPE COG5651
PPE-repeat protein [Function unknown];
233-476 1.30e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 46.42  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  233 ATSSATGL--FSS---STTNSGFAYGQNKTAFGTSTTGFGTNPGGLFGQQNQQTTSLFS--KPFGQATTTQNTGFSfgnt 305
Cdd:COG5651    157 ASAAAVALtpFTQpppTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSgsGPIGLNSGPGNTGFA---- 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  306 stigqpSTNTMGLFGVTQASQPGGLFGTATNtstgtafgtgtgLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGT 385
Cdd:COG5651    233 ------GTGAAAGAAAAAAAAAAAAGAGASA------------ALASLAATLLNASSLGLAATAASSAATNLGLAGSPLG 294
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  386 TSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIGGPLGTGAFGAPGFNTTTATLGFGAP 465
Cdd:COG5651    295 LAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAA 374
                          250
                   ....*....|.
gi 1435761100  466 QAPVALTDPNA 476
Cdd:COG5651    375 AGAASGGGAAA 385
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
243-666 1.76e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  243 SSTTNSGFAYGQNKTaFGTSTTGFGTNPGGLFGQQNQQTTSlfskpfgqaTTTQNTGFSFGNTSTIGQPSTNTMGlfgvt 322
Cdd:pfam05109  374 SGCENISGAFASNRT-FDITVSGLGTAPKTLIITRTATNAT---------TTTHKVIFSKAPESTTTSPTLNTTG----- 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  323 qasqpgglfgtatntstgtafgtgtglFGQTNTGFGAVGSTLFGNNkLTTFGSSTTSAPSFGTTSGGLFGfgtNTSGNSI 402
Cdd:pfam05109  439 ---------------------------FAAPNTTTGLPSSTHVPTN-LTAPASTGPTVSTADVTSPTPAG---TTSGASP 487
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  403 FGSKPAPGTLGTGLGAGFGTAlgagQASLFGNNQPKIGGPlgTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQA 482
Cdd:pfam05109  488 VTPSPSPRDNGTESKAPDMTS----PTSAVTTPTPNATSP--TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPA 561
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  483 VLQQHINSLTYSPFGDSPLFRNPMSDPKKKEERLKPTNPAAQKalTTPTHYKLTPRPATRVRPKALQTTGTAKSHlfdgl 562
Cdd:pfam05109  562 VTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANT--TNHTLGGTSSTPVVTSPPKNATSAVTTGQH----- 634
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  563 dddepslangaFMPKKSIKKLVLKNLNNSNLFSPVNRDSENLASP---SEYPENGERFSFLSKPVDENHqqdgdedslvs 639
Cdd:pfam05109  635 -----------NITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltSAHPTGGENITQVTPASTSTH----------- 692
                          410       420
                   ....*....|....*....|....*..
gi 1435761100  640 HFYTNPIAkPIPQTPESAGNKHSNSNS 666
Cdd:pfam05109  693 HVSTSSPA-PRPGTTSQASGPGNSSTS 718
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
26-167 2.19e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.90  E-value: 2.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3469     75 TTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1435761100  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNpfgSTSGSLFGPSSFTAAPTGTTIKFNPP 167
Cdd:COG3469    155 GTETATGGTTTTSTTTTTTSASTTPSATTTATA---TTASGATTPSATTTATTTGPPTPGLP 213
NupH_GANP pfam16768
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the ...
25-311 3.68e-04

Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the nucleoporin-homology domain at the N-terminus of human GANP or germinal-centre associated nuclear proteins. GANP is part of the TREX-2 complex that links transcription with nuclear messenger RNA export, and it associates with the mRNP particle through the interaction of the NupH_GANP with NXF1, the export factor. This attachment mediates efficient delivery of mRNPs to nuclear pore complexes.


Pssm-ID: 435572 [Multi-domain]  Cd Length: 292  Bit Score: 44.51  E-value: 3.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTsafgssnntgglfgnSQTKPGGLFGTSS-FSQPATSTSTGFGFGTSTGtantlFGTASTGTSLFS 103
Cdd:pfam16768   10 QPSAFSTSSSPSTGT---------------FQAKPPFRFGQPSlFGQNNTLSGKNSGFSQVSS-----FPTTSGVSHSSS 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  104 SQNNAFAQnkptgfgnfgtsTSSGGLFGTTNTTSnPFGSTSgslfGPSSfTAAPTGTTIKFNPPTGTdtmvkaGVSTNIS 183
Cdd:pfam16768   70 GQTLGFTQ------------TSGVGLFSGLEHTP-SFVATS----GPSS-SSVPSNPGFSFKSPTNL------GAFPSTS 125
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  184 TKHQCITAMK-------EYESKSLEELRLEDYQANRKGP---QNQVGAGTTTglFgSSPATSSATGLF--------SSST 245
Cdd:pfam16768  126 TFGPESGEVAssgfgktEFSFKPPENAVFRPIFGAESEPektQSQITSGFFT--F-SHPVSSGPGGLApfsfsqvtSSSA 202
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761100  246 TNSGFAYGQNKTAFGTSTTGFGTNPGglfgqQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQP 311
Cdd:pfam16768  203 TSSNFTFSKPVSSNNSSSAFAPALSS-----QNVEEEKRGPKSFFGSSNSSFTSFPNSSSGSLGEP 263
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
72-337 4.76e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 4.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   72 ATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPS 151
Cdd:COG3469      3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  152 SFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTkhqcitamkeyesksleelrledyqanrkgPQNQVGAGTTTGLFGSS 231
Cdd:COG3469     83 TAAAAAATSTSATLVATSTASGANTGTSTVTTT------------------------------STGAGSVTSTTSSTAGS 132
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  232 PATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNPgglfgqqnqqttslfskpfgqaTTTQNTGFSFGNTSTIGQP 311
Cdd:COG3469    133 TTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTT----------------------TTSASTTPSATTTATATTA 190
                          250       260
                   ....*....|....*....|....*.
gi 1435761100  312 StntmglfGVTQASQPGglfgTATNT 337
Cdd:COG3469    191 S-------GATTPSATT----TATTT 205
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
226-407 1.15e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 43.89  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  226 GLFGSSPATSSATGLFSSSTTNSGFA--YGQNKTAFGTSTTGFGtnpgglFGqqnqqttslFSKPFGQAT-------TTQ 296
Cdd:pfam15967   61 GLFGQKPATGFTFGTPASSTAATGPTglTLGTPAATTAASTGFS------LG---------FNKPAASATpfslpasSTS 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  297 NTGFSFGNTSTIGQP----STNTMGLFGVTQASQPgglfGTATNTSTGTAFGTGTGLFGQTN-TGFGAvgSTLFGNNKLT 371
Cdd:pfam15967  126 GGGLSLGSVLTSTAAqqgaTGFTLNLGGTPATTTA----VSTGLSLGSTLTSLGGSLFQNTNsTGLGQ--TTLGLTLLAT 199
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1435761100  372 TFGSSTTSAPSFGTtsGGL-FGFGTNTSGNSIFGSKP 407
Cdd:pfam15967  200 STAPVSAPAASEGL--GGLdFSTSSEKKSDKASGTRP 234
34 PHA02584
long tail fiber, proximal subunit; Provisional
25-175 2.43e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 42.82  E-value: 2.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGG---------------------------LFGNSQTKPGGlfGTSSFSQPATSTST 77
Cdd:PHA02584   944 QNTSNGTVVVVDETSIAFYSQNNTTGnivfnidgtvdpinvnangtlnatgvaTNGRAVYAEGG--GIARTNNAARAITG 1021
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   78 GFGFGTSTGTANTLFGTASTGTSLFSSQ-----NNAFAQNK--PTGFGNFGTSTSSGGLfgttnTTSNPFGSTSGSlfgp 150
Cdd:PHA02584  1022 GFTIRNDGSTTVFLLTAAGDQTGGFNGLksliiNNANGQVTinDNYIINAGGTIMSGGL-----TVNSRIRSQGTK---- 1092
                          170       180
                   ....*....|....*....|....*
gi 1435761100  151 SSFTAAPTGTTIKFNPPTGTDTMVK 175
Cdd:PHA02584  1093 ASYTRAPTADTVGFWSVDINDSATY 1117
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
25-152 3.48e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 41.91  E-value: 3.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFG------NSQ--------------------TKPGGLFGTSSFSQPATSTSTG 78
Cdd:cd21118    145 GGTGGPWASGGNYGTNSLGGSVGQGGNGGplnygtNSQgavaqpgygtvrgnnqnsgcTNPPPSGSHESFSNSGGSSSSG 224
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761100   79 FGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNK-PTGFGNFGTSTSSGGLFG-TTNTTSNPFGSTSGSLFGPSS 152
Cdd:cd21118    225 SSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSgNSGGSNGGSSGNSGSGSGgSSSGGSNGWGGSSSSGGSGGS 300
PTZ00473 PTZ00473
Plasmodium Vir superfamily; Provisional
26-100 4.21e-03

Plasmodium Vir superfamily; Provisional


Pssm-ID: 240430 [Multi-domain]  Cd Length: 420  Bit Score: 41.76  E-value: 4.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAF--------------GTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTl 91
Cdd:PTZ00473   315 RGPYNANYGGQFnsrsgrtgssesirGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGGSTYGGSST- 393

                   ....*....
gi 1435761100   92 FGTASTGTS 100
Cdd:PTZ00473   394 FDGSSRGSS 402
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
214-389 4.73e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 41.66  E-value: 4.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  214 GPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNPGGLFGQQNQQ-TTSLFSKPFGQA 292
Cdd:COG3469     29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATlVATSTASGANTG 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100  293 TTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTT 372
Cdd:COG3469    109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
                          170
                   ....*....|....*....
gi 1435761100  373 --FGSSTTSAPSFGTTSGG 389
Cdd:COG3469    189 taSGATTPSATTTATTTGP 207
PPE COG5651
PPE-repeat protein [Function unknown];
26-159 9.71e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 40.26  E-value: 9.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761100   26 NTGFGTTSGGAFGTSAFGSSNN-TGGLFGNSQTKPGGLFGTS---SFSQPATSTSTGFGFGTST---------GTANTLF 92
Cdd:COG5651    194 NPGFANLGLTGLNQVGIGGLNSgSGPIGLNSGPGNTGFAGTGaaaGAAAAAAAAAAAAGAGASAalaslaatlLNASSLG 273
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761100   93 GTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTG 159
Cdd:COG5651    274 LAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAA 340
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH