|
Name |
Accession |
Description |
Interval |
E-value |
| Nup96 |
pfam12110 |
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of ... |
1298-1589 |
9.58e-134 |
|
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of the Nup84 heptameric complex in the nuclear pore complex. Nup96 complexes with Sec13 in the middle of the heptamer. The function of the heptamer is to coat the curvature of the nuclear pore complex between the inner and outer nuclear membranes. Nup96 is predicted to be an alpha helical solenoid. The interaction between Nup96 and Sec13 is the point of curvature in the heptameric complex.
Pssm-ID: 463462 Cd Length: 287 Bit Score: 417.77 E-value: 9.58e-134
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 1298 EAVFSYLTGKRISEACSLAQQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLS 1377
Cdd:pfam12110 1 EKAFALLTGHRVEEACELAIDSGDFRLATLLSQAGGDDSFREDMAEQLDDWRESGVDSEIDEPRRKLYELLAGNVLVSEG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 1378 EKKQINVCSQLDWKRSLAIHLWYLLPPTASISRALSMYEEAFQntsdSDRYACSPLPSYLEGSGCVIAEEQNSQTPLrDV 1457
Cdd:pfam12110 81 KKSTINISEGLDWKRAFGLRLWYGIPPDTSIEDAVEAYEEALS----QGREPAPPLPWYLEEGDSESWEDPRLKKRE-DL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 1458 CFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWEVLRA--LNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVL 1535
Cdd:pfam12110 156 LYHLLKLYADPTAPLEAVLDPESSSPDPLDYRLSWHLYQVLSAvrLGYGHLSSAKADQLTLSFASQLESLGLWQWAVFVL 235
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1435761106 1536 LHIDNSGIREKAVRELLTRHCQLLETPESwaKETFLTQKLRVPAKWIHEAKAVR 1589
Cdd:pfam12110 236 LHLEDPARRERAVRELLARHAELISEDDA--KERFLTEKLKIPEAWIHEAKALY 287
|
|
| Nucleoporin2 |
pfam04096 |
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ... |
704-846 |
2.57e-63 |
|
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.
Pssm-ID: 461171 Cd Length: 143 Bit Score: 211.97 E-value: 2.57e-63
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 704 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVVVYLDDNQKPPVGE 779
Cdd:pfam04096 1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761106 780 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 846
Cdd:pfam04096 80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
40-149 |
2.54e-14 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 69.95 E-value: 2.54e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 40 SAFGSSNNT-GGLFGNSQTKP---GGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTslfssqnnafaqnkpt 115
Cdd:pfam13634 1 GLFGAATSTsGGLFGNTSTTAasgGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNTTT---------------- 64
|
90 100 110
....*....|....*....|....*....|....
gi 1435761106 116 gfgnfgTSTSSGGLFGTTNTTSNPfgSTSGSLFG 149
Cdd:pfam13634 65 ------QTATGGGLFGNNAATTTS--TTGGGLFG 90
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
226-323 |
8.02e-14 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 68.41 E-value: 8.02e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 226 GLFGSSPATSsaTGLFSSSTTNsgfaygttgfGTNPGGLFGQQNQQTTSLF-SKPFGQATTTQNTGFSFGNTSTiGQPST 304
Cdd:pfam13634 1 GLFGAATSTS--GGLFGNTSTT----------AASGGGLFGAASTATATTSgGGLFGNSSSNAPSGGLFGATNT-TTQTA 67
|
90 100
....*....|....*....|...
gi 1435761106 305 NTMGLFGVTQASQP----GGLFG 323
Cdd:pfam13634 68 TGGGLFGNNAATTTsttgGGLFG 90
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
24-462 |
9.00e-14 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 77.11 E-value: 9.00e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 24 GQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFS 103
Cdd:COG3210 825 TATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTN 904
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 104 SQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS 183
Cdd:COG3210 905 AASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSA 984
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 184 TKHQCITAmkeyeskSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGG 263
Cdd:COG3210 985 GSTGGVIA-------ATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGN 1057
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 264 LFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQ 343
Cdd:COG3210 1058 AAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTAST 1137
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 344 TNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIG 423
Cdd:COG3210 1138 EAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVT 1217
|
410 420 430
....*....|....*....|....*....|....*....
gi 1435761106 424 GPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAA 462
Cdd:COG3210 1218 TTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDA 1256
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
44-262 |
2.32e-12 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 72.01 E-value: 2.32e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 44 SSNNTGGLFGNSQTKPGGL-FGTSSFSQPatSTSTGFGFGTSTGTANtlfGTASTGTSLFSSQNNAFAQNKPTGFGnfgt 122
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGFsFGAAAASNP--GSTGGFSFGTLGAAPA---ATATTTTATLGLGGGLFGQKPATGFT---- 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 123 stssgglFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS---TKHQCITAMKEYESKS 199
Cdd:pfam15967 73 -------FGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGgglSLGSVLTSTAAQQGAT 145
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1435761106 200 LEELRLedyqanrkGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTnSGFAYGTTGFGTNPG 262
Cdd:pfam15967 146 GFTLNL--------GGTPATTTAVSTGLSLGSTLTSLGGSLFQNTNS-TGLGQTTLGLTLLAT 199
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
25-325 |
1.75e-08 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 59.63 E-value: 1.75e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTantlfgtaSTGTS 100
Cdd:NF033849 255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTD--------SSSHS 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 101 LFSSQNNAFAQnkptgfgNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST 180
Cdd:NF033849 327 QSSSYNVSSGT-------GVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 181 NISTKHQCITAMKEYESKSLEelrlEDYQANRKGPQNQVGAGTTTGlfgsspaTSSATGLFSSSTTNSGFAYGTT---GF 257
Cdd:NF033849 400 GGVTSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSG-------HSDSSSHSTSSGQADSVSQGTSwseGT 468
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761106 258 GTNPGGLFGQ-----QNQQTTSLFSKPFGQATTT-QNTGFSFGNTSTIGQPSTNTMGlfgvtQASQPGGLFGTA 325
Cdd:NF033849 469 GTSQGQSVGTseswsTSQSETDSVGDSTGTSESVsQGDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
|
|
| auto_AIDA-I |
NF033176 |
autotransporter adhesin AIDA-I; |
33-386 |
9.51e-06 |
|
autotransporter adhesin AIDA-I;
Pssm-ID: 380183 [Multi-domain] Cd Length: 1287 Bit Score: 50.81 E-value: 9.51e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 33 SGGAFGTSAFGSSNNTGGLFGNSQTK-PGGLFGTSSFSQPATS--TSTGFGFGT---STGTANTLFGTASTGTSLFSSQN 106
Cdd:NF033176 139 SGGAQNIYNLGHASNTVIFNGGNQTIfSGGISDDTNISSGGQQrvSSGGVASNTtinSSGTQNILSGGSTVSTHISSGGN 218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 107 N-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAaptGTTIKFNPPTGTDTMVKAGVSTN 181
Cdd:NF033176 219 QyisagGNASATVVSSGGFQRVSSGGTATGTVLSGGTQNVSSGGSAISTSVYSS---GVQTVYAGATVTDTTVNSGGKQN 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 182 ISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSS--ATGLFSSSTTNSGFAYGTTGFGT 259
Cdd:NF033176 296 ISSGGIVSGTIVNSSGTQNIYSGGSALSANIKGSQIVNSDGTAINTLVNDGGYQHirNGGVASGTIINQSGRVNISSGGY 375
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 260 NPGGLFGQQNQQttSLFSKPFGQATTTQNTGFSfgNTSTiGQPSTNTMGLFGVTQASQPGGlfgTATNTSTGTAFgtgtg 339
Cdd:NF033176 376 AESTIINSGGTQ--SVLSGGYASGTLINNSGRE--NVSN-GGSAYNTIINAGGNQYIYSNG---EASGTTVNTSG----- 442
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1435761106 340 lFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNT-------SGNSIF 386
Cdd:NF033176 443 -FQRVNSGGTATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTvyaggeaSGTQIF 495
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
26-172 |
3.03e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 48.60 E-value: 3.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTST----------GTANTLFGTA 95
Cdd:COG3469 52 AASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTvtttstgagsVTSTTSSTAG 131
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761106 96 STGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGlfgTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDT 172
Cdd:COG3469 132 STTTSGASATSSAGSTTTTTTVSGTETATGGTT---TTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT 205
|
|
| PRK12688 |
PRK12688 |
flagellin; Reviewed |
72-474 |
2.12e-04 |
|
flagellin; Reviewed
Pssm-ID: 171664 [Multi-domain] Cd Length: 751 Bit Score: 46.02 E-value: 2.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 72 ATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQNNAFaqnkptgFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPS 151
Cdd:PRK12688 276 ATIAVSASGGAVSAAAAGAVTLKSSTGADLSVTGKADL-------LKALGLTTATGAGNATVNANRTTSAGSLGALIQDG 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 152 SfTAAPTGTTIKFN---PPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLF 228
Cdd:PRK12688 349 S-TLNVDGKTITFKnapIPGAASVPSGYGASGNVLTDGNGNSTVYLQGGTINDVLKAIDLATGVQTATIANGTATLATAA 427
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 229 GSSPATSSATGLFSSST-TNSGFAYGTTGFGTNPGGLFGQQnqqttslfskpfGQATTtqntgFSFGNTSTIGQPSTNTM 307
Cdd:PRK12688 428 GQTASSVNASGQLKLSTgLNADLSITGTGNALSALGLAGNT------------GTATA-----FTAARTAGAGGISGKTL 490
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 308 GLFGVTQASQPGGLFGTATNTSTGTafgtgtglFGQTNTgfgavgsTLFGNNKLTTFGSSttsapsfGFGTNTSGNSIFG 387
Cdd:PRK12688 491 TFTSFNGGTAVNVTFGDGTNGTVKT--------LAQLNT-------ALQANNLTATIDAT-------GKLTISASNDYAS 548
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 388 SkpapgtlgtglgaGFGTALGAGqaslfgnnqpKIGGplgtgafgapgfnTTTATLGFGAPQAPVAltDPNASAAQQAVL 467
Cdd:PRK12688 549 S-------------TLGSTLAGG----------AIGG-------------TLTSTLTFSTASAPVA--DTVAQTTRANLV 590
|
....*..
gi 1435761106 468 QQHINSL 474
Cdd:PRK12688 591 KQYNNIL 597
|
|
| 34 |
PHA02584 |
long tail fiber, proximal subunit; Provisional |
25-175 |
2.24e-03 |
|
long tail fiber, proximal subunit; Provisional
Pssm-ID: 222890 [Multi-domain] Cd Length: 1229 Bit Score: 42.82 E-value: 2.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTSAFGSSNNTGG---------------------------LFGNSQTKPGGlfGTSSFSQPATSTST 77
Cdd:PHA02584 944 QNTSNGTVVVVDETSIAFYSQNNTTGnivfnidgtvdpinvnangtlnatgvaTNGRAVYAEGG--GIARTNNAARAITG 1021
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 78 GFGFGTSTGTANTLFGTASTGTSLFSSQ-----NNAFAQNK--PTGFGNFGTSTSSGGLfgttnTTSNPFGSTSGSlfgp 150
Cdd:PHA02584 1022 GFTIRNDGSTTVFLLTAAGDQTGGFNGLksliiNNANGQVTinDNYIINAGGTIMSGGL-----TVNSRIRSQGTK---- 1092
|
170 180
....*....|....*....|....*
gi 1435761106 151 SSFTAAPTGTTIKFNPPTGTDTMVK 175
Cdd:PHA02584 1093 ASYTRAPTADTVGFWSVDINDSATY 1117
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
25-152 |
2.85e-03 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 42.29 E-value: 2.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFG------NSQ--------------------TKPGGLFGTSSFSQPATSTSTG 78
Cdd:cd21118 145 GGTGGPWASGGNYGTNSLGGSVGQGGNGGplnygtNSQgavaqpgygtvrgnnqnsgcTNPPPSGSHESFSNSGGSSSSG 224
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761106 79 FGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNK-PTGFGNFGTSTSSGGLFG-TTNTTSNPFGSTSGSLFGPSS 152
Cdd:cd21118 225 SSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSgNSGGSNGGSSGNSGSGSGgSSSGGSNGWGGSSSSGGSGGS 300
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
220-388 |
5.84e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 41.53 E-value: 5.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 220 GAGTTTGlFGSSPATSSATGL-------FSSSTTNSGFAYGTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFS 292
Cdd:NF033849 236 GQSAGTG-YGESVGHSTSQGQshsvgtsESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTT 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 293 FGNTSTIGQ---PSTNTMGLFGVTQASQ--PGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSS 367
Cdd:NF033849 315 EGTSTTDSSshsQSSSYNVSSGTGVSSShsDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFS 394
|
170 180
....*....|....*....|....*
gi 1435761106 368 TTSAP----SFGFGTNTSGNSIFGS 388
Cdd:NF033849 395 GGIAGggvtSEGLGASQGGSEGWGS 419
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Nup96 |
pfam12110 |
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of ... |
1298-1589 |
9.58e-134 |
|
Nuclear protein 96; Nup96 (often known by the name of its yeast homolog Nup145C) is part of the Nup84 heptameric complex in the nuclear pore complex. Nup96 complexes with Sec13 in the middle of the heptamer. The function of the heptamer is to coat the curvature of the nuclear pore complex between the inner and outer nuclear membranes. Nup96 is predicted to be an alpha helical solenoid. The interaction between Nup96 and Sec13 is the point of curvature in the heptameric complex.
Pssm-ID: 463462 Cd Length: 287 Bit Score: 417.77 E-value: 9.58e-134
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 1298 EAVFSYLTGKRISEACSLAQQSGDHRLALLLSQFVGSQSVRELLTMQLVDWHQLQADSFIQDERLRIFALLAGKPVWQLS 1377
Cdd:pfam12110 1 EKAFALLTGHRVEEACELAIDSGDFRLATLLSQAGGDDSFREDMAEQLDDWRESGVDSEIDEPRRKLYELLAGNVLVSEG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 1378 EKKQINVCSQLDWKRSLAIHLWYLLPPTASISRALSMYEEAFQntsdSDRYACSPLPSYLEGSGCVIAEEQNSQTPLrDV 1457
Cdd:pfam12110 81 KKSTINISEGLDWKRAFGLRLWYGIPPDTSIEDAVEAYEEALS----QGREPAPPLPWYLEEGDSESWEDPRLKKRE-DL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 1458 CFHLLKLYSDRHYDLNQLLEPRSITADPLDYRLSWHLWEVLRA--LNYTHLSAQCEGVLQASYAGQLESEGLWEWAIFVL 1535
Cdd:pfam12110 156 LYHLLKLYADPTAPLEAVLDPESSSPDPLDYRLSWHLYQVLSAvrLGYGHLSSAKADQLTLSFASQLESLGLWQWAVFVL 235
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1435761106 1536 LHIDNSGIREKAVRELLTRHCQLLETPESwaKETFLTQKLRVPAKWIHEAKAVR 1589
Cdd:pfam12110 236 LHLEDPARRERAVRELLARHAELISEDDA--KERFLTEKLKIPEAWIHEAKALY 287
|
|
| Nucleoporin2 |
pfam04096 |
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ... |
704-846 |
2.57e-63 |
|
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.
Pssm-ID: 461171 Cd Length: 143 Bit Score: 211.97 E-value: 2.57e-63
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 704 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVVVYLDDNQKPPVGE 779
Cdd:pfam04096 1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761106 780 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 846
Cdd:pfam04096 80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
40-149 |
2.54e-14 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 69.95 E-value: 2.54e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 40 SAFGSSNNT-GGLFGNSQTKP---GGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTslfssqnnafaqnkpt 115
Cdd:pfam13634 1 GLFGAATSTsGGLFGNTSTTAasgGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNTTT---------------- 64
|
90 100 110
....*....|....*....|....*....|....
gi 1435761106 116 gfgnfgTSTSSGGLFGTTNTTSNPfgSTSGSLFG 149
Cdd:pfam13634 65 ------QTATGGGLFGNNAATTTS--TTGGGLFG 90
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
226-323 |
8.02e-14 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 68.41 E-value: 8.02e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 226 GLFGSSPATSsaTGLFSSSTTNsgfaygttgfGTNPGGLFGQQNQQTTSLF-SKPFGQATTTQNTGFSFGNTSTiGQPST 304
Cdd:pfam13634 1 GLFGAATSTS--GGLFGNTSTT----------AASGGGLFGAASTATATTSgGGLFGNSSSNAPSGGLFGATNT-TTQTA 67
|
90 100
....*....|....*....|...
gi 1435761106 305 NTMGLFGVTQASQP----GGLFG 323
Cdd:pfam13634 68 TGGGLFGNNAATTTsttgGGLFG 90
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
24-462 |
9.00e-14 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 77.11 E-value: 9.00e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 24 GQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFS 103
Cdd:COG3210 825 TATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTN 904
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 104 SQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS 183
Cdd:COG3210 905 AASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSA 984
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 184 TKHQCITAmkeyeskSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGG 263
Cdd:COG3210 985 GSTGGVIA-------ATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGN 1057
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 264 LFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQ 343
Cdd:COG3210 1058 AAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTAST 1137
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 344 TNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIG 423
Cdd:COG3210 1138 EAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVT 1217
|
410 420 430
....*....|....*....|....*....|....*....
gi 1435761106 424 GPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAA 462
Cdd:COG3210 1218 TTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDA 1256
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
26-418 |
1.38e-13 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 76.36 E-value: 1.38e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG4625 85 GGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGG 164
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG4625 165 GGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 244
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 186 HQCITAMKEYESKS-----LEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTN 260
Cdd:COG4625 245 GGGAGGGGGGGGGNgggggAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 324
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 261 PGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGL 340
Cdd:COG4625 325 GGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGG 404
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1435761106 341 FGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNN 418
Cdd:COG4625 405 AGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNN 482
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
25-447 |
1.37e-12 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 73.26 E-value: 1.37e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG3210 297 TNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAG 376
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG3210 377 AGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTN 456
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 185 KHQCITAmkeyeSKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGGL 264
Cdd:COG3210 457 GAGLSGN-----TDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGG 531
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 265 FGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQT 344
Cdd:COG3210 532 TGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGAT 611
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 345 NTGFGAVGST-------LFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGN 417
Cdd:COG3210 612 GTITLGAGTSgaganatGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTL 691
|
410 420 430
....*....|....*....|....*....|
gi 1435761106 418 NQPKIGGPLGTGAFGAPGFNTTTATLGFGA 447
Cdd:COG3210 692 NAATGGTLNNAGNTLTISTGSITVTGQIGA 721
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
44-262 |
2.32e-12 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 72.01 E-value: 2.32e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 44 SSNNTGGLFGNSQTKPGGL-FGTSSFSQPatSTSTGFGFGTSTGTANtlfGTASTGTSLFSSQNNAFAQNKPTGFGnfgt 122
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGFsFGAAAASNP--GSTGGFSFGTLGAAPA---ATATTTTATLGLGGGLFGQKPATGFT---- 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 123 stssgglFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS---TKHQCITAMKEYESKS 199
Cdd:pfam15967 73 -------FGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGgglSLGSVLTSTAAQQGAT 145
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1435761106 200 LEELRLedyqanrkGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTnSGFAYGTTGFGTNPG 262
Cdd:pfam15967 146 GFTLNL--------GGTPATTTAVSTGLSLGSTLTSLGGSLFQNTNS-TGLGQTTLGLTLLAT 199
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
30-447 |
1.93e-11 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 69.80 E-value: 1.93e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 30 GTTSGGAFGTSAFGSSNNTGGLFGNSQT----KPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTL-FGTASTGTSLFSS 104
Cdd:COG3210 756 TLSIGLTANTTASGTTLTLANANGNTSAgatlDNAGAEISIDITADGTITAAGTTAINVTGSGGTItINTATTGLTGTGD 835
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG3210 836 TTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLAT 915
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 185 KHqcITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGGL 264
Cdd:COG3210 916 VT--ATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAA 993
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 265 FGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQT 344
Cdd:COG3210 994 TGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGT 1073
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 345 NTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIGG 424
Cdd:COG3210 1074 AASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSA 1153
|
410 420
....*....|....*....|...
gi 1435761106 425 PLGTGAFGAPGFNTTTATLGFGA 447
Cdd:COG3210 1154 VAGGASSASAGDTTAVAAATTTT 1176
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
263-375 |
3.12e-11 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 61.09 E-value: 3.12e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 263 GLFGQQNQQTTSLFSkpfGQATTTQNTGFSFGNTSTiGQPSTNTMGLFGVTQASQP-GGLFGTatntstgtafgtgtglf 341
Cdd:pfam13634 1 GLFGAATSTSGGLFG---NTSTTAASGGGLFGAAST-ATATTSGGGLFGNSSSNAPsGGLFGA----------------- 59
|
90 100 110
....*....|....*....|....*....|....
gi 1435761106 342 gQTNTGFGAVGSTLFGNNklTTFGSSTTSAPSFG 375
Cdd:pfam13634 60 -TNTTTQTATGGGLFGNN--AATTTSTTGGGLFG 90
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
25-385 |
1.03e-10 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 67.10 E-value: 1.03e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSS 104
Cdd:COG3210 368 NGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIG 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 105 QNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIST 184
Cdd:COG3210 448 GLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNA 527
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 185 KHQCITAMKEY-ESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGG 263
Cdd:COG3210 528 TSGGTGGDGTTlSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGS 607
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 264 LFGQQ--NQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLF 341
Cdd:COG3210 608 AGATGtiTLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTT 687
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1435761106 342 GQTNTGFGAVGSTLFGNNkLTTFGSSTTSAPSFGFGTNTSGNSI 385
Cdd:COG3210 688 GTTLNAATGGTLNNAGNT-LTISTGSITVTGQIGALANANGDTV 730
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
26-466 |
2.98e-10 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 65.56 E-value: 2.98e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3210 585 STSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGV 664
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLF-GPSSFTAA--------PTGTTIKF-NPPTGTDTMVK 175
Cdd:COG3210 665 NTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTiSTGSITVTgqigalanANGDTVTFgNLGTGATLTLN 744
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 176 AGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGlfgsspATSSATGLFSSSTTNSGFAYGTT 255
Cdd:COG3210 745 AGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNAGAEIS------IDITADGTITAAGTTAINVTGSG 818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 256 G--------FGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATN 327
Cdd:COG3210 819 GtitintatTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTN 898
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 328 TSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTAL 407
Cdd:COG3210 899 LGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVG 978
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 1435761106 408 GAGQASLFGNNQPKIGGPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQAV 466
Cdd:COG3210 979 TSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTA 1037
|
|
| AidA |
COG3468 |
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ... |
26-384 |
4.72e-10 |
|
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442691 [Multi-domain] Cd Length: 846 Bit Score: 64.58 E-value: 4.72e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLF--- 102
Cdd:COG3468 100 GTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGTGVGGTGAAAAGGGTGSGGGGSGGGGGAGGggg 179
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 103 -----SSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAG 177
Cdd:COG3468 180 ggaggSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGGGGGSAGGTGGGGLTGGGAA 259
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 178 VSTNISTKHqcitamkeyesksleelrleDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGF 257
Cdd:COG3468 260 GTGGGGGGT--------------------GTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGG 319
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 258 GTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTG 337
Cdd:COG3468 320 SNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGG 399
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 1435761106 338 TGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNS 384
Cdd:COG3468 400 TGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTGNNGTLVLNTVLGDDN 446
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
26-475 |
2.09e-09 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 62.49 E-value: 2.09e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQpATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG4625 173 GGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGG-GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG4625 252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 186 HQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGGLF 265
Cdd:COG4625 332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 266 GQQNQQTTSlfskpFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTStgtaFGTGTGLFGQTN 345
Cdd:COG4625 412 GAGGGGGAA-----GGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSG----SGAGTLTLTGNN 482
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 346 TGFGAVGSTLFGNNkltTFGSSTTSAPSFGFGT----NTSGN-SIFGSKPAPGTLGTGLGAGFgTALGAGQAslfgnnqp 420
Cdd:COG4625 483 TYTGTTTVNGGGNY---TQSAGSTLAVEVDAANsdrlVVTGTaTLNGGTVVVLAGGYAPGTTY-TILAVAAA-------- 550
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761106 421 kiggpLGTGAFGAPGFNTTTATLGFGAPQAPVALTD--PNASAAQQAVLQQHINSLT 475
Cdd:COG4625 551 -----LDALAGNGDLSALYNALAALDAAAARAALDQlsGEIHASAAAALLQASRALR 602
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
25-93 |
1.08e-08 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 54.16 E-value: 1.08e-08
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1435761106 25 QNTGFGTTSGGAFGTSAFGSSNNTGG-LFGNSQTKP--GGLFGTSSFSQPATSTSTGFGFGTSTGTANT---LFG 93
Cdd:pfam13634 16 NTSTTAASGGGLFGAASTATATTSGGgLFGNSSSNApsGGLFGATNTTTQTATGGGLFGNNAATTTSTTgggLFG 90
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
25-325 |
1.75e-08 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 59.63 E-value: 1.75e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTantlfgtaSTGTS 100
Cdd:NF033849 255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTD--------SSSHS 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 101 LFSSQNNAFAQnkptgfgNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVST 180
Cdd:NF033849 327 QSSSYNVSSGT-------GVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAG 399
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 181 NISTKHQCITAMKEYESKSLEelrlEDYQANRKGPQNQVGAGTTTGlfgsspaTSSATGLFSSSTTNSGFAYGTT---GF 257
Cdd:NF033849 400 GGVTSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSG-------HSDSSSHSTSSGQADSVSQGTSwseGT 468
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1435761106 258 GTNPGGLFGQ-----QNQQTTSLFSKPFGQATTT-QNTGFSFGNTSTIGQPSTNTMGlfgvtQASQPGGLFGTA 325
Cdd:NF033849 469 GTSQGQSVGTseswsTSQSETDSVGDSTGTSESVsQGDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
26-462 |
1.94e-08 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 59.78 E-value: 1.94e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTG---------FGFGTSTGTANTLFGTAS 96
Cdd:COG3210 651 TGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGntltistgsITVTGQIGALANANGDTV 730
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 97 TGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTaaPTGTTIKFNpPTGTDTMVKA 176
Cdd:COG3210 731 TFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLD--NAGAEISID-ITADGTITAA 807
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 177 GVST-NISTKHQCITamkeyesksleelrledyqanrkgpqnqVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTT 255
Cdd:COG3210 808 GTTAiNVTGSGGTIT----------------------------INTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASG 859
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 256 GFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFG 335
Cdd:COG3210 860 GGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAG 939
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 336 TGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLF 415
Cdd:COG3210 940 NGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGG 1019
|
410 420 430 440
....*....|....*....|....*....|....*....|....*..
gi 1435761106 416 GNNQPKIGGPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAA 462
Cdd:COG3210 1020 NGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGT 1066
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
90-161 |
3.24e-08 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 52.62 E-value: 3.24e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 90 TLFGTA-STGTSLFSSQNNAF------------AQNKPTGFGNFG---TSTSSGGLFGTTNTTSNPfgSTSGSLFGPSSF 153
Cdd:pfam13634 1 GLFGAAtSTSGGLFGNTSTTAasggglfgaastATATTSGGGLFGnssSNAPSGGLFGATNTTTQT--ATGGGLFGNNAA 78
|
....*...
gi 1435761106 154 TAAPTGTT 161
Cdd:pfam13634 79 TTTSTTGG 86
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
26-380 |
3.43e-08 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 58.63 E-value: 3.43e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG5295 278 SGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAA 357
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKfnppTGTDTMVKAGVSTNISTK 185
Cdd:COG5295 358 ADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASAGGGASAAG----GAAAGSAAAGTSSNTSAV 433
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 186 HQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGGLF 265
Cdd:COG5295 434 GASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSA 513
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 266 GQQNQQTTSLFSkpfgQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQTN 345
Cdd:COG5295 514 AAGGAANAAAAS----GATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAG 589
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 1435761106 346 TGFGAVGSTL-----FGNNKLTTFGSSTTSAPSFGFGTNT 380
Cdd:COG5295 590 AENVAAGATDtdavnGGGAVATGDNSVAVGNNAQASGANS 629
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
216-311 |
6.12e-08 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 51.85 E-value: 6.12e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 216 QNQVGAGTTTGLFGSSPATssatglfsSSTTNSGFAYGTTGFGTNPGGLFGQQNQQttslfskpfgqaTTTQNTGFSFGN 295
Cdd:pfam13634 16 NTSTTAASGGGLFGAASTA--------TATTSGGGLFGNSSSNAPSGGLFGATNTT------------TQTATGGGLFGN 75
|
90
....*....|....*.
gi 1435761106 296 TSTIGQPSTNTmGLFG 311
Cdd:pfam13634 76 NAATTTSTTGG-GLFG 90
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
26-477 |
1.72e-07 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 56.70 E-value: 1.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3210 537 TTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITL 616
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG3210 617 GAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATG 696
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 186 HQCITAmkeyesksleelrledyqanrkGPQNQVGAG--TTTGLFGSSPATSSATglFSSSTTNSGFAYGTTGFGTNPGG 263
Cdd:COG3210 697 GTLNNA----------------------GNTLTISTGsiTVTGQIGALANANGDT--VTFGNLGTGATLTLNAGVTITSG 752
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 264 LFGQQNQQTTSLFSKpFGQATTTQNTGfsfGNTSTIGQPSTNTMGLFGVTQASqpgGLFGTATNTSTGTAFGTGTGLFGQ 343
Cdd:COG3210 753 NAGTLSIGLTANTTA-SGTTLTLANAN---GNTSAGATLDNAGAEISIDITAD---GTITAAGTTAINVTGSGGTITINT 825
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 344 TNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIG 423
Cdd:COG3210 826 ATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNA 905
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 1435761106 424 GPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQAVLQQHINSLTYS 477
Cdd:COG3210 906 ASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASA 959
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
26-418 |
2.72e-07 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 55.55 E-value: 2.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG5295 239 ASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGG 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG5295 319 GAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGS 398
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 186 HQCITAMkeyesksleelrleDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGGLF 265
Cdd:COG5295 399 GGSSTGA--------------SAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAAN 464
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 266 GQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTN----TMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLF 341
Cdd:COG5295 465 VGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAgaagGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGG 544
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1435761106 342 GQTNTGFGAvGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASL-FGNN 418
Cdd:COG5295 545 GSTTAATGT-NSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAGATDTDAVNGGGAVATGDNSVaVGNN 621
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
31-261 |
3.47e-07 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 54.51 E-value: 3.47e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 31 TTSGGAFGTSAFGSSNN-------TGGLFGNSQTKPGGL-FGTSSFSQPATSTSTGFgFGTSTGTANTLFGTASTGTSLF 102
Cdd:COG5651 175 TNPGGLLGAQNAGSGNTssnpgfaNLGLTGLNQVGIGGLnSGSGPIGLNSGPGNTGF-AGTGAAAGAAAAAAAAAAAAGA 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 103 SSQNN-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAG 177
Cdd:COG5651 254 GASAAlaslaATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 178 VSTNistkhqcitamkeyesksleelrledyqanrkGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGF 257
Cdd:COG5651 334 AAAA--------------------------------GAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGG 381
|
....
gi 1435761106 258 GTNP 261
Cdd:COG5651 382 GAAA 385
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
340-446 |
8.87e-07 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 48.38 E-value: 8.87e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 340 LFGQTNTGfgavGSTLFGNNklttfgSSTTSAPSFGFG------TNTSGNSIFGSKPAPgtlgtglgagfgtalgAGQAS 413
Cdd:pfam13634 2 LFGAATST----SGGLFGNT------STTAASGGGLFGaastatATTSGGGLFGNSSSN----------------APSGG 55
|
90 100 110
....*....|....*....|....*....|....*
gi 1435761106 414 LFGNNQPKIGGPLGTGAFGAPGFNTTTATLG--FG 446
Cdd:pfam13634 56 LFGATNTTTQTATGGGLFGNNAATTTSTTGGglFG 90
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
33-182 |
1.43e-06 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 53.13 E-value: 1.43e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 33 SGGAFGTSAFGSSNNTGGL-FGNSQTKP----GGL-FGTSSFSQPATSTST-----------------GFGFGT------ 83
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGFsFGAAAASNpgstGGFsFGTLGAAPAATATTTtatlglggglfgqkpatGFTFGTpassta 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 84 STGTANTLFGTASTGTSlfSSQNNAFAQNKPTG----FGNFGTSTSSGGL-FGTTNTTSNPFGSTSGSLFG----PSSFT 154
Cdd:pfam15967 82 ATGPTGLTLGTPAATTA--ASTGFSLGFNKPAAsatpFSLPASSTSGGGLsLGSVLTSTAAQQGATGFTLNlggtPATTT 159
|
170 180
....*....|....*....|....*...
gi 1435761106 155 AAPTGTTIKFNPPTGTDTMVKAGVSTNI 182
Cdd:pfam15967 160 AVSTGLSLGSTLTSLGGSLFQNTNSTGL 187
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
72-320 |
2.28e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 2.28e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 72 ATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPS 151
Cdd:COG3469 3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATA 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 152 SFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTKhqcitamkeyesksleelrledyqanrkgpqnqVGAGTTTGlfGSS 231
Cdd:COG3469 83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTS---------------------------------TGAGSVTS--TTS 127
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 232 PATSSATGLFSSSTTNSGFAYGTTGFGTNPGGLFGQQNQQTTSlfskpfgqATTTQNTGFSFGNTSTIGQPSTNTMGLFG 311
Cdd:COG3469 128 STAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTT--------TTTSASTTPSATTTATATTASGATTPSAT 199
|
....*....
gi 1435761106 312 VTQASQPGG 320
Cdd:COG3469 200 TTATTTGPP 208
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
115-276 |
3.69e-06 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 46.84 E-value: 3.69e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 115 TGFGNfgTSTSSGGLFGTTNTTsnpfGSTSGSLFGPSSFTAAPTgttikfnpptgtdtmvkagvstnistkhqcitamke 194
Cdd:pfam13634 1 GLFGA--ATSTSGGLFGNTSTT----AASGGGLFGAASTATATT------------------------------------ 38
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 195 yesksleelrledyqanrkgpqnqvgagTTTGLFGSSPATSSATGLFSSSTTNSGFAYGTTGFGTNPGglfGQQNQQTTS 274
Cdd:pfam13634 39 ----------------------------SGGGLFGNSSSNAPSGGLFGATNTTTQTATGGGLFGNNAA---TTTSTTGGG 87
|
..
gi 1435761106 275 LF 276
Cdd:pfam13634 88 LF 89
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
221-392 |
9.43e-06 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 50.44 E-value: 9.43e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 221 AGTTTGL-FGSSPATSSATglfsSSTTNSGFAYGTTGFGTNPGGLFGQQNQQTTSLFSKPFG------QATTTQNTGFSF 293
Cdd:pfam15967 30 PGSTGGFsFGTLGAAPAAT----ATTTTATLGLGGGLFGQKPATGFTFGTPASSTAATGPTGltlgtpAATTAASTGFSL 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 294 GntstIGQPSTNTMGLFGVTQASQPGGL-FGTATNTSTGTAFGTGTGL-FGQTNTGFGAVGSTLFGNNKLTTFGSSTTSA 371
Cdd:pfam15967 106 G----FNKPAASATPFSLPASSTSGGGLsLGSVLTSTAAQQGATGFTLnLGGTPATTTAVSTGLSLGSTLTSLGGSLFQN 181
|
170 180
....*....|....*....|..
gi 1435761106 372 P-SFGFGTNTSGNSIFGSKPAP 392
Cdd:pfam15967 182 TnSTGLGQTTLGLTLLATSTAP 203
|
|
| auto_AIDA-I |
NF033176 |
autotransporter adhesin AIDA-I; |
33-386 |
9.51e-06 |
|
autotransporter adhesin AIDA-I;
Pssm-ID: 380183 [Multi-domain] Cd Length: 1287 Bit Score: 50.81 E-value: 9.51e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 33 SGGAFGTSAFGSSNNTGGLFGNSQTK-PGGLFGTSSFSQPATS--TSTGFGFGT---STGTANTLFGTASTGTSLFSSQN 106
Cdd:NF033176 139 SGGAQNIYNLGHASNTVIFNGGNQTIfSGGISDDTNISSGGQQrvSSGGVASNTtinSSGTQNILSGGSTVSTHISSGGN 218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 107 N-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAaptGTTIKFNPPTGTDTMVKAGVSTN 181
Cdd:NF033176 219 QyisagGNASATVVSSGGFQRVSSGGTATGTVLSGGTQNVSSGGSAISTSVYSS---GVQTVYAGATVTDTTVNSGGKQN 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 182 ISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLFGSSPATSS--ATGLFSSSTTNSGFAYGTTGFGT 259
Cdd:NF033176 296 ISSGGIVSGTIVNSSGTQNIYSGGSALSANIKGSQIVNSDGTAINTLVNDGGYQHirNGGVASGTIINQSGRVNISSGGY 375
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 260 NPGGLFGQQNQQttSLFSKPFGQATTTQNTGFSfgNTSTiGQPSTNTMGLFGVTQASQPGGlfgTATNTSTGTAFgtgtg 339
Cdd:NF033176 376 AESTIINSGGTQ--SVLSGGYASGTLINNSGRE--NVSN-GGSAYNTIINAGGNQYIYSNG---EASGTTVNTSG----- 442
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1435761106 340 lFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNT-------SGNSIF 386
Cdd:NF033176 443 -FQRVNSGGTATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTvyaggeaSGTQIF 495
|
|
| Nucleoporin_FG |
pfam13634 |
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ... |
300-387 |
1.24e-05 |
|
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Pssm-ID: 463941 [Multi-domain] Cd Length: 90 Bit Score: 45.30 E-value: 1.24e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 300 GQPSTNTMGLFG--VTQASQPGGLFGTATNtstGTAFGTGTGLFGQTNTgfGAVGSTLFGNNKLTTFGSSTTSApsFG-- 375
Cdd:pfam13634 4 GAATSTSGGLFGntSTTAASGGGLFGAAST---ATATTSGGGLFGNSSS--NAPSGGLFGATNTTTQTATGGGL--FGnn 76
|
90
....*....|....
gi 1435761106 376 --FGTNTSGNSIFG 387
Cdd:pfam13634 77 aaTTTSTTGGGLFG 90
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
230-459 |
2.47e-05 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 48.74 E-value: 2.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 230 SSPATSSA--TGLFSSSTTNSGFAYGTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFS-FGNTSTIGQPSTNT 306
Cdd:COG5651 168 TQPPPTITnpGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAgTGAAAGAAAAAAAA 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 307 MGLFGVTQASQPGGLFGTATNtstgtafgtgtglfgQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGFGTNTSGNSIF 386
Cdd:COG5651 248 AAAAGAGASAALASLAATLLN---------------ASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGA 312
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1435761106 387 GSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIGGPLGTGAFGAPGFNTTTATLGFGAPQAPVALTDPNA 459
Cdd:COG5651 313 GGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| NupH_GANP |
pfam16768 |
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the ... |
25-302 |
2.48e-05 |
|
Nucleoporin homology of Germinal-centre associated nuclear protein; NupH_GANP is the nucleoporin-homology domain at the N-terminus of human GANP or germinal-centre associated nuclear proteins. GANP is part of the TREX-2 complex that links transcription with nuclear messenger RNA export, and it associates with the mRNP particle through the interaction of the NupH_GANP with NXF1, the export factor. This attachment mediates efficient delivery of mRNPs to nuclear pore complexes.
Pssm-ID: 435572 [Multi-domain] Cd Length: 292 Bit Score: 48.37 E-value: 2.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTsafgssnntgglfgnSQTKPGGLFGTSS-FSQPATSTSTGFGFGTSTGtantlFGTASTGTSLFS 103
Cdd:pfam16768 10 QPSAFSTSSSPSTGT---------------FQAKPPFRFGQPSlFGQNNTLSGKNSGFSQVSS-----FPTTSGVSHSSS 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 104 SQNNAFAQnkptgfgnfgtsTSSGGLFGTTNTTSnPFGSTSgslfGPSSfTAAPTGTTIKFNPPTGTdtmvkaGVSTNIS 183
Cdd:pfam16768 70 GQTLGFTQ------------TSGVGLFSGLEHTP-SFVATS----GPSS-SSVPSNPGFSFKSPTNL------GAFPSTS 125
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 184 TKHQCITAMK-------EYESKSLEELRLEDYQANRKGP---QNQVGAGTTTglFgSSPATSSATGLF--------SSST 245
Cdd:pfam16768 126 TFGPESGEVAssgfgktEFSFKPPENAVFRPIFGAESEPektQSQITSGFFT--F-SHPVSSGPGGLApfsfsqvtSSSA 202
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1435761106 246 TNSGFAYGTTGFGTNPGGLFG----QQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQP 302
Cdd:pfam16768 203 TSSNFTFSKPVSSNNSSSAFApalsSQNVEEEKRGPKSFFGSSNSSFTSFPNSSSGSLGEP 263
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
26-172 |
3.03e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 48.60 E-value: 3.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTST----------GTANTLFGTA 95
Cdd:COG3469 52 AASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTvtttstgagsVTSTTSSTAG 131
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761106 96 STGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGlfgTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDT 172
Cdd:COG3469 132 STTTSGASATSSAGSTTTTTTVSGTETATGGTT---TTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT 205
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
26-167 |
1.90e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.28 E-value: 1.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQ 105
Cdd:COG3469 75 TTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1435761106 106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNpfgSTSGSLFGPSSFTAAPTGTTIKFNPP 167
Cdd:COG3469 155 GTETATGGTTTTSTTTTTTSASTTPSATTTATA---TTASGATTPSATTTATTTGPPTPGLP 213
|
|
| PRK12688 |
PRK12688 |
flagellin; Reviewed |
72-474 |
2.12e-04 |
|
flagellin; Reviewed
Pssm-ID: 171664 [Multi-domain] Cd Length: 751 Bit Score: 46.02 E-value: 2.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 72 ATSTSTGFGFGTSTGTANTLFGTASTGTSLFSSQNNAFaqnkptgFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPS 151
Cdd:PRK12688 276 ATIAVSASGGAVSAAAAGAVTLKSSTGADLSVTGKADL-------LKALGLTTATGAGNATVNANRTTSAGSLGALIQDG 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 152 SfTAAPTGTTIKFN---PPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGAGTTTGLF 228
Cdd:PRK12688 349 S-TLNVDGKTITFKnapIPGAASVPSGYGASGNVLTDGNGNSTVYLQGGTINDVLKAIDLATGVQTATIANGTATLATAA 427
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 229 GSSPATSSATGLFSSST-TNSGFAYGTTGFGTNPGGLFGQQnqqttslfskpfGQATTtqntgFSFGNTSTIGQPSTNTM 307
Cdd:PRK12688 428 GQTASSVNASGQLKLSTgLNADLSITGTGNALSALGLAGNT------------GTATA-----FTAARTAGAGGISGKTL 490
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 308 GLFGVTQASQPGGLFGTATNTSTGTafgtgtglFGQTNTgfgavgsTLFGNNKLTTFGSSttsapsfGFGTNTSGNSIFG 387
Cdd:PRK12688 491 TFTSFNGGTAVNVTFGDGTNGTVKT--------LAQLNT-------ALQANNLTATIDAT-------GKLTISASNDYAS 548
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 388 SkpapgtlgtglgaGFGTALGAGqaslfgnnqpKIGGplgtgafgapgfnTTTATLGFGAPQAPVAltDPNASAAQQAVL 467
Cdd:PRK12688 549 S-------------TLGSTLAGG----------AIGG-------------TLTSTLTFSTASAPVA--DTVAQTTRANLV 590
|
....*..
gi 1435761106 468 QQHINSL 474
Cdd:PRK12688 591 KQYNNIL 597
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
229-649 |
7.73e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.52 E-value: 7.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 229 GSSPATSSATGLFSSSTTnsgFAYGTTGFGTNPGGLFGQQNQQTTSlfskpfgqaTTTQNTGFSFGNTSTIGQPSTNTMG 308
Cdd:pfam05109 371 GTPSGCENISGAFASNRT---FDITVSGLGTAPKTLIITRTATNAT---------TTTHKVIFSKAPESTTTSPTLNTTG 438
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 309 lfgvtqasqpgglfgtatntstgtafgtgtglFGQTNTGFGAVGSTLFGNNkLTTFGS-----STTSAPSFGFGTNTSGN 383
Cdd:pfam05109 439 --------------------------------FAAPNTTTGLPSSTHVPTN-LTAPAStgptvSTADVTSPTPAGTTSGA 485
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 384 SIFGSKPAPGTLGTGLGAGFGTAlgagQASLFGNNQPKIGGPlgTGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQ 463
Cdd:pfam05109 486 SPVTPSPSPRDNGTESKAPDMTS----PTSAVTTPTPNATSP--TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT 559
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 464 QAVLQQHINSLTYSPFGDSPLFRNPMSDPKKKEERLKPTNPAAQKalTTPTHYKLTPRPATRVRPKALQTTGTAKSHlfd 543
Cdd:pfam05109 560 PAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANT--TNHTLGGTSSTPVVTSPPKNATSAVTTGQH--- 634
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 544 gldddepslangaFMPKKSIKKLVLKNLNNSNLFSPVNRDSENLASP---SEYPENGERFSFLSKPVDENHqqdgdedsl 620
Cdd:pfam05109 635 -------------NITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltSAHPTGGENITQVTPASTSTH--------- 692
|
410 420
....*....|....*....|....*....
gi 1435761106 621 vsHFYTNPIAkPIPQTPESAGNKHSNSNS 649
Cdd:pfam05109 693 --HVSTSSPA-PRPGTTSQASGPGNSSTS 718
|
|
| 34 |
PHA02584 |
long tail fiber, proximal subunit; Provisional |
25-175 |
2.24e-03 |
|
long tail fiber, proximal subunit; Provisional
Pssm-ID: 222890 [Multi-domain] Cd Length: 1229 Bit Score: 42.82 E-value: 2.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTSAFGSSNNTGG---------------------------LFGNSQTKPGGlfGTSSFSQPATSTST 77
Cdd:PHA02584 944 QNTSNGTVVVVDETSIAFYSQNNTTGnivfnidgtvdpinvnangtlnatgvaTNGRAVYAEGG--GIARTNNAARAITG 1021
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 78 GFGFGTSTGTANTLFGTASTGTSLFSSQ-----NNAFAQNK--PTGFGNFGTSTSSGGLfgttnTTSNPFGSTSGSlfgp 150
Cdd:PHA02584 1022 GFTIRNDGSTTVFLLTAAGDQTGGFNGLksliiNNANGQVTinDNYIINAGGTIMSGGL-----TVNSRIRSQGTK---- 1092
|
170 180
....*....|....*....|....*
gi 1435761106 151 SSFTAAPTGTTIKFNPPTGTDTMVK 175
Cdd:PHA02584 1093 ASYTRAPTADTVGFWSVDINDSATY 1117
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
25-152 |
2.85e-03 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 42.29 E-value: 2.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 25 QNTGFGTTSGGAFGTSAFGSSNNTGGLFG------NSQ--------------------TKPGGLFGTSSFSQPATSTSTG 78
Cdd:cd21118 145 GGTGGPWASGGNYGTNSLGGSVGQGGNGGplnygtNSQgavaqpgygtvrgnnqnsgcTNPPPSGSHESFSNSGGSSSSG 224
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1435761106 79 FGFGTSTGTANTLFGTASTGTSLFSSQNNAFAQNK-PTGFGNFGTSTSSGGLFG-TTNTTSNPFGSTSGSLFGPSS 152
Cdd:cd21118 225 SSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSgNSGGSNGGSSGNSGSGSGgSSSGGSNGWGGSSSSGGSGGS 300
|
|
| PTZ00473 |
PTZ00473 |
Plasmodium Vir superfamily; Provisional |
26-100 |
4.09e-03 |
|
Plasmodium Vir superfamily; Provisional
Pssm-ID: 240430 [Multi-domain] Cd Length: 420 Bit Score: 41.76 E-value: 4.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAF--------------GTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTl 91
Cdd:PTZ00473 315 RGPYNANYGGQFnsrsgrtgssesirGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGGSTYGGSST- 393
|
....*....
gi 1435761106 92 FGTASTGTS 100
Cdd:PTZ00473 394 FDGSSRGSS 402
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
214-448 |
4.73e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 41.42 E-value: 4.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 214 GPQNQVGAGTTTGLFGSSPAtssATGLFSSSTTNSGFAYGTTGFGTnpgglfgqqnqqttslfskpfgqATTTQNTGFSF 293
Cdd:COG5651 189 GNTSSNPGFANLGLTGLNQV---GIGGLNSGSGPIGLNSGPGNTGF-----------------------AGTGAAAGAAA 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 294 GNTSTIGQPSTNTMGLFGVTQASQPGGLFGTATNTStgtafgtgtglFGQTNTGFGAVGSTLFGNNkLTTFGSSTTSAPS 373
Cdd:COG5651 243 AAAAAAAAAGAGASAALASLAATLLNASSLGLAATA-----------ASSAATNLGLAGSPLGLAG-GGAGAAAATGLGL 310
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1435761106 374 FGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIGGPLGTGAFGAPGFNTTTATLGFGAP 448
Cdd:COG5651 311 GAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
220-388 |
5.84e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 41.53 E-value: 5.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 220 GAGTTTGlFGSSPATSSATGL-------FSSSTTNSGFAYGTTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFS 292
Cdd:NF033849 236 GQSAGTG-YGESVGHSTSQGQshsvgtsESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTT 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 293 FGNTSTIGQ---PSTNTMGLFGVTQASQ--PGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSS 367
Cdd:NF033849 315 EGTSTTDSSshsQSSSYNVSSGTGVSSShsDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFS 394
|
170 180
....*....|....*....|....*
gi 1435761106 368 TTSAP----SFGFGTNTSGNSIFGS 388
Cdd:NF033849 395 GGIAGggvtSEGLGASQGGSEGWGS 419
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
26-159 |
6.29e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 41.03 E-value: 6.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1435761106 26 NTGFGTTSGGAFGTSAFGSSNN-TGGLFGNSQTKPGGLFGTS---SFSQPATSTSTGFGFGTST---------GTANTLF 92
Cdd:COG5651 194 NPGFANLGLTGLNQVGIGGLNSgSGPIGLNSGPGNTGFAGTGaaaGAAAAAAAAAAAAGAGASAalaslaatlLNASSLG 273
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1435761106 93 GTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTG 159
Cdd:COG5651 274 LAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAA 340
|
|
|