|
Name |
Accession |
Description |
Interval |
E-value |
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1064-1431 |
1.09e-25 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 115.49 E-value: 1.09e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1064 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 1143
Cdd:NF033849 218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1144 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 1223
Cdd:NF033849 284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1224 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 1299
Cdd:NF033849 346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1300 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:NF033849 418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 768037089 1380 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1431
Cdd:NF033849 497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
451-611 |
3.93e-23 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 98.88 E-value: 3.93e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 510 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 574
Cdd:pfam01454 81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
|
170 180 190
....*....|....*....|....*....|....*....
gi 768037089 575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454 161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
925-1290 |
1.66e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 102.01 E-value: 1.66e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 925 TSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGgamstsadfggtLS 1004
Cdd:NF033849 239 AGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVG------------TS 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1005 TSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggtlSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVS 1084
Cdd:NF033849 307 ESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGV------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1085 TSACFSgapitnpgfggaFSTSAGFGGALSTaadfGGTPSNSIGFgaapSTSVSFGGAHGTSLcfggapstslcfgsaSN 1164
Cdd:NF033849 381 SSRSSS------------SGVSGGFSGGIAG----GGVTSEGLGA----SQGGSEGWGSGDSV---------------QS 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1165 TNLCFGGPPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLG 1244
Cdd:NF033849 426 VSQSYGSSSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TG 497
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 768037089 1245 TSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 1290
Cdd:NF033849 498 TSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
964-1302 |
4.38e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 100.46 E-value: 4.38e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 964 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 1043
Cdd:NF033849 238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1044 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 1123
Cdd:NF033849 310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1124 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 1203
Cdd:NF033849 384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1204 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 1283
Cdd:NF033849 454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
|
330
....*....|....*....
gi 768037089 1284 VTSDGFGGGLGTNASFGST 1302
Cdd:NF033849 527 TSGAGGSMGLGPSISLGKS 545
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
864-1280 |
6.08e-20 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 97.00 E-value: 6.08e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 864 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 942
Cdd:NF033849 218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 943 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 1020
Cdd:NF033849 293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1021 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 1100
Cdd:NF033849 357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1101 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 1180
Cdd:NF033849 409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1181 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 1260
Cdd:NF033849 460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
|
410 420
....*....|....*....|
gi 768037089 1261 GGLGTSAGFSGGLGTSAGFG 1280
Cdd:NF033849 524 GGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
854-1138 |
2.88e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 84.67 E-value: 2.88e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 854 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 927
Cdd:NF033849 244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 928 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 1003
Cdd:NF033849 324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1004 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 1078
Cdd:NF033849 404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768037089 1079 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 1138
Cdd:NF033849 482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
726-1425 |
1.42e-13 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 76.34 E-value: 1.42e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 726 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 805
Cdd:COG3210 791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 806 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 885
Cdd:COG3210 871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 886 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 965
Cdd:COG3210 951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 966 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 1045
Cdd:COG3210 1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1046 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 1125
Cdd:COG3210 1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1126 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 1205
Cdd:COG3210 1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 1285
Cdd:COG3210 1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1286 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 1365
Cdd:COG3210 1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1366 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210 1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
713-1080 |
3.74e-12 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 71.57 E-value: 3.74e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 713 ENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISfggtlstsssfssaas 792
Cdd:NF033849 237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS---------------- 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 793 ISFGCAHSTSTSFSSeasisfggmpctsasfsggvsssfsgplSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSA 872
Cdd:NF033849 301 SSVGTSESQSHGTTE----------------------------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTS 352
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 873 PTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFG---GSSSTSANF 949
Cdd:NF033849 353 ISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGsgdSVQSVSQSY 430
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 950 GGTLSTSIcfdgspSTGAGFGGALNTSASfgsvlnTSTGFGGAMSTSADFGGTLSTSVcfggspGTSVSFGSALNTNAGY 1029
Cdd:NF033849 431 GSSSSTGT------SSGHSDSSSHSTSSG------QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 768037089 1030 GGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFS 1080
Cdd:NF033849 493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
800-1285 |
1.05e-09 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 63.26 E-value: 1.05e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 800 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 879
Cdd:COG4625 14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 880 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 959
Cdd:COG4625 94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 960 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 1039
Cdd:COG4625 174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1040 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 1119
Cdd:COG4625 252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1120 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 1199
Cdd:COG4625 332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1200 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG4625 412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491
|
....*.
gi 768037089 1280 GGGLVT 1285
Cdd:COG4625 492 GGGNYT 497
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1096-1320 |
2.13e-09 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 61.99 E-value: 2.13e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1096 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 1175
Cdd:pfam15967 5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1176 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 1253
Cdd:pfam15967 80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 768037089 1254 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 1320
Cdd:pfam15967 156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1221-1424 |
1.01e-05 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 50.00 E-value: 1.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1221 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 1300
Cdd:cd21118 125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1301 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 1380
Cdd:cd21118 202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 768037089 1381 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 1424
Cdd:cd21118 279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
1009-1231 |
4.04e-05 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 48.53 E-value: 4.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1009 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 1088
Cdd:PTZ00395 339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1089 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 1168
Cdd:PTZ00395 409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 768037089 1169 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 1231
Cdd:PTZ00395 472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
922-1107 |
3.99e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 44.66 E-value: 3.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 922 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 988
Cdd:pfam15967 28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 989 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 1065
Cdd:pfam15967 103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 768037089 1066 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 1107
Cdd:pfam15967 181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1064-1431 |
1.09e-25 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 115.49 E-value: 1.09e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1064 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 1143
Cdd:NF033849 218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1144 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 1223
Cdd:NF033849 284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1224 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 1299
Cdd:NF033849 346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1300 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:NF033849 418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 768037089 1380 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1431
Cdd:NF033849 497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
451-611 |
3.93e-23 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 98.88 E-value: 3.93e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 510 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 574
Cdd:pfam01454 81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
|
170 180 190
....*....|....*....|....*....|....*....
gi 768037089 575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454 161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
925-1290 |
1.66e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 102.01 E-value: 1.66e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 925 TSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGgamstsadfggtLS 1004
Cdd:NF033849 239 AGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVG------------TS 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1005 TSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggtlSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVS 1084
Cdd:NF033849 307 ESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGV------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1085 TSACFSgapitnpgfggaFSTSAGFGGALSTaadfGGTPSNSIGFgaapSTSVSFGGAHGTSLcfggapstslcfgsaSN 1164
Cdd:NF033849 381 SSRSSS------------SGVSGGFSGGIAG----GGVTSEGLGA----SQGGSEGWGSGDSV---------------QS 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1165 TNLCFGGPPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLG 1244
Cdd:NF033849 426 VSQSYGSSSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TG 497
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 768037089 1245 TSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 1290
Cdd:NF033849 498 TSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
964-1302 |
4.38e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 100.46 E-value: 4.38e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 964 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 1043
Cdd:NF033849 238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1044 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 1123
Cdd:NF033849 310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1124 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 1203
Cdd:NF033849 384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1204 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 1283
Cdd:NF033849 454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
|
330
....*....|....*....
gi 768037089 1284 VTSDGFGGGLGTNASFGST 1302
Cdd:NF033849 527 TSGAGGSMGLGPSISLGKS 545
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
864-1280 |
6.08e-20 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 97.00 E-value: 6.08e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 864 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 942
Cdd:NF033849 218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 943 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 1020
Cdd:NF033849 293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1021 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 1100
Cdd:NF033849 357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1101 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 1180
Cdd:NF033849 409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1181 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 1260
Cdd:NF033849 460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
|
410 420
....*....|....*....|
gi 768037089 1261 GGLGTSAGFSGGLGTSAGFG 1280
Cdd:NF033849 524 GGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
854-1138 |
2.88e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 84.67 E-value: 2.88e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 854 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 927
Cdd:NF033849 244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 928 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 1003
Cdd:NF033849 324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1004 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 1078
Cdd:NF033849 404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768037089 1079 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 1138
Cdd:NF033849 482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
726-1425 |
1.42e-13 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 76.34 E-value: 1.42e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 726 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 805
Cdd:COG3210 791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 806 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 885
Cdd:COG3210 871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 886 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 965
Cdd:COG3210 951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 966 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 1045
Cdd:COG3210 1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1046 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 1125
Cdd:COG3210 1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1126 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 1205
Cdd:COG3210 1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 1285
Cdd:COG3210 1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1286 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 1365
Cdd:COG3210 1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1366 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210 1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
|
|
| AidA |
COG3468 |
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ... |
871-1311 |
4.96e-13 |
|
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442691 [Multi-domain] Cd Length: 846 Bit Score: 74.21 E-value: 4.96e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 871 SAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFG 950
Cdd:COG3468 1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 951 GTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSpGTSVSFGSALNTNAGYG 1030
Cdd:COG3468 81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGG-GGGGTGVGGTGAAAAGG 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1031 GAVSTNTDFGGTLSTSVCFGGspsTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFG 1110
Cdd:COG3468 160 GTGSGGGGSGGGGGAGGGGGG---GAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGG 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1111 GALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGApstslcFGSASNTNLCFGGPPSTSACFSGATSPSFCDG 1190
Cdd:COG3468 237 GVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGG------GANGGGSGGGGGASGTGGGGTASTGGGGGGGG 310
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1191 PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFS 1270
Cdd:COG3468 311 GNGGGGGGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDG 390
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 768037089 1271 GGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSG 1311
Cdd:COG3468 391 VGTGLTTGGTGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTG 431
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
713-1080 |
3.74e-12 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 71.57 E-value: 3.74e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 713 ENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISfggtlstsssfssaas 792
Cdd:NF033849 237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS---------------- 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 793 ISFGCAHSTSTSFSSeasisfggmpctsasfsggvsssfsgplSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSA 872
Cdd:NF033849 301 SSVGTSESQSHGTTE----------------------------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTS 352
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 873 PTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFG---GSSSTSANF 949
Cdd:NF033849 353 ISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGsgdSVQSVSQSY 430
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 950 GGTLSTSIcfdgspSTGAGFGGALNTSASfgsvlnTSTGFGGAMSTSADFGGTLSTSVcfggspGTSVSFGSALNTNAGY 1029
Cdd:NF033849 431 GSSSSTGT------SSGHSDSSSHSTSSG------QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 768037089 1030 GGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFS 1080
Cdd:NF033849 493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
836-1312 |
2.74e-11 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 68.27 E-value: 2.74e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 836 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 915
Cdd:COG4625 27 GAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGG 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 916 SICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTsicfDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMST 995
Cdd:COG4625 107 GGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGG----GGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGG 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 996 SADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 1075
Cdd:COG4625 183 GGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGG 262
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1076 SASFSGAVSTsacfsGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPST 1155
Cdd:COG4625 263 GAGGGGGGGG-----GGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 337
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1156 SLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLST 1235
Cdd:COG4625 338 GGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGG 417
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 768037089 1236 SSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGG 1312
Cdd:COG4625 418 GAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
711-1425 |
3.02e-10 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 65.17 E-value: 3.02e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 711 AQENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSA 790
Cdd:COG3210 606 GSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGG 685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 791 ASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFG 870
Cdd:COG3210 686 TTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANT 765
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 871 SAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFG 950
Cdd:COG3210 766 TASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNT 845
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 951 GTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYG 1030
Cdd:COG3210 846 TDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGG 925
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1031 GAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFG 1110
Cdd:COG3210 926 LTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTT 1005
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1111 GALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTslcfGSASNTNLCFGGPPSTSACFSGATSPSFCDG 1190
Cdd:COG3210 1006 ASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNA----SGISGGNAAALTASGTAGTTGGTAASNGGGG 1081
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1191 PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFS 1270
Cdd:COG3210 1082 TAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSA 1161
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1271 GGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGE 1350
Cdd:COG3210 1162 SAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQT 1241
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 768037089 1351 PSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210 1242 GSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGT 1316
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
714-1345 |
3.86e-10 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 64.79 E-value: 3.86e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 714 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 793
Cdd:COG3210 115 TLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGV 194
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 794 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 873
Cdd:COG3210 195 TGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIG 274
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 874 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 953
Cdd:COG3210 275 TTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGT 354
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 954 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 1033
Cdd:COG3210 355 TGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLG 434
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1034 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAcfsgapITNPGFGGAFSTSAGFGGAL 1113
Cdd:COG3210 435 ITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGG------GIGTVTTNATISNNAGGDAN 508
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1114 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 1193
Cdd:COG3210 509 GIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSA 588
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1194 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGL 1273
Cdd:COG3210 589 TGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAG 668
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 768037089 1274 GTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 1345
Cdd:COG3210 669 GTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
800-1285 |
1.05e-09 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 63.26 E-value: 1.05e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 800 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 879
Cdd:COG4625 14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 880 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 959
Cdd:COG4625 94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 960 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 1039
Cdd:COG4625 174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1040 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 1119
Cdd:COG4625 252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1120 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 1199
Cdd:COG4625 332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1200 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG4625 412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491
|
....*.
gi 768037089 1280 GGGLVT 1285
Cdd:COG4625 492 GGGNYT 497
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1096-1320 |
2.13e-09 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 61.99 E-value: 2.13e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1096 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 1175
Cdd:pfam15967 5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1176 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 1253
Cdd:pfam15967 80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 768037089 1254 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 1320
Cdd:pfam15967 156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
767-1268 |
1.16e-08 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 59.79 E-value: 1.16e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 767 ASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGAS 846
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 847 SGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTS 926
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 927 TGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTS 1006
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1007 VCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTS 1086
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1087 ACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTN 1166
Cdd:COG4625 321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1167 LCFGGPPSTSACFSGATSPSFcDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTS 1246
Cdd:COG4625 401 GGGGAGGTGGGGAGGGGGAAG-GGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLT 479
|
490 500
....*....|....*....|..
gi 768037089 1247 AGFGGGPGTSTGFGGGLGTSAG 1268
Cdd:COG4625 480 GNNTYTGTTTVNGGGNYTQSAG 501
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1049-1281 |
6.27e-07 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 53.90 E-value: 6.27e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1049 FGGSPSTSAGFGGALntnaSFGCAVSTSASFSGAVSTSAcFSGAPITNPGfggAFSTSAGFGGALstaadFGGTPSNSIG 1128
Cdd:pfam15967 6 FGGGPGSTATAGGGF----SFGAAAASNPGSTGGFSFGT-LGAAPAATAT---TTTATLGLGGGL-----FGQKPATGFT 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1129 FGAAPSTSvsfgGAHGTSLCFGGAPSTSlcfgSASNT--NLCFGGPPSTSACFS-GATSPSfcdgpstSTGFSFGNGLST 1205
Cdd:pfam15967 73 FGTPASST----AATGPTGLTLGTPAAT----TAASTgfSLGFNKPAASATPFSlPASSTS-------GGGLSLGSVLTS 137
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768037089 1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGL---STSSGFDGGLGTSAGfGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGG 1281
Cdd:pfam15967 138 TAAQQGATGFTLNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
982-1196 |
6.80e-07 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.60 E-value: 6.80e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 982 VLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGG 1061
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1062 ALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNpgfggafSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGG 1141
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTT-------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 768037089 1142 AHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACfSGATSPSFCDGPSTSTG 1196
Cdd:COG3469 154 SGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA-SGATTPSATTTATTTGP 207
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
852-1431 |
7.17e-07 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 54.01 E-value: 7.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 852 TLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 931
Cdd:COG5295 5 AGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAASSVA 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 932 TLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 1011
Cdd:COG5295 85 SGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSST 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1012 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 1091
Cdd:COG5295 165 ANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAA 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1092 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIG--FGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCF 1169
Cdd:COG5295 245 SGNATTASASSVSGSAVAAGTASTATTASTTAASGAAgtATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALG 324
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1170 GGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGF 1249
Cdd:COG5295 325 SAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTG 404
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1250 GGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDR 1329
Cdd:COG5295 405 ASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSS 484
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1330 GLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGG 1409
Cdd:COG5295 485 AAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTAT 564
|
570 580
....*....|....*....|..
gi 768037089 1410 PSTSAGFGSGAASLGACGFSYG 1431
Cdd:COG5295 565 GANSVALGAGSVASGANSVSVG 586
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1195-1431 |
2.50e-06 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 51.98 E-value: 2.50e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1195 TGFSFGNGLSTNAGFGGGLNtsagFGGGLGTSAGFSGGLstssGFDGGLGTSAGFGGGPGTSTGFGGGLgtsagFSGGLG 1274
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGFS----FGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPA 68
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1275 TSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSrpnaSFDRGLSTIIGFGSGSNTSTGFTGEPSTS 1354
Cdd:pfam15967 69 TGFTFGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASAT----PFSLPASSTSGGGLSLGSVLTSTAAQQGA 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1355 TGFSSGPSSIVGFSGGPSTGVGFCSGPSTSG---FSGGPSTGAGfgggpNTGAGFGGGPSTSAGFGSGAASLGACGFSYG 1431
Cdd:pfam15967 145 TGFTLNLGGTPATTTAVSTGLSLGSTLTSLGgslFQNTNSTGLG-----QTTLGLTLLATSTAPVSAPAASEGLGGLDFS 219
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1221-1424 |
1.01e-05 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 50.00 E-value: 1.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1221 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 1300
Cdd:cd21118 125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1301 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 1380
Cdd:cd21118 202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 768037089 1381 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 1424
Cdd:cd21118 279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1169-1396 |
1.32e-05 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 49.12 E-value: 1.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1169 FGGPPSTSACFSGATSPSFCDGPSTSTGFSFGN-GLSTNAGFG-GGLNTSAGfggglgtSAGFSGGLSTSSGFDGGLGTS 1246
Cdd:COG5651 167 FTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANlGLTGLNQVGiGGLNSGSG-------PIGLNSGPGNTGFAGTGAAAG 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1247 AGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNAs 1326
Cdd:COG5651 240 AAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGA- 318
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1327 fdrGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGF 1396
Cdd:COG5651 319 ---AGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1230-1425 |
1.88e-05 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 48.74 E-value: 1.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1230 SGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGtnaSFGSTLGTSAGF 1309
Cdd:COG5651 177 PGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAA---AAAAAAAAAAGA 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1310 SGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGG 1389
Cdd:COG5651 254 GASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333
|
170 180 190
....*....|....*....|....*....|....*.
gi 768037089 1390 PSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG5651 334 AAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
714-1256 |
2.37e-05 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 49.01 E-value: 2.37e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 714 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 793
Cdd:COG4625 2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 794 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 873
Cdd:COG4625 82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 874 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 953
Cdd:COG4625 162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 954 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 1033
Cdd:COG4625 242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1034 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGAL 1113
Cdd:COG4625 322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1114 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSlcfGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 1193
Cdd:COG4625 402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGG---ATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTL 478
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 768037089 1194 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTS 1256
Cdd:COG4625 479 TGNNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTT 541
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1018-1251 |
3.01e-05 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 48.51 E-value: 3.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1018 SFGSALNTNAGYGGAVStntdFGGTLStsvcfggSPSTSAG---FGGALNTNASFGCAVSTSASFSGAVstsacFSGAPI 1094
Cdd:pfam15967 5 SFGGGPGSTATAGGGFS----FGAAAA-------SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPA 68
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1095 TNPGFGGAFSTSAGFGGALSTaadfGGTPSNSigfgAAPSTSVSFGGAHGTslcfGGAPSTSLCFGSASNTNLCFGGPPS 1174
Cdd:pfam15967 69 TGFTFGTPASSTAATGPTGLT----LGTPAAT----TAASTGFSLGFNKPA----ASATPFSLPASSTSGGGLSLGSVLT 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1175 TSACFSGATSPSFCDGPSTSTGFSFGNGL---STNAGFGGGLNTSAGfGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGG 1251
Cdd:pfam15967 137 STAAQQGATGFTLNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1200-1418 |
3.03e-05 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 47.97 E-value: 3.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1200 GNGLSTNAGFGGGLNTSAGFGgglgtSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG5651 178 GGLLGAQNAGSGNTSSNPGFA-----NLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1280 GGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDgfgsrPNASFDRGLSTIIGFGSGSNTSTGFTGePSTSTGFSS 1359
Cdd:COG5651 253 AGASAALASLAATLLNASSLGLAATAASSAATNLGLAG-----SPLGLAGGGAGAAAATGLGLGAGGAAG-AAGATGAGA 326
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 768037089 1360 GPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGS 1418
Cdd:COG5651 327 ALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
1009-1231 |
4.04e-05 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 48.53 E-value: 4.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1009 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 1088
Cdd:PTZ00395 339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1089 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 1168
Cdd:PTZ00395 409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 768037089 1169 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 1231
Cdd:PTZ00395 472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1190-1398 |
5.56e-05 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 47.74 E-value: 5.56e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1190 GPSTSTGFSFGNGLSTNAGFGGGLntsaGFGGGLGTSAGFSGGLSTSSGFDGGLgtsagFGGGPGTSTGFGGGLGTSAGF 1269
Cdd:pfam15967 13 TATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAAT 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1270 SGGLGTSAGFGGGLVTSDGFGGGLGTNASFGS--TLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGfGSGSNTSTGF 1347
Cdd:pfam15967 84 GPTGLTLGTPAATTAASTGFSLGFNKPAASATpfSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLG-GTPATTTAVS 162
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 768037089 1348 TGEP--STSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGG 1398
Cdd:pfam15967 163 TGLSlgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
| Keratin_2_head |
pfam16208 |
Keratin type II head; |
1204-1308 |
8.02e-05 |
|
Keratin type II head;
Pssm-ID: 465068 [Multi-domain] Cd Length: 156 Bit Score: 44.65 E-value: 8.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1204 STNAGFGGGLNTSAGFGGGLGTSA--GFSGGLSTSSGFDGG---LGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAG 1278
Cdd:pfam16208 21 SSSRRGGGGGGGGGGGGGGFGSRSlyNLGGSKSISISVAGGgsrPGSGFGFGGGGGGGFGGGFGGGGGGGFGGGGGFGGG 100
|
90 100 110
....*....|....*....|....*....|.
gi 768037089 1279 FGGGLVTSDGFG-GGLGTNASFGSTLGTSAG 1308
Cdd:pfam16208 101 FGGGGYGGGGFGgGGFGGRGGFGGPPCPPGG 131
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1101-1328 |
1.02e-04 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 46.53 E-value: 1.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1101 GAFSTSAGFGGALSTAADFGGTPSNSIG-FGAAPSTSVSFGGAHGTSLCFGGAPSTSLC---FGSASNTNL---CFGGPP 1173
Cdd:cd21118 128 GAYGSQGGPGVQGHGIPGGTGGPWASGGnYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAqpgYGTVRGNNQnsgCTNPPP 207
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1174 STSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLN--TSAGFGGGLGTSAGFSGGLSTSSG-FDGGLGTSAGFG 1250
Cdd:cd21118 208 SGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNggNNGSSSSNSGNSGGSNGGSSGNSGsGSGGSSSGGSNG 287
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 768037089 1251 GGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAgfSGGLSTSDGFGSRPNASFD 1328
Cdd:cd21118 288 WGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEA--VGGLNTLNSDASTLPFNFD 363
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1182-1403 |
2.34e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.51 E-value: 2.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1182 ATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGG 1261
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1262 GLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFgsrpNASFDRGLSTIIGFGSGS 1341
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGA----SATSSAGSTTTTTTVSGT 156
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 768037089 1342 NTSTGFTGEPSTSTGFSSGPSSivgfsggPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTG 1403
Cdd:COG3469 157 ETATGGTTTTSTTTTTTSASTT-------PSATTTATATTASGATTPSATTTATTTGPPTPG 211
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
922-1107 |
3.99e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 44.66 E-value: 3.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 922 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 988
Cdd:pfam15967 28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 989 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 1065
Cdd:pfam15967 103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 768037089 1066 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 1107
Cdd:pfam15967 181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
1258-1404 |
6.20e-04 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 44.68 E-value: 6.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1258 GFGGGL--GTSAGF-SGGLGTSAGFGG-GLVTSD---GFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPnasfdrg 1330
Cdd:PTZ00395 341 GFHDGSpnAASAGApFNGLGNQADGGHiNQVHPDargAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAG------- 413
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768037089 1331 lstiigFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPstsgFSGGPSTGAGFGGGPNTGA 1404
Cdd:PTZ00395 414 ------YSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLP----YSNTPYSNAPLSNAPPSSA 477
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1180-1431 |
8.64e-04 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 43.83 E-value: 8.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1180 SGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFG--------- 1250
Cdd:cd21118 14 GGEASPLHSGGEGTGAGESAGHGLGDAISHGIGEAVGQGAKEAASSGIQNALGQGHGEEGGSTLGSRGDVFehrlgeaar 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1251 --GGPGTSTG-----------------FGGGLGTSAGFSGGLGTSAGFG--GGLVTSDGFGGGLGTNASFGS-------- 1301
Cdd:cd21118 94 slGNAGNEIGrqaediirhgvdavhnsWQGSGGHGAYGSQGGPGVQGHGipGGTGGPWASGGNYGTNSLGGSvgqggngg 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1302 --TLGTSagfSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:cd21118 174 plNYGTN---SQGAVAQPGYGTVRGNNQNSGCTNPPPSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGG 250
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 768037089 1380 GPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGACGFSYG 1431
Cdd:cd21118 251 NNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSNGWGGSSSSGGSGGSGG 302
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
989-1165 |
3.28e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 41.96 E-value: 3.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 989 FGGAMSTSADFGGTLSTSVCFGGSPGTS--VSFGSALNTNAGYGGAVSTNTDFGGTLstsvcFGGSPSTSAGFGGALNTN 1066
Cdd:pfam15967 6 FGGGPGSTATAGGGFSFGAAAASNPGSTggFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1067 ASFGCAVSTSASFSGAVSTSACFSgapitnPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTS 1146
Cdd:pfam15967 81 AATGPTGLTLGTPAATTAASTGFS------LGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGT 154
|
170
....*....|....*....
gi 768037089 1147 LCFGGAPSTSLCFGSASNT 1165
Cdd:pfam15967 155 PATTTAVSTGLSLGSTLTS 173
|
|
| Gly_rich |
pfam12810 |
Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. ... |
1207-1281 |
4.40e-03 |
|
Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. The proteins are composed of several glycine rich motifs interspersed through the sequence. Although many proteins have been annotated by similarity in the family these annotations given the biased composition of the sequences these are unlikely to be functionally relevant.
Pssm-ID: 403882 [Multi-domain] Cd Length: 257 Bit Score: 40.72 E-value: 4.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1207 AGFGGG----LNTSAGFGGGLGTSAGFSG------GLSTSSGFDGGLGTSAGFGGGP-----GTSTGFGGGLGTSAGFSG 1271
Cdd:pfam12810 107 AGGGGGsgegDDGSGGYGGGLTGGGGGSGcyegsyGATQTSGGIGGYGINGSFGQGGngrnsGGGGGGGGGGGYYGGFGG 186
|
90
....*....|
gi 768037089 1272 GLGTSAGFGG 1281
Cdd:pfam12810 187 GSYGGGGGGG 196
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1253-1425 |
5.03e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 41.03 E-value: 5.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1253 PGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLS 1332
Cdd:COG5651 171 PPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAA 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037089 1333 TIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAG--FGGGPNTGAGFGGGP 1410
Cdd:COG5651 251 AGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGgaAGAAGATGAGAALGA 330
|
170
....*....|....*
gi 768037089 1411 STSAGFGSGAASLGA 1425
Cdd:COG5651 331 GAAAAAAGAAAGAGA 345
|
|
|