NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034675055|ref|XP_016885256|]
View 

trophinin isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1064-1431 1.09e-25

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 115.49  E-value: 1.09e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1064 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 1143
Cdd:NF033849   218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1144 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 1223
Cdd:NF033849   284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1224 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 1299
Cdd:NF033849   346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1300 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:NF033849   418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034675055 1380 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1431
Cdd:NF033849   497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
451-611 3.93e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


:

Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 3.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  510 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 574
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1034675055  575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
854-1138 2.88e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 84.67  E-value: 2.88e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  854 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 927
Cdd:NF033849   244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  928 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 1003
Cdd:NF033849   324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1004 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 1078
Cdd:NF033849   404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034675055 1079 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 1138
Cdd:NF033849   482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1064-1431 1.09e-25

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 115.49  E-value: 1.09e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1064 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 1143
Cdd:NF033849   218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1144 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 1223
Cdd:NF033849   284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1224 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 1299
Cdd:NF033849   346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1300 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:NF033849   418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034675055 1380 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1431
Cdd:NF033849   497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
451-611 3.93e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 3.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  510 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 574
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1034675055  575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
925-1290 1.66e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 102.01  E-value: 1.66e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  925 TSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGgamstsadfggtLS 1004
Cdd:NF033849   239 AGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVG------------TS 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1005 TSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggtlSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVS 1084
Cdd:NF033849   307 ESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGV------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1085 TSACFSgapitnpgfggaFSTSAGFGGALSTaadfGGTPSNSIGFgaapSTSVSFGGAHGTSLcfggapstslcfgsaSN 1164
Cdd:NF033849   381 SSRSSS------------SGVSGGFSGGIAG----GGVTSEGLGA----SQGGSEGWGSGDSV---------------QS 425
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1165 TNLCFGGPPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLG 1244
Cdd:NF033849   426 VSQSYGSSSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TG 497
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1034675055 1245 TSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 1290
Cdd:NF033849   498 TSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
964-1302 4.38e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 100.46  E-value: 4.38e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  964 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 1043
Cdd:NF033849   238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1044 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 1123
Cdd:NF033849   310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1124 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 1203
Cdd:NF033849   384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1204 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 1283
Cdd:NF033849   454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
                          330
                   ....*....|....*....
gi 1034675055 1284 VTSDGFGGGLGTNASFGST 1302
Cdd:NF033849   527 TSGAGGSMGLGPSISLGKS 545
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
864-1280 6.08e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 97.00  E-value: 6.08e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  864 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 942
Cdd:NF033849   218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  943 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 1020
Cdd:NF033849   293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1021 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 1100
Cdd:NF033849   357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1101 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 1180
Cdd:NF033849   409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1181 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 1260
Cdd:NF033849   460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
                          410       420
                   ....*....|....*....|
gi 1034675055 1261 GGLGTSAGFSGGLGTSAGFG 1280
Cdd:NF033849   524 GGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
854-1138 2.88e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 84.67  E-value: 2.88e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  854 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 927
Cdd:NF033849   244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  928 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 1003
Cdd:NF033849   324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1004 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 1078
Cdd:NF033849   404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034675055 1079 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 1138
Cdd:NF033849   482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
726-1425 1.42e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 1.42e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  726 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 805
Cdd:COG3210    791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  806 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 885
Cdd:COG3210    871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  886 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 965
Cdd:COG3210    951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  966 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 1045
Cdd:COG3210   1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1046 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 1125
Cdd:COG3210   1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1126 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 1205
Cdd:COG3210   1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 1285
Cdd:COG3210   1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1286 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 1365
Cdd:COG3210   1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1366 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210   1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
713-1080 3.74e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.57  E-value: 3.74e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  713 ENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISfggtlstsssfssaas 792
Cdd:NF033849   237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS---------------- 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  793 ISFGCAHSTSTSFSSeasisfggmpctsasfsggvsssfsgplSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSA 872
Cdd:NF033849   301 SSVGTSESQSHGTTE----------------------------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTS 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  873 PTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFG---GSSSTSANF 949
Cdd:NF033849   353 ISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGsgdSVQSVSQSY 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  950 GGTLSTSIcfdgspSTGAGFGGALNTSASfgsvlnTSTGFGGAMSTSADFGGTLSTSVcfggspGTSVSFGSALNTNAGY 1029
Cdd:NF033849   431 GSSSSTGT------SSGHSDSSSHSTSSG------QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034675055 1030 GGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFS 1080
Cdd:NF033849   493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
800-1285 1.05e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 63.26  E-value: 1.05e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  800 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 879
Cdd:COG4625     14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  880 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 959
Cdd:COG4625     94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  960 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 1039
Cdd:COG4625    174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1040 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 1119
Cdd:COG4625    252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1120 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 1199
Cdd:COG4625    332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1200 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG4625    412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491

                   ....*.
gi 1034675055 1280 GGGLVT 1285
Cdd:COG4625    492 GGGNYT 497
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1096-1320 2.13e-09

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 61.99  E-value: 2.13e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1096 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 1175
Cdd:pfam15967    5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1176 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 1253
Cdd:pfam15967   80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034675055 1254 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 1320
Cdd:pfam15967  156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1221-1424 1.01e-05

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 50.00  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1221 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 1300
Cdd:cd21118    125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1301 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 1380
Cdd:cd21118    202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1034675055 1381 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 1424
Cdd:cd21118    279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
PTZ00395 PTZ00395
Sec24-related protein; Provisional
1009-1231 4.04e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.53  E-value: 4.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1009 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 1088
Cdd:PTZ00395   339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1089 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 1168
Cdd:PTZ00395   409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034675055 1169 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 1231
Cdd:PTZ00395   472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
922-1107 3.99e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 44.66  E-value: 3.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  922 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 988
Cdd:pfam15967   28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  989 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 1065
Cdd:pfam15967  103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1034675055 1066 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 1107
Cdd:pfam15967  181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1064-1431 1.09e-25

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 115.49  E-value: 1.09e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1064 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 1143
Cdd:NF033849   218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1144 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 1223
Cdd:NF033849   284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1224 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 1299
Cdd:NF033849   346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1300 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:NF033849   418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034675055 1380 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1431
Cdd:NF033849   497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
451-611 3.93e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 3.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  510 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 574
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1034675055  575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
925-1290 1.66e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 102.01  E-value: 1.66e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  925 TSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGgamstsadfggtLS 1004
Cdd:NF033849   239 AGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVG------------TS 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1005 TSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggtlSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVS 1084
Cdd:NF033849   307 ESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGV------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1085 TSACFSgapitnpgfggaFSTSAGFGGALSTaadfGGTPSNSIGFgaapSTSVSFGGAHGTSLcfggapstslcfgsaSN 1164
Cdd:NF033849   381 SSRSSS------------SGVSGGFSGGIAG----GGVTSEGLGA----SQGGSEGWGSGDSV---------------QS 425
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1165 TNLCFGGPPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLG 1244
Cdd:NF033849   426 VSQSYGSSSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TG 497
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1034675055 1245 TSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 1290
Cdd:NF033849   498 TSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
964-1302 4.38e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 100.46  E-value: 4.38e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  964 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 1043
Cdd:NF033849   238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1044 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 1123
Cdd:NF033849   310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1124 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 1203
Cdd:NF033849   384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1204 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 1283
Cdd:NF033849   454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
                          330
                   ....*....|....*....
gi 1034675055 1284 VTSDGFGGGLGTNASFGST 1302
Cdd:NF033849   527 TSGAGGSMGLGPSISLGKS 545
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
864-1280 6.08e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 97.00  E-value: 6.08e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  864 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 942
Cdd:NF033849   218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  943 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 1020
Cdd:NF033849   293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1021 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 1100
Cdd:NF033849   357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1101 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 1180
Cdd:NF033849   409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1181 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 1260
Cdd:NF033849   460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
                          410       420
                   ....*....|....*....|
gi 1034675055 1261 GGLGTSAGFSGGLGTSAGFG 1280
Cdd:NF033849   524 GGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
854-1138 2.88e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 84.67  E-value: 2.88e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  854 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 927
Cdd:NF033849   244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  928 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 1003
Cdd:NF033849   324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1004 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 1078
Cdd:NF033849   404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034675055 1079 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 1138
Cdd:NF033849   482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
726-1425 1.42e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 1.42e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  726 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 805
Cdd:COG3210    791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  806 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 885
Cdd:COG3210    871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  886 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 965
Cdd:COG3210    951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  966 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 1045
Cdd:COG3210   1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1046 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 1125
Cdd:COG3210   1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1126 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 1205
Cdd:COG3210   1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 1285
Cdd:COG3210   1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1286 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 1365
Cdd:COG3210   1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1366 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210   1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
871-1311 4.96e-13

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 74.21  E-value: 4.96e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  871 SAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFG 950
Cdd:COG3468      1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  951 GTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSpGTSVSFGSALNTNAGYG 1030
Cdd:COG3468     81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGG-GGGGTGVGGTGAAAAGG 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1031 GAVSTNTDFGGTLSTSVCFGGspsTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFG 1110
Cdd:COG3468    160 GTGSGGGGSGGGGGAGGGGGG---GAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGG 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1111 GALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGApstslcFGSASNTNLCFGGPPSTSACFSGATSPSFCDG 1190
Cdd:COG3468    237 GVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGG------GANGGGSGGGGGASGTGGGGTASTGGGGGGGG 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1191 PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFS 1270
Cdd:COG3468    311 GNGGGGGGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDG 390
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 1034675055 1271 GGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSG 1311
Cdd:COG3468    391 VGTGLTTGGTGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTG 431
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
713-1080 3.74e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.57  E-value: 3.74e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  713 ENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISfggtlstsssfssaas 792
Cdd:NF033849   237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS---------------- 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  793 ISFGCAHSTSTSFSSeasisfggmpctsasfsggvsssfsgplSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSA 872
Cdd:NF033849   301 SSVGTSESQSHGTTE----------------------------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTS 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  873 PTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFG---GSSSTSANF 949
Cdd:NF033849   353 ISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGsgdSVQSVSQSY 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  950 GGTLSTSIcfdgspSTGAGFGGALNTSASfgsvlnTSTGFGGAMSTSADFGGTLSTSVcfggspGTSVSFGSALNTNAGY 1029
Cdd:NF033849   431 GSSSSTGT------SSGHSDSSSHSTSSG------QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034675055 1030 GGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFS 1080
Cdd:NF033849   493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
836-1312 2.74e-11

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 68.27  E-value: 2.74e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  836 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 915
Cdd:COG4625     27 GAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGG 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  916 SICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTsicfDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMST 995
Cdd:COG4625    107 GGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGG----GGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGG 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  996 SADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 1075
Cdd:COG4625    183 GGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGG 262
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1076 SASFSGAVSTsacfsGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPST 1155
Cdd:COG4625    263 GAGGGGGGGG-----GGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1156 SLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLST 1235
Cdd:COG4625    338 GGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGG 417
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034675055 1236 SSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGG 1312
Cdd:COG4625    418 GAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
711-1425 3.02e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 65.17  E-value: 3.02e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  711 AQENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSA 790
Cdd:COG3210    606 GSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGG 685
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  791 ASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFG 870
Cdd:COG3210    686 TTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANT 765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  871 SAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFG 950
Cdd:COG3210    766 TASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNT 845
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  951 GTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYG 1030
Cdd:COG3210    846 TDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGG 925
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1031 GAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFG 1110
Cdd:COG3210    926 LTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTT 1005
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1111 GALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTslcfGSASNTNLCFGGPPSTSACFSGATSPSFCDG 1190
Cdd:COG3210   1006 ASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNA----SGISGGNAAALTASGTAGTTGGTAASNGGGG 1081
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1191 PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFS 1270
Cdd:COG3210   1082 TAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSA 1161
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1271 GGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGE 1350
Cdd:COG3210   1162 SAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQT 1241
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034675055 1351 PSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210   1242 GSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGT 1316
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
714-1345 3.86e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 64.79  E-value: 3.86e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  714 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 793
Cdd:COG3210    115 TLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGV 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  794 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 873
Cdd:COG3210    195 TGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIG 274
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  874 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 953
Cdd:COG3210    275 TTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGT 354
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  954 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 1033
Cdd:COG3210    355 TGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLG 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1034 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAcfsgapITNPGFGGAFSTSAGFGGAL 1113
Cdd:COG3210    435 ITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGG------GIGTVTTNATISNNAGGDAN 508
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1114 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 1193
Cdd:COG3210    509 GIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSA 588
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1194 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGL 1273
Cdd:COG3210    589 TGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAG 668
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034675055 1274 GTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 1345
Cdd:COG3210    669 GTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
800-1285 1.05e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 63.26  E-value: 1.05e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  800 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 879
Cdd:COG4625     14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  880 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 959
Cdd:COG4625     94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  960 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 1039
Cdd:COG4625    174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1040 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 1119
Cdd:COG4625    252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1120 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 1199
Cdd:COG4625    332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1200 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG4625    412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491

                   ....*.
gi 1034675055 1280 GGGLVT 1285
Cdd:COG4625    492 GGGNYT 497
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1096-1320 2.13e-09

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 61.99  E-value: 2.13e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1096 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 1175
Cdd:pfam15967    5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1176 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 1253
Cdd:pfam15967   80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034675055 1254 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 1320
Cdd:pfam15967  156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
767-1268 1.16e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 59.79  E-value: 1.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  767 ASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGAS 846
Cdd:COG4625      1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  847 SGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTS 926
Cdd:COG4625     81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  927 TGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTS 1006
Cdd:COG4625    161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1007 VCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTS 1086
Cdd:COG4625    241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1087 ACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTN 1166
Cdd:COG4625    321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1167 LCFGGPPSTSACFSGATSPSFcDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTS 1246
Cdd:COG4625    401 GGGGAGGTGGGGAGGGGGAAG-GGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLT 479
                          490       500
                   ....*....|....*....|..
gi 1034675055 1247 AGFGGGPGTSTGFGGGLGTSAG 1268
Cdd:COG4625    480 GNNTYTGTTTVNGGGNYTQSAG 501
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1049-1281 6.27e-07

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 53.90  E-value: 6.27e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1049 FGGSPSTSAGFGGALntnaSFGCAVSTSASFSGAVSTSAcFSGAPITNPGfggAFSTSAGFGGALstaadFGGTPSNSIG 1128
Cdd:pfam15967    6 FGGGPGSTATAGGGF----SFGAAAASNPGSTGGFSFGT-LGAAPAATAT---TTTATLGLGGGL-----FGQKPATGFT 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1129 FGAAPSTSvsfgGAHGTSLCFGGAPSTSlcfgSASNT--NLCFGGPPSTSACFS-GATSPSfcdgpstSTGFSFGNGLST 1205
Cdd:pfam15967   73 FGTPASST----AATGPTGLTLGTPAAT----TAASTgfSLGFNKPAASATPFSlPASSTS-------GGGLSLGSVLTS 137
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034675055 1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGL---STSSGFDGGLGTSAGfGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGG 1281
Cdd:pfam15967  138 TAAQQGATGFTLNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
982-1196 6.80e-07

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.60  E-value: 6.80e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  982 VLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGG 1061
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1062 ALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNpgfggafSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGG 1141
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTT-------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1034675055 1142 AHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACfSGATSPSFCDGPSTSTG 1196
Cdd:COG3469    154 SGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA-SGATTPSATTTATTTGP 207
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
852-1431 7.17e-07

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 54.01  E-value: 7.17e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  852 TLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 931
Cdd:COG5295      5 AGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAASSVA 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  932 TLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 1011
Cdd:COG5295     85 SGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSST 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1012 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 1091
Cdd:COG5295    165 ANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAA 244
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1092 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIG--FGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCF 1169
Cdd:COG5295    245 SGNATTASASSVSGSAVAAGTASTATTASTTAASGAAgtATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALG 324
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1170 GGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGF 1249
Cdd:COG5295    325 SAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTG 404
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1250 GGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDR 1329
Cdd:COG5295    405 ASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSS 484
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1330 GLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGG 1409
Cdd:COG5295    485 AAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTAT 564
                          570       580
                   ....*....|....*....|..
gi 1034675055 1410 PSTSAGFGSGAASLGACGFSYG 1431
Cdd:COG5295    565 GANSVALGAGSVASGANSVSVG 586
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1195-1431 2.50e-06

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 51.98  E-value: 2.50e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1195 TGFSFGNGLSTNAGFGGGLNtsagFGGGLGTSAGFSGGLstssGFDGGLGTSAGFGGGPGTSTGFGGGLgtsagFSGGLG 1274
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFS----FGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPA 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1275 TSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSrpnaSFDRGLSTIIGFGSGSNTSTGFTGEPSTS 1354
Cdd:pfam15967   69 TGFTFGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASAT----PFSLPASSTSGGGLSLGSVLTSTAAQQGA 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1355 TGFSSGPSSIVGFSGGPSTGVGFCSGPSTSG---FSGGPSTGAGfgggpNTGAGFGGGPSTSAGFGSGAASLGACGFSYG 1431
Cdd:pfam15967  145 TGFTLNLGGTPATTTAVSTGLSLGSTLTSLGgslFQNTNSTGLG-----QTTLGLTLLATSTAPVSAPAASEGLGGLDFS 219
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1221-1424 1.01e-05

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 50.00  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1221 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 1300
Cdd:cd21118    125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1301 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 1380
Cdd:cd21118    202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1034675055 1381 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 1424
Cdd:cd21118    279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
PPE COG5651
PPE-repeat protein [Function unknown];
1169-1396 1.32e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 49.12  E-value: 1.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1169 FGGPPSTSACFSGATSPSFCDGPSTSTGFSFGN-GLSTNAGFG-GGLNTSAGfggglgtSAGFSGGLSTSSGFDGGLGTS 1246
Cdd:COG5651    167 FTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANlGLTGLNQVGiGGLNSGSG-------PIGLNSGPGNTGFAGTGAAAG 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1247 AGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNAs 1326
Cdd:COG5651    240 AAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGA- 318
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1327 fdrGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGF 1396
Cdd:COG5651    319 ---AGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
PPE COG5651
PPE-repeat protein [Function unknown];
1230-1425 1.88e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 48.74  E-value: 1.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1230 SGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGtnaSFGSTLGTSAGF 1309
Cdd:COG5651    177 PGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAA---AAAAAAAAAAGA 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1310 SGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGG 1389
Cdd:COG5651    254 GASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1034675055 1390 PSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG5651    334 AAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
714-1256 2.37e-05

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 49.01  E-value: 2.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  714 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 793
Cdd:COG4625      2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  794 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 873
Cdd:COG4625     82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  874 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 953
Cdd:COG4625    162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  954 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 1033
Cdd:COG4625    242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1034 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGAL 1113
Cdd:COG4625    322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1114 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSlcfGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 1193
Cdd:COG4625    402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGG---ATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTL 478
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034675055 1194 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTS 1256
Cdd:COG4625    479 TGNNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTT 541
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1018-1251 3.01e-05

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 48.51  E-value: 3.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1018 SFGSALNTNAGYGGAVStntdFGGTLStsvcfggSPSTSAG---FGGALNTNASFGCAVSTSASFSGAVstsacFSGAPI 1094
Cdd:pfam15967    5 SFGGGPGSTATAGGGFS----FGAAAA-------SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPA 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1095 TNPGFGGAFSTSAGFGGALSTaadfGGTPSNSigfgAAPSTSVSFGGAHGTslcfGGAPSTSLCFGSASNTNLCFGGPPS 1174
Cdd:pfam15967   69 TGFTFGTPASSTAATGPTGLT----LGTPAAT----TAASTGFSLGFNKPA----ASATPFSLPASSTSGGGLSLGSVLT 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1175 TSACFSGATSPSFCDGPSTSTGFSFGNGL---STNAGFGGGLNTSAGfGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGG 1251
Cdd:pfam15967  137 STAAQQGATGFTLNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
PPE COG5651
PPE-repeat protein [Function unknown];
1200-1418 3.03e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.97  E-value: 3.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1200 GNGLSTNAGFGGGLNTSAGFGgglgtSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG5651    178 GGLLGAQNAGSGNTSSNPGFA-----NLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1280 GGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDgfgsrPNASFDRGLSTIIGFGSGSNTSTGFTGePSTSTGFSS 1359
Cdd:COG5651    253 AGASAALASLAATLLNASSLGLAATAASSAATNLGLAG-----SPLGLAGGGAGAAAATGLGLGAGGAAG-AAGATGAGA 326
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034675055 1360 GPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGS 1418
Cdd:COG5651    327 ALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
PTZ00395 PTZ00395
Sec24-related protein; Provisional
1009-1231 4.04e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.53  E-value: 4.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1009 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 1088
Cdd:PTZ00395   339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1089 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 1168
Cdd:PTZ00395   409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034675055 1169 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 1231
Cdd:PTZ00395   472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1190-1398 5.56e-05

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 47.74  E-value: 5.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1190 GPSTSTGFSFGNGLSTNAGFGGGLntsaGFGGGLGTSAGFSGGLSTSSGFDGGLgtsagFGGGPGTSTGFGGGLGTSAGF 1269
Cdd:pfam15967   13 TATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAAT 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1270 SGGLGTSAGFGGGLVTSDGFGGGLGTNASFGS--TLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGfGSGSNTSTGF 1347
Cdd:pfam15967   84 GPTGLTLGTPAATTAASTGFSLGFNKPAASATpfSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLG-GTPATTTAVS 162
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034675055 1348 TGEP--STSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGG 1398
Cdd:pfam15967  163 TGLSlgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Keratin_2_head pfam16208
Keratin type II head;
1204-1308 8.02e-05

Keratin type II head;


Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 44.65  E-value: 8.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1204 STNAGFGGGLNTSAGFGGGLGTSA--GFSGGLSTSSGFDGG---LGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAG 1278
Cdd:pfam16208   21 SSSRRGGGGGGGGGGGGGGFGSRSlyNLGGSKSISISVAGGgsrPGSGFGFGGGGGGGFGGGFGGGGGGGFGGGGGFGGG 100
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1034675055 1279 FGGGLVTSDGFG-GGLGTNASFGSTLGTSAG 1308
Cdd:pfam16208  101 FGGGGYGGGGFGgGGFGGRGGFGGPPCPPGG 131
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1101-1328 1.02e-04

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 46.53  E-value: 1.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1101 GAFSTSAGFGGALSTAADFGGTPSNSIG-FGAAPSTSVSFGGAHGTSLCFGGAPSTSLC---FGSASNTNL---CFGGPP 1173
Cdd:cd21118    128 GAYGSQGGPGVQGHGIPGGTGGPWASGGnYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAqpgYGTVRGNNQnsgCTNPPP 207
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1174 STSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLN--TSAGFGGGLGTSAGFSGGLSTSSG-FDGGLGTSAGFG 1250
Cdd:cd21118    208 SGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNggNNGSSSSNSGNSGGSNGGSSGNSGsGSGGSSSGGSNG 287
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034675055 1251 GGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAgfSGGLSTSDGFGSRPNASFD 1328
Cdd:cd21118    288 WGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEA--VGGLNTLNSDASTLPFNFD 363
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1182-1403 2.34e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.51  E-value: 2.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1182 ATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGG 1261
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1262 GLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFgsrpNASFDRGLSTIIGFGSGS 1341
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGA----SATSSAGSTTTTTTVSGT 156
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034675055 1342 NTSTGFTGEPSTSTGFSSGPSSivgfsggPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTG 1403
Cdd:COG3469    157 ETATGGTTTTSTTTTTTSASTT-------PSATTTATATTASGATTPSATTTATTTGPPTPG 211
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
922-1107 3.99e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 44.66  E-value: 3.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  922 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 988
Cdd:pfam15967   28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  989 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 1065
Cdd:pfam15967  103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1034675055 1066 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 1107
Cdd:pfam15967  181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
PTZ00395 PTZ00395
Sec24-related protein; Provisional
1258-1404 6.20e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 44.68  E-value: 6.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1258 GFGGGL--GTSAGF-SGGLGTSAGFGG-GLVTSD---GFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPnasfdrg 1330
Cdd:PTZ00395   341 GFHDGSpnAASAGApFNGLGNQADGGHiNQVHPDargAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAG------- 413
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034675055 1331 lstiigFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPstsgFSGGPSTGAGFGGGPNTGA 1404
Cdd:PTZ00395   414 ------YSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLP----YSNTPYSNAPLSNAPPSSA 477
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1180-1431 8.64e-04

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 43.83  E-value: 8.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1180 SGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFG--------- 1250
Cdd:cd21118     14 GGEASPLHSGGEGTGAGESAGHGLGDAISHGIGEAVGQGAKEAASSGIQNALGQGHGEEGGSTLGSRGDVFehrlgeaar 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1251 --GGPGTSTG-----------------FGGGLGTSAGFSGGLGTSAGFG--GGLVTSDGFGGGLGTNASFGS-------- 1301
Cdd:cd21118     94 slGNAGNEIGrqaediirhgvdavhnsWQGSGGHGAYGSQGGPGVQGHGipGGTGGPWASGGNYGTNSLGGSvgqggngg 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1302 --TLGTSagfSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:cd21118    174 plNYGTN---SQGAVAQPGYGTVRGNNQNSGCTNPPPSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGG 250
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1034675055 1380 GPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGACGFSYG 1431
Cdd:cd21118    251 NNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSNGWGGSSSSGGSGGSGG 302
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
989-1165 3.28e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 41.96  E-value: 3.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055  989 FGGAMSTSADFGGTLSTSVCFGGSPGTS--VSFGSALNTNAGYGGAVSTNTDFGGTLstsvcFGGSPSTSAGFGGALNTN 1066
Cdd:pfam15967    6 FGGGPGSTATAGGGFSFGAAAASNPGSTggFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1067 ASFGCAVSTSASFSGAVSTSACFSgapitnPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTS 1146
Cdd:pfam15967   81 AATGPTGLTLGTPAATTAASTGFS------LGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGT 154
                          170
                   ....*....|....*....
gi 1034675055 1147 LCFGGAPSTSLCFGSASNT 1165
Cdd:pfam15967  155 PATTTAVSTGLSLGSTLTS 173
Gly_rich pfam12810
Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. ...
1207-1281 4.40e-03

Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. The proteins are composed of several glycine rich motifs interspersed through the sequence. Although many proteins have been annotated by similarity in the family these annotations given the biased composition of the sequences these are unlikely to be functionally relevant.


Pssm-ID: 403882 [Multi-domain]  Cd Length: 257  Bit Score: 40.72  E-value: 4.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1207 AGFGGG----LNTSAGFGGGLGTSAGFSG------GLSTSSGFDGGLGTSAGFGGGP-----GTSTGFGGGLGTSAGFSG 1271
Cdd:pfam12810  107 AGGGGGsgegDDGSGGYGGGLTGGGGGSGcyegsyGATQTSGGIGGYGINGSFGQGGngrnsGGGGGGGGGGGYYGGFGG 186
                           90
                   ....*....|
gi 1034675055 1272 GLGTSAGFGG 1281
Cdd:pfam12810  187 GSYGGGGGGG 196
PPE COG5651
PPE-repeat protein [Function unknown];
1253-1425 5.03e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.03  E-value: 5.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1253 PGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLS 1332
Cdd:COG5651    171 PPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAA 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034675055 1333 TIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAG--FGGGPNTGAGFGGGP 1410
Cdd:COG5651    251 AGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGgaAGAAGATGAGAALGA 330
                          170
                   ....*....|....*
gi 1034675055 1411 STSAGFGSGAASLGA 1425
Cdd:COG5651    331 GAAAAAAGAAAGAGA 345
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH