NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|768037095|ref|XP_011529115|]
View 

trophinin isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1064-1431 1.09e-25

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 115.49  E-value: 1.09e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1064 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 1143
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1144 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 1223
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1224 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 1299
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1300 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 768037095 1380 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1431
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
451-611 3.93e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


:

Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 3.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   510 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 574
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 768037095   575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
854-1138 2.88e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 84.67  E-value: 2.88e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  854 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 927
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  928 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 1003
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1004 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 1078
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768037095 1079 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 1138
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1064-1431 1.09e-25

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 115.49  E-value: 1.09e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1064 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 1143
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1144 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 1223
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1224 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 1299
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1300 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 768037095 1380 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1431
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
451-611 3.93e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 3.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   510 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 574
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 768037095   575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
925-1290 1.66e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 102.01  E-value: 1.66e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  925 TSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGgamstsadfggtLS 1004
Cdd:NF033849  239 AGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVG------------TS 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1005 TSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggtlSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVS 1084
Cdd:NF033849  307 ESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGV------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1085 TSACFSgapitnpgfggaFSTSAGFGGALSTaadfGGTPSNSIGFgaapSTSVSFGGAHGTSLcfggapstslcfgsaSN 1164
Cdd:NF033849  381 SSRSSS------------SGVSGGFSGGIAG----GGVTSEGLGA----SQGGSEGWGSGDSV---------------QS 425
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1165 TNLCFGGPPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLG 1244
Cdd:NF033849  426 VSQSYGSSSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TG 497
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*.
gi 768037095 1245 TSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 1290
Cdd:NF033849  498 TSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
964-1302 4.38e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 100.46  E-value: 4.38e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  964 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 1043
Cdd:NF033849  238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1044 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 1123
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1124 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 1203
Cdd:NF033849  384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1204 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 1283
Cdd:NF033849  454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
                         330
                  ....*....|....*....
gi 768037095 1284 VTSDGFGGGLGTNASFGST 1302
Cdd:NF033849  527 TSGAGGSMGLGPSISLGKS 545
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
864-1280 6.08e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 97.00  E-value: 6.08e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  864 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 942
Cdd:NF033849  218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  943 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 1020
Cdd:NF033849  293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1021 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 1100
Cdd:NF033849  357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1101 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 1180
Cdd:NF033849  409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1181 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 1260
Cdd:NF033849  460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
                         410       420
                  ....*....|....*....|
gi 768037095 1261 GGLGTSAGFSGGLGTSAGFG 1280
Cdd:NF033849  524 GGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
854-1138 2.88e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 84.67  E-value: 2.88e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  854 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 927
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  928 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 1003
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1004 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 1078
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768037095 1079 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 1138
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
726-1425 1.42e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 1.42e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  726 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 805
Cdd:COG3210   791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  806 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 885
Cdd:COG3210   871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  886 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 965
Cdd:COG3210   951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  966 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 1045
Cdd:COG3210  1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1046 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 1125
Cdd:COG3210  1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1126 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 1205
Cdd:COG3210  1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 1285
Cdd:COG3210  1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1286 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 1365
Cdd:COG3210  1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                         650       660       670       680       690       700
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1366 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210  1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
713-1080 3.74e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.57  E-value: 3.74e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  713 ENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISfggtlstsssfssaas 792
Cdd:NF033849  237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS---------------- 300
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  793 ISFGCAHSTSTSFSSeasisfggmpctsasfsggvsssfsgplSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSA 872
Cdd:NF033849  301 SSVGTSESQSHGTTE----------------------------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTS 352
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  873 PTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFG---GSSSTSANF 949
Cdd:NF033849  353 ISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGsgdSVQSVSQSY 430
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  950 GGTLSTSIcfdgspSTGAGFGGALNTSASfgsvlnTSTGFGGAMSTSADFGGTLSTSVcfggspGTSVSFGSALNTNAGY 1029
Cdd:NF033849  431 GSSSSTGT------SSGHSDSSSHSTSSG------QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|.
gi 768037095 1030 GGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFS 1080
Cdd:NF033849  493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
800-1285 1.05e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 63.26  E-value: 1.05e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  800 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 879
Cdd:COG4625    14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  880 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 959
Cdd:COG4625    94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  960 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 1039
Cdd:COG4625   174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1040 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 1119
Cdd:COG4625   252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1120 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 1199
Cdd:COG4625   332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1200 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG4625   412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491

                  ....*.
gi 768037095 1280 GGGLVT 1285
Cdd:COG4625   492 GGGNYT 497
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1096-1320 2.13e-09

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 61.99  E-value: 2.13e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1096 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 1175
Cdd:pfam15967    5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1176 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 1253
Cdd:pfam15967   80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 768037095  1254 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 1320
Cdd:pfam15967  156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1221-1424 1.01e-05

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 50.00  E-value: 1.01e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1221 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 1300
Cdd:cd21118   125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1301 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 1380
Cdd:cd21118   202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 768037095 1381 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 1424
Cdd:cd21118   279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
PTZ00395 PTZ00395
Sec24-related protein; Provisional
1009-1231 4.04e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.53  E-value: 4.04e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1009 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 1088
Cdd:PTZ00395  339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1089 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 1168
Cdd:PTZ00395  409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 768037095 1169 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 1231
Cdd:PTZ00395  472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
922-1107 3.99e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 44.66  E-value: 3.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   922 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 988
Cdd:pfam15967   28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   989 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 1065
Cdd:pfam15967  103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 768037095  1066 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 1107
Cdd:pfam15967  181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1064-1431 1.09e-25

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 115.49  E-value: 1.09e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1064 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 1143
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1144 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 1223
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1224 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 1299
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1300 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 768037095 1380 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1431
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
451-611 3.93e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 3.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   510 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 574
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 768037095   575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
925-1290 1.66e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 102.01  E-value: 1.66e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  925 TSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGgamstsadfggtLS 1004
Cdd:NF033849  239 AGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVG------------TS 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1005 TSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggtlSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVS 1084
Cdd:NF033849  307 ESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGV------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1085 TSACFSgapitnpgfggaFSTSAGFGGALSTaadfGGTPSNSIGFgaapSTSVSFGGAHGTSLcfggapstslcfgsaSN 1164
Cdd:NF033849  381 SSRSSS------------SGVSGGFSGGIAG----GGVTSEGLGA----SQGGSEGWGSGDSV---------------QS 425
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1165 TNLCFGGPPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLG 1244
Cdd:NF033849  426 VSQSYGSSSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TG 497
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*.
gi 768037095 1245 TSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 1290
Cdd:NF033849  498 TSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
964-1302 4.38e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 100.46  E-value: 4.38e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  964 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 1043
Cdd:NF033849  238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1044 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 1123
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1124 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 1203
Cdd:NF033849  384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1204 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 1283
Cdd:NF033849  454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
                         330
                  ....*....|....*....
gi 768037095 1284 VTSDGFGGGLGTNASFGST 1302
Cdd:NF033849  527 TSGAGGSMGLGPSISLGKS 545
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
864-1280 6.08e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 97.00  E-value: 6.08e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  864 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 942
Cdd:NF033849  218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  943 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 1020
Cdd:NF033849  293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1021 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 1100
Cdd:NF033849  357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1101 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 1180
Cdd:NF033849  409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1181 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 1260
Cdd:NF033849  460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
                         410       420
                  ....*....|....*....|
gi 768037095 1261 GGLGTSAGFSGGLGTSAGFG 1280
Cdd:NF033849  524 GGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
854-1138 2.88e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 84.67  E-value: 2.88e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  854 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 927
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  928 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 1003
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1004 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 1078
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768037095 1079 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 1138
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
726-1425 1.42e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 1.42e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  726 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 805
Cdd:COG3210   791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  806 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 885
Cdd:COG3210   871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  886 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 965
Cdd:COG3210   951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  966 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 1045
Cdd:COG3210  1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1046 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 1125
Cdd:COG3210  1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1126 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 1205
Cdd:COG3210  1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 1285
Cdd:COG3210  1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1286 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 1365
Cdd:COG3210  1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                         650       660       670       680       690       700
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1366 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210  1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
871-1311 4.96e-13

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 74.21  E-value: 4.96e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  871 SAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFG 950
Cdd:COG3468     1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  951 GTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSpGTSVSFGSALNTNAGYG 1030
Cdd:COG3468    81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGG-GGGGTGVGGTGAAAAGG 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1031 GAVSTNTDFGGTLSTSVCFGGspsTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFG 1110
Cdd:COG3468   160 GTGSGGGGSGGGGGAGGGGGG---GAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGG 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1111 GALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGApstslcFGSASNTNLCFGGPPSTSACFSGATSPSFCDG 1190
Cdd:COG3468   237 GVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGG------GANGGGSGGGGGASGTGGGGTASTGGGGGGGG 310
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1191 PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFS 1270
Cdd:COG3468   311 GNGGGGGGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDG 390
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|.
gi 768037095 1271 GGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSG 1311
Cdd:COG3468   391 VGTGLTTGGTGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTG 431
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
713-1080 3.74e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.57  E-value: 3.74e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  713 ENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISfggtlstsssfssaas 792
Cdd:NF033849  237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS---------------- 300
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  793 ISFGCAHSTSTSFSSeasisfggmpctsasfsggvsssfsgplSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSA 872
Cdd:NF033849  301 SSVGTSESQSHGTTE----------------------------GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTS 352
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  873 PTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFG---GSSSTSANF 949
Cdd:NF033849  353 ISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGsgdSVQSVSQSY 430
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  950 GGTLSTSIcfdgspSTGAGFGGALNTSASfgsvlnTSTGFGGAMSTSADFGGTLSTSVcfggspGTSVSFGSALNTNAGY 1029
Cdd:NF033849  431 GSSSSTGT------SSGHSDSSSHSTSSG------QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|.
gi 768037095 1030 GGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFS 1080
Cdd:NF033849  493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
836-1312 2.74e-11

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 68.27  E-value: 2.74e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  836 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 915
Cdd:COG4625    27 GAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGG 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  916 SICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTsicfDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMST 995
Cdd:COG4625   107 GGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGG----GGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGG 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  996 SADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 1075
Cdd:COG4625   183 GGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGG 262
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1076 SASFSGAVSTsacfsGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPST 1155
Cdd:COG4625   263 GAGGGGGGGG-----GGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 337
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1156 SLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLST 1235
Cdd:COG4625   338 GGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGG 417
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 768037095 1236 SSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGG 1312
Cdd:COG4625   418 GAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
711-1425 3.02e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 65.17  E-value: 3.02e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  711 AQENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSA 790
Cdd:COG3210   606 GSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGG 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  791 ASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFG 870
Cdd:COG3210   686 TTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANT 765
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  871 SAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFG 950
Cdd:COG3210   766 TASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNT 845
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  951 GTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYG 1030
Cdd:COG3210   846 TDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGG 925
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1031 GAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFG 1110
Cdd:COG3210   926 LTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTT 1005
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1111 GALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTslcfGSASNTNLCFGGPPSTSACFSGATSPSFCDG 1190
Cdd:COG3210  1006 ASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNA----SGISGGNAAALTASGTAGTTGGTAASNGGGG 1081
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1191 PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFS 1270
Cdd:COG3210  1082 TAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSA 1161
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1271 GGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGE 1350
Cdd:COG3210  1162 SAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQT 1241
                         650       660       670       680       690       700       710
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 768037095 1351 PSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG3210  1242 GSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGT 1316
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
714-1345 3.86e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 64.79  E-value: 3.86e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  714 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 793
Cdd:COG3210   115 TLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGV 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  794 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 873
Cdd:COG3210   195 TGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIG 274
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  874 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 953
Cdd:COG3210   275 TTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGT 354
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  954 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 1033
Cdd:COG3210   355 TGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLG 434
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1034 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAcfsgapITNPGFGGAFSTSAGFGGAL 1113
Cdd:COG3210   435 ITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGG------GIGTVTTNATISNNAGGDAN 508
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1114 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 1193
Cdd:COG3210   509 GIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSA 588
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1194 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGL 1273
Cdd:COG3210   589 TGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAG 668
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 768037095 1274 GTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 1345
Cdd:COG3210   669 GTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
800-1285 1.05e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 63.26  E-value: 1.05e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  800 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 879
Cdd:COG4625    14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  880 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 959
Cdd:COG4625    94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  960 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 1039
Cdd:COG4625   174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1040 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 1119
Cdd:COG4625   252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1120 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 1199
Cdd:COG4625   332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1200 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG4625   412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491

                  ....*.
gi 768037095 1280 GGGLVT 1285
Cdd:COG4625   492 GGGNYT 497
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1096-1320 2.13e-09

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 61.99  E-value: 2.13e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1096 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 1175
Cdd:pfam15967    5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1176 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 1253
Cdd:pfam15967   80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 768037095  1254 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 1320
Cdd:pfam15967  156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
767-1268 1.16e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 59.79  E-value: 1.16e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  767 ASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGAS 846
Cdd:COG4625     1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  847 SGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTS 926
Cdd:COG4625    81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  927 TGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTS 1006
Cdd:COG4625   161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1007 VCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTS 1086
Cdd:COG4625   241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1087 ACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTN 1166
Cdd:COG4625   321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1167 LCFGGPPSTSACFSGATSPSFcDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTS 1246
Cdd:COG4625   401 GGGGAGGTGGGGAGGGGGAAG-GGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLT 479
                         490       500
                  ....*....|....*....|..
gi 768037095 1247 AGFGGGPGTSTGFGGGLGTSAG 1268
Cdd:COG4625   480 GNNTYTGTTTVNGGGNYTQSAG 501
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1049-1281 6.27e-07

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 53.90  E-value: 6.27e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1049 FGGSPSTSAGFGGALntnaSFGCAVSTSASFSGAVSTSAcFSGAPITNPGfggAFSTSAGFGGALstaadFGGTPSNSIG 1128
Cdd:pfam15967    6 FGGGPGSTATAGGGF----SFGAAAASNPGSTGGFSFGT-LGAAPAATAT---TTTATLGLGGGL-----FGQKPATGFT 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1129 FGAAPSTSvsfgGAHGTSLCFGGAPSTSlcfgSASNT--NLCFGGPPSTSACFS-GATSPSfcdgpstSTGFSFGNGLST 1205
Cdd:pfam15967   73 FGTPASST----AATGPTGLTLGTPAAT----TAASTgfSLGFNKPAASATPFSlPASSTS-------GGGLSLGSVLTS 137
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768037095  1206 NAGFGGGLNTSAGFGGGLGTSAGFSGGL---STSSGFDGGLGTSAGfGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGG 1281
Cdd:pfam15967  138 TAAQQGATGFTLNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
982-1196 6.80e-07

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.60  E-value: 6.80e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  982 VLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGG 1061
Cdd:COG3469     1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1062 ALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNpgfggafSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGG 1141
Cdd:COG3469    81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTT-------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 768037095 1142 AHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACfSGATSPSFCDGPSTSTG 1196
Cdd:COG3469   154 SGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA-SGATTPSATTTATTTGP 207
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
852-1431 7.17e-07

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 54.01  E-value: 7.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  852 TLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 931
Cdd:COG5295     5 AGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAASSVA 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  932 TLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 1011
Cdd:COG5295    85 SGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSST 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1012 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 1091
Cdd:COG5295   165 ANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAA 244
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1092 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIG--FGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCF 1169
Cdd:COG5295   245 SGNATTASASSVSGSAVAAGTASTATTASTTAASGAAgtATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALG 324
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1170 GGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGF 1249
Cdd:COG5295   325 SAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTG 404
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1250 GGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDR 1329
Cdd:COG5295   405 ASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSS 484
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1330 GLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGG 1409
Cdd:COG5295   485 AAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTAT 564
                         570       580
                  ....*....|....*....|..
gi 768037095 1410 PSTSAGFGSGAASLGACGFSYG 1431
Cdd:COG5295   565 GANSVALGAGSVASGANSVSVG 586
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1195-1431 2.50e-06

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 51.98  E-value: 2.50e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1195 TGFSFGNGLSTNAGFGGGLNtsagFGGGLGTSAGFSGGLstssGFDGGLGTSAGFGGGPGTSTGFGGGLgtsagFSGGLG 1274
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFS----FGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPA 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1275 TSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSrpnaSFDRGLSTIIGFGSGSNTSTGFTGEPSTS 1354
Cdd:pfam15967   69 TGFTFGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASAT----PFSLPASSTSGGGLSLGSVLTSTAAQQGA 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1355 TGFSSGPSSIVGFSGGPSTGVGFCSGPSTSG---FSGGPSTGAGfgggpNTGAGFGGGPSTSAGFGSGAASLGACGFSYG 1431
Cdd:pfam15967  145 TGFTLNLGGTPATTTAVSTGLSLGSTLTSLGgslFQNTNSTGLG-----QTTLGLTLLATSTAPVSAPAASEGLGGLDFS 219
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1221-1424 1.01e-05

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 50.00  E-value: 1.01e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1221 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 1300
Cdd:cd21118   125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1301 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 1380
Cdd:cd21118   202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 768037095 1381 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 1424
Cdd:cd21118   279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
PPE COG5651
PPE-repeat protein [Function unknown];
1169-1396 1.32e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 49.12  E-value: 1.32e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1169 FGGPPSTSACFSGATSPSFCDGPSTSTGFSFGN-GLSTNAGFG-GGLNTSAGfggglgtSAGFSGGLSTSSGFDGGLGTS 1246
Cdd:COG5651   167 FTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANlGLTGLNQVGiGGLNSGSG-------PIGLNSGPGNTGFAGTGAAAG 239
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1247 AGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNAs 1326
Cdd:COG5651   240 AAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGA- 318
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1327 fdrGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGF 1396
Cdd:COG5651   319 ---AGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
PPE COG5651
PPE-repeat protein [Function unknown];
1230-1425 1.88e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 48.74  E-value: 1.88e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1230 SGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGtnaSFGSTLGTSAGF 1309
Cdd:COG5651   177 PGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAA---AAAAAAAAAAGA 253
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1310 SGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGG 1389
Cdd:COG5651   254 GASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 768037095 1390 PSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1425
Cdd:COG5651   334 AAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
714-1256 2.37e-05

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 49.01  E-value: 2.37e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  714 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 793
Cdd:COG4625     2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  794 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 873
Cdd:COG4625    82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  874 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 953
Cdd:COG4625   162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  954 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 1033
Cdd:COG4625   242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1034 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGAL 1113
Cdd:COG4625   322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1114 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSlcfGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 1193
Cdd:COG4625   402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGG---ATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTL 478
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 768037095 1194 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTS 1256
Cdd:COG4625   479 TGNNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTT 541
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1018-1251 3.01e-05

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 48.51  E-value: 3.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1018 SFGSALNTNAGYGGAVStntdFGGTLStsvcfggSPSTSAG---FGGALNTNASFGCAVSTSASFSGAVstsacFSGAPI 1094
Cdd:pfam15967    5 SFGGGPGSTATAGGGFS----FGAAAA-------SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPA 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1095 TNPGFGGAFSTSAGFGGALSTaadfGGTPSNSigfgAAPSTSVSFGGAHGTslcfGGAPSTSLCFGSASNTNLCFGGPPS 1174
Cdd:pfam15967   69 TGFTFGTPASSTAATGPTGLT----LGTPAAT----TAASTGFSLGFNKPA----ASATPFSLPASSTSGGGLSLGSVLT 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1175 TSACFSGATSPSFCDGPSTSTGFSFGNGL---STNAGFGGGLNTSAGfGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGG 1251
Cdd:pfam15967  137 STAAQQGATGFTLNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
PPE COG5651
PPE-repeat protein [Function unknown];
1200-1418 3.03e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.97  E-value: 3.03e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1200 GNGLSTNAGFGGGLNTSAGFGgglgtSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 1279
Cdd:COG5651   178 GGLLGAQNAGSGNTSSNPGFA-----NLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1280 GGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDgfgsrPNASFDRGLSTIIGFGSGSNTSTGFTGePSTSTGFSS 1359
Cdd:COG5651   253 AGASAALASLAATLLNASSLGLAATAASSAATNLGLAG-----SPLGLAGGGAGAAAATGLGLGAGGAAG-AAGATGAGA 326
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 768037095 1360 GPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGS 1418
Cdd:COG5651   327 ALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
PTZ00395 PTZ00395
Sec24-related protein; Provisional
1009-1231 4.04e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.53  E-value: 4.04e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1009 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 1088
Cdd:PTZ00395  339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1089 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 1168
Cdd:PTZ00395  409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 768037095 1169 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 1231
Cdd:PTZ00395  472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1190-1398 5.56e-05

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 47.74  E-value: 5.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1190 GPSTSTGFSFGNGLSTNAGFGGGLntsaGFGGGLGTSAGFSGGLSTSSGFDGGLgtsagFGGGPGTSTGFGGGLGTSAGF 1269
Cdd:pfam15967   13 TATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAAT 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1270 SGGLGTSAGFGGGLVTSDGFGGGLGTNASFGS--TLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGfGSGSNTSTGF 1347
Cdd:pfam15967   84 GPTGLTLGTPAATTAASTGFSLGFNKPAASATpfSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLG-GTPATTTAVS 162
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 768037095  1348 TGEP--STSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGG 1398
Cdd:pfam15967  163 TGLSlgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Keratin_2_head pfam16208
Keratin type II head;
1204-1308 8.02e-05

Keratin type II head;


Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 44.65  E-value: 8.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1204 STNAGFGGGLNTSAGFGGGLGTSA--GFSGGLSTSSGFDGG---LGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAG 1278
Cdd:pfam16208   21 SSSRRGGGGGGGGGGGGGGFGSRSlyNLGGSKSISISVAGGgsrPGSGFGFGGGGGGGFGGGFGGGGGGGFGGGGGFGGG 100
                           90       100       110
                   ....*....|....*....|....*....|.
gi 768037095  1279 FGGGLVTSDGFG-GGLGTNASFGSTLGTSAG 1308
Cdd:pfam16208  101 FGGGGYGGGGFGgGGFGGRGGFGGPPCPPGG 131
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1101-1328 1.02e-04

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 46.53  E-value: 1.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1101 GAFSTSAGFGGALSTAADFGGTPSNSIG-FGAAPSTSVSFGGAHGTSLCFGGAPSTSLC---FGSASNTNL---CFGGPP 1173
Cdd:cd21118   128 GAYGSQGGPGVQGHGIPGGTGGPWASGGnYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAqpgYGTVRGNNQnsgCTNPPP 207
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1174 STSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLN--TSAGFGGGLGTSAGFSGGLSTSSG-FDGGLGTSAGFG 1250
Cdd:cd21118   208 SGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNggNNGSSSSNSGNSGGSNGGSSGNSGsGSGGSSSGGSNG 287
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 768037095 1251 GGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAgfSGGLSTSDGFGSRPNASFD 1328
Cdd:cd21118   288 WGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEA--VGGLNTLNSDASTLPFNFD 363
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1182-1403 2.34e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.51  E-value: 2.34e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1182 ATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGG 1261
Cdd:COG3469     1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1262 GLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFgsrpNASFDRGLSTIIGFGSGS 1341
Cdd:COG3469    81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGA----SATSSAGSTTTTTTVSGT 156
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 768037095 1342 NTSTGFTGEPSTSTGFSSGPSSivgfsggPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTG 1403
Cdd:COG3469   157 ETATGGTTTTSTTTTTTSASTT-------PSATTTATATTASGATTPSATTTATTTGPPTPG 211
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
922-1107 3.99e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 44.66  E-value: 3.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   922 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 988
Cdd:pfam15967   28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   989 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 1065
Cdd:pfam15967  103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 768037095  1066 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 1107
Cdd:pfam15967  181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
PTZ00395 PTZ00395
Sec24-related protein; Provisional
1258-1404 6.20e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 44.68  E-value: 6.20e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1258 GFGGGL--GTSAGF-SGGLGTSAGFGG-GLVTSD---GFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPnasfdrg 1330
Cdd:PTZ00395  341 GFHDGSpnAASAGApFNGLGNQADGGHiNQVHPDargAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAG------- 413
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768037095 1331 lstiigFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPstsgFSGGPSTGAGFGGGPNTGA 1404
Cdd:PTZ00395  414 ------YSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLP----YSNTPYSNAPLSNAPPSSA 477
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1180-1431 8.64e-04

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 43.83  E-value: 8.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1180 SGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFG--------- 1250
Cdd:cd21118    14 GGEASPLHSGGEGTGAGESAGHGLGDAISHGIGEAVGQGAKEAASSGIQNALGQGHGEEGGSTLGSRGDVFehrlgeaar 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1251 --GGPGTSTG-----------------FGGGLGTSAGFSGGLGTSAGFG--GGLVTSDGFGGGLGTNASFGS-------- 1301
Cdd:cd21118    94 slGNAGNEIGrqaediirhgvdavhnsWQGSGGHGAYGSQGGPGVQGHGipGGTGGPWASGGNYGTNSLGGSvgqggngg 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1302 --TLGTSagfSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 1379
Cdd:cd21118   174 plNYGTN---SQGAVAQPGYGTVRGNNQNSGCTNPPPSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGG 250
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 768037095 1380 GPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGACGFSYG 1431
Cdd:cd21118   251 NNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSNGWGGSSSSGGSGGSGG 302
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
989-1165 3.28e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 41.96  E-value: 3.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095   989 FGGAMSTSADFGGTLSTSVCFGGSPGTS--VSFGSALNTNAGYGGAVSTNTDFGGTLstsvcFGGSPSTSAGFGGALNTN 1066
Cdd:pfam15967    6 FGGGPGSTATAGGGFSFGAAAASNPGSTggFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1067 ASFGCAVSTSASFSGAVSTSACFSgapitnPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTS 1146
Cdd:pfam15967   81 AATGPTGLTLGTPAATTAASTGFS------LGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGT 154
                          170
                   ....*....|....*....
gi 768037095  1147 LCFGGAPSTSLCFGSASNT 1165
Cdd:pfam15967  155 PATTTAVSTGLSLGSTLTS 173
Gly_rich pfam12810
Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. ...
1207-1281 4.40e-03

Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. The proteins are composed of several glycine rich motifs interspersed through the sequence. Although many proteins have been annotated by similarity in the family these annotations given the biased composition of the sequences these are unlikely to be functionally relevant.


Pssm-ID: 403882 [Multi-domain]  Cd Length: 257  Bit Score: 40.72  E-value: 4.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095  1207 AGFGGG----LNTSAGFGGGLGTSAGFSG------GLSTSSGFDGGLGTSAGFGGGP-----GTSTGFGGGLGTSAGFSG 1271
Cdd:pfam12810  107 AGGGGGsgegDDGSGGYGGGLTGGGGGSGcyegsyGATQTSGGIGGYGINGSFGQGGngrnsGGGGGGGGGGGYYGGFGG 186
                           90
                   ....*....|
gi 768037095  1272 GLGTSAGFGG 1281
Cdd:pfam12810  187 GSYGGGGGGG 196
PPE COG5651
PPE-repeat protein [Function unknown];
1253-1425 5.03e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.03  E-value: 5.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1253 PGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLS 1332
Cdd:COG5651   171 PPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAA 250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768037095 1333 TIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAG--FGGGPNTGAGFGGGP 1410
Cdd:COG5651   251 AGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGgaAGAAGATGAGAALGA 330
                         170
                  ....*....|....*
gi 768037095 1411 STSAGFGSGAASLGA 1425
Cdd:COG5651   331 GAAAAAAGAAAGAGA 345
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH