NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|403310649|ref|NP_001258112|]
View 

trophinin isoform 6 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
595-962 7.23e-24

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 108.55  E-value: 7.23e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  595 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 674
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  675 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 754
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  755 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 830
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  831 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 910
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 403310649  911 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 962
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE super family cl03220
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
13-142 2.82e-16

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


The actual alignment was detected with superfamily member pfam01454:

Pssm-ID: 426270  Cd Length: 205  Bit Score: 78.47  E-value: 2.82e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649   13 FPEIIERASYTLEKMFRVNLKEID--------------------KQSSLYILIST---QESSAGILGTTK---------D 60
Cdd:pfam01454  33 FKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSKSYILVSTlppEYRVPAIIWPSKapsfvldqdE 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649   61 TPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFGEVRKLItDEFVKQKYLEYKRVPNSRP--PEYE 135
Cdd:pfam01454 113 ATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNGNTDDLL-KRLVKQGYLVRTKEGASDDgeEIIE 191

                  ....*..
gi 403310649  136 FFWGLRS 142
Cdd:pfam01454 192 YRVGPRA 198
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
385-669 3.52e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 80.43  E-value: 3.52e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  385 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 458
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  459 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 534
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  535 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 609
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 403310649  610 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 669
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
595-962 7.23e-24

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 108.55  E-value: 7.23e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  595 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 674
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  675 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 754
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  755 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 830
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  831 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 910
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 403310649  911 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 962
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
465-821 3.32e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 97.00  E-value: 3.32e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  465 STSVSFGgsSSTSANFGGTLSTSicfdGSPSTGAGFGGAL--NTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 542
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQS----AGTGYGESVGHSTsqGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQST 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  543 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 622
Cdd:NF033849  292 SESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHST 371
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  623 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGgahgtslcfggapstslcfGSASNTNLCFGG 702
Cdd:NF033849  372 SSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSG-------------------DSVQSVSQSYGS 432
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  703 PPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLGTSAGFGG 782
Cdd:NF033849  433 SSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TGTSESVSQ 504
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 403310649  783 GPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 821
Cdd:NF033849  505 GDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
495-833 9.24e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 95.46  E-value: 9.24e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  495 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 574
Cdd:NF033849  238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  575 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 654
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  655 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 734
Cdd:NF033849  384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  735 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 814
Cdd:NF033849  454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
                         330
                  ....*....|....*....
gi 403310649  815 VTSDGFGGGLGTNASFGST 833
Cdd:NF033849  527 TSGAGGSMGLGPSISLGKS 545
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
395-811 3.80e-18

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 90.06  E-value: 3.80e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  395 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 473
Cdd:NF033849  218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  474 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 551
Cdd:NF033849  293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  552 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 631
Cdd:NF033849  357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  632 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 711
Cdd:NF033849  409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  712 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 791
Cdd:NF033849  460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
                         410       420
                  ....*....|....*....|
gi 403310649  792 GGLGTSAGFSGGLGTSAGFG 811
Cdd:NF033849  524 GGRTSGAGGSMGLGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
13-142 2.82e-16

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 78.47  E-value: 2.82e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649   13 FPEIIERASYTLEKMFRVNLKEID--------------------KQSSLYILIST---QESSAGILGTTK---------D 60
Cdd:pfam01454  33 FKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSKSYILVSTlppEYRVPAIIWPSKapsfvldqdE 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649   61 TPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFGEVRKLItDEFVKQKYLEYKRVPNSRP--PEYE 135
Cdd:pfam01454 113 ATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNGNTDDLL-KRLVKQGYLVRTKEGASDDgeEIIE 191

                  ....*..
gi 403310649  136 FFWGLRS 142
Cdd:pfam01454 192 YRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
385-669 3.52e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 80.43  E-value: 3.52e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  385 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 458
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  459 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 534
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  535 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 609
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 403310649  610 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 669
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
257-956 6.42e-14

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.73  E-value: 6.42e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  257 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 336
Cdd:COG3210   791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  337 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 416
Cdd:COG3210   871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  417 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 496
Cdd:COG3210   951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  497 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 576
Cdd:COG3210  1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  577 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 656
Cdd:COG3210  1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  657 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 736
Cdd:COG3210  1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  737 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 816
Cdd:COG3210  1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  817 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 896
Cdd:COG3210  1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                         650       660       670       680       690       700
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  897 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 956
Cdd:COG3210  1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
289-611 2.27e-11

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 68.11  E-value: 2.27e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  289 TFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISF--GGMPCTSASFSGgvsssfsgpl 366
Cdd:NF033849  246 SVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSsvGTSESQSHGTTE---------- 315
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  367 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 446
Cdd:NF033849  316 GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG 395
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  447 SIcfGGSPCTSTGFGGTLSTSVSFGGSSSTSanfggtlSTSICFDGSPSTGAGFGGALNTSASFGSvlNTSTGFGGAMST 526
Cdd:NF033849  396 GI--AGGGVTSEGLGASQGGSEGWGSGDSVQ-------SVSQSYGSSSSTGTSSGHSDSSSHSTSS--GQADSVSQGTSW 464
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  527 SADFGGTLSTSVcfggspGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 606
Cdd:NF033849  465 SEGTGTSQGQSV------GTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538

                  ....*
gi 403310649  607 SASFS 611
Cdd:NF033849  539 SISLG 543
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
627-851 6.07e-08

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 56.60  E-value: 6.07e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  627 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 706
Cdd:pfam15967   5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  707 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 784
Cdd:pfam15967  80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 403310649  785 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 851
Cdd:pfam15967 156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
331-816 8.24e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 56.33  E-value: 8.24e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 331 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 410
Cdd:COG4625   14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 411 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 490
Cdd:COG4625   94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 491 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 570
Cdd:COG4625  174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 571 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 650
Cdd:COG4625  252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 651 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 730
Cdd:COG4625  332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 731 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 810
Cdd:COG4625  412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491

                 ....*.
gi 403310649 811 GGGLVT 816
Cdd:COG4625  492 GGGNYT 497
PTZ00395 PTZ00395
Sec24-related protein; Provisional
540-762 2.51e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.53  E-value: 2.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  540 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 619
Cdd:PTZ00395  339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  620 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 699
Cdd:PTZ00395  409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 403310649  700 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 762
Cdd:PTZ00395  472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
752-955 4.21e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 40.75  E-value: 4.21e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 752 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 831
Cdd:cd21118  125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 832 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 911
Cdd:cd21118  202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*..
gi 403310649 912 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 955
Cdd:cd21118  279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
453-638 8.30e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 40.04  E-value: 8.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  453 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 519
Cdd:pfam15967  28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  520 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 596
Cdd:pfam15967 103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 403310649  597 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 638
Cdd:pfam15967 181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
595-962 7.23e-24

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 108.55  E-value: 7.23e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  595 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 674
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  675 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 754
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  755 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 830
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  831 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 910
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 403310649  911 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 962
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
465-821 3.32e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 97.00  E-value: 3.32e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  465 STSVSFGgsSSTSANFGGTLSTSicfdGSPSTGAGFGGAL--NTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 542
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQS----AGTGYGESVGHSTsqGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQST 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  543 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 622
Cdd:NF033849  292 SESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHST 371
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  623 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGgahgtslcfggapstslcfGSASNTNLCFGG 702
Cdd:NF033849  372 SSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSG-------------------DSVQSVSQSYGS 432
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  703 PPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLGTSAGFGG 782
Cdd:NF033849  433 SSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TGTSESVSQ 504
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 403310649  783 GPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 821
Cdd:NF033849  505 GDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
495-833 9.24e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 95.46  E-value: 9.24e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  495 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 574
Cdd:NF033849  238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  575 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 654
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  655 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 734
Cdd:NF033849  384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  735 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 814
Cdd:NF033849  454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
                         330
                  ....*....|....*....
gi 403310649  815 VTSDGFGGGLGTNASFGST 833
Cdd:NF033849  527 TSGAGGSMGLGPSISLGKS 545
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
395-811 3.80e-18

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 90.06  E-value: 3.80e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  395 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 473
Cdd:NF033849  218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  474 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 551
Cdd:NF033849  293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  552 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 631
Cdd:NF033849  357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  632 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 711
Cdd:NF033849  409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  712 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 791
Cdd:NF033849  460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
                         410       420
                  ....*....|....*....|
gi 403310649  792 GGLGTSAGFSGGLGTSAGFG 811
Cdd:NF033849  524 GGRTSGAGGSMGLGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
13-142 2.82e-16

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 78.47  E-value: 2.82e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649   13 FPEIIERASYTLEKMFRVNLKEID--------------------KQSSLYILIST---QESSAGILGTTK---------D 60
Cdd:pfam01454  33 FKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSKSYILVSTlppEYRVPAIIWPSKapsfvldqdE 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649   61 TPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFGEVRKLItDEFVKQKYLEYKRVPNSRP--PEYE 135
Cdd:pfam01454 113 ATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNGNTDDLL-KRLVKQGYLVRTKEGASDDgeEIIE 191

                  ....*..
gi 403310649  136 FFWGLRS 142
Cdd:pfam01454 192 YRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
385-669 3.52e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 80.43  E-value: 3.52e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  385 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 458
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  459 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 534
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  535 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 609
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 403310649  610 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 669
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
257-956 6.42e-14

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.73  E-value: 6.42e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  257 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 336
Cdd:COG3210   791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  337 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 416
Cdd:COG3210   871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  417 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 496
Cdd:COG3210   951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  497 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 576
Cdd:COG3210  1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  577 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 656
Cdd:COG3210  1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  657 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 736
Cdd:COG3210  1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  737 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 816
Cdd:COG3210  1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  817 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 896
Cdd:COG3210  1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                         650       660       670       680       690       700
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  897 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 956
Cdd:COG3210  1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
289-611 2.27e-11

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 68.11  E-value: 2.27e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  289 TFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISF--GGMPCTSASFSGgvsssfsgpl 366
Cdd:NF033849  246 SVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSsvGTSESQSHGTTE---------- 315
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  367 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 446
Cdd:NF033849  316 GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG 395
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  447 SIcfGGSPCTSTGFGGTLSTSVSFGGSSSTSanfggtlSTSICFDGSPSTGAGFGGALNTSASFGSvlNTSTGFGGAMST 526
Cdd:NF033849  396 GI--AGGGVTSEGLGASQGGSEGWGSGDSVQ-------SVSQSYGSSSSTGTSSGHSDSSSHSTSS--GQADSVSQGTSW 464
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  527 SADFGGTLSTSVcfggspGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 606
Cdd:NF033849  465 SEGTGTSQGQSV------GTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538

                  ....*
gi 403310649  607 SASFS 611
Cdd:NF033849  539 SISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
242-956 1.14e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 65.94  E-value: 1.14e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  242 AQENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSA 321
Cdd:COG3210   606 GSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGG 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  322 ASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFG 401
Cdd:COG3210   686 TTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANT 765
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  402 SAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFG 481
Cdd:COG3210   766 TASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNT 845
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  482 GTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYG 561
Cdd:COG3210   846 TDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGG 925
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  562 GAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFG 641
Cdd:COG3210   926 LTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTT 1005
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  642 GALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTslcfGSASNTNLCFGGPPSTSACFSGATSPSFCDG 721
Cdd:COG3210  1006 ASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNA----SGISGGNAAALTASGTAGTTGGTAASNGGGG 1081
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  722 PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFS 801
Cdd:COG3210  1082 TAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSA 1161
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  802 GGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGE 881
Cdd:COG3210  1162 SAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQT 1241
                         650       660       670       680       690       700       710
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 403310649  882 PSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 956
Cdd:COG3210  1242 GSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGT 1316
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
245-876 1.77e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 65.17  E-value: 1.77e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  245 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 324
Cdd:COG3210   115 TLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGV 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  325 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 404
Cdd:COG3210   195 TGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIG 274
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  405 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 484
Cdd:COG3210   275 TTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGT 354
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  485 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 564
Cdd:COG3210   355 TGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLG 434
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  565 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAcfsgapITNPGFGGAFSTSAGFGGAL 644
Cdd:COG3210   435 ITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGG------GIGTVTTNATISNNAGGDAN 508
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  645 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 724
Cdd:COG3210   509 GIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSA 588
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  725 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGL 804
Cdd:COG3210   589 TGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAG 668
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 403310649  805 GTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 876
Cdd:COG3210   669 GTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
411-842 1.26e-09

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 62.27  E-value: 1.26e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 411 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 490
Cdd:COG3468    3 SGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAGSG 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 491 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSvcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 570
Cdd:COG3468   83 GTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGG---GGGTGSAGGGGGGGGGGTGVGGTGAAAAGG 159
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 571 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 650
Cdd:COG3468  160 GTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVG 239
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 651 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 730
Cdd:COG3468  240 GGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGG 319
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 731 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 810
Cdd:COG3468  320 SNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGG 399
                        410       420       430
                 ....*....|....*....|....*....|..
gi 403310649 811 GGGLVTSDGFGGGLGTNASFGSTLGTSAGFSG 842
Cdd:COG3468  400 TGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTG 431
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
367-843 2.09e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 61.72  E-value: 2.09e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 367 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 446
Cdd:COG4625   18 GGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGG 97
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 447 SICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMST 526
Cdd:COG4625   98 GGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGG 177
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 527 SADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 606
Cdd:COG4625  178 GGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGG 257
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 607 SASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPST 686
Cdd:COG4625  258 NGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 337
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 687 SLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLST 766
Cdd:COG4625  338 GGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGG 417
                        410       420       430       440       450       460       470
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 403310649 767 SSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGG 843
Cdd:COG4625  418 GAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
627-851 6.07e-08

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 56.60  E-value: 6.07e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  627 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 706
Cdd:pfam15967   5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  707 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 784
Cdd:pfam15967  80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 403310649  785 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 851
Cdd:pfam15967 156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
331-816 8.24e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 56.33  E-value: 8.24e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 331 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 410
Cdd:COG4625   14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 411 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 490
Cdd:COG4625   94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 491 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 570
Cdd:COG4625  174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 571 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 650
Cdd:COG4625  252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 651 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 730
Cdd:COG4625  332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 731 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 810
Cdd:COG4625  412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491

                 ....*.
gi 403310649 811 GGGLVT 816
Cdd:COG4625  492 GGGNYT 497
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
383-962 4.09e-07

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 54.01  E-value: 4.09e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 383 TLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 462
Cdd:COG5295    5 AGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAASSVA 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 463 TLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 542
Cdd:COG5295   85 SGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSST 164
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 543 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 622
Cdd:COG5295  165 ANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAA 244
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 623 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIG--FGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCF 700
Cdd:COG5295  245 SGNATTASASSVSGSAVAAGTASTATTASTTAASGAAgtATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALG 324
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 701 GGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGF 780
Cdd:COG5295  325 SAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTG 404
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 781 GGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDR 860
Cdd:COG5295  405 ASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSS 484
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 861 GLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGG 940
Cdd:COG5295  485 AAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTAT 564
                        570       580
                 ....*....|....*....|..
gi 403310649 941 PSTSAGFGSGAASLGACGFSYG 962
Cdd:COG5295  565 GANSVALGAGSVASGANSVSVG 586
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
298-799 7.34e-07

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 53.24  E-value: 7.34e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 298 ASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGAS 377
Cdd:COG4625    1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 378 SGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTS 457
Cdd:COG4625   81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 458 TGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTS 537
Cdd:COG4625  161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 538 VCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTS 617
Cdd:COG4625  241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 618 ACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTN 697
Cdd:COG4625  321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 698 LCFGGPPSTSACFSGATSPSFcDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTS 777
Cdd:COG4625  401 GGGGAGGTGGGGAGGGGGAAG-GGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLT 479
                        490       500
                 ....*....|....*....|..
gi 403310649 778 AGFGGGPGTSTGFGGGLGTSAG 799
Cdd:COG4625  480 GNNTYTGTTTVNGGGNYTQSAG 501
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
513-727 6.92e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.75  E-value: 6.92e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 513 VLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGG 592
Cdd:COG3469    1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 593 ALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNpgfggafSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGG 672
Cdd:COG3469   81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTT-------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 403310649 673 AHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACfSGATSPSFCDGPSTSTG 727
Cdd:COG3469  154 SGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA-SGATTPSATTTATTTGP 207
PTZ00395 PTZ00395
Sec24-related protein; Provisional
540-762 2.51e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.53  E-value: 2.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  540 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 619
Cdd:PTZ00395  339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  620 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 699
Cdd:PTZ00395  409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 403310649  700 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 762
Cdd:PTZ00395  472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
580-812 3.28e-05

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 47.74  E-value: 3.28e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  580 FGGSPSTSAGFGGALntnaSFGCAVSTSASFSGAVSTSAcFSGAPITNPGfggAFSTSAGFGGALstaadFGGTPSNSIG 659
Cdd:pfam15967   6 FGGGPGSTATAGGGF----SFGAAAASNPGSTGGFSFGT-LGAAPAATAT---TTTATLGLGGGL-----FGQKPATGFT 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  660 FGAAPSTSvsfgGAHGTSLCFGGAPSTSlcfgSASNTNLCFGGPPSTSAcfsgATSPSFCDGPSTSTGFSFGNGLSTNAG 739
Cdd:pfam15967  73 FGTPASST----AATGPTGLTLGTPAAT----TAASTGFSLGFNKPAAS----ATPFSLPASSTSGGGLSLGSVLTSTAA 140
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 403310649  740 FGGGLNTSAGFGGGLGTSAGFSGGL---STSSGFDGGLGTSAGfGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGG 812
Cdd:pfam15967 141 QQGATGFTLNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
726-962 5.07e-05

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 46.97  E-value: 5.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  726 TGFSFGNGLSTNAGFGGGLNtsagFGGGLGTSAGFSGGLstssGFDGGLGTSAGFGGGPGTSTGFGGGLgtsagFSGGLG 805
Cdd:pfam15967   2 SGFSFGGGPGSTATAGGGFS----FGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPA 68
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  806 TSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSrpnaSFDRGLSTIIGFGSGSNTSTGFTGEPSTS 885
Cdd:pfam15967  69 TGFTFGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASAT----PFSLPASSTSGGGLSLGSVLTSTAAQQGA 144
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 403310649  886 TGFSSGPSSIVGFSGGPSTGVGFcsGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGACGFSYG 962
Cdd:pfam15967 145 TGFTLNLGGTPATTTAVSTGLSL--GSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGLGGLDFS 219
PPE COG5651
PPE-repeat protein [Function unknown];
761-956 1.86e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 44.88  E-value: 1.86e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 761 SGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGtnaSFGSTLGTSAGF 840
Cdd:COG5651  177 PGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAA---AAAAAAAAAAGA 253
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 841 SGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGG 920
Cdd:COG5651  254 GASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333
                        170       180       190
                 ....*....|....*....|....*....|....*.
gi 403310649 921 PSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 956
Cdd:COG5651  334 AAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369
PPE COG5651
PPE-repeat protein [Function unknown];
700-932 2.10e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 44.88  E-value: 2.10e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 700 FGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNglstnagfggglntsAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAG 779
Cdd:COG5651  167 FTQPPPTITNPGGLLGAQNAGSGNTSSNPGFAN---------------LGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGF 231
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 780 FGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFD 859
Cdd:COG5651  232 AGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLG 311
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 403310649 860 RGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPN 932
Cdd:COG5651  312 AGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
PTZ00395 PTZ00395
Sec24-related protein; Provisional
789-935 3.90e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 44.68  E-value: 3.90e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  789 GFGGGL--GTSAGF-SGGLGTSAGFGG-GLVTSD---GFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPnasfdrg 861
Cdd:PTZ00395  341 GFHDGSpnAASAGApFNGLGNQADGGHiNQVHPDargAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAG------- 413
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 403310649  862 lstiigFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPstsgFSGGPSTGAGFGGGPNTGA 935
Cdd:PTZ00395  414 ------YSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLP----YSNTPYSNAPLSNAPPSSA 477
PPE COG5651
PPE-repeat protein [Function unknown];
742-949 4.21e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 43.73  E-value: 4.21e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 742 GGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 821
Cdd:COG5651  178 GGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASA 257
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 822 GGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGG 901
Cdd:COG5651  258 ALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAA 337
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*...
gi 403310649 902 PSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGS 949
Cdd:COG5651  338 GAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
245-787 6.43e-04

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 43.61  E-value: 6.43e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 245 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 324
Cdd:COG4625    2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 325 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 404
Cdd:COG4625   82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 405 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 484
Cdd:COG4625  162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 485 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 564
Cdd:COG4625  242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 565 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGAL 644
Cdd:COG4625  322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 645 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSlcfGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 724
Cdd:COG4625  402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGG---ATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTL 478
                        490       500       510       520       530       540
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 403310649 725 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTS 787
Cdd:COG4625  479 TGNNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTT 541
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
721-929 7.26e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 43.50  E-value: 7.26e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  721 GPSTSTGFSFGNGLSTNAGFGGGLntsaGFGGGLGTSAGFSGGLSTSSGFDGGLgtsagFGGGPGTSTGFGGGLGTSAGF 800
Cdd:pfam15967  13 TATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAAT 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  801 SGGLGTSAGFGGGLVTSDGFGGGLGTNASFGS--TLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGfGSGSNTSTGF 878
Cdd:pfam15967  84 GPTGLTLGTPAATTAASTGFSLGFNKPAASATpfSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLG-GTPATTTAVS 162
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 403310649  879 TGEP--STSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGG 929
Cdd:pfam15967 163 TGLSlgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
560-782 7.26e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 43.50  E-value: 7.26e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  560 YGGAVSTNTDFGGTLSTSVCFGGSPSTSAG--FGGALNTNASFGCAVSTSASFSGAVstsacFSGAPITNPGFGGAFSTS 637
Cdd:pfam15967   6 FGGGPGSTATAGGGFSFGAAAASNPGSTGGfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  638 AGFGGALSTaadfGGTPsnsigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPS 717
Cdd:pfam15967  81 AATGPTGLT----LGTP--------AATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFT 148
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 403310649  718 FCDGPSTSTGFSFGNGL---STNAGFGGGLNTSAGfGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGG 782
Cdd:pfam15967 149 LNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
713-934 1.78e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 1.78e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 713 ATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGG 792
Cdd:COG3469    1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 793 GLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFgsrpNASFDRGLSTIIGFGSGS 872
Cdd:COG3469   81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGA----SATSSAGSTTTTTTVSGT 156
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 403310649 873 NTSTGFTGEPSTSTGFSSGPSSivgfsggPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTG 934
Cdd:COG3469  157 ETATGGTTTTSTTTTTTSASTT-------PSATTTATATTASGATTPSATTTATTTGPPTPG 211
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
752-955 4.21e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 40.75  E-value: 4.21e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 752 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 831
Cdd:cd21118  125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649 832 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 911
Cdd:cd21118  202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*..
gi 403310649 912 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 955
Cdd:cd21118  279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
453-638 8.30e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 40.04  E-value: 8.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  453 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 519
Cdd:pfam15967  28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310649  520 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 596
Cdd:pfam15967 103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 403310649  597 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 638
Cdd:pfam15967 181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
PRK13729 PRK13729
conjugal transfer pilus assembly protein TraB; Provisional
779-823 9.87e-03

conjugal transfer pilus assembly protein TraB; Provisional


Pssm-ID: 184281 [Multi-domain]  Cd Length: 475  Bit Score: 39.42  E-value: 9.87e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*
gi 403310649 779 GFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGG 823
Cdd:PRK13729 324 GWAWGAGFVDGIGQGMERASQPAVGLGATAAYGAGDVLKMGIGGG 368
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH