NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|446115929|ref|WP_000193784|]
View 

MULTISPECIES: aminopeptidase N [Salmonella]

Protein Classification

M1 family metallopeptidase( domain architecture ID 11487037)

M1 family metallopeptidase is a zinc-dependent metallopeptidase that functions as an aminopeptidase and contains an HEXXH motif as part of its active site; such as aminopeptidase N, which is a type II integral membrane protease that preferentially cleaves neutral amino acids from the N-terminus of oligopeptides

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
pepN PRK14015
aminopeptidase N; Provisional
1-870 0e+00

aminopeptidase N; Provisional


:

Pssm-ID: 237585 [Multi-domain]  Cd Length: 875  Bit Score: 1687.61  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   1 MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHGAPD--APLRLDGEDLTLVSIHVNDAPWT--AYKE 76
Cdd:PRK14015   2 RTQQPQAIYLKDYRPPDYLIDTVDLDFDLDPDKTRVTARLQVRRNPDAAhsAPLVLDGEDLELLSLALDGQPLApsAYEL 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  77 EEGALVISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLL 156
Cdd:PRK14015  82 DEEGLTIENLPDRFTLEIETEIDPEANTALEGLYRSGGMFCTQCEAEGFRRITYFLDRPDVLARYTVRIEADKAKYPVLL 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 157 SNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKW 236
Cdd:PRK14015 162 SNGNLVESGELPDGRHWATWEDPFPKPSYLFALVAGDLDVLEDTFTTRSGREVALEIYVEPGNLDKCDHAMDSLKKSMKW 241
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 237 DEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQL 316
Cdd:PRK14015 242 DEERFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQL 321
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 317 SLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLG 396
Cdd:PRK14015 322 SLKEGLTVFRDQEFSADLGSRAVKRIEDVRVLRAAQFAEDAGPMAHPVRPDSYIEINNFYTATVYEKGAEVIRMLHTLLG 401
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 397 EENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQA 476
Cdd:PRK14015 402 EEGFRKGMDLYFERHDGQAVTCEDFVAAMEDASGRDLSQFRRWYSQAGTPRVTVSDEYDAAAGTYTLTLSQSTPPTPGQP 481
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 477 EKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMR 556
Cdd:PRK14015 482 EKQPLHIPVAIGLLDPDGKELPLQLEGEPVERVLELTEAEQTFTFENVAERPVPSLLRGFSAPVKLEYDYSDEDLLFLMA 561
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 557 HARNDFSRWDAAQSLLATYIKLNVARHqqGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIA 636
Cdd:PRK14015 562 HDSDPFNRWEAGQRLATRLLLANVARH--GQPLSLDEALIDAFRAVLLDESLDPAFAAELLTLPSEAELAEQMEVIDPDA 639
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 637 IAQVREALTRTLAAELADEFLAIYNANHLDE-YRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALA 715
Cdd:PRK14015 640 IHAAREALRRALATALKDELLALYEALQTDGpYSPDAEAAGRRALRNVCLSYLAAADDEEAAELAEAQFDQADNMTDRLA 719
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 716 ALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRIRSLIGAFAGSNPA 795
Cdd:PRK14015 720 ALSALVNADLPERDEALADFYDRWKDDPLVMDKWFALQATSPAPDTLERVRALMQHPAFDLKNPNRVRSLIGAFAAANPA 799
                        810       820       830       840       850       860       870
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 446115929 796 AFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDDKRQEKMRAALEQLKGLENLSGDLYEKITKALA 870
Cdd:PRK14015 800 GFHAADGSGYRFLADQILALDKINPQVAARLATPLIRWRRYDPKRQALMRAALERIAALPNLSKDVREIVSKALA 874
 
Name Accession Description Interval E-value
pepN PRK14015
aminopeptidase N; Provisional
1-870 0e+00

aminopeptidase N; Provisional


Pssm-ID: 237585 [Multi-domain]  Cd Length: 875  Bit Score: 1687.61  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   1 MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHGAPD--APLRLDGEDLTLVSIHVNDAPWT--AYKE 76
Cdd:PRK14015   2 RTQQPQAIYLKDYRPPDYLIDTVDLDFDLDPDKTRVTARLQVRRNPDAAhsAPLVLDGEDLELLSLALDGQPLApsAYEL 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  77 EEGALVISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLL 156
Cdd:PRK14015  82 DEEGLTIENLPDRFTLEIETEIDPEANTALEGLYRSGGMFCTQCEAEGFRRITYFLDRPDVLARYTVRIEADKAKYPVLL 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 157 SNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKW 236
Cdd:PRK14015 162 SNGNLVESGELPDGRHWATWEDPFPKPSYLFALVAGDLDVLEDTFTTRSGREVALEIYVEPGNLDKCDHAMDSLKKSMKW 241
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 237 DEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQL 316
Cdd:PRK14015 242 DEERFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQL 321
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 317 SLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLG 396
Cdd:PRK14015 322 SLKEGLTVFRDQEFSADLGSRAVKRIEDVRVLRAAQFAEDAGPMAHPVRPDSYIEINNFYTATVYEKGAEVIRMLHTLLG 401
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 397 EENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQA 476
Cdd:PRK14015 402 EEGFRKGMDLYFERHDGQAVTCEDFVAAMEDASGRDLSQFRRWYSQAGTPRVTVSDEYDAAAGTYTLTLSQSTPPTPGQP 481
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 477 EKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMR 556
Cdd:PRK14015 482 EKQPLHIPVAIGLLDPDGKELPLQLEGEPVERVLELTEAEQTFTFENVAERPVPSLLRGFSAPVKLEYDYSDEDLLFLMA 561
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 557 HARNDFSRWDAAQSLLATYIKLNVARHqqGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIA 636
Cdd:PRK14015 562 HDSDPFNRWEAGQRLATRLLLANVARH--GQPLSLDEALIDAFRAVLLDESLDPAFAAELLTLPSEAELAEQMEVIDPDA 639
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 637 IAQVREALTRTLAAELADEFLAIYNANHLDE-YRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALA 715
Cdd:PRK14015 640 IHAAREALRRALATALKDELLALYEALQTDGpYSPDAEAAGRRALRNVCLSYLAAADDEEAAELAEAQFDQADNMTDRLA 719
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 716 ALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRIRSLIGAFAGSNPA 795
Cdd:PRK14015 720 ALSALVNADLPERDEALADFYDRWKDDPLVMDKWFALQATSPAPDTLERVRALMQHPAFDLKNPNRVRSLIGAFAAANPA 799
                        810       820       830       840       850       860       870
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 446115929 796 AFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDDKRQEKMRAALEQLKGLENLSGDLYEKITKALA 870
Cdd:PRK14015 800 GFHAADGSGYRFLADQILALDKINPQVAARLATPLIRWRRYDPKRQALMRAALERIAALPNLSKDVREIVSKALA 874
pepN_proteo TIGR02414
aminopeptidase N, Escherichia coli type; The M1 family of zinc metallopeptidases contains a ...
12-869 0e+00

aminopeptidase N, Escherichia coli type; The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile (see SP|P04825 for a description of catalytic activity). This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc. [Protein fate, Degradation of proteins, peptides, and glycopeptides]


Pssm-ID: 274122 [Multi-domain]  Cd Length: 863  Bit Score: 1330.42  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   12 DYRAPDYQITDIDLTFDLDAEKTVVTAISQAVR-HGAPDAPLRLDGEDLTLVSIHVNDAPWTA--YKEEEGALVISDLPE 88
Cdd:TIGR02414   1 DYKPPPFLIEKTHLDFDLHEEETVVRARLTVRRnPDGNGAPLVLDGEELKLLSIAIDGKPLAAgdYQLDDETLTIASVPE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   89 RFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGELE 168
Cdd:TIGR02414  81 SFTLEIETEIHPEENTSLEGLYKSGGNFCTQCEAEGFRRITYFPDRPDVMSRYTVTITADKKKYPVLLSNGNKIASGELP 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  169 NGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLD 248
Cdd:TIGR02414 161 DGRHWAEWEDPFPKPSYLFALVAGDLDVLEDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVFGLEYDLD 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  249 IYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQ 328
Cdd:TIGR02414 241 IFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQ 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  329 EFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYF 408
Cdd:TIGR02414 321 EFSADMTSRAVKRIEDVRLLRAHQFPEDAGPMAHPVRPESYVEINNFYTATVYEKGAEVIRMLHTLLGEEGFRKGMDLYF 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  409 ERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIE 488
Cdd:TIGR02414 401 SRHDGQAVTCEDFVAAMEDASGRDLNQFRRWYSQAGTPVLEVKENYDAAKKTYTLTVRQSTPPTPGQTEKKPLHIPIAVG 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  489 LYDNEGNVIPLQ-KGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARNDFSRWDA 567
Cdd:TIGR02414 481 LLGPNGRKLMLSlDGERDTTRVLELTEAEQTFVFEGIAEKPVPSLLRGFSAPVNLEYPYSDEDLLLLLAHDSDPFNRWEA 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  568 AQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIAIAQVREALTRT 647
Cdd:TIGR02414 561 GQRLARRVILANIARAQGGEELPVDPAFIDALGKLLNDPHLDAAFKALLLALPSEAYLAELMENIDPDALHAAREFLRAA 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  648 LAAELADEFLAIYNANHL-DEYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALAALSAAVAAQLP 726
Cdd:TIGR02414 641 IARQLADDLLRLYDALQEnGPYSVDPAAAGRRALRNACLSYLSAADDAEIRNLALEQFKSADNMTDRLAALSALVHFESD 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  727 CRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRIRSLIGAFAGSNPAAFHAQDGSGYQ 806
Cdd:TIGR02414 721 FRERALAAFYQKWKDDPLVMDKWFALQATSPRPDTLERVKALLQHPAFDLKNPNRVRALIGAFANNNLVRFHDISGSGYR 800
                         810       820       830       840       850       860
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 446115929  807 FLVEMLTDLNSRNPQVASRLIEPLIRLKRYDDKRQEKMRAALEQLKGLENLSGDLYEKITKAL 869
Cdd:TIGR02414 801 FLADQIIAIDRFNPQVAARLLEPLTRWRKLDPKRQELMKAALERIAAEENLSKDVREVVSKAL 863
M1_APN cd09600
Peptidase M1 family, including aminopeptidase N catalytic domain; This model represents the ...
12-441 0e+00

Peptidase M1 family, including aminopeptidase N catalytic domain; This model represents the catalytic domain of aminopeptidase N (APN; CD13; alanyl aminopeptidase; EC 3.4.11.2), a type II integral membrane protease belonging to the M1 gluzincin family. It includes bacterial-type alanyl aminopeptidases as well as PfA-M1 aminopeptidase (Plasmodium falciparum-type). APN preferentially cleaves neutral amino acids from the N-terminus of oligopeptides and, in higher eukaryotes, is present in a variety of human tissues and cell types (leukocyte, fibroblast, endothelial and epithelial cells). APN expression is dysregulated in inflammatory diseases such as chronic pain, rheumatoid arthritis, multiple sclerosis, systemic sclerosis, systemic lupus erythematosus, polymyositis/dermatomyosytis and pulmonary sarcoidosis, and is enhanced in tumor cells such as melanoma, renal, prostate, pancreas, colon, gastric and thyroid cancers. It is predominantly expressed on stem cells and on cells of the granulocytic and monocytic lineages at distinct stages of differentiation, thus considered a marker of differentiation. Thus, APN inhibition may lead to the development of anti-cancer and anti-inflammatory drugs. APNs are also present in many pathogenic bacteria and represent potential drug targets. Some APNs have been used commercially, such as one from Lactococcus lactis used in the food industry. APN also serves as a receptor for coronaviruses, although the virus receptor interaction site seems to be distinct from the enzymatic site and aminopeptidase activity is not necessary for viral infection. APNs have also been extensively studied as putative Cry toxin receptors. Cry1 proteins are pore-forming toxins that bind to the midgut epithelial cell membrane of susceptible insect larvae, causing extensive damage. Several different toxins, including Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to bind to APNs; however, a direct role of APN in cytotoxicity has been yet to be firmly established.


Pssm-ID: 341063 [Multi-domain]  Cd Length: 434  Bit Score: 871.84  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  12 DYRAPDYQITDIDLTFDLDAEKTVVTAISQAVR--HGAPDAPLRLDGEDLTLVSIHVNDAP--WTAYKEEEGALVISDLP 87
Cdd:cd09600    1 DYKPPDFLIDHVDLDFDLDDDETIVTSRLRVRRnpDSGEGAPLVLDGEDLELLSVKIDGKPlsPSDYTLDEEGLTIKNVP 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  88 ERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGEL 167
Cdd:cd09600   81 DRFVLEIEVRINPAANTSLEGLYKSGGILCTQCEAEGFRRITYFPDRPDVMSKFTVTIEADKEKYPVLLSNGNLIEEGEL 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 168 ENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDL 247
Cdd:cd09600  161 PNGRHFAVWEDPFPKPSYLFALVAGDLGSVEDTFTTKSGRKVKLRIYVEPGNEDKCHHAMESLKKAMKWDEERFGLEYDL 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 248 DIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRD 327
Cdd:cd09600  241 DLFNIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRD 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 328 QEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLY 407
Cdd:cd09600  321 QEFSADMNSRAVKRIEDVRRLRSAQFPEDAGPMAHPIRPDSYIEINNFYTVTVYEKGAEVIRMLHTLLGEEGFRKGMDLY 400
                        410       420       430
                 ....*....|....*....|....*....|....
gi 446115929 408 FERHDGSAATCDDFVQAMEDASNVDLSHFRRWYS 441
Cdd:cd09600  401 FERHDGQAVTCEDFVAAMEDASGRDLSQFKRWYS 434
PepN COG0308
Aminopeptidase N, contains DUF3458 domain [Amino acid transport and metabolism];
10-659 0e+00

Aminopeptidase N, contains DUF3458 domain [Amino acid transport and metabolism];


Pssm-ID: 440077 [Multi-domain]  Cd Length: 609  Bit Score: 579.29  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  10 RHDYRAPDYQITDIDLTFDLDAEKTVVTAIS--QAVRHGAPDAPLRLDGEDLTLVSIHVNDAPWTaYKEEEGALVIsDLP 87
Cdd:COG0308    7 LEAYRPPGYDVTHYDLDLDLDPATTRLSGTAtiTFTATEAPLDSLVLDLKGLEVTSVTVDGKPLD-FTRDGERLTI-TLP 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  88 ------ERFTLRIVNEISPaaNTALEGLYQSGDA------LCTQCEAEGFRhiTWYL--DRPDVLARFTTKIIADKskYP 153
Cdd:COG0308   85 kplapgETFTLEIEYSGKP--SNGGEGLYRSGDPpdgppyLYTQCEPEGAR--RWFPcfDHPDDKATFTLTVTVPA--GW 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 154 FLLSNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFttRSGreVALELYVDRGNLDRAPWAMTSLKNS 233
Cdd:COG0308  159 VAVSNGNLVSETELGDGRTTWHWADTQPIPTYLFALAAGDYAVVEDTF--ASG--VPLRVYVRPGLADKAKEAFESTKRM 234
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 234 MKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLarTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDW 313
Cdd:COG0308  235 LDFFEELFGVPYPFDKYDQVAVPDFNFGAMENQGLVTFGEKVLA--DETATDADYERRESVIAHELAHQWFGNLVTCADW 312
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 314 FQLSLKEGLTVFRDQEFSSDLGSR-AVNRIsNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIH 392
Cdd:COG0308  313 DDLWLNEGFATYMEQLFSEDLYGKdAADRI-FVGALRSYAFAEDAGPNAHPIRPDDYPEIENFFDGIVYEKGALVLHMLR 391
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 393 TLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSH-FRRWYSQSGTPIVTVKDDYNPETeQYTLTISQRtpa 471
Cdd:COG0308  392 TLLGDEAFRAGLRLYFARHAGGNATTEDFLAALEEASGRDLSAfFDQWLYQAGLPTLEVEYEYDADG-KVTLTLRQT--- 467
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 472 tadQAEKQPLHIPFAIELYDnegnviplqkgGHPVNAVLNVTQAEQTFTfdnvyFQPVPALLCEFsapvkleykwsDQQL 551
Cdd:COG0308  468 ---PPRPHPFHIPLEVGLLG-----------GKLTARTVLLDGEQTELV-----AKPDPVLLLRL-----------DDEL 517
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 552 TFLMRHARNDFSRWDAAQSLLatyiklnvarhQQGQPLSLpvhvaDAFRAvlLDEKiDPALAAEILTLPSANEiaelfev 631
Cdd:COG0308  518 AFLLAHDSDPFNRWEALQALW-----------RDGEADYL-----DALRA--LADT-DPAVRAEALALLGSDQ------- 571
                        650       660
                 ....*....|....*....|....*...
gi 446115929 632 idpiaIAQVREALTRTLAAELADEFLAI 659
Cdd:COG0308  572 -----LALARAALALAAELALLRALDDL 594
DUF3458_C pfam17432
Domain of unknown function (DUF3458_C) ARM repeats; This presumed domain is functionally ...
548-870 9.09e-155

Domain of unknown function (DUF3458_C) ARM repeats; This presumed domain is functionally uncharacterized. This domain is found in bacteria, archaea and eukaryotes.


Pssm-ID: 465424 [Multi-domain]  Cd Length: 324  Bit Score: 456.21  E-value: 9.09e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  548 DQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAE 627
Cdd:pfam17432   1 DEDLAFLLAHDSDPFNRWEAGQTLALRLLLALVAALQAGEPLALDAAFIDAFRAVLADAALDPAFKAEALTLPSEAYLAE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  628 LFEVIDPIAIAQVREALTRTLAAELADEFLAIYNANHLD-EYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRD 706
Cdd:pfam17432  81 QMDVVDPDAIHAAREALRRALAEALRDELLALYQALAATgPYSPDAAAAGRRALRNLALSYLAAAGDPEAADLAAAQFES 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  707 ANNMTDALAALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRIRSLI 786
Cdd:pfam17432 161 ADNMTDRLAALAALVNSDLPEREEALADFYQRWKDDPLVMDKWFALQATSPRPDTLERVKALMQHPAFDLKNPNRVRALI 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  787 GAFAGSNPAAFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDDKRQEKMRAALEQLKGLENLSGDLYEKIT 866
Cdd:pfam17432 241 GAFAAANPVAFHAADGSGYRFLADQVLELDAINPQVAARLLTPLTRWRRYDPPRQALMRAALERIAATPGLSKDVFEIVS 320

                  ....
gi 446115929  867 KALA 870
Cdd:pfam17432 321 KALA 324
 
Name Accession Description Interval E-value
pepN PRK14015
aminopeptidase N; Provisional
1-870 0e+00

aminopeptidase N; Provisional


Pssm-ID: 237585 [Multi-domain]  Cd Length: 875  Bit Score: 1687.61  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   1 MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHGAPD--APLRLDGEDLTLVSIHVNDAPWT--AYKE 76
Cdd:PRK14015   2 RTQQPQAIYLKDYRPPDYLIDTVDLDFDLDPDKTRVTARLQVRRNPDAAhsAPLVLDGEDLELLSLALDGQPLApsAYEL 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  77 EEGALVISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLL 156
Cdd:PRK14015  82 DEEGLTIENLPDRFTLEIETEIDPEANTALEGLYRSGGMFCTQCEAEGFRRITYFLDRPDVLARYTVRIEADKAKYPVLL 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 157 SNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKW 236
Cdd:PRK14015 162 SNGNLVESGELPDGRHWATWEDPFPKPSYLFALVAGDLDVLEDTFTTRSGREVALEIYVEPGNLDKCDHAMDSLKKSMKW 241
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 237 DEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQL 316
Cdd:PRK14015 242 DEERFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQL 321
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 317 SLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLG 396
Cdd:PRK14015 322 SLKEGLTVFRDQEFSADLGSRAVKRIEDVRVLRAAQFAEDAGPMAHPVRPDSYIEINNFYTATVYEKGAEVIRMLHTLLG 401
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 397 EENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQA 476
Cdd:PRK14015 402 EEGFRKGMDLYFERHDGQAVTCEDFVAAMEDASGRDLSQFRRWYSQAGTPRVTVSDEYDAAAGTYTLTLSQSTPPTPGQP 481
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 477 EKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMR 556
Cdd:PRK14015 482 EKQPLHIPVAIGLLDPDGKELPLQLEGEPVERVLELTEAEQTFTFENVAERPVPSLLRGFSAPVKLEYDYSDEDLLFLMA 561
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 557 HARNDFSRWDAAQSLLATYIKLNVARHqqGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIA 636
Cdd:PRK14015 562 HDSDPFNRWEAGQRLATRLLLANVARH--GQPLSLDEALIDAFRAVLLDESLDPAFAAELLTLPSEAELAEQMEVIDPDA 639
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 637 IAQVREALTRTLAAELADEFLAIYNANHLDE-YRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALA 715
Cdd:PRK14015 640 IHAAREALRRALATALKDELLALYEALQTDGpYSPDAEAAGRRALRNVCLSYLAAADDEEAAELAEAQFDQADNMTDRLA 719
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 716 ALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRIRSLIGAFAGSNPA 795
Cdd:PRK14015 720 ALSALVNADLPERDEALADFYDRWKDDPLVMDKWFALQATSPAPDTLERVRALMQHPAFDLKNPNRVRSLIGAFAAANPA 799
                        810       820       830       840       850       860       870
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 446115929 796 AFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDDKRQEKMRAALEQLKGLENLSGDLYEKITKALA 870
Cdd:PRK14015 800 GFHAADGSGYRFLADQILALDKINPQVAARLATPLIRWRRYDPKRQALMRAALERIAALPNLSKDVREIVSKALA 874
pepN_proteo TIGR02414
aminopeptidase N, Escherichia coli type; The M1 family of zinc metallopeptidases contains a ...
12-869 0e+00

aminopeptidase N, Escherichia coli type; The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile (see SP|P04825 for a description of catalytic activity). This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc. [Protein fate, Degradation of proteins, peptides, and glycopeptides]


Pssm-ID: 274122 [Multi-domain]  Cd Length: 863  Bit Score: 1330.42  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   12 DYRAPDYQITDIDLTFDLDAEKTVVTAISQAVR-HGAPDAPLRLDGEDLTLVSIHVNDAPWTA--YKEEEGALVISDLPE 88
Cdd:TIGR02414   1 DYKPPPFLIEKTHLDFDLHEEETVVRARLTVRRnPDGNGAPLVLDGEELKLLSIAIDGKPLAAgdYQLDDETLTIASVPE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   89 RFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGELE 168
Cdd:TIGR02414  81 SFTLEIETEIHPEENTSLEGLYKSGGNFCTQCEAEGFRRITYFPDRPDVMSRYTVTITADKKKYPVLLSNGNKIASGELP 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  169 NGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLD 248
Cdd:TIGR02414 161 DGRHWAEWEDPFPKPSYLFALVAGDLDVLEDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVFGLEYDLD 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  249 IYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQ 328
Cdd:TIGR02414 241 IFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQ 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  329 EFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYF 408
Cdd:TIGR02414 321 EFSADMTSRAVKRIEDVRLLRAHQFPEDAGPMAHPVRPESYVEINNFYTATVYEKGAEVIRMLHTLLGEEGFRKGMDLYF 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  409 ERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIE 488
Cdd:TIGR02414 401 SRHDGQAVTCEDFVAAMEDASGRDLNQFRRWYSQAGTPVLEVKENYDAAKKTYTLTVRQSTPPTPGQTEKKPLHIPIAVG 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  489 LYDNEGNVIPLQ-KGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARNDFSRWDA 567
Cdd:TIGR02414 481 LLGPNGRKLMLSlDGERDTTRVLELTEAEQTFVFEGIAEKPVPSLLRGFSAPVNLEYPYSDEDLLLLLAHDSDPFNRWEA 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  568 AQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIAIAQVREALTRT 647
Cdd:TIGR02414 561 GQRLARRVILANIARAQGGEELPVDPAFIDALGKLLNDPHLDAAFKALLLALPSEAYLAELMENIDPDALHAAREFLRAA 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  648 LAAELADEFLAIYNANHL-DEYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALAALSAAVAAQLP 726
Cdd:TIGR02414 641 IARQLADDLLRLYDALQEnGPYSVDPAAAGRRALRNACLSYLSAADDAEIRNLALEQFKSADNMTDRLAALSALVHFESD 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  727 CRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRIRSLIGAFAGSNPAAFHAQDGSGYQ 806
Cdd:TIGR02414 721 FRERALAAFYQKWKDDPLVMDKWFALQATSPRPDTLERVKALLQHPAFDLKNPNRVRALIGAFANNNLVRFHDISGSGYR 800
                         810       820       830       840       850       860
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 446115929  807 FLVEMLTDLNSRNPQVASRLIEPLIRLKRYDDKRQEKMRAALEQLKGLENLSGDLYEKITKAL 869
Cdd:TIGR02414 801 FLADQIIAIDRFNPQVAARLLEPLTRWRKLDPKRQELMKAALERIAAEENLSKDVREVVSKAL 863
M1_APN cd09600
Peptidase M1 family, including aminopeptidase N catalytic domain; This model represents the ...
12-441 0e+00

Peptidase M1 family, including aminopeptidase N catalytic domain; This model represents the catalytic domain of aminopeptidase N (APN; CD13; alanyl aminopeptidase; EC 3.4.11.2), a type II integral membrane protease belonging to the M1 gluzincin family. It includes bacterial-type alanyl aminopeptidases as well as PfA-M1 aminopeptidase (Plasmodium falciparum-type). APN preferentially cleaves neutral amino acids from the N-terminus of oligopeptides and, in higher eukaryotes, is present in a variety of human tissues and cell types (leukocyte, fibroblast, endothelial and epithelial cells). APN expression is dysregulated in inflammatory diseases such as chronic pain, rheumatoid arthritis, multiple sclerosis, systemic sclerosis, systemic lupus erythematosus, polymyositis/dermatomyosytis and pulmonary sarcoidosis, and is enhanced in tumor cells such as melanoma, renal, prostate, pancreas, colon, gastric and thyroid cancers. It is predominantly expressed on stem cells and on cells of the granulocytic and monocytic lineages at distinct stages of differentiation, thus considered a marker of differentiation. Thus, APN inhibition may lead to the development of anti-cancer and anti-inflammatory drugs. APNs are also present in many pathogenic bacteria and represent potential drug targets. Some APNs have been used commercially, such as one from Lactococcus lactis used in the food industry. APN also serves as a receptor for coronaviruses, although the virus receptor interaction site seems to be distinct from the enzymatic site and aminopeptidase activity is not necessary for viral infection. APNs have also been extensively studied as putative Cry toxin receptors. Cry1 proteins are pore-forming toxins that bind to the midgut epithelial cell membrane of susceptible insect larvae, causing extensive damage. Several different toxins, including Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to bind to APNs; however, a direct role of APN in cytotoxicity has been yet to be firmly established.


Pssm-ID: 341063 [Multi-domain]  Cd Length: 434  Bit Score: 871.84  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  12 DYRAPDYQITDIDLTFDLDAEKTVVTAISQAVR--HGAPDAPLRLDGEDLTLVSIHVNDAP--WTAYKEEEGALVISDLP 87
Cdd:cd09600    1 DYKPPDFLIDHVDLDFDLDDDETIVTSRLRVRRnpDSGEGAPLVLDGEDLELLSVKIDGKPlsPSDYTLDEEGLTIKNVP 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  88 ERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGEL 167
Cdd:cd09600   81 DRFVLEIEVRINPAANTSLEGLYKSGGILCTQCEAEGFRRITYFPDRPDVMSKFTVTIEADKEKYPVLLSNGNLIEEGEL 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 168 ENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDL 247
Cdd:cd09600  161 PNGRHFAVWEDPFPKPSYLFALVAGDLGSVEDTFTTKSGRKVKLRIYVEPGNEDKCHHAMESLKKAMKWDEERFGLEYDL 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 248 DIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRD 327
Cdd:cd09600  241 DLFNIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRD 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 328 QEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLY 407
Cdd:cd09600  321 QEFSADMNSRAVKRIEDVRRLRSAQFPEDAGPMAHPIRPDSYIEINNFYTVTVYEKGAEVIRMLHTLLGEEGFRKGMDLY 400
                        410       420       430
                 ....*....|....*....|....*....|....
gi 446115929 408 FERHDGSAATCDDFVQAMEDASNVDLSHFRRWYS 441
Cdd:cd09600  401 FERHDGQAVTCEDFVAAMEDASGRDLSQFKRWYS 434
PepN COG0308
Aminopeptidase N, contains DUF3458 domain [Amino acid transport and metabolism];
10-659 0e+00

Aminopeptidase N, contains DUF3458 domain [Amino acid transport and metabolism];


Pssm-ID: 440077 [Multi-domain]  Cd Length: 609  Bit Score: 579.29  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  10 RHDYRAPDYQITDIDLTFDLDAEKTVVTAIS--QAVRHGAPDAPLRLDGEDLTLVSIHVNDAPWTaYKEEEGALVIsDLP 87
Cdd:COG0308    7 LEAYRPPGYDVTHYDLDLDLDPATTRLSGTAtiTFTATEAPLDSLVLDLKGLEVTSVTVDGKPLD-FTRDGERLTI-TLP 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  88 ------ERFTLRIVNEISPaaNTALEGLYQSGDA------LCTQCEAEGFRhiTWYL--DRPDVLARFTTKIIADKskYP 153
Cdd:COG0308   85 kplapgETFTLEIEYSGKP--SNGGEGLYRSGDPpdgppyLYTQCEPEGAR--RWFPcfDHPDDKATFTLTVTVPA--GW 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 154 FLLSNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFttRSGreVALELYVDRGNLDRAPWAMTSLKNS 233
Cdd:COG0308  159 VAVSNGNLVSETELGDGRTTWHWADTQPIPTYLFALAAGDYAVVEDTF--ASG--VPLRVYVRPGLADKAKEAFESTKRM 234
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 234 MKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLarTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDW 313
Cdd:COG0308  235 LDFFEELFGVPYPFDKYDQVAVPDFNFGAMENQGLVTFGEKVLA--DETATDADYERRESVIAHELAHQWFGNLVTCADW 312
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 314 FQLSLKEGLTVFRDQEFSSDLGSR-AVNRIsNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIH 392
Cdd:COG0308  313 DDLWLNEGFATYMEQLFSEDLYGKdAADRI-FVGALRSYAFAEDAGPNAHPIRPDDYPEIENFFDGIVYEKGALVLHMLR 391
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 393 TLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSH-FRRWYSQSGTPIVTVKDDYNPETeQYTLTISQRtpa 471
Cdd:COG0308  392 TLLGDEAFRAGLRLYFARHAGGNATTEDFLAALEEASGRDLSAfFDQWLYQAGLPTLEVEYEYDADG-KVTLTLRQT--- 467
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 472 tadQAEKQPLHIPFAIELYDnegnviplqkgGHPVNAVLNVTQAEQTFTfdnvyFQPVPALLCEFsapvkleykwsDQQL 551
Cdd:COG0308  468 ---PPRPHPFHIPLEVGLLG-----------GKLTARTVLLDGEQTELV-----AKPDPVLLLRL-----------DDEL 517
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 552 TFLMRHARNDFSRWDAAQSLLatyiklnvarhQQGQPLSLpvhvaDAFRAvlLDEKiDPALAAEILTLPSANEiaelfev 631
Cdd:COG0308  518 AFLLAHDSDPFNRWEALQALW-----------RDGEADYL-----DALRA--LADT-DPAVRAEALALLGSDQ------- 571
                        650       660
                 ....*....|....*....|....*...
gi 446115929 632 idpiaIAQVREALTRTLAAELADEFLAI 659
Cdd:COG0308  572 -----LALARAALALAAELALLRALDDL 594
DUF3458_C pfam17432
Domain of unknown function (DUF3458_C) ARM repeats; This presumed domain is functionally ...
548-870 9.09e-155

Domain of unknown function (DUF3458_C) ARM repeats; This presumed domain is functionally uncharacterized. This domain is found in bacteria, archaea and eukaryotes.


Pssm-ID: 465424 [Multi-domain]  Cd Length: 324  Bit Score: 456.21  E-value: 9.09e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  548 DQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAE 627
Cdd:pfam17432   1 DEDLAFLLAHDSDPFNRWEAGQTLALRLLLALVAALQAGEPLALDAAFIDAFRAVLADAALDPAFKAEALTLPSEAYLAE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  628 LFEVIDPIAIAQVREALTRTLAAELADEFLAIYNANHLD-EYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRD 706
Cdd:pfam17432  81 QMDVVDPDAIHAAREALRRALAEALRDELLALYQALAATgPYSPDAAAAGRRALRNLALSYLAAAGDPEAADLAAAQFES 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  707 ANNMTDALAALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRIRSLI 786
Cdd:pfam17432 161 ADNMTDRLAALAALVNSDLPEREEALADFYQRWKDDPLVMDKWFALQATSPRPDTLERVKALMQHPAFDLKNPNRVRALI 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  787 GAFAGSNPAAFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDDKRQEKMRAALEQLKGLENLSGDLYEKIT 866
Cdd:pfam17432 241 GAFAAANPVAFHAADGSGYRFLADQVLELDAINPQVAARLLTPLTRWRRYDPPRQALMRAALERIAATPGLSKDVFEIVS 320

                  ....
gi 446115929  867 KALA 870
Cdd:pfam17432 321 KALA 324
M1 cd09595
Peptidase M1 family includes the catalytic domains of aminopeptidase N and leukotriene A4 ...
21-427 8.25e-122

Peptidase M1 family includes the catalytic domains of aminopeptidase N and leukotriene A4 hydrolase; The model represents the catalytic domains of M1 peptidase family members including aminopeptidase N (APN) and leukotriene A4 hydrolase (LTA4H). All peptidases in this family bind a single catalytic zinc ion which is tetrahedrally co-ordinated by three amino acid ligands and a water molecule that forms the nucleophile upon activation during catalysis. APN preferentially cleaves neutral amino acids from the N-terminus of oligopeptides and is present in a variety of human tissues and cell types. APN expression is dysregulated in many inflammatory diseases and is enhanced in numerous tumor cells, making it a lead target in the development of anti-cancer and anti-inflammatory drugs. LTA4H is a bifunctional enzyme, possessing an aminopeptidase as well as an epoxide hydrolase activity. The two activities occupy different, but overlapping sites. The activity and physiological relevance of the aminopeptidase in LTA4H is as yet unknown, while the epoxide hydrolase converts leukotriene A4 (LTA4) into leukotriene B4 (LTB4), a potent chemotaxin that is fundamental to the inflammatory response of mammals.


Pssm-ID: 341058 [Multi-domain]  Cd Length: 413  Bit Score: 374.47  E-value: 8.25e-122
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  21 TDIDLTFDLDAEKTVVTAiSQAVRHGAPDAPLRLDGEDLTLVSIHVNDAPwTAYKEEE-------GALVISDLPERFTLR 93
Cdd:cd09595    3 YDLDLDVDFTTKTLNGTE-TLTVDASQVGRELVLDLVGLTIHSVSVNGAA-VDFGEREhydgeklTIPGPKPPGQTFTVR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  94 IVNEISPAANTAL----EGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKskYPFLLSNGNRVAQGELEN 169
Cdd:cd09595   81 ISFEAKPSKNLLGwlweQTAGKEKPYLFTQFEATHARRIFPCIDHPAVKATFTVTITTPK--KDLLASNGALVGEETGAN 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 170 GRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRsgREVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLDI 249
Cdd:cd09595  159 GRKTYRFEDTPPIPTYLVAVVVGDLEFKYVTVKSQ--PRVGLSVYSEPLQVDQAQYAFDATRAALAWFEDYFGGPYPLPK 236
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 250 YMIVAVDFFNMGAMENKGLNIFNSKYVLArtDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQE 329
Cdd:cd09595  237 YDLLAVPDFNSGAMENPGLITFRTTYLLR--SKVTDTGARSIENVIAHELAHQWFGNLVTMRWWNDLWLNEGFAVYYENR 314
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 330 FSSDLG---SRAVNRISNVRTMRGLQFAEDASPMAHPIRPDkvIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQL 406
Cdd:cd09595  315 IMDATFgtsSRHLDQLSGSSDLNTEQLLEDSSPTSTPVRSP--ADPDVAYDGVTYAKGALVLRMLEELVGEEAFDKGVQA 392
                        410       420
                 ....*....|....*....|.
gi 446115929 407 YFERHDGSAATCDDFVQAMED 427
Cdd:cd09595  393 YFNRHKFKNATTDDFIDALEE 413
Peptidase_M1 pfam01433
Peptidase family M1 domain; Members of this family are aminopeptidases. The members differ ...
225-439 1.93e-67

Peptidase family M1 domain; Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase activity.


Pssm-ID: 426262 [Multi-domain]  Cd Length: 219  Bit Score: 223.70  E-value: 1.93e-67
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  225 WAMTSLKNSMKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWT 304
Cdd:pfam01433   1 YALEITVKLLEFYEDYFNIPYPLPKYDLVALPDFSAGAMENWGLITYRETLLLYDPGNSSTSDKQRVASVIAHELAHQWF 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  305 GNRVTCRDWFQLSLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPI--RPDKVIEMNNFYTLTVYE 382
Cdd:pfam01433  81 GNLVTMKWWDDLWLNEGFATYMEYLGTDALFPEWNIWEQFLLDEVQNAMARDALDSSHPItqNVNDPSEIDDIFDAIPYE 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 446115929  383 KGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASN-VDLSHF-RRW 439
Cdd:pfam01433 161 KGASVLRMLETLLGEEVFQKGLRSYLKKFQYGNATTEDLWDALSEASGpLDVDSFmDTW 219
M1_APN_like cd09603
Peptidase M1 family similar to aminopeptidase N catalytic domain; This family contains mostly ...
18-440 7.38e-50

Peptidase M1 family similar to aminopeptidase N catalytic domain; This family contains mostly bacterial and some archaeal M1 peptidases with smilarity to the catalytic domain of aminopeptidase N (APN; CD13; alanyl aminopeptidase; EC 3.4.11.2), a type II integral membrane protease belonging to the M1 gluzincin family. APN preferentially cleaves neutral amino acids from the N-terminus of oligopeptides and, in higher eukaryotes, is present in a variety of human tissues and cell types (leukocyte, fibroblast, endothelial and epithelial cells). APN expression is dysregulated in inflammatory diseases such as chronic pain, rheumatoid arthritis, multiple sclerosis, systemic sclerosis, systemic lupus erythematosus, polymyositis/dermatomyosytis and pulmonary sarcoidosis, and is enhanced in tumor cells such as melanoma, renal, prostate, pancreas, colon, gastric and thyroid cancers. It is predominantly expressed on stem cells and on cells of the granulocytic and monocytic lineages at distinct stages of differentiation, thus considered a marker of differentiation. Thus, APN inhibition may lead to the development of anti-cancer and anti-inflammatory drugs. APNs are also present in many pathogenic bacteria and represent potential drug targets. Some APNs have been used commercially, such as one from Lactococcus lactis used in the food industry. APN also serves as a receptor for coronaviruses, although the virus receptor interaction site seems to be distinct from the enzymatic site and aminopeptidase activity is not necessary for viral infection. APNs have also been extensively studied as putative Cry toxin receptors. Cry1 proteins are pore-forming toxins that bind to the midgut epithelial cell membrane of susceptible insect larvae, causing extensive damage. Several different toxins, including Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to bind to APNs; however, a direct role of APN in cytotoxicity has been yet to be firmly established.


Pssm-ID: 341066 [Multi-domain]  Cd Length: 410  Bit Score: 181.24  E-value: 7.38e-50
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  18 YQITDIDLTFDLDAEKTVV--TAISQAVRHGAPDApLRLDGEDLTLVSIHVNDAPWTAYKEEEGALVIsDLP------ER 89
Cdd:cd09603    1 YDVLHYDLDLDYDPATKSLsgTATITFRATQDLDS-LQLDLVGLTVSSVTVDGVPAAFFTHDGDKLVI-TLPrplaagET 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  90 FTLRI----VNEISPAANTALEGLYQSGDALCTQCEAEGFRHitWY--LDRPDVLARFTTKIIADKSKYPflLSNGNRVA 163
Cdd:cd09603   79 FTVTVrysgKPRPAGYPPGDGGGWEEGDDGVWTAGQPEGAST--WFpcNDHPDDKATYDITVTVPAGLTV--VSNGRLVS 154
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 164 QGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTfttrSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGl 243
Cdd:cd09603  155 TTTNGGGTTTWHWKMDYPIATYLVTLAVGRYAVVEDG----SGGGIPLRYYVPPGDAAKAKASFARTPEMLDFFEELFG- 229
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 244 EYDLDIYMIVAVDFFNmGAMENKGLNIFNSKYVLARtdtatdkdyLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLT 323
Cdd:cd09603  230 PYPFEKYGQVVVPDLG-GGMEHQTATTYGNNFLNGD---------RGSERLIAHELAHQWFGDSVTCADWADIWLNEGFA 299
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 324 VFRDQEFSSDLGSRAVNRiSNVRTMRGLQFAEDASPMAHPIRPDkviemnnFYTLTVYEKGAEVIRMIHTLLGEENFQKG 403
Cdd:cd09603  300 TYAEWLWSEHKGGADAYR-AYLAGQRQDYLNADPGPGRPPDPDD-------LFDRDVYQKGALVLHMLRNLLGDEAFFAA 371
                        410       420       430
                 ....*....|....*....|....*....|....*...
gi 446115929 404 MQLYFERHDGSAATCDDFVQAMEDASNVDLSH-FRRWY 440
Cdd:cd09603  372 LRAYLARYAHGNVTTEDFIAAAEEVSGRDLTWfFDQWL 409
M1_APN cd09602
Peptidase M1 family including aminopeptidase N catalytic domain; This model represents the ...
4-439 6.24e-49

Peptidase M1 family including aminopeptidase N catalytic domain; This model represents the catalytic domain of bacterial and eukaryotic aminopeptidase N (APN; CD13; alanyl aminopeptidase; EC 3.4.11.2), a type II integral membrane protease belonging to the M1 gluzincin family. APN preferentially cleaves neutral amino acids from the N-terminus of oligopeptides and, in higher eukaryotes, is present in a variety of human tissues and cell types (leukocyte, fibroblast, endothelial and epithelial cells). APN expression is dysregulated in inflammatory diseases such as chronic pain, rheumatoid arthritis, multiple sclerosis, systemic sclerosis, systemic lupus erythematosus, polymyositis/dermatomyosytis and pulmonary sarcoidosis, and is enhanced in tumor cells such as melanoma, renal, prostate, pancreas, colon, gastric and thyroid cancers. It is predominantly expressed on stem cells and on cells of the granulocytic and monocytic lineages at distinct stages of differentiation, thus considered a marker of differentiation. Thus, APN inhibition may lead to the development of anti-cancer and anti-inflammatory drugs. APNs are also present in many pathogenic bacteria and represent potential drug targets. Some APNs have been used commercially, such as one from Lactococcus lactis used in the food industry. APN also serves as a receptor for coronaviruses, although the virus receptor interaction site seems to be distinct from the enzymatic site and aminopeptidase activity is not necessary for viral infection. APNs have also been extensively studied as putative Cry toxin receptors. Cry1 proteins are pore-forming toxins that bind to the midgut epithelial cell membrane of susceptible insect larvae, causing extensive damage. Several different toxins, including Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to bind to APNs; however, a direct role of APN in cytotoxicity has been yet to be firmly established.


Pssm-ID: 341065 [Multi-domain]  Cd Length: 440  Bit Score: 179.63  E-value: 6.24e-49
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   4 QPQAKYRHDyrapdyQITDI--DLTFDLDAEKTVVTAISQ-AVRHGAPDAPLRLDGEDLTLVSIHVNDAPWTAYKEEEGA 80
Cdd:cd09602    3 REEAEQRAA------LISVVsyDLDLDLTEGAETFRGTVTiRFTLREPGASLFLDFRGGEVKSVTLNGRPLDPSAFDGER 76
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  81 LVISDLP--ERFTLRIVNEispAANTAL-EGLYQSGDA------LCTQCEAEGFRhiTWY--LDRPDVLARFTTKIIADK 149
Cdd:cd09602   77 ITLPGLLkaGENTVVVEFT---APYSSDgEGLHRFVDPadgetyLYTLFEPDDAR--RVFpcFDQPDLKATFTLTVTAPA 151
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 150 S-KypfLLSNGNRVAQGELENGRHWVqwqdpFPK----PCYLFALVAGDFDVLRDTfttrsGREVALELYVDRG----NL 220
Cdd:cd09602  152 DwT---VISNGPETSTEEAGGRKRWR-----FAEtpplSTYLFAFVAGPYHRVEDE-----HDGIPLGLYCRESlaeyER 218
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 221 DRAPW-AMTslKNSMKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLarTDTATDKDYLDIERVIGHEY 299
Cdd:cd09602  219 DADEIfEVT--KQGLDFYEDYFGIPYPFGKYDQVFVPEFNFGAMENPGAVTFRESYLF--REEPTRAQRLRRANTILHEM 294
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 300 FHNWTGNRVTCRDWFQLSLKEgltVFRDqeFSSDLGSRAVNRISNVRtmrgLQFA---------EDASPMAHPIRPDkvI 370
Cdd:cd09602  295 AHMWFGDLVTMKWWDDLWLNE---SFAD--FMAAKALAEATPFTDAW----LTFLlrrkpwayrADQLPTTHPIAQD--V 363
                        410       420       430       440       450       460       470
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 446115929 371 E-----MNNFYTLTvYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHF-RRW 439
Cdd:cd09602  364 PdleaaGSNFDGIT-YAKGASVLKQLVALVGEEAFRAGLREYFKKHAYGNATLDDLIAALDEASGRDLSAWaDAW 437
M1_APN-Q_like cd09601
Peptidase M1 aminopeptidase N catalytic domain family which includes aminopeptidase N (APN), ...
22-434 3.64e-46

Peptidase M1 aminopeptidase N catalytic domain family which includes aminopeptidase N (APN), aminopeptidase Q (APQ), tricorn interacting factor F3, and endoplasmic reticulum aminopeptidase 1 (ERAP1); This M1 peptidase family includes eukaryotic and bacterial members: the catalytic domains of aminopeptidase N (APN), aminopeptidase Q (APQ, laeverin), endoplasmic reticulum aminopeptidase 1 (ERAP1) as well as tricorn interacting factor F3. Aminopeptidase N (APN; CD13; alanyl aminopeptidase; EC 3.4.11.2), a type II integral membrane protease, preferentially cleaves neutral amino acids from the N-terminus of oligopeptides and is present in a variety of human tissues and cell types (leukocyte, fibroblast, endothelial and epithelial cells). APN expression is dysregulated in inflammatory diseases such as chronic pain, rheumatoid arthritis, multiple sclerosis, systemic sclerosis, systemic lupus erythematosus, polymyositis/dermatomyosytis and pulmonary sarcoidosis, and is enhanced in tumor cells such as melanoma, renal, prostate, pancreas, colon, gastric and thyroid cancers. It is considered a marker of differentiation since it is predominantly expressed on stem cells and on cells of the granulocytic and monocytic lineages at distinct stages of differentiation. Thus, APN inhibition may lead to the development of anti-cancer and anti-inflammatory drugs. ERAP1, also known as endoplasmic reticulum aminopeptidase associated with antigen processing (ERAAP), adipocyte derived leucine aminopeptidase (A-LAP), or aminopeptidase regulating tumor necrosis factor receptor I (THFRI) shedding (ARTS-1), associates with the closely related ER aminopeptidase ERAP2, for the final trimming of peptides within the ER for presentation by MHC class I molecules. ERAP1 is associated with ankylosing spondylitis (AS), an inflammatory arthritis that predominantly affects the spine. ERAP1 also aids in the shedding of membrane-bound cytokine receptors. The tricorn interacting factor F3, together with factors F1 and F2, degrades the tricorn protease products, producing free amino acids, thus completing the proteasomal degradation pathway. F3 is homologous to F2, but not F1, and shows a strong preference for glutamate in the P1' position. APQ, also known as laeverin, is specifically expressed in human embryo-derived extravillous trophoblasts (EVTs) that invade the uterus during early placentation. It cleaves the N-terminal amino acid of various peptides such as angiotensin III, endokinin C, and kisspeptin-10, all expressed in the placenta in large quantities. APN is a receptor for coronaviruses, although the virus receptor interaction site seems to be distinct from the enzymatic site and aminopeptidase activity is not necessary for viral infection. APNs are also putative Cry toxin receptors. Cry1 proteins are pore-forming toxins that bind to the midgut epithelial cell membrane of susceptible insect larvae, causing extensive damage. Several different toxins, including Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to bind to APNs; however, a direct role of APN in cytotoxicity has been yet to be firmly established.


Pssm-ID: 341064 [Multi-domain]  Cd Length: 442  Bit Score: 171.61  E-value: 3.64e-46
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  22 DIDLTFDLDAEK---------TVVTAISQAVRHGAPdapLRLDGEDLTLVSIHVNDAPWTAYKEEEGALVI---SDLP-- 87
Cdd:cd09601    4 DLTLTPDLENFTfsgsvtitlEVLEPTDTIVLHAKD---LTITSASLTLKGGSGIIEVTVVTDEETEFLTItldETLPpg 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  88 ERFTLRI--VNEIspaaNTALEGLYQS--GDA-------LCTQCEAEGFRHI--TWylDRPDVLARFTTKIIADKSKYPf 154
Cdd:cd09601   81 ENYTLSIefTGKL----NDDLRGFYRSsyTDEdgetrylAATQFEPTDARRAfpCF--DEPAFKATFDITITHPKGYTA- 153
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 155 lLSNGNRVAQGELENGRHWVQwqdpF---PK-PCYLFALVAGDFDVLrdtfTTRSGREVALELYVDRGNLDRAPWAMTSL 230
Cdd:cd09601  154 -LSNMPPVESTELEDGWKTTT----FettPPmSTYLVAFVVGDFEYI----ESTTKSGVPVRVYARPGKIEQGDFALEVA 224
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 231 KNSMKWDEARFGLEYDL---DIymiVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNR 307
Cdd:cd09601  225 PKILDFYEDYFGIPYPLpklDL---VAIPDFAAGAMENWGLITYRETALLYDPKTSSASDKQRVAEVIAHELAHQWFGNL 301
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 308 VTCRDWFQLSLKEGltvfrdqeFSSDLGSRAVN------RISNVRTMRGLQ--FAEDASPMAHPIRPD--KVIEMNNFYT 377
Cdd:cd09601  302 VTMKWWDDLWLNEG--------FATYMEYLAVDklfpewNMWDQFVVDELQsaLELDSLASSHPIEVPveSPSEISEIFD 373
                        410       420       430       440       450
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 446115929 378 LTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLS 434
Cdd:cd09601  374 AISYSKGASVLRMLENFLGEEVFRKGLRKYLKKHAYGNATTDDLWEALQEASGESKP 430
DUF3458 pfam11940
Domain of unknown function (DUF3458) Ig-like fold; This presumed domain is functionally ...
444-544 3.43e-45

Domain of unknown function (DUF3458) Ig-like fold; This presumed domain is functionally uncharacterized. This domain is found in bacteria, archaea and eukaryotes. The domain has an Ig-like fold. This domain is found associated with pfam01433.


Pssm-ID: 463405 [Multi-domain]  Cd Length: 95  Bit Score: 157.29  E-value: 3.43e-45
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  444 GTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIELYDNEGNVIPLqkgghpvNAVLNVTQAEQTFTFDN 523
Cdd:pfam11940   1 GTPRVTVSDSYDAAAGTYTLTLSQTTPPTPGQPEKQPLHIPIRIALLDPNGQELAL-------ERVLELTEAEQTFTFEG 73
                          90       100
                  ....*....|....*....|.
gi 446115929  524 VYFQPVPALLCEFSAPVKLEY 544
Cdd:pfam11940  74 VAEKPVPSLLRGFSAPVKLEY 94
leuko_A4_hydro TIGR02411
leukotriene A-4 hydrolase/aminopeptidase; Members of this family represent a distinctive ...
21-446 2.49e-20

leukotriene A-4 hydrolase/aminopeptidase; Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.


Pssm-ID: 274120 [Multi-domain]  Cd Length: 602  Bit Score: 96.00  E-value: 2.49e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   21 TDIDLTFDLDAEKTVVTAISQAVRHGAPDAPLRLDGEDLTLVSIHVNDAPwTAYKEEE------GALVISdLP------E 88
Cdd:TIGR02411  16 TDLNLSVDFTKRKLSGSVTFTLKSLTDNLNKLVLDTSYLDIQKVTINGLP-ADFAIGErkeplgSPLTIS-LPiatsknD 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   89 RFTLRIVNEISPAAnTALEGLY------QSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIiadKSKYPFLLSnGNRV 162
Cdd:TIGR02411  94 EFVLNISFSTTPKC-TALQWLNpeqtsgKKHPYLFSQCQAIHARSLFPCQDTPSVKSTYTAEV---ESPLPVLMS-GIRD 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  163 AQGELENGRHwvQWQDPFPKPCYLFALVAGDFDvlRDTFTTRSgrevalELYVDRGNLDRAPWAMTSLKNSMKWDEARFG 242
Cdd:TIGR02411 169 GETSNDPGKY--LFKQKVPIPAYLIAIASGDLA--SAPIGPRS------TVYSEPEQLEKCQYEFENDTEKFIKTAEDLI 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  243 LEYDLDIY-MIVAVDFFNMGAMENKGLNiFNSKYVLARtdtatDKDYLDierVIGHEYFHNWTGNRVTCRDWFQLSLKEG 321
Cdd:TIGR02411 239 FPYEWGQYdLLVLPPSFPYGGMENPNLT-FATPTLIAG-----DRSNVD---VIAHELAHSWSGNLVTNCSWEHFWLNEG 309
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  322 LTVFRDQEFSSDL---GSRAVNRISN-------VRTMrglqfaEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMI 391
Cdd:TIGR02411 310 WTVYLERRIIGRLygeKTRHFSALIGwgdlqesVKTL------GETPEFTKLVVDLKDNDPDDAFSSVPYEKGFNFLFYL 383
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 446115929  392 HTLLG-EENFQKGMQLYFERHDGSAATCDDFVQAMED-------ASNVDLSHFRRWYSQSGTP 446
Cdd:TIGR02411 384 EQLLGgPAEFDPFLRHYFKKFAYKSLDTYQFKDALYEyfkdkkkVDKLDAVDWETWLYSPGMP 446
Peptidase_M1_N pfam17900
Peptidase M1 N-terminal domain; This domain is found at the N-terminus of aminopeptidases from ...
18-186 2.66e-20

Peptidase M1 N-terminal domain; This domain is found at the N-terminus of aminopeptidases from the M1 family.


Pssm-ID: 465557 [Multi-domain]  Cd Length: 186  Bit Score: 89.33  E-value: 2.66e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   18 YQIT-DIDLT-FDLDAEKTVVTAISQAVRHgapdapLRLDGEDLTLVSIHVNDAPWTAYK-------EEEGALVISDLPE 88
Cdd:pfam17900   5 YDLDlKIDLKnFTFSGSVTITLQLNNATNV------IVLHASDLTIRSISLSDEVTSDGVpadftedQKDGEKLTIVLPE 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929   89 ------RFTLRIvnEISPAANTALEGLYQS--------GDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKypF 154
Cdd:pfam17900  79 tlnqtgPYTLEI--EYSGELNDSMTGFYRStytdngekKVLVTTQFEPTDARSAFPCFDEPSVKATFTISIIHPKDY--T 154
                         170       180       190
                  ....*....|....*....|....*....|..
gi 446115929  155 LLSNGNRVAQGELENGRHWVQWQDPFPKPCYL 186
Cdd:pfam17900 155 ALSNMPVIASEPLENGWVITTFEQTPKMSTYL 186
M1_LTA4H cd09599
Peptidase M1 family including Leukotriene A4 hydrolase catalytic domain; This model represents ...
17-447 8.24e-19

Peptidase M1 family including Leukotriene A4 hydrolase catalytic domain; This model represents the N-terminal catalytic domain of leukotriene A4 hydrolase (LTA4H; E.C. 3.3.2.6) and the close homolog cold-active aminopeptidase (Colwellia psychrerythraea-type peptidase; ColAP), both members of the aminopeptidase M1 family. LTA4H is a bifunctional enzyme, possessing an aminopeptidase as well as an epoxide hydrolase activity. The two activities occupy different, but overlapping sites. The activity and physiological relevance of the aminopeptidase is poorly understood while the epoxide hydrolase converts leukotriene A4 (LTA4) into leukotriene B4 (LTB4), a potent chemotaxin that is fundamental to the inflammatory response of mammals. It accepts a variety of substrates, including some opioid, di- and tripeptides, as well as chromogenic aminoacyl-p-nitroanilide derivatives. The aminopeptidase activity of LTA4H is possibly involved in the processing of peptides related to inflammation and host defense. Kinetic analysis shows that LTA4H hydrolyzes arginyl tripeptides with high efficiency and specificity, indicating its function as an arginyl aminopeptidase. Thermodynamic characterization using different biophysical methods shows that structurally distinct inhibitors of the LTA4H occupy different regions of the binding site; while some (RB202, ARM1 and SC57461A) bind to the hydrophobic hydrolase side, both bestatin and captopril are located at the hydrophilic peptidase side. LTB4H overexpression is associated with different pathological conditions and diseases such as cystic fibrosis, coronary heart disease, sepsis, shock, connective tissue disease, and chronic obstructive pulmonary disease. It is also overexpressed in certain human cancers, and has been identified as a functionally important target for mediating anticancer properties of resveratrol, a well-known red wine polyphenolic compound with cancer chemopreventive activity.


Pssm-ID: 341062 [Multi-domain]  Cd Length: 442  Bit Score: 90.21  E-value: 8.24e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  17 DYQITDIDLTFDLDAEKTVV--TAISQAVRHGAPDAPLRLDGEDLTLVSIHVNDAPWTAYKEEE------GALVIS---- 84
Cdd:cd09599   10 EVRTTHLDLDLTVDFDKKTIsgSATLTLEVLQDGADELVLDTRDLDISSVTVNGGKELKFELGPrdpvlgSALTITlpsp 89
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929  85 -DLPERFTLRIVNEISPAAnTAL--------EG-----LYqsgdalcTQCEAEGFRhiTWY--LDRPDVLARFTTKIIAD 148
Cdd:cd09599   90 lAKGDTFKVKIEYSTTPQA-TALqwltpeqtAGkkhpyLF-------TQCQAIHAR--SLFpcQDTPSVKSTYSATVTVP 159
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 149 KSkYPFLLSnGNRVaQGELENGRHWVQWQDPFPKPCYLFALVAGDFdVLRDTfTTRSGrevaleLYVDRGNLDRAPWAMT 228
Cdd:cd09599  160 KG-LTALMS-ALRT-GEKEEAGTGTYTFEQPVPIPSYLIAIAVGDL-ESREI-GPRSG------VWAEPSVVDAAAEEFA 228
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 229 SLKNSMKWDEARFGlEYDLDIY-MIVAVDFFNMGAMENKGLnIFNSKYVLartdtATDKDYLDierVIGHEYFHNWTGNR 307
Cdd:cd09599  229 DTEKFLKAAEKLYG-PYVWGRYdLLVLPPSFPYGGMENPCL-TFATPTLI-----AGDRSLVD---VIAHEIAHSWSGNL 298
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 308 VTCRDWFQLSLKEGLTVFrdqefssdlgsrAVNRIS---NVRTMRGLQFAEDASPMAHPIrpdKVIEMNNFYTLTV---- 380
Cdd:cd09599  299 VTNANWEHFWLNEGFTVY------------LERRILerlYGEEYRQFEAILGWKDLQESI---KEFGEDPPYTLLVpdlk 363
                        410       420       430       440       450       460       470
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 446115929 381 ------------YEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFvqamedasnvdLSHFRRWYSQSGTPI 447
Cdd:cd09599  364 gvdpddafssvpYEKGFQFLYYLEQLGGREVFDPFLRAYFKKFAFQSIDTEDF-----------KDFLLEYFAEDKPEI 431
M1_APN_like cd09604
Peptidase M1 family similar to aminopeptidase N catalytic domain; This family contains ...
187-440 5.95e-17

Peptidase M1 family similar to aminopeptidase N catalytic domain; This family contains bacterial M1 peptidases with smilarity to the catalytic domain of aminopeptidase N (APN; CD13; alanyl aminopeptidase; EC 3.4.11.2), a type II integral membrane protease belonging to the M1 gluzincin family. APN preferentially cleaves neutral amino acids from the N-terminus of oligopeptides and, in higher eukaryotes, is present in a variety of human tissues and cell types (leukocyte, fibroblast, endothelial and epithelial cells). APN expression is dysregulated in inflammatory diseases such as chronic pain, rheumatoid arthritis, multiple sclerosis, systemic sclerosis, systemic lupus erythematosus, polymyositis/dermatomyosytis and pulmonary sarcoidosis, and is enhanced in tumor cells such as melanoma, renal, prostate, pancreas, colon, gastric and thyroid cancers. It is predominantly expressed on stem cells and on cells of the granulocytic and monocytic lineages at distinct stages of differentiation, thus considered a marker of differentiation. Thus, APN inhibition may lead to the development of anti-cancer and anti-inflammatory drugs. APNs are also present in many pathogenic bacteria and represent potential drug targets. Some APNs have been used commercially, such as one from Lactococcus lactis used in the food industry. APN also serves as a receptor for coronaviruses, although the virus receptor interaction site seems to be distinct from the enzymatic site and aminopeptidase activity is not necessary for viral infection. APNs have also been extensively studied as putative Cry toxin receptors. Cry1 proteins are pore-forming toxins that bind to the midgut epithelial cell membrane of susceptible insect larvae, causing extensive damage. Several different toxins, including Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to bind to APNs; however, a direct role of APN in cytotoxicity has been yet to be firmly established.


Pssm-ID: 341067 [Multi-domain]  Cd Length: 440  Bit Score: 84.25  E-value: 5.95e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 187 FALVAG-DFDVlrdtfTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGlEYDLDIYMIVAVDFFNmGAMEN 265
Cdd:cd09604  204 FAWAASpDFVV-----DAATVDGVTVNVYYLPENAEAAERALEYAKDALEFFSEKFG-PYPYPELDVVQGPFGG-GGMEY 276
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 266 KGLnIFNSKYVlartdtatDKDYLDIERVIGHEYFHNW----TGNrvtcrD-----WfqlsLKEGLTVFRDQEFSSDLGS 336
Cdd:cd09604  277 PGL-VFIGSRL--------YDPKRSLEGVVVHEIAHQWfygiVGN-----DerrepW----LDEGLATYAESLYLEEKYG 338
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446115929 337 RAVNRISNVRTMRGLQFAEDASPMAHPIR--PDkviemNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGS 414
Cdd:cd09604  339 KEAADELLGRRYYRAYARGPGGPINLPLDtfPD-----GSYYSNAVYSKGALFLEELREELGDEAFDKALREYYRRYKFK 413
                        250       260
                 ....*....|....*....|....*..
gi 446115929 415 AATCDDFVQAMEDASNVDLS-HFRRWY 440
Cdd:cd09604  414 HPTPEDFFRTAEEVSGKDLDwFFRGWL 440
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH