NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039290979|ref|NP_001315538|]
View 

T-cell surface antigen CD2 isoform 1 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
IgV_CD2_like_N cd05775
N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; ...
29-127 3.66e-25

N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; The members here are composed of the N-terminal immunoglobulin (Ig)-like domain (or domain 1) of T-cell surface antigen Clusters of Differentiation (CD) 2 and similar proteins. CD2 is a T-cell specific surface glycoprotein and is critically important for mediating adhesion between T cells and antigen-presenting cells or between cytolytic T cells and target cells. CD2 is located on chromosome 1 at 1p13 in humans and on chromosome 3 in mice. CD2 contains an extracellular domain with two or Ig-like domains, a single transmembrane segment, and a cytoplasmic region rich in proline and basic residues.


:

Pssm-ID: 409431  Cd Length: 98  Bit Score: 97.80  E-value: 3.66e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  29 NALETWGALGQDINLDIPSFQmsDDIDDIKWEKTSDKkkIAQFRK-EKETFKE--KDTYKLFKN-GTLKIKHLKTDDQDI 104
Cdd:cd05775     1 SSGEVYGALGGNVTLTISSLQ--DDIDEIKWKKTKDK--IVEWENnIGPTYFGsfKDRVLLDKEsGSLTIKNLTKEDSGT 76
                          90       100
                  ....*....|....*....|...
gi 1039290979 105 YKVSIYDTKGKnVLEKIFDLKIQ 127
Cdd:cd05775    77 YELEITSTNGK-VLSSKFTLEVL 98
C2-set pfam05790
Immunoglobulin C2-set domain;
135-205 9.27e-16

Immunoglobulin C2-set domain;


:

Pssm-ID: 399065  Cd Length: 80  Bit Score: 71.61  E-value: 9.27e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 135 ISWTCINTTLTCEVMNGT-------DPELNLYQDGKH-LKLSQRVITHKWTTSLSAKFKCTA-GNKVSKESSVEPVSCPG 205
Cdd:pfam05790   1 VTVSCSNNLLTCEVLELTlpkgskmDPSLKLKGQEAKsLETKKLESTFQPTTEDSGTWVCLAsDNDQKKLESVIEVLVLE 80
Amelogenin super family cl33250
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
291-364 1.37e-06

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


The actual alignment was detected with superfamily member smart00818:

Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 47.86  E-value: 1.37e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039290979  291 PHQIPASTPQNPATSQHPP-PPPGHRSQAPS--HRPPPPGHrVQHQPQKRPPA-PSGTQVHQQKGPPLPRPRVQPKPP 364
Cdd:smart00818  55 HHHIPVLPAQQPVVPQQPLmPVPGQHSMTPTqhHQPNLPQP-AQQPFQPQPLQpPQPQQPMQPQPPVHPIPPLPPQPP 131
 
Name Accession Description Interval E-value
IgV_CD2_like_N cd05775
N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; ...
29-127 3.66e-25

N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; The members here are composed of the N-terminal immunoglobulin (Ig)-like domain (or domain 1) of T-cell surface antigen Clusters of Differentiation (CD) 2 and similar proteins. CD2 is a T-cell specific surface glycoprotein and is critically important for mediating adhesion between T cells and antigen-presenting cells or between cytolytic T cells and target cells. CD2 is located on chromosome 1 at 1p13 in humans and on chromosome 3 in mice. CD2 contains an extracellular domain with two or Ig-like domains, a single transmembrane segment, and a cytoplasmic region rich in proline and basic residues.


Pssm-ID: 409431  Cd Length: 98  Bit Score: 97.80  E-value: 3.66e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  29 NALETWGALGQDINLDIPSFQmsDDIDDIKWEKTSDKkkIAQFRK-EKETFKE--KDTYKLFKN-GTLKIKHLKTDDQDI 104
Cdd:cd05775     1 SSGEVYGALGGNVTLTISSLQ--DDIDEIKWKKTKDK--IVEWENnIGPTYFGsfKDRVLLDKEsGSLTIKNLTKEDSGT 76
                          90       100
                  ....*....|....*....|...
gi 1039290979 105 YKVSIYDTKGKnVLEKIFDLKIQ 127
Cdd:cd05775    77 YELEITSTNGK-VLSSKFTLEVL 98
C2-set pfam05790
Immunoglobulin C2-set domain;
135-205 9.27e-16

Immunoglobulin C2-set domain;


Pssm-ID: 399065  Cd Length: 80  Bit Score: 71.61  E-value: 9.27e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 135 ISWTCINTTLTCEVMNGT-------DPELNLYQDGKH-LKLSQRVITHKWTTSLSAKFKCTA-GNKVSKESSVEPVSCPG 205
Cdd:pfam05790   1 VTVSCSNNLLTCEVLELTlpkgskmDPSLKLKGQEAKsLETKKLESTFQPTTEDSGTWVCLAsDNDQKKLESVIEVLVLE 80
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
291-364 1.37e-06

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 47.86  E-value: 1.37e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039290979  291 PHQIPASTPQNPATSQHPP-PPPGHRSQAPS--HRPPPPGHrVQHQPQKRPPA-PSGTQVHQQKGPPLPRPRVQPKPP 364
Cdd:smart00818  55 HHHIPVLPAQQPVVPQQPLmPVPGQHSMTPTqhHQPNLPQP-AQQPFQPQPLQpPQPQQPMQPQPPVHPIPPLPPQPP 131
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
290-364 3.12e-06

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 46.95  E-value: 3.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 290 KPHQIPASTPQNPATSQHPPPPPGHRSQAP------SHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKP 363
Cdd:pfam15240  76 PPPQGGKQKPQGPPPQGGPRPPPGKPQGPPpqggnqQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPPPPPPGNPQGPPQR 155

                  .
gi 1039290979 364 P 364
Cdd:pfam15240 156 P 156
PHA03247 PHA03247
large tegument protein UL36; Provisional
267-375 8.65e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 8.65e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  267 RSRRNDEELETRAHRVATEERGRKPHQiPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQV 346
Cdd:PHA03247  2585 RARRPDAPPQSARPRAPVDDRGDPRGP-APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR 2663
                           90       100
                   ....*....|....*....|....*....
gi 1039290979  347 HQQKGPPlPRPRVQPKPPHGAAENSLSPS 375
Cdd:PHA03247  2664 PRRARRL-GRAAQASSPPQRPRRRAARPT 2691
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
291-376 4.40e-04

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 41.81  E-value: 4.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 291 PHQIPASTPQNPatsqhPPPPPGHRSQAPSHRPPPpghrvqhqPQKRPPAPsgtqvhqqkgPPLPRPRVQPKPPHGAAEN 370
Cdd:NF040983   86 PNKVPPPPPPPP-----PPPPPPPTPPPPPPPPPP--------PPPPSPPP----------PPPPSPPPSPPPPTTTPPT 142

                  ....*.
gi 1039290979 371 SLSPSS 376
Cdd:NF040983  143 RTTPST 148
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
288-343 1.89e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 39.24  E-value: 1.89e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039290979 288 GRKPHQIPASTPQNPATSQHPPPPPGHRSQAPShRPPPPGHRVQHQPQKRPPAPSG 343
Cdd:COG3416    91 GGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAP-QQPGYGQPQYGQPAAGPSGGGG 145
 
Name Accession Description Interval E-value
IgV_CD2_like_N cd05775
N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; ...
29-127 3.66e-25

N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; The members here are composed of the N-terminal immunoglobulin (Ig)-like domain (or domain 1) of T-cell surface antigen Clusters of Differentiation (CD) 2 and similar proteins. CD2 is a T-cell specific surface glycoprotein and is critically important for mediating adhesion between T cells and antigen-presenting cells or between cytolytic T cells and target cells. CD2 is located on chromosome 1 at 1p13 in humans and on chromosome 3 in mice. CD2 contains an extracellular domain with two or Ig-like domains, a single transmembrane segment, and a cytoplasmic region rich in proline and basic residues.


Pssm-ID: 409431  Cd Length: 98  Bit Score: 97.80  E-value: 3.66e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  29 NALETWGALGQDINLDIPSFQmsDDIDDIKWEKTSDKkkIAQFRK-EKETFKE--KDTYKLFKN-GTLKIKHLKTDDQDI 104
Cdd:cd05775     1 SSGEVYGALGGNVTLTISSLQ--DDIDEIKWKKTKDK--IVEWENnIGPTYFGsfKDRVLLDKEsGSLTIKNLTKEDSGT 76
                          90       100
                  ....*....|....*....|...
gi 1039290979 105 YKVSIYDTKGKnVLEKIFDLKIQ 127
Cdd:cd05775    77 YELEITSTNGK-VLSSKFTLEVL 98
C2-set pfam05790
Immunoglobulin C2-set domain;
135-205 9.27e-16

Immunoglobulin C2-set domain;


Pssm-ID: 399065  Cd Length: 80  Bit Score: 71.61  E-value: 9.27e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 135 ISWTCINTTLTCEVMNGT-------DPELNLYQDGKH-LKLSQRVITHKWTTSLSAKFKCTA-GNKVSKESSVEPVSCPG 205
Cdd:pfam05790   1 VTVSCSNNLLTCEVLELTlpkgskmDPSLKLKGQEAKsLETKKLESTFQPTTEDSGTWVCLAsDNDQKKLESVIEVLVLE 80
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
291-364 1.37e-06

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 47.86  E-value: 1.37e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039290979  291 PHQIPASTPQNPATSQHPP-PPPGHRSQAPS--HRPPPPGHrVQHQPQKRPPA-PSGTQVHQQKGPPLPRPRVQPKPP 364
Cdd:smart00818  55 HHHIPVLPAQQPVVPQQPLmPVPGQHSMTPTqhHQPNLPQP-AQQPFQPQPLQpPQPQQPMQPQPPVHPIPPLPPQPP 131
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
290-364 3.12e-06

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 46.95  E-value: 3.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 290 KPHQIPASTPQNPATSQHPPPPPGHRSQAP------SHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKP 363
Cdd:pfam15240  76 PPPQGGKQKPQGPPPQGGPRPPPGKPQGPPpqggnqQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPPPPPPGNPQGPPQR 155

                  .
gi 1039290979 364 P 364
Cdd:pfam15240 156 P 156
PHA03247 PHA03247
large tegument protein UL36; Provisional
267-375 8.65e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 8.65e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  267 RSRRNDEELETRAHRVATEERGRKPHQiPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQV 346
Cdd:PHA03247  2585 RARRPDAPPQSARPRAPVDDRGDPRGP-APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR 2663
                           90       100
                   ....*....|....*....|....*....
gi 1039290979  347 HQQKGPPlPRPRVQPKPPHGAAENSLSPS 375
Cdd:PHA03247  2664 PRRARRL-GRAAQASSPPQRPRRRAARPT 2691
PHA03247 PHA03247
large tegument protein UL36; Provisional
278-374 9.03e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 9.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  278 RAHRVATE--ERGRKPHQIPASTPQNPATSQHPPPPpghRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGP--- 352
Cdd:PHA03247  2882 PVRRLARPavSRSTESFALPPDQPERPPQPQAPPPP---QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPsga 2958
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1039290979  353 --------------PLPRPRV-QPKPPHGAAENSLSP 374
Cdd:PHA03247  2959 vpqpwlgalvpgrvAVPRFRVpQPAPSREAPASSTPP 2995
PHA03247 PHA03247
large tegument protein UL36; Provisional
287-371 9.52e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 9.52e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  287 RGRKPhqiPASTPQNPATSQHPP--------PPPGHRSQA----PSHRPPPPGHRVQHQPQKRPPAPSGTQvHQQKGPPL 354
Cdd:PHA03247  2863 RRRPP---SRSPAAKPAAPARPPvrrlarpaVSRSTESFAlppdQPERPPQPQAPPPPQPQPQPPPPPQPQ-PPPPPPPR 2938
                           90
                   ....*....|....*...
gi 1039290979  355 PRPRVQPKP-PHGAAENS 371
Cdd:PHA03247  2939 PQPPLAPTTdPAGAGEPS 2956
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
293-365 4.00e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.80  E-value: 4.00e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039290979 293 QIPASTPQNPATSQHPP-PPPGHRSQAPSHRPPPPGHrvQHQPQKRPPAPsgtQVHQQKGPP---LPRPRVQPKPPH 365
Cdd:pfam09770 206 QAKKPAQQPAPAPAQPPaAPPAQQAQQQQQFPPQIQQ--QQQPQQQPQQP---QQHPGQGHPvtiLQRPQSPQPDPA 277
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
273-368 4.15e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.80  E-value: 4.15e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 273 EELET--RAH-RVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPghrVQHQPQKRPPAPSGTQVHQQ 349
Cdd:pfam09770 197 EEVEAamRAQaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQP---QQHPGQGHPVTILQRPQSPQ 273
                          90
                  ....*....|....*....
gi 1039290979 350 KGPPLPRPRVQPKPPHGAA 368
Cdd:pfam09770 274 PDPAQPSIQPQAQQFHQQP 292
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
291-365 7.05e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.03  E-value: 7.05e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039290979 291 PHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPH 365
Cdd:pfam09770 222 PAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
291-377 9.06e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 9.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 291 PHQIPAS---TPQNPATSQHPPPP----PGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPP------LPRP 357
Cdd:pfam03154 290 QHPVPPQpfpLTPQSSQSQVPPGPspaaPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPttpipqLPNP 369
                          90       100
                  ....*....|....*....|
gi 1039290979 358 RVQPKPPHGAAENSLSPSSN 377
Cdd:pfam03154 370 QSHKHPPHLSGPSPFQMNSN 389
PHA03247 PHA03247
large tegument protein UL36; Provisional
281-368 9.45e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 9.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  281 RVATEERGRKPHQIP--ASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPR 358
Cdd:PHA03247  2660 RVSRPRRARRLGRAAqaSSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                           90
                   ....*....|
gi 1039290979  359 VQPKPPHGAA 368
Cdd:PHA03247  2740 APPAVPAGPA 2749
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
288-366 1.13e-04

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 42.33  E-value: 1.13e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 288 GRKPHQIPASTPQN----PATSQHPPPPPGHrsQAPSHRPPPPGHRVQHQPQKRPPAPSGTqvhQQKGPPLPRPRVQPKP 363
Cdd:pfam15240  52 GGFPPQPPASDDPPgpppPGGPQQPPPQGGK--QKPQGPPPQGGPRPPPGKPQGPPPQGGN---QQQGPPPPGKPQGPPP 126

                  ...
gi 1039290979 364 PHG 366
Cdd:pfam15240 127 QGG 129
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
291-361 1.14e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 44.26  E-value: 1.14e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039290979 291 PHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPpgHRVQHQ--PQKRPPAPSGTQVHQQKGPPLPRPRVQP 361
Cdd:pfam09770 229 QAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPV--TILQRPqsPQPDPAQPSIQPQAQQFHQQPPPVPVQP 299
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
291-367 1.15e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 44.26  E-value: 1.15e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 291 PHQIPASTPQNPatSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQ------------QKGPPLPRPR 358
Cdd:pfam09770 246 PQQQPQQPQQHP--GQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQilqnpnrlsaarVGYPQNPQPG 323

                  ....*....
gi 1039290979 359 VQPKPPHGA 367
Cdd:pfam09770 324 VQPAPAHQA 332
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
279-370 1.20e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 1.20e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 279 AHRVATEERGRKPHQIPASTPQnPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPR 358
Cdd:PRK07764  419 AAAAPAPAAAPQPAPAPAPAPA-PPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAP 497
                          90
                  ....*....|..
gi 1039290979 359 VQPKPPHGAAEN 370
Cdd:PRK07764  498 AAPAAPAGADDA 509
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
290-365 1.76e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.49  E-value: 1.76e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039290979 290 KPHQiPASTPQNPATSQHPPPPPGHRSQA---PsHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPH 365
Cdd:pfam09770 275 DPAQ-PSIQPQAQQFHQQPPPVPVQPTQIlqnP-NRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQQ 351
PRK10927 PRK10927
cell division protein FtsN;
276-363 2.56e-04

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 42.36  E-value: 2.56e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 276 ETRAHRVATEERGRKPHQIPASTP--QNPATSQHPPPPPGHRSQAP--SHRPPPPGHRVQHQPQK---RPPAPSGTQVHQ 348
Cdd:PRK10927  144 QTPEQRQQTLQRQRQAQQLAEQQRlaQQSRTTEQSWQQQTRTSQAApvQAQPRQSKPASTQQPYQdllQTPAHTTAQSKP 223
                          90
                  ....*....|....*
gi 1039290979 349 QKGPPLPRPRVQPKP 363
Cdd:PRK10927  224 QQAAPVTRAADAPKP 238
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
265-371 3.41e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 3.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  265 KQRSRRNDEELETRAHRVATEERGRKPHQIPASTPQNPATSQHP----------PPPPGHRSQAPSHRPPPPGHRVQHQP 334
Cdd:PHA03307   809 ADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSEsskskpaaagGRARGKNGRRRPRPPEPRARPGAAAP 888
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1039290979  335 QKRPPAPSGTQVHQQKGPPLPRPRVQPKPPHGAAENS 371
Cdd:PHA03307   889 PKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPRG 925
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
294-364 3.48e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 3.48e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 294 IPASTPQNPATSQHPPPP---PGHRSQAPSHRPPPP-------GHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKP 363
Cdd:pfam03154 278 MPHSLQTGPSHMQHPVPPqpfPLTPQSSQSQVPPGPspaapgqSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKP 357

                  .
gi 1039290979 364 P 364
Cdd:pfam03154 358 P 358
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
291-376 4.40e-04

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 41.81  E-value: 4.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 291 PHQIPASTPQNPatsqhPPPPPGHRSQAPSHRPPPpghrvqhqPQKRPPAPsgtqvhqqkgPPLPRPRVQPKPPHGAAEN 370
Cdd:NF040983   86 PNKVPPPPPPPP-----PPPPPPPTPPPPPPPPPP--------PPPPSPPP----------PPPPSPPPSPPPPTTTPPT 142

                  ....*.
gi 1039290979 371 SLSPSS 376
Cdd:NF040983  143 RTTPST 148
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
279-376 4.68e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 4.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 279 AHRVATEERGRKPHQIPASTPQNPATSQhPPPPPGHRSQAPSHRPPPPGHRVqhqPQKRPPAPSGTQVHQQKGPPLPrPR 358
Cdd:PRK14951  391 AAPVAQAAAAPAPAAAPAAAASAPAAPP-AAAPPAPVAAPAAAAPAAAPAAA---PAAVALAPAPPAQAAPETVAIP-VR 465
                          90
                  ....*....|....*...
gi 1039290979 359 VQPKPPHGAAENSLSPSS 376
Cdd:PRK14951  466 VAPEPAVASAAPAPAAAP 483
PHA03247 PHA03247
large tegument protein UL36; Provisional
277-364 5.19e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 5.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  277 TRAHRV---ATEERGRKPHQIPASTPqnpATSQHPPPPPGhrsqapSHRPPPPGhrvqhqPQKRPPAPSGTQVHQQKGPP 353
Cdd:PHA03247  2584 SRARRPdapPQSARPRAPVDDRGDPR---GPAPPSPLPPD------THAPDPPP------PSPSPAANEPDPHPPPTVPP 2648
                           90
                   ....*....|.
gi 1039290979  354 LPRPRVQPKPP 364
Cdd:PHA03247  2649 PERPRDDPAPG 2659
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
283-376 5.23e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 5.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 283 ATEERGRKPHQIPASTPQNPATSQHPPPPPghRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPK 362
Cdd:PRK07764  396 AAAPSAAAAAPAAAPAPAAAAPAAAAAPAP--AAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAA 473
                          90
                  ....*....|....
gi 1039290979 363 PPHGAAENSLSPSS 376
Cdd:PRK07764  474 PEPTAAPAPAPPAA 487
Gag_spuma pfam03276
Spumavirus gag protein;
291-368 8.23e-04

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 41.27  E-value: 8.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 291 PHQIPASTPQN--PATSQHPPP------PPGHR-SQAPSHRP-PPPGHRVQHQPQKR---PPAPSGTQVHQQKGPPLPRP 357
Cdd:pfam03276 218 PGNIARSLGDDimPSLGDAGMPqprfafHPGNPfAEAEGHPFaEAEGERPRDIPRAPridAPSAPAIPAIQPIAPPMIPP 297
                          90
                  ....*....|..
gi 1039290979 358 R-VQPKPPHGAA 368
Cdd:pfam03276 298 IgAPIPIPHGAS 309
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
291-365 9.69e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.29  E-value: 9.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 291 PHQIPASTPQNPATSQHPPPPPGHRSQAPSHRP---PPPGHRVQHQPQKRPPAPSGTQVHQQ-----KGPPLPRPrVQPK 362
Cdd:pfam03154 207 PPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPqrlPSPHPPLQPMTQPPPPSQVSPQPLPQpslhgQMPPMPHS-LQTG 285

                  ...
gi 1039290979 363 PPH 365
Cdd:pfam03154 286 PSH 288
PHA03247 PHA03247
large tegument protein UL36; Provisional
262-376 1.22e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  262 KRKKQRSRrndeeletraHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGhrsqaPSHRPPPPGHRVqhqPQKRPPAP 341
Cdd:PHA03247   382 TRKRRSAR----------HAATPFARGPGGDDQTRPAAPVPASVPTPAPTPV-----PASAPPPPATPL---PSAEPGSD 443
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1039290979  342 SGTQVhqqkgPPLPRPRVQPKPPHGAAENSLSPSS 376
Cdd:PHA03247   444 DGPAP-----PPERQPPAPATEPAPDDPDDATRKA 473
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
286-376 1.37e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 1.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 286 ERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRP-----PAPSGTQVHQQKGPPLPRPRVQ 360
Cdd:PRK07764  610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDAsdggdGWPAKAGGAAPAAPPPAPAPAA 689
                          90
                  ....*....|....*.
gi 1039290979 361 PKPPHGAAENSLSPSS 376
Cdd:PRK07764  690 PAAPAGAAPAQPAPAP 705
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
279-369 1.47e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 1.47e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 279 AHRVATEERgRKPHQIPASTPQNPATSQHPPPP----PGHRSQAPSHRPPPPGHRVQH--QPQKRPPAPSGTQVHQQKGP 352
Cdd:PRK07764  375 LARLERLER-RLGVAGGAGAPAAAAPSAAAAAPaaapAPAAAAPAAAAAPAPAAAPQPapAPAPAPAPPSPAGNAPAGGA 453
                          90
                  ....*....|....*..
gi 1039290979 353 PLPRPRVQPKPPHGAAE 369
Cdd:PRK07764  454 PSPPPAAAPSAQPAPAP 470
IgV_CEACAM_like cd05741
Immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion ...
35-116 1.66e-03

Immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) and similar proteins; The members here are composed of the immunoglobulin (Ig)-like domain in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) and related domains. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions: it is a cell adhesion molecule and a signaling molecule that regulates the growth of tumor cells, an angiogenic factor, and a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two (D1, D4) or four (D1-D4) Ig-like domains on the cell surface. This family corresponds to the D1 Ig-like domain. Also belonging to this group is the N-terminal immunoglobulin (Ig)-like domain of the signaling lymphocyte activation molecule (SLAM) family, CD84-like family. The SLAM family is a group of immune-cell specific receptors that can regulate both adaptive and innate immune responses. SLAM family proteins are organized as an extracellular domain with having two or four Ig-like domains, a single transmembrane segment, and a cytoplasmic region having Tyr-based motifs. The extracellular domain is organized as a membrane-distal Ig variable (IgV) domain that is responsible for ligand recognition and a membrane-proximal truncated Ig constant-2 (IgC2) domain.


Pssm-ID: 409403  Cd Length: 102  Bit Score: 37.50  E-value: 1.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  35 GALGQDINLDIPSFQMSDDidDIKWEKTSD---KKKIAQFRKEKETF----KEKDTYKLFKNGTLKIKHLKTDDQDIYKV 107
Cdd:cd05741     7 GAEGKNVLLLVPNLQTPLK--SVSWYKGKQvsrNDEIAEYENSSDEFragsAFSGREYIYTNGSLLIQNITLSDTGFYTL 84

                  ....*....
gi 1039290979 108 SIYDTKGKN 116
Cdd:cd05741    85 ESTNIGGKT 93
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
288-343 1.89e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 39.24  E-value: 1.89e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039290979 288 GRKPHQIPASTPQNPATSQHPPPPPGHRSQAPShRPPPPGHRVQHQPQKRPPAPSG 343
Cdd:COG3416    91 GGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAP-QQPGYGQPQYGQPAAGPSGGGG 145
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
290-375 1.89e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 1.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 290 KPHQIPASTPQNPATSQHPP------PPPGHRSQA-PSHRPPP------PGHRVQHQPQKRPPAPSGTQVHQQKGPPLPR 356
Cdd:pfam03154 250 QPMTQPPPPSQVSPQPLPQPslhgqmPPMPHSLQTgPSHMQHPvppqpfPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPP 329
                          90
                  ....*....|....*....
gi 1039290979 357 PRVQPKPPHGAAENSLSPS 375
Cdd:pfam03154 330 SQSQLQSQQPPREQPLPPA 348
PRK10263 PRK10263
DNA translocase FtsK; Provisional
276-374 2.01e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.45  E-value: 2.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  276 ETRAHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKrpPAPSGTQVHQQKGPPLP 355
Cdd:PRK10263   752 VQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA--PQPQYQQPQQPVAPQPQ 829
                           90
                   ....*....|....*....
gi 1039290979  356 RPRVQPKPPHGAAENSLSP 374
Cdd:PRK10263   830 YQQPQQPVAPQPQDTLLHP 848
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
295-375 2.33e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.97  E-value: 2.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 295 PASTPQNPATSQHPPPPPGhrSQAPSHRPPPPGHRVQHQPQKrPPAPSGTQVHQQKGPPLPRPRVQPKPPhGAAENSLSP 374
Cdd:PRK07764  425 PAAAPQPAPAPAPAPAPPS--PAGNAPAGGAPSPPPAAAPSA-QPAPAPAAAPEPTAAPAPAPPAAPAPA-AAPAAPAAP 500

                  .
gi 1039290979 375 S 375
Cdd:PRK07764  501 A 501
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
283-364 2.49e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.15  E-value: 2.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  283 ATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRP--------PAPSGTQVHQQKGPPL 354
Cdd:PHA03307   328 STSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGrptrrrarAAVAGRARRRDATGRF 407
                           90
                   ....*....|
gi 1039290979  355 PRPRVQPKPP 364
Cdd:PHA03307   408 PAGRPRPSPL 417
PHA03247 PHA03247
large tegument protein UL36; Provisional
295-375 2.89e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.92  E-value: 2.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  295 PASTPQNPATSQHPPPP---------PGH--RSQAPSHRPP-----PPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPR 358
Cdd:PHA03247  2832 TSAQPTAPPPPPGPPPPslplggsvaPGGdvRRRPPSRSPAakpaaPARPPVRRLARPAVSRSTESFALPPDQPERPPQP 2911
                           90
                   ....*....|....*..
gi 1039290979  359 VQPKPPHGAAENSLSPS 375
Cdd:PHA03247  2912 QAPPPPQPQPQPPPPPQ 2928
PHA03247 PHA03247
large tegument protein UL36; Provisional
278-376 3.02e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.92  E-value: 3.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  278 RAHRVATEERGRKPHQIPASTPQNPATSQHPP---PPPGHRSQAPSHRPPP--PGHRVQHQPQKRPPAPSGTQVHQQKGP 352
Cdd:PHA03247   344 RQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPkraSLPTRKRRSARHAATPfaRGPGGDDQTRPAAPVPASVPTPAPTPV 423
                           90       100
                   ....*....|....*....|....*
gi 1039290979  353 PLPRPRVQPKP-PHGAAENSLSPSS 376
Cdd:PHA03247   424 PASAPPPPATPlPSAEPGSDDGPAP 448
PHA03247 PHA03247
large tegument protein UL36; Provisional
263-366 3.20e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.92  E-value: 3.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  263 RKKQRSRRNDEELETRAHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAP----SHRPPPPGHRVQHQPQKRP 338
Cdd:PHA03247   382 TRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAepgsDDGPAPPPERQPPAPATEP 461
                           90       100
                   ....*....|....*....|....*...
gi 1039290979  339 PAPSGTQVHQQKGPPLpRPRVQPKPPHG 366
Cdd:PHA03247   462 APDDPDDATRKALDAL-RERRPPEPPGA 488
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
296-369 3.34e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 39.34  E-value: 3.34e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039290979 296 ASTPQNPATSQHPPPPPG--HRSQAPSHRPPPPGHRVQHQPQKRPPAPSgtqvhqqkGPPLPRPRVQPKPPHGAAE 369
Cdd:PRK14965  381 APAPPSAAWGAPTPAAPAapPPAAAPPVPPAAPARPAAARPAPAPAPPA--------AAAPPARSADPAAAASAGD 448
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
294-376 3.50e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 39.41  E-value: 3.50e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 294 IPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQ--PQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPHGAAENS 371
Cdd:PRK14950  360 LVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPkePVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVD 439

                  ....*
gi 1039290979 372 LSPSS 376
Cdd:PRK14950  440 EKPKY 444
Ig_2 pfam13895
Immunoglobulin domain; This domain contains immunoglobulin-like domains.
141-197 3.58e-03

Immunoglobulin domain; This domain contains immunoglobulin-like domains.


Pssm-ID: 464026 [Multi-domain]  Cd Length: 79  Bit Score: 36.22  E-value: 3.58e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039290979 141 NTTLTCEVMNGTDPELNLYQDGKHLKLSQRVITHKWTTSLSAKFKCTAGNKVSKESS 197
Cdd:pfam13895  16 PVTLTCSAPGNPPPSYTWYKDGSAISSSPNFFTLSVSAEDSGTYTCVARNGRGGKVS 72
DUF6264 pfam19779
Family of unknown function (DUF6264); This family of putative integral membrane proteins is ...
283-343 3.62e-03

Family of unknown function (DUF6264); This family of putative integral membrane proteins is functionally uncharacterized. This family of proteins is found in bacteria. Proteins in this family are typically between 179 and 218 amino acids in length.


Pssm-ID: 466182  Cd Length: 182  Bit Score: 38.01  E-value: 3.62e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039290979 283 ATEERGRKPHQIPASTpQNPATSQHPPPPPGHRSQAPSHRP---PPPGHRVQHQPQKRPPAPSG 343
Cdd:pfam19779   8 APPGWQRAPIGDPAAA-AAAAPPAAPAPAAPAPPAAPAAPPaapPPPGAPAPGAPAAARRARRW 70
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
299-364 3.99e-03

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 37.71  E-value: 3.99e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039290979 299 PQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQkrPPAPSgtqvhQQKGPPL--PRPRVQPKPP 364
Cdd:pfam15240 106 PQGGNQQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPP--PPPPG-----NPQGPPQrpPQPGNPQGPP 166
PHA03419 PHA03419
E4 protein; Provisional
292-375 4.17e-03

E4 protein; Provisional


Pssm-ID: 223079 [Multi-domain]  Cd Length: 200  Bit Score: 38.01  E-value: 4.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 292 HQIPASTPQNPATSQHP-------PPPPGHRSQAP-SHRP-----PPPGHR--VQHQP---------QKRPPAPSGTQVH 347
Cdd:PHA03419   43 RQLETGYPFCPPTTPHPssqpppcPPSPGHPPQTNdTHEKdlalqPPPGGKkkEKKKKetekpaqggEKPDQGPEAKGEG 122
                          90       100
                  ....*....|....*....|....*...
gi 1039290979 348 QQKGPPLPRPRVQPKPPHGAAENSLSPS 375
Cdd:PHA03419  123 EGHEPEDPPPEDTPPPPGGEGEVEGGPS 150
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
283-368 4.20e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.20  E-value: 4.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 283 ATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPL-----PRP 357
Cdd:PRK07764  620 APAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAapagaAPA 699
                          90
                  ....*....|.
gi 1039290979 358 RVQPKPPHGAA 368
Cdd:PRK07764  700 QPAPAPAATPP 710
PHA03378 PHA03378
EBNA-3B; Provisional
283-363 4.26e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 39.28  E-value: 4.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 283 ATEERGRKPHQIPASTPQNPATSQHPPPPPGhrsqAPSHRPPP---PGHRVQHQPQKRPPAPsgtQVHQQKGP-PLPRPR 358
Cdd:PHA03378  729 AAPGRARPPAAAPGRARPPAAAPGRARPPAA----APGRARPPaaaPGAPTPQPPPQAPPAP---QQRPRGAPtPQPPPQ 801

                  ....*
gi 1039290979 359 VQPKP 363
Cdd:PHA03378  802 AGPTS 806
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
287-364 4.76e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 39.08  E-value: 4.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 287 RGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQ--QKGPPLPRPRVQPKPP 364
Cdd:PRK07994  360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQllAARQQLQRAQGATKAK 439
Jun pfam03957
Jun-like transcription factor;
294-342 6.00e-03

Jun-like transcription factor;


Pssm-ID: 461108 [Multi-domain]  Cd Length: 231  Bit Score: 37.97  E-value: 6.00e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1039290979 294 IPASTPQNPATSQHPPPPPGhRSQAPSHRPPPPGHRVQHQPQKRPPAPS 342
Cdd:pfam03957 179 APAQPPQPVSYAAEPPPFAV-PVQHPPPGRPPRLQALKEEPQTVPEVPS 226
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
298-364 6.31e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 38.90  E-value: 6.31e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039290979 298 TPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAP----SGTQVHQQKGPPLPRPRVQPKPP 364
Cdd:PTZ00449  596 KPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPqrpsSPERPEGPKIIKSPKPPKSPKPP 666
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
292-364 6.32e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 37.08  E-value: 6.32e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039290979  292 HQIPASTPQNPATsqHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPsgTQVHQQKGPPLPRPRVQPKPP 364
Cdd:smart00818  37 HQIIPVSQQHPPT--HTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTP--TQHHQPNLPQPAQQPFQPQPL 105
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
291-376 6.77e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 38.43  E-value: 6.77e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 291 PHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPHGAAEN 370
Cdd:PRK07764  650 PEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGAS 729

                  ....*.
gi 1039290979 371 SLSPSS 376
Cdd:PRK07764  730 APSPAA 735
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
291-367 7.05e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 38.48  E-value: 7.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 291 PHQIPAST---PQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQ--HQPQKRPPAPSGTQVHQQKG----PPLPRPRVQP 361
Cdd:pfam09770 258 GQGHPVTIlqrPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQilQNPNRLSAARVGYPQNPQPGvqpaPAHQAHRQQG 337

                  ....*.
gi 1039290979 362 KPPHGA 367
Cdd:pfam09770 338 SFGRQA 343
PRK12757 PRK12757
cell division protein FtsN; Provisional
293-364 7.12e-03

cell division protein FtsN; Provisional


Pssm-ID: 237191 [Multi-domain]  Cd Length: 256  Bit Score: 37.71  E-value: 7.12e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039290979 293 QIPASTPQNPATSQHPPPPPghRSQAPSHRPPPPGHRVQHQPQKRPPapsgtqVHQQKGPPLPRPRVQPKPP 364
Cdd:PRK12757  113 QVPRSTVQIQQQAQQQQPPA--TTAQPQPVTPPRQTTAPVQPQTPAP------VRTQPAAPVTQAVEAPKVE 176
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
273-364 8.37e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 38.23  E-value: 8.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979  273 EELETRAHRVATEERGRKPHqiPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGP 352
Cdd:PHA03307   103 EGSPTPPGPSSPDPPPPTPP--PASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
                           90
                   ....*....|..
gi 1039290979  353 PLPRPRVQPKPP 364
Cdd:PHA03307   181 ETARAPSSPPAE 192
PHA03378 PHA03378
EBNA-3B; Provisional
277-364 9.04e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 38.12  E-value: 9.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 277 TRAHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVqhqPQKRP-PAPSGTQvhQQKGPPLP 355
Cdd:PHA03378  732 GRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPA---PQQRPrGAPTPQP--PPQAGPTS 806

                  ....*....
gi 1039290979 356 RPRVQPKPP 364
Cdd:PHA03378  807 MQLMPRAAP 815
PHA03378 PHA03378
EBNA-3B; Provisional
277-363 9.20e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 38.12  E-value: 9.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039290979 277 TRAHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPshRPPPPGHRVQHQPQKRPPAPSGTQvhQQKGPPLP- 355
Cdd:PHA03378  712 GRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRA--RPPAAAPGRARPPAAAPGAPTPQP--PPQAPPAPq 787

                  ....*....
gi 1039290979 356 -RPRVQPKP 363
Cdd:PHA03378  788 qRPRGAPTP 796
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH