(A) Top 50
-mers ranked by the weight vector
, depicted as a graph-mer, which are associated by the PLS procedure to the expression pattern of oocyte genes. Graph motif patterns were identified in the form of
-mer clusters using the MCODE plug-in in Cytoscape. PSSMs generated through hierarchical sequence agglomeration of the corresponding
-mer sets are indicated, revealing several CG-rich motifs. (B) Analysis of oocyte
-mer conservation using the motif conservation score (MCS). The plot shows the distribution of (oocyte MCS
non-oocyte MCS) for top 50
-mers versus remaining
-mers in
. The score distribution for the top 50
-mers has a heavy right tail, showing that as a distribution, the top 50
-mers have higher oocyte-specific conservation scores as compared to other
-mers (
e-13 by a one-sided KS statistic). Significantly conserved
-mers are annotated, including CG-rich
-mers for oocyte genes. (C) Distribution of (sperm MCS
non-sperm MCS) for top 50
-mers versus remaining
-mers in
. The score distribution for the top 50
-mers has a heavy right tail, showing that the top 50
-mers have higher distribution of sperm-spefic conservation scores than other
-mers (
e-5, one-sided KS statistic). Significantly conserved
-mers are annotated, including ACGTG motif for sperm genes.