Combining bioinformatics and phylogenetics to identify large sets of single-copy orthologous genes (COSII) for comparative, evolutionary and systematic studies: a test case in the euasterid plant clade

Genetics. 2006 Nov;174(3):1407-20. doi: 10.1534/genetics.106.062455. Epub 2006 Sep 1.

Abstract

We report herein the application of a set of algorithms to identify a large number (2869) of single-copy orthologs (COSII), which are shared by most, if not all, euasterid plant species as well as the model species Arabidopsis. Alignments of the orthologous sequences across multiple species enabled the design of "universal PCR primers," which can be used to amplify the corresponding orthologs from a broad range of taxa, including those lacking any sequence databases. Functional annotation revealed that these conserved, single-copy orthologs encode a higher-than-expected frequency of proteins transported and utilized in organelles and a paucity of proteins associated with cell walls, protein kinases, transcription factors, and signal transduction. The enabling power of this new ortholog resource was demonstrated in phylogenetic studies, as well as in comparative mapping across the plant families tomato (family Solanaceae) and coffee (family Rubiaceae). The combined results of these studies provide compelling evidence that (1) the ancestral species that gave rise to the core euasterid families Solanaceae and Rubiaceae had a basic chromosome number of x=11 or 12.2) No whole-genome duplication event (i.e., polyploidization) occurred immediately prior to or after the radiation of either Solanaceae or Rubiaceae as has been recently suggested.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Arabidopsis / genetics
  • Base Sequence
  • Chromosome Mapping
  • Chromosomes, Plant
  • Coffee / genetics
  • Computational Biology*
  • Databases, Genetic
  • Expressed Sequence Tags
  • Gene Dosage*
  • Genes, Plant*
  • Magnoliopsida / classification
  • Magnoliopsida / genetics*
  • Molecular Sequence Data
  • Phylogeny*
  • Plants / genetics*
  • Sequence Homology, Amino Acid
  • Solanum lycopersicum / genetics

Substances

  • Coffee

Associated data

  • GENBANK/DQ422968
  • GENBANK/DQ422969
  • GENBANK/DQ422970
  • GENBANK/DQ422971
  • GENBANK/DQ422972
  • GENBANK/DQ422973
  • GENBANK/DQ422974
  • GENBANK/DQ422975
  • GENBANK/DQ422976
  • GENBANK/DQ422977
  • GENBANK/DQ422978
  • GENBANK/DQ422979
  • GENBANK/DQ422980
  • GENBANK/DQ422981
  • GENBANK/DQ422982
  • GENBANK/DQ422983
  • GENBANK/DQ422984
  • GENBANK/DQ422985
  • GENBANK/DQ422986
  • GENBANK/DQ422987
  • GENBANK/DQ422988
  • GENBANK/DQ422989
  • GENBANK/DQ422990
  • GENBANK/DQ422991
  • GENBANK/DQ422992
  • GENBANK/DQ422993
  • GENBANK/DQ422994
  • GENBANK/DQ422995
  • GENBANK/DQ422996
  • GENBANK/DQ422997
  • GENBANK/DQ422998
  • GENBANK/DQ422999
  • GENBANK/DQ423000
  • GENBANK/DQ423001
  • GENBANK/DQ423002
  • GENBANK/DQ423003
  • GENBANK/DQ423004
  • GENBANK/DQ423005
  • GENBANK/DQ423006
  • GENBANK/DQ423007
  • GENBANK/DQ423008
  • GENBANK/DQ423009
  • GENBANK/DQ423010
  • GENBANK/DQ423011
  • GENBANK/DQ423012
  • GENBANK/DQ423013
  • GENBANK/DQ423014
  • GENBANK/DQ423015
  • GENBANK/DQ423016
  • GENBANK/DQ423017
  • GENBANK/DQ423018
  • GENBANK/DQ423019
  • GENBANK/DQ423020
  • GENBANK/DQ423021
  • GENBANK/DQ423022
  • GENBANK/DQ423023
  • GENBANK/DQ423024
  • GENBANK/DQ423025
  • GENBANK/DQ423026
  • GENBANK/DQ423027
  • GENBANK/DQ423028
  • GENBANK/DQ423029
  • GENBANK/DQ423030
  • GENBANK/DQ423031
  • GENBANK/DQ423032
  • GENBANK/DQ423033
  • GENBANK/DQ423034
  • GENBANK/DQ423035
  • GENBANK/DQ423036
  • GENBANK/DQ423037
  • GENBANK/DQ423038
  • GENBANK/DQ423039
  • GENBANK/DQ423040
  • GENBANK/DQ423041
  • GENBANK/DQ423042
  • GENBANK/DQ423043
  • GENBANK/DQ423044
  • GENBANK/DQ423045
  • GENBANK/DQ423046
  • GENBANK/DQ423047
  • GENBANK/DQ423048
  • GENBANK/DQ423049
  • GENBANK/DQ423050
  • GENBANK/DQ423051
  • GENBANK/DQ423052
  • GENBANK/DQ423053
  • GENBANK/DQ423054
  • GENBANK/DQ423055
  • GENBANK/DQ423056
  • GENBANK/DQ423057
  • GENBANK/DQ423058
  • GENBANK/DQ423059
  • GENBANK/DQ423060
  • GENBANK/DQ423061
  • GENBANK/DQ423062
  • GENBANK/DQ423063
  • GENBANK/DQ423064
  • GENBANK/DQ423065
  • GENBANK/DQ423066
  • GENBANK/DQ423067
  • GENBANK/DQ423068
  • GENBANK/DQ423069
  • GENBANK/DQ423070
  • GENBANK/DQ423071
  • GENBANK/DQ423072
  • GENBANK/DQ423073
  • GENBANK/DQ423074
  • GENBANK/DQ423075
  • GENBANK/DQ423076
  • GENBANK/DQ423077
  • GENBANK/DQ423078
  • GENBANK/DQ423079
  • GENBANK/DQ423080
  • GENBANK/DQ423081
  • GENBANK/DQ423082
  • GENBANK/DQ423083
  • GENBANK/DQ423084
  • GENBANK/DQ423085
  • GENBANK/DQ423086
  • GENBANK/DQ423087
  • GENBANK/DQ423088
  • GENBANK/DQ423089
  • GENBANK/DQ423090
  • GENBANK/DQ423091
  • GENBANK/DQ423092
  • GENBANK/DQ423093
  • GENBANK/DQ423094
  • GENBANK/DQ423095
  • GENBANK/DQ423096
  • GENBANK/DQ423097
  • GENBANK/DQ423098
  • GENBANK/DQ423099
  • GENBANK/DQ423100
  • GENBANK/DQ423101
  • GENBANK/DQ423102
  • GENBANK/DQ423103
  • GENBANK/DQ423104
  • GENBANK/DQ423105
  • GENBANK/DQ423106
  • GENBANK/DQ423107
  • GENBANK/DQ423108
  • GENBANK/DQ423109
  • GENBANK/DQ423110
  • GENBANK/DQ423111
  • GENBANK/DQ423112
  • GENBANK/DQ423113
  • GENBANK/DQ423114
  • GENBANK/DQ423115
  • GENBANK/DQ423116
  • GENBANK/DQ423117
  • GENBANK/DQ423118
  • GENBANK/DQ423119
  • GENBANK/DQ423120
  • GENBANK/DQ423121
  • GENBANK/DQ423122
  • GENBANK/DQ423123
  • GENBANK/DQ423124
  • GENBANK/DQ423125
  • GENBANK/DQ423126
  • GENBANK/DQ423127
  • GENBANK/DQ423128
  • GENBANK/DQ423129
  • GENBANK/DQ423130
  • GENBANK/DQ423131
  • GENBANK/DQ423132
  • GENBANK/DQ423133
  • GENBANK/DQ423134
  • GENBANK/DQ423135
  • GENBANK/DQ423136
  • GENBANK/DQ423137
  • GENBANK/DQ423138
  • GENBANK/DQ423139
  • GENBANK/DQ423140
  • GENBANK/DQ423141
  • GENBANK/DQ423142
  • GENBANK/DQ423143
  • GENBANK/DQ423144
  • GENBANK/DQ423145
  • GENBANK/DQ423146
  • GENBANK/DQ423147
  • GENBANK/DQ423148
  • GENBANK/DQ423149
  • GENBANK/DQ423150
  • GENBANK/DQ423151
  • GENBANK/DQ423152
  • GENBANK/DQ423153
  • GENBANK/DQ423154
  • GENBANK/DQ423155
  • GENBANK/DQ423156
  • GENBANK/DQ423157
  • GENBANK/DQ423158
  • GENBANK/DQ423159
  • GENBANK/DQ423160
  • GENBANK/DQ423161
  • GENBANK/DQ423162
  • GENBANK/DQ423163
  • GENBANK/DQ423164
  • GENBANK/DQ423165
  • GENBANK/DQ423166
  • GENBANK/DQ423167
  • GENBANK/DQ423168
  • GENBANK/DQ423169
  • GENBANK/DQ423170
  • GENBANK/DQ423171
  • GENBANK/DQ423172
  • GENBANK/DQ423173
  • GENBANK/DQ423174
  • GENBANK/DQ423175
  • GENBANK/DQ423176
  • GENBANK/DQ423177
  • GENBANK/DQ423178
  • GENBANK/DQ423179
  • GENBANK/DQ423180
  • GENBANK/DQ423181
  • GENBANK/DQ423182
  • GENBANK/DQ423183
  • GENBANK/DQ423184
  • GENBANK/DQ423185
  • GENBANK/DQ423186
  • GENBANK/DQ423187
  • GENBANK/DQ423188
  • GENBANK/DQ423189
  • GENBANK/DQ423190
  • GENBANK/DQ423191
  • GENBANK/DQ423192
  • GENBANK/DQ423193
  • GENBANK/DQ423194
  • GENBANK/DQ423195
  • GENBANK/DQ423196
  • GENBANK/DQ423197
  • GENBANK/DQ423198
  • GENBANK/DQ423199
  • GENBANK/DQ423200
  • GENBANK/DQ423201
  • GENBANK/DQ423202