A little labeling goes a long way: Semi-supervised learning in infancy

Dev Sci. 2019 Jan;22(1):e12736. doi: 10.1111/desc.12736. Epub 2018 Sep 18.

Abstract

There is considerable evidence that labeling supports infants' object categorization. Yet in daily life, most of the category exemplars that infants encounter will remain unlabeled. Inspired by recent evidence from machine learning, we propose that infants successfully exploit this sparsely labeled input through "semi-supervised learning." Providing only a few labeled exemplars leads infants to initiate the process of categorization, after which they can integrate all subsequent exemplars, labeled or unlabeled, into their evolving category representations. Using a classic novelty preference task, we introduced 2-year-old infants (n = 96) to a novel object category, varying whether and when its exemplars were labeled. Infants were equally successful whether all exemplars were labeled (fully supervised condition) or only the first two exemplars were labeled (semi-supervised condition), but they failed when no exemplars were labeled (unsupervised condition). Furthermore, the timing of the labeling mattered: when the labeled exemplars were provided at the end, rather than the beginning, of familiarization (reversed semi-supervised condition), infants failed to learn the category. This provides the first evidence of semi-supervised learning in infancy, revealing that infants excel at learning from exactly the kind of input that they typically receive in acquiring real-world categories and their names.

Keywords: category learning; conceptual development; language acquisition; language and thought; semi-supervised learning.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Child, Preschool
  • Humans
  • Infant
  • Language Development
  • Learning
  • Supervised Machine Learning*