Why flashcards work for kids — and when they don't

A research-backed guide for parents using picture flashcards with young children, with a focus on multilingual households.


Flashcards have a polarizing reputation in early childhood education. Some parents swear by them; some pediatricians warn against them. The truth, supported by 30+ years of developmental and cognitive-science research, is more nuanced: picture-based flashcards can produce real, durable learning in young children — but only under specific conditions. This article walks through what the evidence actually says, with a focus on multilingual households where parents want to teach the same concepts in two, three, or four languages.

When does picture-word learning actually work?

The first question is whether young children can even understand that a picture stands for something in the real world. The answer turns out to depend on age, and the developmental window is well-mapped.

By 18 to 24 months, toddlers shown a picture of a novel object (e.g., a "whisk") and taught its label will reliably map that label onto the real object, not the picture itself (Preissler & Carey, 2004, Journal of Cognition and Development). This is referential learning, not rote pairing — the child has understood that the picture represents a kind of thing in the world. Earlier work showed that even at 15 months children begin extending newly learned nouns flexibly across colors and from 2D pictures to 3D objects (Geraghty, Waxman & Gelman, 2014, Cognitive Development), though the evidence at the very youngest end (12–15 months) is weaker and partly contested (Ganea, Allen, Butler, Carey & DeLoache, 2009, Journal of Experimental Child Psychology).

By around two and a half years, children "very readily appreciate the relation between a picture and its referent" (DeLoache, 1991, Child Development). Pictures, importantly, are easier symbolic stimuli for young children than scale models or other 3D representations, because pictures don't impose the dual-representation burden of being interesting objects in their own right (DeLoache, 2004, Trends in Cognitive Sciences00334-6)).

Bottom line: picture-word flashcards become genuinely productive — not just associative — somewhere between 18 and 30 months, and consolidate from age 2.5 onward. For toddlers under 18 months, treat what's happening as exposure rather than acquisition. For 2-7 year-olds, the mechanism is sound.

Spaced retrieval beats spaced study (yes, even for preschoolers)

The second question is whether the spaced repetition logic that powers adult flashcard apps works for young children. The recent literature is unambiguous: repeated retrieval at spaced intervals is the active ingredient, not repeated viewing.

A 2020 review covering vocabulary intervention research concluded that retrieval-based and spaced practice "can enhance word learning" and applies "both within and across sessions to support encoding and retention" in young children (Gordon, 2020, Language, Speech & Hearing Services in Schools).

The cleanest experimental evidence comes from Leonard and colleagues. In one study, repeated spaced retrieval of new words produced greater recall of word form and meaning than either repeated study or repeated immediate retrieval — and crucially, the benefits generalized to new pictures of the same word, indicating real comprehension rather than rote pairing. The effect held in both typically developing preschoolers and children with developmental language disorder (Leonard & Deevy, 2020, Journal of Speech, Language, and Hearing Research).

A follow-up study showed that combining a quick immediate retrieval check with subsequent spaced retrievals produced accuracy that was "very stable" across a one-week interval, while immediate-only retrieval decayed substantially: "Immediate retrieval boosts the phonetic accuracy of new words in the short term but spaced retrieval promotes stability" (Leonard, Kueser, Deevy, Haebig, Karpicke & Weber, 2022, Autism & Developmental Language Impairments).

One scope caveat from this literature: the strongest evidence is concentrated on children aged 4–5. Direct empirical findings for ages 2–3 are weaker. For toddlers, the underlying mechanism almost certainly works the same way — but the published intervention studies haven't extensively tested those ages yet.

Should you teach the same word in multiple languages?

For multilingual families, this is the central question. The research is reassuring and clear: bilingual and multilingual children build parallel lexicons from the very start, and acquiring the same concept in multiple languages is the norm, not a problem.

Pearson, Fernández, and Oller followed 27 English–Spanish bilingual infants from 8 to 30 months. They found that translation equivalents — the same concept learned in both languages — were observed in 26 of 27 children, averaging 30% of the child's vocabulary. This held both at the very early stages (the first 2–12 words) and continuing up to 500-word vocabularies (Pearson, Fernández & Oller, 1995, Journal of Child Language). This finding refuted an older "unitary language hypothesis" that bilingual children supposedly start with one mixed system and only later differentiate. They don't. They build two from the beginning.

A second result from the same research group showed that simultaneous bilingual children at 8–30 months had no statistically detectable disadvantage in early vocabulary growth compared to monolinguals when both languages are counted together. The authors specifically recommended that vocabulary assessment "reference performance in both languages together rather than just one" (Pearson, Fernández & Oller, 1993, Language Learning).

How do bilingual children manage two languages without confusion? They actively control them in real time. A 2017 study using eye-tracking showed that 20-month-old simultaneous bilinguals "use language-control mechanisms to preferentially activate the currently heard language during listening" — processing similarly to bilingual adults (Byers-Heinlein, Morin-Lessard & Lew-Williams, 2017, PNAS). Children manage their languages; they don't muddle them.

The number of languages also reshapes how children approach new words. Monolingual children strongly use the mutual exclusivity heuristic — when they hear a new word, they assume it refers to a new object, since they already have a label for everything familiar. But this heuristic weakens as exposure to more languages grows: bilingual 17–18-month-olds show only marginal mutual exclusivity, and trilingual children show none at all (Byers-Heinlein & Werker, 2009, Developmental Science). In other words, children growing up with three or more languages already expect multiple labels for the same thing. A trilingual or quadrilingual flashcard practice doesn't fight against their cognitive grain — it works with it.

Practical implication for households running 3+ languages: show the same picture, switch the language. Don't force one-language-at-a-time silos that contradict the heuristics multilingual kids already use.

What words to start with

The MacArthur-Bates Communicative Development Inventories (CDI) are the gold-standard instruments for tracking early vocabulary, with adaptations in dozens of languages. Aggregated CDI data across thousands of children — including English, Mandarin, Japanese, French, and Spanish samples — show consistent patterns in what young children learn first (Wordbank database, Frank, Braginsky, Yurovsky & Marchman, 2021, Variability and Consistency in Early Language Learning: The Wordbank Project, MacArthur-Bates CDI norms).

Across languages, the canonical first-vocabulary categories for ages roughly 18–30 months consistently include:

A picture-flashcard app for very young children should weight content toward the first five categories, with numbers and colors as Level 2 or 3 content rather than starter material.

Parent involvement matters

There's a strong literature showing that parent-led, interactive engagement with reading material — particularly the dialogic reading technique developed by Whitehurst and colleagues — produces robustly better language outcomes than passive consumption (Whitehurst et al., 1988; Mol, Bus & de Jong, 2008, Review of Educational Research via meta-analysis; Reese et al., 2010). The active ingredients are: the parent following the child's lead, asking open questions ("What do you see?"), expanding the child's response, and praising attempts.

While these studies focus on shared book reading rather than flashcards specifically, the mechanism — turning a passive viewing experience into an interactive language exchange — is the same. A flashcard session where the parent shows the card, prompts the child, listens, and responds with warmth has more in common with dialogic reading than with rote drilling. A child alone with a screen is not what the literature endorses.

Session length and frequency

This is one area where direct primary-source evidence specific to flashcards in early childhood is thin. The deep-research workflow that generated this article flagged it as an open question. Drawing on adjacent attention-span and cognitive-load literature, the consensus practitioner guidance is:

The pattern matters more than the duration: short, frequent, positive sessions beat long, occasional, frustrating ones. A 3-year-old who finishes 10 cards smiling has learned more than one who reluctantly completes 30.

Honest limitations: when flashcards are the wrong tool

Not everyone in early childhood education endorses flashcards, and the critique deserves to be heard. The Montessori tradition has long argued that direct concrete experience — handling real objects, observing in the natural environment — should precede symbolic representation in early childhood. The argument is that introducing decontextualized pictures too early can encourage rote labeling without conceptual understanding (Montessori practitioner literature on rote memorization, Montessori "from concrete to abstract" framing).

The cognitive-science community has produced more nuanced critiques of educational apps generally, focusing on whether they actually deliver active, engaging, meaningful, and socially-interactive learning — the four pillars of evidence-based design (Hirsh-Pasek et al., 2015, "Putting Education in Educational Apps"). Flashcards delivered as solo-screen-time content fail on social interaction. Flashcards as a parent-mediated interaction designed to spark conversation pass that test.

The honest take: flashcards are not a silver bullet, they cannot replace lived experience or play, and for children under 18 months, evidence for productive symbolic learning is weak. They are one useful tool among many — strongest when used with a present, engaged adult, on top of (not instead of) play, books, songs, and real-world experience.


Practical guidance for parents

Drawn from the research above, distilled into rules of thumb for daily use:


References

  1. Byers-Heinlein, K., Morin-Lessard, E., & Lew-Williams, C. (2017). Bilingual infants control their languages as they listen. Proceedings of the National Academy of Sciences, 114(34), 9032–9037. https://www.pnas.org/doi/abs/10.1073/pnas.1703220114
  2. Byers-Heinlein, K., & Werker, J. F. (2009). Monolingual, bilingual, trilingual: Infants' language experience influences the development of a word-learning heuristic. Developmental Science, 12(5), 815–823. https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1467-7687.2009.00902.x
  3. DeLoache, J. S. (1991). Symbolic functioning in very young children: Understanding of pictures and models. Child Development, 62(4), 736–752. https://academic.oup.com/chidev/article-abstract/62/4/736/8293693
  4. DeLoache, J. S. (2004). Becoming symbol-minded. Trends in Cognitive Sciences, 8(2), 66–70. https://www.cell.com/trends/cognitive-sciences/abstract/S1364-6613(03)00334-6
  5. Frank, M. C., Braginsky, M., Yurovsky, D., & Marchman, V. A. (2021). Variability and Consistency in Early Language Learning: The Wordbank Project. MIT Press. https://langcog.github.io/wordbank-book/ · https://wordbank.stanford.edu/ · https://mb-cdi.stanford.edu/
  6. Ganea, P. A., Allen, M. L., Butler, L., Carey, S., & DeLoache, J. S. (2009). Toddlers' referential understanding of pictures. Journal of Experimental Child Psychology, 104(3), 283–295. https://www.sciencedirect.com/science/article/pii/S0022096509001301
  7. Geraghty, K., Waxman, S. R., & Gelman, S. A. (2014). Learning words from pictures: 15- and 17-month-old infants appreciate the referential and symbolic links among words, pictures, and objects. Cognitive Development, 32, 1–11. https://www.sciencedirect.com/science/article/pii/S0885201414000458
  8. Gordon, K. R. (2020). The advantages of retrieval-based and spaced practice: Implications for word learning in clinical and educational contexts. Language, Speech, and Hearing Services in Schools, 51(4), 955–965. https://pubs.asha.org/doi/abs/10.1044/2020_LSHSS-19-00001
  9. Hirsh-Pasek, K., Zosh, J. M., Golinkoff, R. M., Gray, J. H., Robb, M. B., & Kaufman, J. (2015). Putting education in "educational" apps: Lessons from the science of learning. Psychological Science in the Public Interest, 16(1), 3–34. https://docslib.org/doc/11851924/putting-education-in-educational-apps-lesson-for-the-science-of-learning
  10. Leonard, L. B., & Deevy, P. (2020). Retrieval practice and word learning in children with specific language impairment and their typically developing peers. Journal of Speech, Language, and Hearing Research, 63(10), 3252–3262. https://pubs.asha.org/doi/abs/10.1044/2020_JSLHR-20-00006
  11. Leonard, L. B., Kueser, J. B., Deevy, P., Haebig, E., Karpicke, J. D., & Weber, C. (2022). The contributions of immediate retrieval and spaced retrieval to word learning in preschoolers with developmental language disorder. Autism & Developmental Language Impairments, 7. https://journals.sagepub.com/doi/abs/10.1177/23969415221077652
  12. Mol, S. E., Bus, A. G., de Jong, M. T., & Smeets, D. J. H. (2008). Added value of dialogic parent-child book readings: A meta-analysis. Early Education and Development, 19(1), 7–26. https://pubmed.ncbi.nlm.nih.gov/18194022/
  13. Pearson, B. Z., Fernández, S. C., & Oller, D. K. (1993). Lexical development in bilingual infants and toddlers: Comparison to monolingual norms. Language Learning, 43(1), 93–120. https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1467-1770.1993.tb00174.x
  14. Pearson, B. Z., Fernández, S. C., & Oller, D. K. (1995). Cross-language synonyms in the lexicons of bilingual infants: One language or two? Journal of Child Language, 22(2), 345–368. https://www.cambridge.org/core/journals/journal-of-child-language/article/crosslanguage-synonyms-in-the-lexicons-of-bilingual-infants-one-language-or-two/5F47F33EEDA12AA7A88493B1128841C1
  15. Preissler, M. A., & Carey, S. (2004). Do both pictures and words function as symbols for 18- and 24-month-old children? Journal of Cognition and Development, 5(2), 185–212. https://www.tandfonline.com/doi/abs/10.1207/s15327647jcd0502_2
  16. Reese, E., Sparks, A., & Leyva, D. (2010). A review of parent interventions for preschool children's language and emergent literacy. Journal of Early Childhood Literacy, 10(1), 97–117. https://pubmed.ncbi.nlm.nih.gov/20633892/
  17. Whitehurst, G. J., Falco, F. L., Lonigan, C. J., Fischel, J. E., DeBaryshe, B. D., Valdez-Menchaca, M. C., & Caulfield, M. (1988). Accelerating language development through picture book reading. Developmental Psychology, 24(4), 552–559. https://pubmed.ncbi.nlm.nih.gov/29595311/

Last updated 2026-05-31. Article generated by a multi-agent research workflow with 3-vote adversarial verification: 23 of 25 verified claims confirmed against primary sources, 2 refuted (claim about under-2 picture confusion being universal — overstated; claim about productive picture-word learning being reliable at 12-15 months — insufficient evidence). Where the verified evidence base is thin (notably session-length recommendations and the youngest 2–3-year-old end of the SRS literature), the article surfaces that limitation explicitly.


Picture credits. Card artwork uses OpenMoji — the open-source emoji and icon project — licensed under CC BY-SA 4.0.