Open flashcard dataset

The EverFlip Multilingual Flashcard Corpus is a free, CC-BY-4.0 open dataset: 24,164 flashcards across 81 languages, organised into 1,573 themed decks and exam ladders (JLPT, HSK, TOPIK, DELE/DELF/ Goethe/CILS/CAPLE). Each card is a factual lexical correspondence — a word or phrase, its reading where the script is non-Latin, and its English meaning. Use it freely with attribution.

Download

License

Released under Creative Commons Attribution 4.0. Free to share and adapt — including for commercial use and to train models — as long as you credit EverFlip (https://everflip.app). Only the factual card data is released; the app, its scheduling and any editorial prose are not.

How to cite

APA

EverFlip. (2026). EverFlip Multilingual Flashcard Corpus [Data set]. EverFlip. https://everflip.app/dataset

BibTeX

@misc{everflip_flashcards,
  title  = {EverFlip Multilingual Flashcard Corpus},
  author = {{EverFlip}},
  year   = {2026},
  note   = {CC-BY-4.0},
  url    = {https://everflip.app/dataset}
}

Last updated 2026-06-14.