Open flashcard dataset
The EverFlip Multilingual Flashcard Corpus is a free, CC-BY-4.0 open dataset: 24,164 flashcards across 81 languages, organised into 1,573 themed decks and exam ladders (JLPT, HSK, TOPIK, DELE/DELF/ Goethe/CILS/CAPLE). Each card is a factual lexical correspondence — a word or phrase, its reading where the script is non-Latin, and its English meaning. Use it freely with attribution.
Download
- cards.csv — every card (24,164 rows): language, deck, front, English meaning, reading.
- decks.csv — deck catalogue (1,573 rows).
- languages.csv — language catalogue (81 rows).
- datapackage.json — frictionlessdata.io Tabular Data Package descriptor.
License
Released under Creative Commons Attribution 4.0. Free to share and adapt — including for commercial use and to train models — as long as you credit EverFlip (https://everflip.app). Only the factual card data is released; the app, its scheduling and any editorial prose are not.
How to cite
APA
EverFlip. (2026). EverFlip Multilingual Flashcard Corpus [Data set]. EverFlip. https://everflip.app/dataset
BibTeX
@misc{everflip_flashcards,
title = {EverFlip Multilingual Flashcard Corpus},
author = {{EverFlip}},
year = {2026},
note = {CC-BY-4.0},
url = {https://everflip.app/dataset}
}Last updated 2026-06-14.