This project only contains our releases of processed kanji data.
The processed data files released by the PikaPikaGems team are licensed under CC BY-SA 4.0.
- Dmitry Shpika's projects under CC BY 4.0:
- Kanji Keys
- TopoKanji that used data from:
- KanjiVG project
- CJK Decompositions Data project
- Kanji Frequency project (see below)
- Kanji Frequency
- Redditor Nukemarine's data:
- David Gouveia's Kanji Data project
- Jouyou kanji list with relevant data from:
- Shirabe Jisho's data:
- Kanji lists by JLPT
- Common Words list
- from EDRDG's projects and Jonathan Waller's JLPT resources
- WaniKani (Terms of Service)
- Drew Edwards' Kanji School project
- kanjiapi.dev which uses data from:
- Usagi Chan Kanji Phonetics Deck by shoui520
- JmdictFurigana project under CC BY-SA 4.0
- Netflix Japanese Frequency List by OhTalkWho オタク (Dave Doebrick)
- Chris Kempson's Japanese Subtitles Word & Kanji Frequency Lists project under MIT
- Patrick Kandrac's 2242 Kanji Frequency List (1, 2) which sources data from:
- Kouji Shibano's Google Kanji Data
- Kanji Usage Frequency (KUF)
- Matsushita's Character Database (MCD)
- Japanese Agency for Cultural Affairs (文化庁)
- Alexandre Girardi's word frequency list
- public domain (see Girardi section in Monash FTP Archive)
- kanjidatabase.com
- Alex Yatskov's Wikipedia Kanji Frequency Report
Data owned by Electronic Dictionary Research and Development Group such as KANJIDIC are used under the Group's license.
Jonathan Waller's JLPT resources are licensed under CC BY 4.0.