My favorites | Sign in
Project Home Downloads Wiki Issues Source
Search
for
KanjiStats  
Updated Jul 23, 2011 by himse...@gmail.com

Parses text files (books) and builds a list of used kanji by frequency. It's possible to exclude already known kanjis from the list.

Syntax

kanjistats.exe <file1> [file2] [...] [-k known_file] [-o output_file] [-v]
-k   Kanjis from this file will be considered learned, and ignored.
-o   Output file. If none specified, outputs to console.
-v   Enable verbose output (kanji=occurences).

Purpose

Vocabularies of different authors differ greatly and generic "kanji frequency lists" found on the internet are often built from newspaper articles, mostly of political and economical profile. Not much help when you want to read this one specific book which has nothing to do with economy!

Feed the book to this tool and you'll get exactly the kanjis you need to learn to quickly improve your understanding of the text.

Note

Books must be in UTF16LE encoding.


Sign in to add a comment
Powered by Google Project Hosting