How to use Free Software to learn Japanese, and more.

Basic Vocabulary

January 31, 2021 — Tatsumoto Ren

After finishing kanji, kana and essential grammar the bulk of your AJATT journey will consist of learning vocabulary. As the first step in this process, it makes sense to go through a basic vocabulary deck containing the most frequent words in Japanese. As before, you are going to use Anki to do it.


Motivation behind studying basic vocabulary

book

Not all words are created equal. In Japanese, the most frequently used 1,000 words comprise around 75% of all written language. Similar ratios exist for virtually all languages. If you just prioritize the most vital words, the ones that you hear and see a lot, you can rapidly acquire a language.

After memorizing the most frequently used 1,000 or 2,000 words learning vocabulary from immersion becomes easier. When reading you'll be able to recognize the majority of words in a given sentence. You will still have to look up many words per page, but the learning process is going to take less effort.

Focus on understanding

On this site I recommend that you learn to understand Japanese through immersion before trying to speak it. Once you can understand the language, learning to speak doesn't take any effort.

The Anki deck I'm about to introduce below is going to help you understand the language. Cards in the deck are designed to test your recognition instead of forcing you to recall words from memory.

Words and sentences

Our preferred way to learn new words is by reading and understanding sentences. We believe it's the most natural approach. When immersing we encounter sentences and phrases much more than isolated words. Besides, understanding sentences is easier than understanding words. The additional context triggers the memory at times when the meaning of a single word would slip away.

Of course, it's not possible to start reading full sentences if you know no words yet. To work around this temporary hindrance we employ targeted sentence cards. A targeted sentence card is a flashcard that gives you context, but knowing the context is not mandatory to pass the flashcard. Only the target word is taken into account. So, if you know the context, it helps you out. If not, you treat the flashcard as an isolated word. This idea quickly starts working in your favor. Once you've learned even just a few hundred words, your comprehension expands substantially. As you're beginning to understand not just isolated words, any extra exposure you get from reviewing sentences in Anki reinforces your memory. Sentences help you better understand how the words are being used in speech, what roles they play in a sentence and how they connect with other words. You don't get any of these benefits if you use isolated vocab cards (word cards).

Although learning sentences is the best way to get familiar with how language is used and grammar structures are formed, in practice when a new learner starts with one of the premade sentence decks available on AnkiWeb, it is rare for all sentences to introduce only a single unknown word. One way to deal with it is to make isolated word cards (vocabulary cards). Word cards have a different problem. They don't teach usages of words. From a word card it's impossible to learn how to use a word, what collocations it is typically used with, and where it's appropriate to use it. TSCs bypass the problem by making the target word the only tested piece of knowledge while retaining the context and the ability to read the full sentence.

Anki decks

There are two main Anki decks that can be used to learn basic vocabulary.

Ankidrone Essentials

Ankidrone Essentials is the universally recommended Anki deck for newcomers to Japanese who want to quickly learn basic vocabulary before they start mining. Since the first release in February 2020, it has helped many people and received positive feedback.

Ankidrone Essentials is covered in this article. You can download it from there.

Core10k

Ankidrone Core10k is an extra deck that contains sentences from iKnow. Before Ankidrone Essentials was released in 2020, this deck was commonly used for the same purpose of learning basic vocabulary.

Core10k is a famous deck among Japanese learners. Its format is roughly similar to the Tango decks. Each card contains a target word within an example sentence on the front. You have readings, translation, word meanings and audio on the back.

The additional 10,000 sentences are definitely an overkill for someone who has completed Ankidrone Essentials. I include Core10k here for reference. Use this deck to supplement your sentence bank in the sentence mining phase. You can refer to it to find example sentences. Don't learn it back to back because it will take a lot of time. Note that the words that this deck teaches largely overlaps with Ankidrone Essentials.

Core10k can be downloaded from here.

How to study

When learning basic vocabulary from a premade Anki deck, the goal is to learn just enough words to become able to learn the rest through immersion alone. As soon as you feel like you have learned enough, so you can understand large chunks of native Japanese, for example when you watch anime or Japanese movies, then it is time to put premade decks to the side and focus on learning new words from immersion and creating new Anki cards using sentences you find in your immersion.

Typically, after you download a premade deck, learn the first 1,000 to 2,000 cards from it. Then start sentence mining using TV-shows with Japanese subtitles, and later manga and novels. While sentence mining, you can continue learning new cards from premade decks at a reduced pace. It shouldn't be a priority. It is critical not to spend too much time on beginner decks and focus on mining. The higher level you reach, the less benefit from premade decks you will get. Refer to premade decks when you have trouble finding example sentences.

How to review

  • When a flashcard comes up, read the target word (usually marked bold), or the full sentence if you want.
  • If the target word contains kanji, try to recall the reading of the word from memory.
  • Try to recall general the meaning of the target word.
  • Use the context as an aid to understand how the target word connects with other words.
  • Reveal the back of the card. Confirm whether you recalled the correct meaning (and reading if the word contains kanji).
  • Press "Good" if your guess is correct. Otherwise, press "Again".
  • Don't use the "Hard" and "Easy" buttons. Install AJT Flexible Grading to hide them.

For more detailed instructions see How to review.

Note: The meaning does not have to be precise. All you need to do is have a very basic understanding of what each word means. English translations are not enough to fully teach you the nuances of what a word means and how it is used. That level of understanding can only be achieved through immersion.

Tips

  • If you can't seem to remember a card, try Mortician. The add-on postpones difficult cards, preventing them from stealing your time.
  • Feel free to skip words that exist in languages you know. Examples include many katakana words, such as タクシー, エアコン, イクラ. Even though they're pronounced differently in your language, you learn them easily from immersion alone. To suspend a card, press @ on the keyboard.
  • Don't do too many new cards a day. At first, it may seem easy, but eventually Anki will overwhelm you with reviews. Our recommended boundaries are 10~30 new cards a day.
  • Don't take English translations literally. The English translations of example sentences often don't match word-by-word. To understand their meanings in Japanese you need to know the underlying grammar structures. Studying sentences can't completely replace a grammar guide. If there are grammar patterns that trip you up, look them up in a grammar guide, a dictionary like Jisho.org, or on Google.

How many words do I need to learn from a premade deck

To maximize the benefits that premade structured decks bring, we recommended that you learn from 1,000 to 2,000 cards from a premade deck of your choice. Once you do it, your comprehension goes from 0% to more than 75%, which is enough to start learning on your own. From that point on, you can simply grab a book, watch a movie or do anything else you like doing in your native language and learn new vocabulary as it pops up.

If you want, continue learning new words from premade decks. They can be handy even for intermediate learners.

Is 1,000 words enough?

Knowing 75% of the words does not mean that you'll understand 75% of the sentences. In every sentence there will be 2 or 3 unknown words. You will understand something, but it's not a comfortable level. A comfortable level is when you understand everything, or at least 99%, so you'll have to continue learning new words for quite some time. Learning 1,000 words or even completing an entire Anki deck won't make you fluent.

How to measure comprehension?

If you want to know your real comprehension, you can do the following. Take an episode of an anime with Japanese subtitles. Watch it and make a list of all sentences where there was at least one unknown word. Then count the total number of sentences in the episode. Comprehension will be equal to the number of sentences you understood divided by the total number of sentences.

Can't I build my own vocabulary deck?

You should! No premade deck can replace mining your own sentences. In fact, the example mining deck introduced earlier contains a few dozen example targeted sentence cards to show you how your deck should look like.

However, it's not a secret that making your own cards from native Japanese content is too difficult in the beginning. Premade decks exist to give beginners a shortcut to understanding native media and help them quickly reach a point where they can start building their own mining decks easily.

You will learn about sentence mining later in this guide.

Intermission

From the BCCWJ語彙表 data set mentioned in the beginning of the article we find the following.

Most frequently used N words % of written Japanese
1,000 75%
2,000 80%
3,000 85%
6,000 90%
10,000 93%
15,000 95%
32,000 98%
50,000 99%
How do you calculate it?

If you have downloaded the frequency list, you can calculate the percentage for the first N=1000 words with this Shell snippet.

N=1000; {
        sed "1d;$((N+1))q" BCCWJ_frequencylist_suw_ver1_0.tsv | cut -f 8 | awk '{s+=$1} END {print s}'
        echo '/1000000'
} | paste -s -d '' | bc -l

You're expected to progress very quickly in your first months of doing AJATT. Unfortunately, the reality is that we quickly hit the point of diminishing returns. The more words you learn from that point, the slower your comprehension grows.

The bright side is that you can use the numbers as milestones. Reaching each milestone is like winning a small game, and it makes learning your target language feel less like a routine.

Other Anki decks

Resources page contains a Vocabulary section with Anki decks suggested by our members.

AnkiWeb has a wide variety of premade decks for Japanese. When browsing the catalog, I recommend you prefer decks that contain audio recordings and example sentences. It is important that the example sentences always appear on the front of the cards because it is easier to learn words when you see them in context. If you download an incorrectly formatted deck, make sure to fix the card template in settings.

A little trick you can do to filter results is search AnkiWeb with Google.

Tags: guide, vocab