Skip to main content
Article

Recurrent neural network based language model

Tomáš MikolovBrno University of Technology, Brno, CzechiaMartin KarafiátBrno University of Technology, Brno, CzechiaLukáš BurgetBrno University of Technology, Brno, CzechiaJaň ČernockýSanjeev KhudanpurJohns Hopkins University, Baltimore, United States
2010en
ABI

Abstract

A new recurrent neural network based language model (RNN LM) with applications to speech recognition is presented. Results indicate that it is possible to obtain around 50% reduction of perplexity by using mixture of several RNN LMs, compared to a state of the art backoff language model. Speech recognition experiments show around 18% reduction of word error rate on the Wall Street Journal task when comparing models trained on the same amount of data, and around 5% on the much harder NIST RT05 task, even when the backoff model is trained on much more data than the RNN LM. We provide ample empirical evidence to suggest that connectionist language models are superior to standard n-gram techniques, except their high computational (training) complexity. Index Terms: language modeling, recurrent neural networks, speech recognition

Identifiers

Citations and references

Cited by 40 references