Slide 1Actively Transfer Domain Knowledge Xiaoxiao Shi Wei Fan Jiangtao Ren Sun Yat-sen University IBM T. J. Watson Research Center Transfer when you can, otherwise ask and…
n-gram Smoothing Smoothing: take out some probability mass from seen n-grams and distribute among unseen n-grams Over 10 different smoothing techniques were proposed in the…
What Is a Language Model? A probability distribution over word sequences Based on conditional probability distributions: probability of a word given its history (past words)…
n-gram Smoothing Smoothing: take out some probability mass from seen n-grams and distribute among unseen n-grams Over 10 different smoothing techniques were proposed in the…
What Is a Language Model? A probability distribution over word sequences Based on conditional probability distributions: probability of a word given its history (past words)…