Houjun Liu

Language Information Index

What makes language modeling hard: resolving ambiguity is hard.

“the chef made her duck”

Contents

Basic Text Processing

Edit Distance

DP costs \(O(nm)\), backtrace costs \(O(n+m)\).

Ngrams

Text Classification

Logistic Regression

Information Retrial

Ranked Information Retrial

Vector Semantics

POS and NER

Dialogue Systems

Recommender Systems

Dora

Neural Nets

The Web