Skip to content
Jinho D. Choi edited this page Jan 12, 2017 · 2 revisions

Language Generation

  • Download non-case sensitive 2-grams.
  • Build a bigram model by measuring P(wi+1|wi) for each bigram.
  • Given the first word being "dear", generate 10 sentences of length 20 where each bigram probability P(wi+1|wi) is greater than certain thresholds.
  • Try out at least 5 different thresholds and compare the results.
  • Write a report including your approach, results, and analysis. Use the ACL latex template.

Submission

CS571: Natural Language Processing

Instructor


Emory University

Clone this wiki locally