Universität des Saarlandes
 

Spoken Language Systems

   
 

Statistical Natural Language Processing

Vorlesung:
Leitung: Dietrich Klakow
Location: Seminarraum  Building C7 2  
Time: Fr 8.30 - 10 Uhr (can be shifted if all participants agree)     
Starts: 25.04.2008
Geeignet für: M.Sc.

Exercises:
Tutor: Grzegorz Chrupala
Location and Time : Monday 16:15 in conference room 2.11
Starts: 19.5.2008

How does a search engine find the right web page? How can information be extracted from e-mails. The lecture will introduce statistical methods to process natural language. Also common themes like the design of classifiers will be treated.
The lecture covers the following topics:

  • language processing: basic terms
  • mathematical foundations
  • word sense disambiguation
  • part-of-speech tagging
  • named-entity recognition
  • statistical methods in information retrieval
  • text classification
Note: in the first lecture, we decided on a revised list of topics. This new list can be found on the last slide of chapter 1.

Slides
Chapter 1: Overview pdf
Chapter 2 Natural Language pdf
Chapter 3 Basics of Language Modeling pdf
Chapter 4 Entropy pdf
Chapter 5 Backing-Off Language Models pdf
Chapter 6 Text Classification pdf
Chapter 7 Word Sense Disambiguation pdf
Chapter 8 Information Retrieval pdf
Chapter 9 Named Entity Tagging pdf
Chapter 10 Topic Detection and Tracking pdf

Exercises

Exercise 1
Exercise 2
Exercise 3
Exercise 4
Exercise 5
Exercise 6
Exercise 7
Exercise 8