|
Forelesninger: INF-3710
Sitting 1: 13th November 2007
Place: Lille Auditorium, Realfagsbygget
Time: 10:15 – 12:00 and 13:15 – 15:00
Total four hours and two chapters
Reading material:
This is an initial reading material list. Additional reading material will be provided during the sitting.
CHAPTER 1, INTRODUCTION TO INFORMATON RETRIEVAL (2 hours, 13th November, 10:15 – 12:00 at Lille Auditorium)
This chapter gives a two hour introduction to the topic of Information Retrieval. We examine the history of the field and explore an overview taxonomy of the different areas of Information Retrieval that will be covered in this course. In addition, user issues and the effect of these issues on information retrieval systems development are discussed as well as query expansion and relevance feedback. Finally the chapter ends with a discussion of Information Retrieval evaluation methodologies and evaluation measures, from the original Cranfield collection to the present day TREC/CLEF style evaluations.
- Background and History of Information Retrieval
- A Taxonomy of Information Retrieval
- Comparing Information Retrieval to Data Retrieval
- Users Issues in Information Retrieval
- Query Expansion & Relevance Feedback
- Evaluation Issues for Information Retrieval (incl. TREC)
CHAPTER 2, MANAGING STRUCTURED DATA (2 Hours, 13th November 2007, 13:15 – 15:00 at Lille Auditorium)
This chapter gives an overview of how to manage structured data. The rational for including this chapter is because the field of Information Retrieval typically operates over unstructured data, and it is important to compare this to the more familiar structured data retrieval as exists in relational databases and similar.
- Relational Database Review
- XML Review
- Comparison to Information Retrieval
Subsequent Sittings:
Following from the first sitting, the following chapters will be delivered over the course of the following two sittings.
CHAPTER 3, MANAGING UNSTRUCTURED DATA (2 Hours, 17th December 2007, from 12:15 – 14:00 at Lille Auditorium)
- Introduction to Hypertext & Hypermedia
- Examination of IR issues on the WWW
CHAPTER 4, CONTENT RETRIEVAL OF TEXT (4 Hours, 18th December 2007, from 09:15 – 11:00 and from 12:15 – 14:00 at Lille Auditorium)
- Nature of Text (how to index)
- Boolean IR (Boolean Architecture)
- Extended Models of Information Retrieval
CHAPTER 5, WWW SEARCH ENGINES (4 Hours, Most likely February 2008, TBA)
- The Nature of Search Engines
- Web Directories
- Search Engine Architecture
- Linkage-Based Search
- PageRank & Kleinberg’s Algorithm
- Spidering, Indexing and Interfaces
- Personalisation & Recommendation on the WWW
- Novelty & Summarisation
- Personalisation & Recommendation for Text data
CHAPTER 6, CONTENT RETRIEVAL OF NON-TEXT (6 Hours, Most likely February 2008, TBA)
- The Nature of Non-Text IR
- Audio IR
- Image IR (Content & Context)
- Video IR
- Personalisation & Recommendation for Multimedia Data
- LifeLogging Multimedia
Link to the course notes can be found at:
http://www.computing.dcu.ie/~cgurrin/ir101/
|