06IS842 Information Retrieval syllabus for IS


Part A
Unit-1 Introduction, Retrieval Strategies 1 7 hours

Introduction; Retrieval Strategies: Vector Space Model; Probabilistic Retrieval strategies

Unit-2 Retrieval Strategies 2 6 hours

Some More Retrieval Strategies: Language Models; Inference Networks; Extended Boolean Retrieval; Latent Semantic Indexing; Neural Networks; Genetic Algorithms; Fuzzy Set Retrieval.

Unit-3 Retrieval Utilities 7 hours

Relevance feedback; Clustering; Passage-Based Retrieval; N-Grams; Regression Analysis; Thesauri; Semantic Networks; Parsing.

Unit-4 Indexing and Searching 6 hours

Introduction; Inverted Files; Other indices for text; Boolean queries; Sequential searching; Pattern matching; Structural queries; Compression.

Part B
Unit-5 Cross-Language Information Retrieval and Efficiency 6 hours

Introduction; Crossing the language barrier; Cross-Language retrieval strategies; Cross language utilities. Duplicate Document Detection.

Unit-6 Integrating Structured Data and Text 6 hours

Review of the relational model; A historical progression; Information retrieval as a relational application; Semi-structured search using a relational schema; Multi-dimensional data model.

Unit-7 Parallel Information Retrieval, Distributed Information Retrieval 7 hours

Parallel text scanning; Parallel indexing; Clustering and classification; Large parallel systems; A theoretic model of distributed information retrieval; Web search; Result fusion; Peer-to-Peer information systems; Other architectures.

Unit-8 Multimedia IR 7 hours

Introduction; data modeling; Query languages; Spatial access methods; A general multimedia indexing approach; One-dimensional time series; Two-dimensional color images; Automatic picture extraction.

Last Updated: Tuesday, January 24, 2023