20MCA265 Natural Language Processing syllabus for MCA



A d v e r t i s e m e n t

Module-1 Introduction 0 hours

Introduction, Morphology:

Knowledge in Speech & Lang Processing, Ambiguity, Models & Algorithms, Language, Thought & Understanding, Some Brief History, The State of the Art & Near-Term Future, Summary Morphology and Finite State Transducers: Survey of English Morphology, Finite state Morphological Parsing, Lexicon-Free FST: The Porter Stemmer, Human Morphological Parsing, Summary, Combining FST Lexicon and Rules.

Module-2 N-Grams 0 hours

N-Grams:

Counting Words in Corpora, Simple N-Grams, Smoothing, Back off, Deleted Interpolation, N-Grams for Spelling and Pronunciation, Entropy, Summary. Word Classes and Part-of- Speech Tagging: English Word Classes, Tag sets for English, Part-of-Speech Tagging.

Module-3 Context-Free Grammars and Predicate Calculus for English 0 hours

Context-Free Grammars and Predicate Calculus for English:

Constituency, Context-Free Rules and Trees, Sentence Level Constructions, Coordination, Agreement, The Verb Phrase Sub Categorization, Auxiliaries, Spoken Language Syntax, Grammar Equivalence and Normal Form, Finite –State and Context- Free Grammars, Grammars and Human Processing, The Early Algorithm, Finite-State Parsing Method, Summary Representing Meaning:

Module-4 Semantic Analysis 0 hours

Semantic Analysis:

Syntax-Driven Semantic Analysis, Attachments for a Fragment of English, Integrating Semantic Analysis into the Earley Parser, Idioms and Compositionality, Robust Semantic Analysis, Summary. Lexical Semantics: Relations Among Lexemes and Their Senses, WordNet: A Database of Lexical Relations, The Internal Structure of Words, Creativity and the Lexicon, Summary Word Sense Disambiguation and Information

Module-5 Retrieval 0 hours

Retrieval:

Selection Restriction Based Disambiguation, Robust Word Sense Disambiguation, Information Retrieval, Other Retrieval Tasks, and Summary. Case Study of Simple Text Recognition or Content Based Text Extraction System. Evolving Explanatory Novel Patterns for Semantically-Based Text Mining: Related Work, A Semantically Guided Model for Effective Text Mining.

 

Question Paper Pattern:

• The Question paper will have TEN questions

• Each full question will be for 20 marks • There will be 02 full questions (with maximum of four sub questions) from each module.

• Each full question will have sub questions covering all the topics under a module.

• The students will have to answer FIVE full questions, selecting one full question from each module.

 

Text books

1.DanielJurafsky and James H Martin, “Speech and Language Processing: An introduction to Natural Language Processing, Computational Linguistics and Speech Recognition”, 2nd Edition, Prentice Hall, 2009.

 

References

1. Christopher D.Manning and HinrichSchutze, “Foundations of Statistical Natural LanguageProcessing”, MIT Press, 1999.

2.TanveerSiddiqui, U.S. Tiwary, “Natural Language Processing and Information Retrieval”, Oxford University Press, 2008.

3.Anne Kao and Stephen R. Poteet (Eds), “Natural Language Processing and Text Mining”, Springer Verlag London Limited 2007.

 

Course Outcomes:

CO1: Apply parsing technique to the given problem and verify the output and give valid conclusions

CO2: Illustrate the approaches to syntax and semantics in NLP.

CO3: Formulate solutions for a range of natural language components using existing algorithms, techniques and frameworks, including part-of-speech tagging, language modelling, parsing and semantic role labelling.

CO4. Evaluate NLP solutions of the given problem and arrive at valid conclusions.

CO5: Illustrate information retrieval techniques.

Last Updated: Tuesday, January 24, 2023