Introduction, Morphology: Knowledge in Speech & Lang Processing, Ambiguity, Models & Algorithms, Language, Thought & Understanding, Some Brief History, The State of the Art & Near-Term Future, Summary Morphology and Finite State Transducers: Survey of English Morphology, Finite state Morphological Parsing, Lexicon-Free FST: The Porter Stemmer, Human Morphological Parsing, Summary, Combining FST Lexicon and Rules.
N-Grams:
Counting Words in Corpora, Simple N-Grams, Smoothing, Back off, Deleted Interpolation, N-Grams for Spelling and Pronunciation, Entropy, Summary. Word Classes and Part-of- Speech Tagging: English Word Classes, Tag sets for English, Part-of-Speech Tagging.
Context-Free Grammars and Predicate Calculus for English:
Constituency, Context-Free Rules and Trees, Sentence Level Constructions, Coordination, Agreement, The Verb Phrase Sub Categorization, Auxiliaries, Spoken Language Syntax, Grammar Equivalence and Normal Form, Finite –State and Context- Free Grammars, Grammars and Human Processing, The Early Algorithm, Finite-State Parsing Method, Summary Representing Meaning
Semantic Analysis:
Syntax-Driven Semantic Analysis, Attachments for a Fragment of English, Integrating Semantic Analysis into the Earley Parser, Idioms and Compositionality, 02.03.2021 updated 49/ 104 Robust Semantic Analysis, Summary. Lexical Semantics: Relations Among Lexemes and Their Senses, WordNet: A Database of Lexical Relations, The Internal Structure of Words, Creativity and the Lexicon, Summary Word Sense Disambiguation and Information.
Retrieval:
Selection Restriction Based Disambiguation, Robust Word Sense Disambiguation, Information Retrieval, Other Retrieval Tasks, and Summary. Case Study of Simple Text Recognition or Content Based Text Extraction System. Evolving Explanatory Novel Patterns for Semantically-Based Text Mining: Related Work, A Semantically Guided Model for Effective Text Mining.
Assessment Details (both CIE and SEE)
Continuous Internal Evaluation:
1. Three Unit Tests each of 20 Marks
2. Two assignments each of 20 Marks or one Skill Development Activity of 40 marks to attain the COs and POs
The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome defined for the course.
Semester End Examination:
1. The SEE question paper will be set for 100 marks and the marks scored will be proportionately reduced to 50.
2. The question paper will have ten full questions carrying equal marks.
3. Each full question is for 20 marks. There will be two full questions (with a maximum of four sub-questions) from each module.
4. Each full question will have a sub-question covering all the topics under a module.
5. The students will have to answer five full questions, selecting one full question from each module
Suggested Learning Resources:
Text Books:
1.DanielJurafsky and James H Martin, “Speech and Language Processing: An introduction to Natural Language Processing, Computational Linguistics and Speech Recognition”, 2nd Edition, Prentice Hall, 2009.
Reference books:
1. Christopher D.Manning and HinrichSchutze, “Foundations of Statistical Natural LanguageProcessing”, MIT Press, 1999.
2.TanveerSiddiqui, U.S. Tiwary, “Natural Language Processing and Information Retrieval”, Oxford University Press, 2008.
3.Anne Kao and Stephen R. Poteet (Eds), “Natural Language Processing and Text Mining”, Springer Verlag London Limited 2007.
Skill Development Activities Suggested
Course outcome (Course Skill Set)
At the end of the course the student will be able to :
CO1 Apply parsing technique to the given problem and verify the output and give valid conclusions. L2
CO2 Illustrate the approaches to syntax and semantics in NLP. L2
CO3 Formulate solutions for a range of natural language components using existing algorithms, techniques and frameworks, including part-of-speech tagging, language modelling, parsing and semantic role labelling. L2 CO4 Evaluate NLP solutions of the given problem and arrive at valid conclusions. L3
CO5 Illustrate information retrieval techniques. L2
Program Outcome of this course
1 Engineering knowledge: Apply the knowledge of mathematics, science, engineering fundamentals, and computer science and business systems to the solution of complex engineering and societal problems. PO1
2 Problem analysis: Identify, formulate, review research literature, and analyze complex engineering and business problems reaching substantiated conclusions using first principles of mathematics, natural sciences, and engineering sciences. PO2
3 Design/development of solutions: Design solutions for complex engineering problems and design system components or processes that meet the specified needs with appropriate consideration for the public health and safety, and the cultural, societal, and environmental considerations. PO3
4 Conduct investigations of complex problems: Use research-based knowledge and research methods including design of experiments, analysis and interpretation of data, and synthesis of the information to provide valid conclusions. PO4
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and modern engineering and IT tools including prediction and modeling to complex engineering activities with an understanding of the limitations PO5
6 The engineer and society: Apply reasoning informed by the contextual knowledge to assess societal, health, safety, legal and cultural issues and the consequent responsibilities relevant to the professional engineering and business practices. PO6
7 Environment and sustainability: Understand the impact of the professional engineering solutions in business societal and environmental contexts, and demonstrate the knowledge of, and need for sustainable development. PO7
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and norms of the engineering and business practices. PO8
9 Individual and team work: Function effectively as an individual, and as a member or leader in diverse teams, and in multidisciplinary settings. PO9
10 Communication: Communicate effectively on complex engineering activities with the engineering community and with society at large, such as, being able to comprehend and write effective reports and design documentation, make effective presentations, and give and receive clear instructions. PO10
11 Project management and finance: Demonstrate knowledge and understanding of the engineering, business and management principles and apply these to one‟s own work, as a member and leader in a team, to manage projects and in multidisciplinary environments. PO11
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage in independent and life-long learning in the broadest context of technological change. PO12