22MCA265 Natural Language Processing syllabus for MCA



A d v e r t i s e m e n t

Module-1 Introduction 0 hours

Introduction, Morphology: Knowledge in Speech & Lang Processing, Ambiguity, Models & Algorithms, Language, Thought & Understanding, Some Brief History, The State of the Art & Near-Term Future, Summary Morphology and Finite State Transducers: Survey of English Morphology, Finite state Morphological Parsing, Lexicon-Free FST: The Porter Stemmer, Human Morphological Parsing, Summary, Combining FST Lexicon and Rules.

Module-2 N-Grams 0 hours

N-Grams:

Counting Words in Corpora, Simple N-Grams, Smoothing, Back off, Deleted Interpolation, N-Grams for Spelling and Pronunciation, Entropy, Summary. Word Classes and Part-of- Speech Tagging: English Word Classes, Tag sets for English, Part-of-Speech Tagging.

Module-3 Context-Free Grammars and Predicate Calculus for English 0 hours

Context-Free Grammars and Predicate Calculus for English:

Constituency, Context-Free Rules and Trees, Sentence Level Constructions, Coordination, Agreement, The Verb Phrase Sub Categorization, Auxiliaries, Spoken Language Syntax, Grammar Equivalence and Normal Form, Finite –State and Context- Free Grammars, Grammars and Human Processing, The Early Algorithm, Finite-State Parsing Method, Summary Representing Meaning

Module-4 Semantic Analysis 0 hours

Semantic Analysis:

Syntax-Driven Semantic Analysis, Attachments for a Fragment of English, Integrating Semantic Analysis into the Earley Parser, Idioms and Compositionality, 02.03.2021 updated 49/ 104 Robust Semantic Analysis, Summary. Lexical Semantics: Relations Among Lexemes and Their Senses, WordNet: A Database of Lexical Relations, The Internal Structure of Words, Creativity and the Lexicon, Summary Word Sense Disambiguation and Information.

Module-5 Retrieval 0 hours

Retrieval:

Selection Restriction Based Disambiguation, Robust Word Sense Disambiguation, Information Retrieval, Other Retrieval Tasks, and Summary. Case Study of Simple Text Recognition or Content Based Text Extraction System. Evolving Explanatory Novel Patterns for Semantically-Based Text Mining: Related Work, A Semantically Guided Model for Effective Text Mining.

 

Assessment Details (both CIE and SEE)

  • The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%.
  • The minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the maximum marks of SEE.
  • A student shall be deemed to have satisfied the academic requirements and earned the credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.

 

Continuous Internal Evaluation:

1. Three Unit Tests each of 20 Marks

2. Two assignments each of 20 Marks or one Skill Development Activity of 40 marks to attain the COs and POs

The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks

CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome defined for the course.

 

Semester End Examination:

1. The SEE question paper will be set for 100 marks and the marks scored will be proportionately reduced to 50.

2. The question paper will have ten full questions carrying equal marks.

3. Each full question is for 20 marks. There will be two full questions (with a maximum of four sub-questions) from each module.

4. Each full question will have a sub-question covering all the topics under a module.

5. The students will have to answer five full questions, selecting one full question from each module

 

Suggested Learning Resources:

Text Books:

1.DanielJurafsky and James H Martin, “Speech and Language Processing: An introduction to Natural Language Processing, Computational Linguistics and Speech Recognition”, 2nd Edition, Prentice Hall, 2009.

 

Reference books:

1. Christopher D.Manning and HinrichSchutze, “Foundations of Statistical Natural LanguageProcessing”, MIT Press, 1999.

2.TanveerSiddiqui, U.S. Tiwary, “Natural Language Processing and Information Retrieval”, Oxford University Press, 2008.

3.Anne Kao and Stephen R. Poteet (Eds), “Natural Language Processing and Text Mining”, Springer Verlag London Limited 2007.

 

Skill Development Activities Suggested

  • The students with the help of the course teacher can take up technical –activities which will enhance their skill or the students should interact with industry (small, medium and large), understand their problems or foresee what can be undertaken for study in the form of research/testing/projects, and for creative and innovative methods to solve the identified problem. The prepared report shall be evaluated for CIE marks.

 

Course outcome (Course Skill Set)

At the end of the course the student will be able to :

CO1 Apply parsing technique to the given problem and verify the output and give valid conclusions. L2

CO2 Illustrate the approaches to syntax and semantics in NLP. L2

CO3 Formulate solutions for a range of natural language components using existing algorithms, techniques and frameworks, including part-of-speech tagging, language modelling, parsing and semantic role labelling. L2 CO4 Evaluate NLP solutions of the given problem and arrive at valid conclusions. L3

CO5 Illustrate information retrieval techniques. L2

 

Program Outcome of this course

1 Engineering knowledge: Apply the knowledge of mathematics, science, engineering fundamentals, and computer science and business systems to the solution of complex engineering and societal problems. PO1

2 Problem analysis: Identify, formulate, review research literature, and analyze complex engineering and business problems reaching substantiated conclusions using first principles of mathematics, natural sciences, and engineering sciences. PO2

3 Design/development of solutions: Design solutions for complex engineering problems and design system components or processes that meet the specified needs with appropriate consideration for the public health and safety, and the cultural, societal, and environmental considerations. PO3

4 Conduct investigations of complex problems: Use research-based knowledge and research methods including design of experiments, analysis and interpretation of data, and synthesis of the information to provide valid conclusions. PO4

5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and modern engineering and IT tools including prediction and modeling to complex engineering activities with an understanding of the limitations PO5

6 The engineer and society: Apply reasoning informed by the contextual knowledge to assess societal, health, safety, legal and cultural issues and the consequent responsibilities relevant to the professional engineering and business practices. PO6

7 Environment and sustainability: Understand the impact of the professional engineering solutions in business societal and environmental contexts, and demonstrate the knowledge of, and need for sustainable development. PO7

8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and norms of the engineering and business practices. PO8

9 Individual and team work: Function effectively as an individual, and as a member or leader in diverse teams, and in multidisciplinary settings. PO9

10 Communication: Communicate effectively on complex engineering activities with the engineering community and with society at large, such as, being able to comprehend and write effective reports and design documentation, make effective presentations, and give and receive clear instructions. PO10

11 Project management and finance: Demonstrate knowledge and understanding of the engineering, business and management principles and apply these to one‟s own work, as a member and leader in a team, to manage projects and in multidisciplinary environments. PO11

12 Life-long learning: Recognize the need for, and have the preparation and ability to engage in independent and life-long learning in the broadest context of technological change. PO12

Last Updated: Tuesday, January 24, 2023