0% found this document useful (0 votes)

51 views18 pages

POS Tagging: Name: E Gayathri REG NO: 21MIS0241

POS tagging is a fundamental NLP task that labels words according to their grammatical roles, aiding in the understanding of language structure and enhancing applications like machine translation and sentiment analysis. The method has evolved from rule-based approaches to advanced machine learning and deep learning techniques, improving accuracy and handling complex contexts. Despite challenges such as ambiguity and data sparsity, POS tagging remains essential for various NLP applications.

Uploaded by

Emani Gayathri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views18 pages

POS Tagging: Name: E Gayathri REG NO: 21MIS0241

Uploaded by

Emani Gayathri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

POS Tagging

NAME: E GAYATHRI
REG NO: 21MIS0241
ABSTRACT

• POS tagging is a core task in NLP to label words based on their

syntactic roles.
• It plays a vital role in understanding the grammatical structure of
language.
• POS tagging contributes to more effective machine understanding
of language.
• POS tagging provides foundational grammatical information for
advanced NLP applications like machine translation, sentiment
analysis, and syntactic parsing.
INTRODUCTION

• POS tagging assigns labels like noun, verb, adjective, etc., to

each word in a sentence.
• It helps capture both syntactic and semantic meaning from text.
• POS tagging is context-dependent; the same word may have
different tags in different sentences.
• Key categories include nouns, verbs, adjectives, adverbs,
pronouns, conjunctions, and prepositions.
• Crucial for many downstream NLP applications, including
parsing and machine translation.
POS TAGGING
HISTORY

• Early Research: Began in the 1950s with rule-based approaches to

classify word categories.
• Transformation-Based Learning: Introduced in the 1990s by Eric Brill
(Brill Tagger).
• Shift to Machine Learning: Transitioned from rule-based systems to
probabilistic and ML-based models (HMM, CRF).
• Deep Learning Era: Recent advances use neural networks and
embeddings for better accuracy.
• Corpus Annotation: Early efforts involved manually tagging large
corpora like the Penn Treebank.
IMPORTANCE OF POS TAGGING

1. Sentence Structure Understanding:

• Helps machines understand the grammatical structure of sentences, which is
essential for many NLP tasks.
2. Enhances Machine Translation:
• POS tags ensure that words are translated accurately according to their
grammatical roles.
3. Word Sense Disambiguation:
• Assists in determining the correct meaning of words with multiple meanings
(e.g., "lead" as a noun vs. verb).
METHODS OF POS TAGGING

1. Rule-Based Tagging: Uses predefined linguistic rules to assign POS tags

(early method).

2. Stochastic Tagging: Relies on probabilistic models like Hidden Markov

Models (HMM) to predict tags.

3. Machine Learning Approaches: Techniques such as Conditional Random

Fields (CRF) or Maximum Entropy (MaxEnt).
RULE-BASED POS TAGGING

Step 1: Tokenize the sentence into individual words.

Step 2: Use a lexicon (dictionary) to assign basic POS tags based on word
forms.

Step 3: Apply predefined linguistic rules (e.g., “If a word ends with 'ed',
it's likely a past-tense verb").

Step 4: Adjust tags based on word context (e.g., "He can swim" – 'can' is
tagged as a modal verb).
RULE-BASED POS TAGGING EXAMPLE

Example Rules:
• Nouns (NN): A word is tagged as a noun if it follows an article (e.g., "the," "a," "an").
• Verbs (VBG): A word ending with "ing" is tagged as a verb (e.g., "running," "swimming").
• Adverbs (RB): A word ending with "ly" is tagged as an adverb (e.g., "quickly," "silently").
Sample Sentence:
• Sentence: "The dog is running quickly.“
Final Output:
• The sentence is tagged as follows:
"The" (DT), "dog" (NN), "is" (NN), "running" (VBG), "quickly" (RB).
MACHINE LEARNING POS TAGGING
Step 1: Import Necessary Libraries

Step 2: Prepare Sample Data

Step 3: Create Feature DataFrame

Step 4: Apply One-Hot Encoding

Step 5: Train the Decision Tree Classifier

Step 6: Prepare New Sentence for Classification

Step 7: Predict POS Tags for New Sentence

PROBABILISTIC POS TAGGING

HIDDEN MARKOV MODEL

Step 1: Define possible states (POS tags) and transitions between states.

Step 2: Calculate emission probabilities (likelihood of word given a POS

tag).

Step 3: Use the Viterbi algorithm to find the most probable sequence of POS
tags for the sentence.

Step 4: Output the best tag sequence based on maximum probability.

APPLICATIONS OF POS TAGGING

1.Text-to-Speech (TTS): POS tags guide pronunciation, stress, and rhythm in

speech synthesis.
2.Information Extraction: Helps identify named entities, dates, places, and
important data from text.
3.Machine Translation: Improves grammatical correctness in translated
sentences.
4.Sentiment Analysis: Helps disambiguate words based on context (e.g., "like" as
a verb vs. preposition).
5.Question Answering Systems: Supports the understanding of syntax for more
accurate answers.
ADVANTAGES OF POS TAGGING

• Improved Language Understanding: Helps NLP models better understand the

structure of language.
• Enhanced Machine Translation: Results in more accurate and grammatically sound
translations.
• Supports Syntax Parsing: Essential for tasks like dependency parsing and syntactic
analysis.
• Improves Contextual Word Sense: Helps in identifying the correct sense of a word in
context.
• Facilitates Text Summarization: Assists in accurately summarizing and simplifying
text based on structure.
CHALLENGES OF POS TAGGING

•Ambiguity: Words can have multiple POS tags depending on context (e.g.,
“book” as noun vs. verb).
•Data Sparsity: Lack of sufficient labeled data for low-resource languages and
unseen words.
•Complex Sentences: Handling complex, unstructured, or idiomatic language is
difficult.
•Language Variability: Different languages have different grammatical rules,
requiring tailored models.
•Out-of-Vocabulary (OOV) Words: Handling new words that do not exist in the
training data.
EVALUATION OF POS TAGGING
MODELS

• Accuracy: Measures the percentage of correctly tagged words in a corpus.

• Precision and Recall: Evaluates performance for specific tags, especially important
ones like nouns and verbs.

• F1 Score: Balances precision and recall to give a single performance metric.

• Cross-Validation: Common technique used to evaluate models on multiple test sets.

• Error Analysis: Identifying common errors to improve tagging performance.

CONCLUSION

In conclusion, POS tagging is a key technique in NLP that assigns grammatical

roles to words, enabling better language understanding for tasks like translation
and sentiment analysis. Over time, it has evolved from rule-based systems to
advanced deep learning models, improving accuracy and handling complex
contexts. Despite challenges like ambiguity, POS tagging remains vital for
various NLP applications. Ongoing advancements in AI continue to refine its
effectiveness.
THANK YOU

NLPChapter 3
No ratings yet
NLPChapter 3
14 pages
What Is POS Tagging in NLP
No ratings yet
What Is POS Tagging in NLP
8 pages
PARTS OF SPEECH TAGGING Article
No ratings yet
PARTS OF SPEECH TAGGING Article
4 pages
Lecture#11 (POS Tagging)
No ratings yet
Lecture#11 (POS Tagging)
19 pages
Unit 2 Pos Tagger
No ratings yet
Unit 2 Pos Tagger
9 pages
Unit 3
No ratings yet
Unit 3
16 pages
POS Tagging: Techniques and Challenges
No ratings yet
POS Tagging: Techniques and Challenges
75 pages
Part of Speech Tagging and Hidden Markov Models
No ratings yet
Part of Speech Tagging and Hidden Markov Models
24 pages
Rutuja
No ratings yet
Rutuja
10 pages
POS Tagging
No ratings yet
POS Tagging
11 pages
Unit-3.Word Level Analysis AIML
No ratings yet
Unit-3.Word Level Analysis AIML
5 pages
4 Pos
No ratings yet
4 Pos
62 pages
Module-5 (Markov Model and Pos Tagging)
No ratings yet
Module-5 (Markov Model and Pos Tagging)
66 pages
Part-of-Speech (POS) Tagging
No ratings yet
Part-of-Speech (POS) Tagging
94 pages
L11-POS - Tagging - II
No ratings yet
L11-POS - Tagging - II
43 pages
2025-NLP-Lecture 05 - Sequence Labeling For Parts of Speech and Name Entities
No ratings yet
2025-NLP-Lecture 05 - Sequence Labeling For Parts of Speech and Name Entities
69 pages
POS Tagging for NLP Enthusiasts
No ratings yet
POS Tagging for NLP Enthusiasts
47 pages
Tagging and Its Types
No ratings yet
Tagging and Its Types
3 pages
3.1 Chap NLP Pos - Tagging - Lecture3
No ratings yet
3.1 Chap NLP Pos - Tagging - Lecture3
38 pages
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
No ratings yet
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
108 pages
Ai TXT Unit4
No ratings yet
Ai TXT Unit4
39 pages
Lecture 20-23 Part of Speech Tagging
No ratings yet
Lecture 20-23 Part of Speech Tagging
36 pages
Lecture Part of Speech Tagging
No ratings yet
Lecture Part of Speech Tagging
41 pages
CH-2 Natural Language Processing Models and Algorithm
No ratings yet
CH-2 Natural Language Processing Models and Algorithm
119 pages
Module 3
No ratings yet
Module 3
33 pages
NLP Report - Modified
No ratings yet
NLP Report - Modified
8 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
36 pages
3 Natural Language Processing-PoS Tagging
No ratings yet
3 Natural Language Processing-PoS Tagging
14 pages
NLP Unit III Notes
No ratings yet
NLP Unit III Notes
30 pages
NLP Ia2
No ratings yet
NLP Ia2
18 pages
NLP Notes Unit2 & Unit3
No ratings yet
NLP Notes Unit2 & Unit3
22 pages
NLP Session 6
No ratings yet
NLP Session 6
5 pages
NLP MINI PROJECT (Updated Devesh)
No ratings yet
NLP MINI PROJECT (Updated Devesh)
16 pages
POS Tagging HMM Notes With Diagrams
No ratings yet
POS Tagging HMM Notes With Diagrams
4 pages
Developing Methods For Part of Speech Tagging in Turkish Language
No ratings yet
Developing Methods For Part of Speech Tagging in Turkish Language
45 pages
Rule Based POS Tagging Example
No ratings yet
Rule Based POS Tagging Example
4 pages
Lec3-Posner Intro
No ratings yet
Lec3-Posner Intro
30 pages
POS Tagging: Introduction: Heng Ji
No ratings yet
POS Tagging: Introduction: Heng Ji
35 pages
Module 2 HMMPPT
No ratings yet
Module 2 HMMPPT
31 pages
pxc3904245 (Marathi)
No ratings yet
pxc3904245 (Marathi)
4 pages
Pos Tagging and Chunking
No ratings yet
Pos Tagging and Chunking
29 pages
Lecture 16-17-18-19
No ratings yet
Lecture 16-17-18-19
42 pages
Cme4408 p6 Pos Tagging
No ratings yet
Cme4408 p6 Pos Tagging
33 pages
POS Tagging
No ratings yet
POS Tagging
5 pages
Unit No 3
No ratings yet
Unit No 3
8 pages
POS Tagging Comparison
No ratings yet
POS Tagging Comparison
3 pages
POStagging
No ratings yet
POStagging
72 pages
Techniques for POS Tagging Explained
No ratings yet
Techniques for POS Tagging Explained
12 pages
Improving Punjabi Part of Speech Tagger by Using Reduced Tag Set
No ratings yet
Improving Punjabi Part of Speech Tagger by Using Reduced Tag Set
7 pages
Multi-Tagging For Transition-Based Dependency Parsing
No ratings yet
Multi-Tagging For Transition-Based Dependency Parsing
10 pages
Hadiyyisa POS Tagger With Deep Learning
100% (2)
Hadiyyisa POS Tagger With Deep Learning
34 pages
Word Classes and Part-of-Speech (POS) Tagging: CS4705 Julia Hirschberg
No ratings yet
Word Classes and Part-of-Speech (POS) Tagging: CS4705 Julia Hirschberg
40 pages
2021 25 Pos Tagging NLP
No ratings yet
2021 25 Pos Tagging NLP
8 pages
Unit3 01
No ratings yet
Unit3 01
10 pages
POS Tagging for NLP Students
No ratings yet
POS Tagging for NLP Students
36 pages
Part-Of-Speech Tagging Overview
No ratings yet
Part-Of-Speech Tagging Overview
84 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
7 pages
Grammar and Writing Grade 2
No ratings yet
Grammar and Writing Grade 2
37 pages
Notes Business Communication Skills Unit 2
No ratings yet
Notes Business Communication Skills Unit 2
81 pages
Focus5 2E Unit Test Unit1 Dictation Vocabulary Grammar UoE GroupA ANSWERSdgui
No ratings yet
Focus5 2E Unit Test Unit1 Dictation Vocabulary Grammar UoE GroupA ANSWERSdgui
2 pages
Acd Unit-4
No ratings yet
Acd Unit-4
23 pages
Paraphrasing and Summarizing
No ratings yet
Paraphrasing and Summarizing
27 pages
Eng 1 Q4 Las 1
No ratings yet
Eng 1 Q4 Las 1
8 pages
Natural Language Processing Lecture Notes
No ratings yet
Natural Language Processing Lecture Notes
27 pages
Ingles 3
No ratings yet
Ingles 3
12 pages
Grade7 Grammar Workbook Teacher
No ratings yet
Grade7 Grammar Workbook Teacher
3 pages
DA Background Essay Conditionals
100% (2)
DA Background Essay Conditionals
9 pages
Stage 8 English Paper 2 Mark Scheme
No ratings yet
Stage 8 English Paper 2 Mark Scheme
10 pages
Digestion: II. Choose The Best Answer A, B, C or D To Complete Each of The Following Sentences (2pts)
No ratings yet
Digestion: II. Choose The Best Answer A, B, C or D To Complete Each of The Following Sentences (2pts)
2 pages
Graph Trends: Structures & Vocabulary Guide
No ratings yet
Graph Trends: Structures & Vocabulary Guide
44 pages
Punctuation Practice Exercises
No ratings yet
Punctuation Practice Exercises
2 pages
Introducing Functional Grammar: SIL Electronic Book Reviews 2009-022
No ratings yet
Introducing Functional Grammar: SIL Electronic Book Reviews 2009-022
11 pages
Family Structures & Adjectives Guide
No ratings yet
Family Structures & Adjectives Guide
138 pages
English Grammar Exercises
No ratings yet
English Grammar Exercises
5 pages
Active and Passive Voice Print PDF
No ratings yet
Active and Passive Voice Print PDF
35 pages
Eng Form 2 2025 Assignment
No ratings yet
Eng Form 2 2025 Assignment
9 pages
10 Ingles 6 Icfes - 2P
No ratings yet
10 Ingles 6 Icfes - 2P
1 page
Understanding Word Order in English
No ratings yet
Understanding Word Order in English
6 pages
1.tenses Chart
No ratings yet
1.tenses Chart
1 page
Personal Pronoun Singular Plural Sindhi 2
No ratings yet
Personal Pronoun Singular Plural Sindhi 2
21 pages
Morphology Practice With Answers
73% (11)
Morphology Practice With Answers
16 pages
Second Conditional and Passive Voice Exercises
No ratings yet
Second Conditional and Passive Voice Exercises
4 pages
Grade 8 Big Summative 2 (Gulshan Anar)
No ratings yet
Grade 8 Big Summative 2 (Gulshan Anar)
2 pages
Understanding the Thirteen English Tenses
No ratings yet
Understanding the Thirteen English Tenses
40 pages
Materi Level 12 For Kids
No ratings yet
Materi Level 12 For Kids
23 pages
Countable and Uncountable Nouns Guide
No ratings yet
Countable and Uncountable Nouns Guide
11 pages
Cases
No ratings yet
Cases
4 pages

POS Tagging: Name: E Gayathri REG NO: 21MIS0241

Uploaded by

POS Tagging: Name: E Gayathri REG NO: 21MIS0241

Uploaded by

POS Tagging

• POS tagging is a core task in NLP to label words based on their

• POS tagging assigns labels like noun, verb, adjective, etc., to

• Early Research: Began in the 1950s with rule-based approaches to

1. Sentence Structure Understanding:

1. Rule-Based Tagging: Uses predefined linguistic rules to assign POS tags

2. Stochastic Tagging: Relies on probabilistic models like Hidden Markov

3. Machine Learning Approaches: Techniques such as Conditional Random

Step 1: Tokenize the sentence into individual words.

Step 2: Prepare Sample Data

Step 3: Create Feature DataFrame

Step 4: Apply One-Hot Encoding

Step 5: Train the Decision Tree Classifier

Step 6: Prepare New Sentence for Classification

Step 7: Predict POS Tags for New Sentence

HIDDEN MARKOV MODEL

Step 2: Calculate emission probabilities (likelihood of word given a POS

Step 4: Output the best tag sequence based on maximum probability.

1.Text-to-Speech (TTS): POS tags guide pronunciation, stress, and rhythm in

• Improved Language Understanding: Helps NLP models better understand the

• Accuracy: Measures the percentage of correctly tagged words in a corpus.

• F1 Score: Balances precision and recall to give a single performance metric.

• Cross-Validation: Common technique used to evaluate models on multiple test sets.

• Error Analysis: Identifying common errors to improve tagging performance.

In conclusion, POS tagging is a key technique in NLP that assigns grammatical

You might also like