0% found this document useful (0 votes)

680 views5 pages

U4 NLP Notes

nlp notes ai ml

Uploaded by

salad10shark

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

680 views5 pages

U4 NLP Notes

nlp notes ai ml

Uploaded by

salad10shark

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

unit 4: Predicate Argument Structure, Meaning representation systems, Software

Predicate Argument Structure:

-Predicate-argument structure, also known as semantic role labeling, is a method used to identify the
roles of different parts of a sentence.
-The "predicate" is usually a verb (but can also be a noun, adjective, or preposition), and the
"arguments" are the entities that participate in the action or state described by the predicate.

Resources:

-These resources help computers understand the meaning of sentences by identifying the action and
who is involved.
-This is important for things like translating languages, answering questions, and even helping virtual
assistants understand commands better.
(i) Framenet
(ii) PropBank

FRAMENET:

FrameNet looks at how words are used in different situations (frames) and identifies the roles that
other words play in these situations.
It is based on the theory of frame semantics, which suggests that the meaning of a word can be
understood in terms of the typical situations it describes.

Key Elements:

Frames: A frame is a type of situation or scenario. Each frame involves certain participants, which are
called frame elements.
Frame Elements: These are the roles played by the different participants in a frame.
Lexical Units (LUs): These are pairs of words and their meanings (frames). Each lexical unit is a
specific meaning of a word in a given frame.

Think of the word "break" in two different frames:

Frame 1: "Break" as in breaking a rule.
Roles:
Breaker (the person who breaks the rule)
Rule (the rule being broken)
Frame 2: "Break" as in breaking an object.
Roles:
Breaker (the person who breaks the object)
Object (the thing being broken)

Working:

1. Identify Frames: Researchers identify common situations (frames).

2. Assign Frame Elements: Each frame has specific roles.
3. Label Sentences: Sentences are tagged with these frames and frame elements to show how
words are used in context.

Example:

Frame: COMMERCE_BUY
Sentence: "John bought a car from Mary for $20,000."
Frame Elements:
Buyer: John
Goods: a car
Seller: Mary
Money: $20,000

PROPBANK:

-PropBank is a corpus of texts where each verb is annotated with its arguments, giving us a clear idea
of who is doing what to whom in a sentence.
-This helps in understanding the roles of different entities in relation to the verb.

Key Elements:
Predicate: Usually a verb, it represents an action or state.
Arguments: The participants involved in the action or state described by the predicate. Arguments
are categorized as core (essential to the meaning of the predicate) or
adjunctive (providing additional information).

For the verb "operate":

Sentence: "The doctor operates the machine."
Roles:
Operator (who is operating, e.g., "The doctor")
Thing being operated (what is being operated, e.g., "the machine")

Working:
1. Annotations: PropBank annotates verbs in the Wall Street Journal section of the Penn Treebank.
Each verb is tagged with its core arguments
(like the subject, object) and adjunctive arguments (like time, location).
2. Framesets: Each verb has a frameset that lists possible argument structures (roles) it can take,
along with descriptions of these roles.

Example of PropBank Annotations

Sentence: "John gave Mary a book."
Predicate: gave
Arguments:
ARG0 (Agent): John (the one who gives)
ARG1 (Theme): a book (the thing given)
ARG2 (Recipient): Mary (the one who receives)

In PropBank notation, this might be represented as:

[ARG0 John] [gave] [ARG2 Mary] [ARG1 a book].

Core Arguments:

These are essential participants directly involved with the predicate:

ARG0: Typically the agent or doer of the action.

ARG1: Typically the patient or theme (the entity undergoing the action).
ARG2, ARG3, ARG4: Other roles that vary depending on the verb’s meaning.

Adjunctive Arguments:

These provide additional information about the action and are labeled as ARGM-XYZ, where XYZ
indicates the type of information:

ARGM-LOC: Location (e.g., "in the hotel")

ARGM-TMP: Time (e.g., "yesterday")
ARGM-MNR: Manner (e.g., "quickly")
ARGM-CAU: Cause (e.g., "because he was hungry")
ARGM-DIR: Direction (e.g., "to the store")
ARGM-PRP: Purpose (e.g., "to buy groceries")
ARGM-NEG: Negation (e.g., "not")
ARGM-MOD: Modality (e.g., "can," "might")

Example of a Complex Annotation:

Sentence: "The company operates stores mostly in Iowa and Nebraska."

Predicate: operates
Arguments:
ARG0 (Agent): The company
ARG1 (Theme): stores
ARGM-LOC (Location): mostly in Iowa and Nebraska

Other resources of predicate argument structure:

1. Nombank
2. VerbNet

SOFTWARES

Following is a list of software packages available for semantic role labeling

1. ASSERT (Automatic Statistical SEmantic Role Tagger)

A semantic role labeler trained on the English PropBank data.

2. C-ASSERT
An extension of ASSERT for Chinese Language.

3. SwiRL
Another semantic role labeler trained on PropBank data.

4. Shalmaneser (A Shallow Semantic Parser)

A toolchain for shallow semantic parsing based on the Framenet Data.

--------------------------------------------------------------------------------------------------------------------------------------------
----------------------------------------------

MEANING REPRESENTATION:

- Meaning representation is a deeper level of semantic interpretation aimed at converting natural

language into a format that machines can understand and act on.
- This process is similar to how programming languages are compiled into machine code that
computers execute.
- Unlike artificial languages, natural language is flexible and relies on context and general world
knowledge for understanding, which poses a challenge for machines.
- Researchers have been working for decades to develop methods to interpret and encode this context
and knowledge for machines.
- However, current techniques are limited to specific domains and problems and do not scale well to
arbitrary domains.

RESOURCES:

1. ATIS:

The ATIS project was one of the first major efforts to develop systems that convert natural language into
a form usable by applications for decision-making. Specifically, it focused on transforming user queries
about flight information into SQL queries to extract answers from a flight database.
Here’s how it worked:

1. A user would ask a question in natural speech using a restricted vocabulary.

2. The system would convert this query into a hierarchical frame representation, encoding the essential
semantic information.
3. This representation was then compiled into a SQL query to retrieve the required data from the
database.

The ATIS training corpus included over 7,300 spoken utterances from 137 subjects, with 2,900 of them
categorized and annotated, and around 600 treebanked for detailed syntactic analysis. This resource
helped promote experimentation in transforming natural language into machine-readable formats.

2. COMMUNICATOR

The Communicator program was the next step after the ATIS project. While ATIS focused on
user-initiated dialogs where users asked questions and machines provided answers, Communicator
introduced a mixed-initiative dialog system. This means both the user and the machine could actively
participate in the conversation.

3. GeoQuery

GeoQuery is a natural language interface (NLI) designed to interact with a geographic database called
Geobase. Geobase contains about 800 Prolog facts, which store geographic information such as
populations, neighboring states, major rivers, and major cities in a relational database.

4. RoboCup: CLang

RoboCup is an international competition where teams of robots play soccer, and it’s organized by the
artificial intelligence community. The goal is to advance AI and robotics research through this challenging
and fun domain.

Software

• WASP
• KRISPER

• CHILL

Shortlisted Students Details
No ratings yet
Shortlisted Students Details
20 pages
NLP Unit II Notes
75% (8)
NLP Unit II Notes
18 pages
NLP Unit4
No ratings yet
NLP Unit4
13 pages
Advanced Data Structure Compile MCQ
No ratings yet
Advanced Data Structure Compile MCQ
25 pages
MK COMPUTER
No ratings yet
MK COMPUTER
20 pages
NLP Unit I Notes-1
75% (4)
NLP Unit I Notes-1
22 pages
KRR UNIT-5
100% (1)
KRR UNIT-5
51 pages
NLP Unit-Iv
No ratings yet
NLP Unit-Iv
124 pages
NLP UNIT III Notes
100% (5)
NLP UNIT III Notes
9 pages
IS&MLA_Module 1
No ratings yet
IS&MLA_Module 1
18 pages
1 Marks BODMAS
No ratings yet
1 Marks BODMAS
9 pages
Mamelodi Grade 11 Username & Password
No ratings yet
Mamelodi Grade 11 Username & Password
8 pages
u 4 Predicate Argument Structure
No ratings yet
u 4 Predicate Argument Structure
3 pages
Com - Cherisher.beauty - Camera.videocall Logcat
No ratings yet
Com - Cherisher.beauty - Camera.videocall Logcat
29 pages
AM601PC KRR unit 1
No ratings yet
AM601PC KRR unit 1
16 pages
NLP UNIT IV Notes
100% (1)
NLP UNIT IV Notes
5 pages
Lesson 7
No ratings yet
Lesson 7
30 pages
Understanding Inputs and Outputs of Mapreduce
No ratings yet
Understanding Inputs and Outputs of Mapreduce
13 pages
Military CV Examples Uk
100% (2)
Military CV Examples Uk
8 pages
KINNIYA MOBILE PHONE SHOP
No ratings yet
KINNIYA MOBILE PHONE SHOP
6 pages
Fattura PA Specification
No ratings yet
Fattura PA Specification
6 pages
Open Sesame
No ratings yet
Open Sesame
3 pages
Pega_Testing_sample_resume
No ratings yet
Pega_Testing_sample_resume
4 pages
Lecture 01 Microcomputer Interfacing
No ratings yet
Lecture 01 Microcomputer Interfacing
17 pages
Pre Assestment
No ratings yet
Pre Assestment
4 pages
OpTransactionHistoryUX502-01-2025
No ratings yet
OpTransactionHistoryUX502-01-2025
6 pages
SRS WEATHER3 EDITED.pdf
No ratings yet
SRS WEATHER3 EDITED.pdf
16 pages
Unit 3
No ratings yet
Unit 3
19 pages
Unit 3
100% (1)
Unit 3
11 pages
Comp 101 PDF
No ratings yet
Comp 101 PDF
125 pages
ENG-STAG-4 Eco (PS-01) Connection (2011.10.25)
No ratings yet
ENG-STAG-4 Eco (PS-01) Connection (2011.10.25)
1 page
8-Systematic Scenario Creation
No ratings yet
8-Systematic Scenario Creation
18 pages
CXS 346e
No ratings yet
CXS 346e
4 pages
Lab 5.0 - Order Creation and Orchestration Utah
No ratings yet
Lab 5.0 - Order Creation and Orchestration Utah
34 pages
Kanp-Sack Lecture
No ratings yet
Kanp-Sack Lecture
28 pages
CRM Guide Small Business
No ratings yet
CRM Guide Small Business
16 pages
KRR UNIT-3
No ratings yet
KRR UNIT-3
19 pages
FIoT Unit 05
100% (1)
FIoT Unit 05
73 pages
Unit 4
100% (1)
Unit 4
8 pages
NLP LAB MANUAL 3-2 AIML R22 UPDATE (1)
100% (1)
NLP LAB MANUAL 3-2 AIML R22 UPDATE (1)
20 pages
NLP Unit V Notes
100% (1)
NLP Unit V Notes
21 pages
Natural Language Processing
100% (2)
Natural Language Processing
48 pages
UT ID 22.17.1 3 Otis Glide A Startup Manual
100% (1)
UT ID 22.17.1 3 Otis Glide A Startup Manual
69 pages
KRR UNIT 1
No ratings yet
KRR UNIT 1
26 pages
Forensics FJ
No ratings yet
Forensics FJ
28 pages
NLP Unit III
No ratings yet
NLP Unit III
17 pages
Unit 2 - Notes
No ratings yet
Unit 2 - Notes
9 pages
NLP- AI2214601 unit 1to unit 5 notes
No ratings yet
NLP- AI2214601 unit 1to unit 5 notes
98 pages
NLP Notes Unit-3.Doc
No ratings yet
NLP Notes Unit-3.Doc
19 pages
Internship Review
No ratings yet
Internship Review
12 pages
Fibre Channel Frame Format Terminology
100% (1)
Fibre Channel Frame Format Terminology
2 pages
NLP Lab Manual-1
No ratings yet
NLP Lab Manual-1
18 pages
Wiring Diagrams
No ratings yet
Wiring Diagrams
1 page
Unit 5
No ratings yet
Unit 5
20 pages
Natural Language Processing
No ratings yet
Natural Language Processing
47 pages
Torsional Fatigue of Turbine-Generator Shafts Caused by Different Electrical System Faults and Switching Operations
No ratings yet
Torsional Fatigue of Turbine-Generator Shafts Caused by Different Electrical System Faults and Switching Operations
13 pages
Unit-III PDF
No ratings yet
Unit-III PDF
72 pages
NLP Final
No ratings yet
NLP Final
26 pages
NLP Notes
No ratings yet
NLP Notes
18 pages
NLP Unit-1 Notes
No ratings yet
NLP Unit-1 Notes
59 pages
NLP Unit 3
No ratings yet
NLP Unit 3
20 pages
Natural Language Processing: Dr. Abdulfetah A.A
No ratings yet
Natural Language Processing: Dr. Abdulfetah A.A
25 pages
NLP SEM QUESTIONS AND ANSWERS
No ratings yet
NLP SEM QUESTIONS AND ANSWERS
72 pages
Natural Language Processing Notes
No ratings yet
Natural Language Processing Notes
26 pages
Unit 2
No ratings yet
Unit 2
15 pages
Notes of NLP - Unit-2
No ratings yet
Notes of NLP - Unit-2
23 pages
Chapter 7
No ratings yet
Chapter 7
49 pages
NLP Notes For Students
No ratings yet
NLP Notes For Students
18 pages
Unit 4 Knowledge Representation
No ratings yet
Unit 4 Knowledge Representation
13 pages
NLP Notes
No ratings yet
NLP Notes
43 pages
Krr Unit i Notes
100% (1)
Krr Unit i Notes
32 pages
R18 JP Lab Manual
No ratings yet
R18 JP Lab Manual
34 pages
NLP Lect Unit I
100% (1)
NLP Lect Unit I
140 pages
CCS369 - TSS-Unit 3
No ratings yet
CCS369 - TSS-Unit 3
55 pages
Unit 1 2 3 4 5 NLP Notes Merged
100% (1)
Unit 1 2 3 4 5 NLP Notes Merged
105 pages
NLP Lab Manual Updated
No ratings yet
NLP Lab Manual Updated
34 pages
Unit I
No ratings yet
Unit I
30 pages
Unit 4 NLP Notes
No ratings yet
Unit 4 NLP Notes
35 pages
Natural Language Processing
100% (1)
Natural Language Processing
21 pages
Module 3 - Paper 1 - Extracting Relations From Text From Word Sequences To Dependency Paths
No ratings yet
Module 3 - Paper 1 - Extracting Relations From Text From Word Sequences To Dependency Paths
11 pages
NLP Unit-3-Semantics-And-Pragmatics
No ratings yet
NLP Unit-3-Semantics-And-Pragmatics
20 pages
NLP UNIT 2 (Ques Ans Bank)
No ratings yet
NLP UNIT 2 (Ques Ans Bank)
26 pages
Data Analytics - Object Segmentation UNIT-IV
100% (1)
Data Analytics - Object Segmentation UNIT-IV
33 pages
Unit 5 - Notes
No ratings yet
Unit 5 - Notes
11 pages
SEM-2-NLP Questions
No ratings yet
SEM-2-NLP Questions
3 pages
5.2 Natural Language Processing
No ratings yet
5.2 Natural Language Processing
43 pages
NLP QB
100% (2)
NLP QB
14 pages
Unit 4 NLP
No ratings yet
Unit 4 NLP
51 pages
CSE4022 Natural-Language-Processing ETH 1 AC41
No ratings yet
CSE4022 Natural-Language-Processing ETH 1 AC41
6 pages
Data Analytics Unit-I
No ratings yet
Data Analytics Unit-I
25 pages