30409-Article Text-34463-1-2-20240324

Uploaded by

slhthota

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

30409-Article Text-34463-1-2-20240324

Uploaded by

slhthota

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

Learning Neuro-Symbolic Abstractions for Robot Planning and Learning

Naman Shah
School of Computing and Augmented Intelligence
Arizona State University, Tempe, AZ, USA, 85281
[email protected]

Abstract
Although state-of-the-art hierarchical robot planning algo-
rithms allow robots to efficiently compute long-horizon mo-
tion plans for achieving user desired tasks, these methods typ-
ically rely upon environment-dependent state and action ab-
stractions that need to be hand-designed by experts. On the
other hand, non-hierarchical robot planning approaches fail (a) (b) (c)
to compute solutions for complex tasks that require reason-
ing over a long horizon. My research addresses these prob-
lems by proposing an approach for learning abstractions and Figure 1: The figure shows the overall approach of my re-
developing hierarchical planners that efficiently use learned search. (a) shows the ground input motion planning prob-
abstractions to boost robot planning performance and provide lem. The next step is to identify critical regions as show in
strong guarantees of reliability. (b) and use them synthesize abstract states and actions as
shown in (c) using colored cell and arrows respectively.
1 Introduction
Robots needs to plan their actions in order to complete com- 2 Proposed Approach
plex tasks in these various areas. E.g., consider the prob-
My research develops data-driven neuro-symbolic ap-
lem shown in Fig. 1(a). However, robot planning over a
proaches for learning to create hierarchical state and action
long horizon is challenging due to the continuous state and
abstractions for unseen environments. I use the concept of
action spaces of the robot. Hierarchical approaches (Gar-
critical regions (Molina, Kumar, and Srivastava 2020) for
rett, Lozano-Pérez, and Kaelbling 2020; Shah et al. 2020)
constructing these hierarchical abstractions. Intuitively, crit-
have shown that such abstractions can also be used for ef-
ical regions generalize bottlenecks and hub or access points
ficient robot planning. Unfortunately, these approaches re-
in an environment in a single concept. I propose to learn a
quire sound abstractions that are consistent with the motion
critical region predictor using randomly generated motion
planning of the robot. However, designing these abstractions
plans in a few training environments and use it to automat-
is non-intuitive and non-trivial and requires a domain ex-
ically identify critical regions in an unseen environment us-
pert. Most related approaches require hand-coded abstrac-
ing an occupancy matrix of the environment.
tions (Garrett, Lozano-Pérez, and Kaelbling 2020; Shah
et al. 2020) or require experience in the test domain (Bagaria My research uses these automatically identified critical re-
and Konidaris 2020) to learn abstractions. gions to automatically construct a region-based Voronoi di-
My research aims to answer two crucial research ques- agram (RBVD). A region-based Voronoi diagram partitions
tion: (1) Can we automatically learn effective hierarchical the configuration space into different cells. Each cell defines
state and action abstractions that enable hierarchical plan- an abstract state inducing an abstraction function. High-level
ning, and (2) Is it possible to develop efficient approaches abstract actions are defined as transition between these ab-
that use these automatically generated hierarchical abstrac- stract states induced by Voronoi cells. Fig. 1(c) shows an
tions for robot planning? My research focuses on devel- illustration of a region-based Voronoi diagram. Thus, we ob-
oping data-driven neuro-symbolic approaches for automat- tain a neuro-symbolic atomic abstract representation for an
ically learning such hierarchical states and action abstrac- otherwise continuous configuration space of the robot.
tions for complex long-horizon robot planning tasks in un- Given an abstract representation with a discrete set of ab-
seen environments. I also develop hierarchical planners that stract states and abstract actions constructed in a bottom-
use these learned abstractions for efficient robot planning. up fashion as outlined above, I focus to develop hierarchi-
cal robot planning approaches for efficiently using these
Copyright © 2024, Association for the Advancement of Artificial abstractions. My research proposes to develop hierarchical
Intelligence (www.aaai.org). All rights reserved. probabilistically-complete robot planning algorithms that

23417
The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

interleave high-level symbolic reasoning with continuous 1996; LaValle et al. 1998) and learning-based (Molina, Ku-
low-level motion planning using learned state and action mar, and Srivastava 2020) motion planners. The results show
abstractions. Here, an interleaved approach implies that the that using hierarchical planning alongside learning signifi-
developed algorithm searches for a high-level abstract plan cantly (∼ 10×) improves the efficiency.
that has valid low-level refinement for all its symbolic ac-
tion in iterative setting. This develops a suite of hierarchi- 3.3 Robot Planning Under Uncertainty
cal algorithms that provide strong theoretical guarantees of Shah and Srivastava (2022a) develop an approach -- stochas-
probabilistically-completeness and downward refineability. tic hierarchical abstraction-guided robot planner (SHARP)
-- for computing motion policies for robots in stochastic dy-
3 Preliminary Results namics. It uses the abstract states defined using an RBVD
and defines options that makes transitions between these ab-
This section outlines multiple algorithms for hierarchical stract states. These options are multi-task meaning same set
planning developed using the above mentioned approach for of options can be used for multiple problems in the same en-
solving robot planning problem. These approaches include vironment. SHARP uses A∗ search to compute a high-level
stochastic task and motion planning approach (Sec. 3.1) us- plan by composing options and then uses an off-the-shelf
ing hand-coded abstractions and hierarchical planning ap- DRL approach to compute policies for these options.
proaches using learned abstractions (Sec. 3.2 and 3.3). The approach is evaluated in 14 different settings and
compared against a re-planning variant of RRT (LaValle
3.1 Stochastic Task and Motion Planning et al. 1998), SAC (Haarnoja et al. 2018), and several HRL
Shah et al. (2020) develop an interleaved algorithm for com- approaches. While most baselines failed to compute solu-
bined task and motion planning. It takes a continuous robot tions, our approach significantly outperformed all the base-
planning problem in the form of a stochastic shortest path lines owing to a dense auto-generated pseudo-reward and an
(SSP) problem and an entity abstraction as an input and uses effectively shorter horizon for learning reactive policies.
it to compute task and motion policy for the input SSP that
the robot can execute in the low-level. It iteratively computes References
a high-level policy and its refinements until it finds a policy Bagaria, A.; and Konidaris, G. 2020. Option discovery using
that has valid motion planning refinements for all its actions. deep skill chaining. In ICLR.
The approach is evaluated in multiple settings where com- Garrett, C.; Lozano-Pérez, T.; and Kaelbling, L. 2020.
bined task and motion planning is necessary to compute fea- PDDLStream: Integrating symbolic planners and blackbox
sible solutions. Refining each possible outcome in the policy samplers via optimistic adaptive planning. In ICAPS.
can take a substantial amount of time. However, ATAM al-
Haarnoja, T.; Zhou, A.; Abbeel, P.; and Levine, S. 2018. Soft
gorithm (Shah et al. 2020) reduces the problem of selecting
actor-critic: Off-policy maximum entropy deep reinforce-
scenarios for refinement to a knapsack problem and use a
ment learning with a stochastic actor. In ICML.
greedy approach to prioritize more likely outcomes for re-
finement. The empirical evaluation shows that this approach Kavraki, L. E.; Svestka, P.; Latombe, J.-C.; and Overmars,
allows the robot to start executing action much earlier com- M. 1996. Probabilistic Roadmaps for Path Planning in High-
pared to when actions are selected randomly. Detailed algo- Dimensional Configuration Spaces. IEEE TRO, 12(4).
rithm and experiments are available in the paper. LaValle, S. M.; et al. 1998. Rapidly-exploring random trees:
A new tool for path planning. Iowa State University.
3.2 Robot Planning Using Learned Abstractions Lowerre, B. T. 1976. The Harpy Speech Recognition System.
Carnegie Mellon University.
Shah and Srivastava (2022b) develop a hierarchical planner
-- hierarchical abstraction-guided robot planner (HARP) -- Molina, D.; Kumar, K.; and Srivastava, S. 2020. Identifying
that uses automatically synthesized state abstraction in the Critical Regions for Motion Planning using Auto-Generated
form of a region-based Voronoi diagram and the action ab- Saliency Labels with Convolutional Neural Networks. In
stractions induced by it. The approach develops a hierarchi- ICRA.
cal planner that uses a multi-source multi-directional vari- Shah, N.; Kala Vasudevan, D.; Kumar, K.; Kamojjhala, P.;
ant of the Beam search (Lowerre 1976) for computing a set and Srivastava, S. 2020. Anytime Integrated Task and Mo-
of high-level plans and use a multi-source multi-directional tion Policies for Stochastic Environments. In ICRA.
motion planner LLP (Molina, Kumar, and Srivastava 2020) Shah, N.; and Srivastava, S. 2022a. Multi-Task Option
to simultaneously refine them into a motion plan. Multi- Learning and Discovery for Stochastic Path Planning. arXiv
source approaches typically do not work for robot planning. preprint arXiv:2210.00068.
However, critical regions provide crucial information about Shah, N.; and Srivastava, S. 2022b. Using Deep Learning to
the states that the robot would potentially visit allowing a Bootstrap Abstractions for Hierarchical Robot Planning. In
multi-source approach to work for robot planning. In sum- AAMAS.
mary, Shah and Srivastava (2022b) develop a first approach
for learning to create zero-shot state and action abstractions.
The approach is evaluated in multiple settings and com-
pared against state-of-the art sampling-based (Kavraki et al.

23418

Tom Silver Research2024
No ratings yet
Tom Silver Research2024
5 pages
HVF Hierarchical Foresight
No ratings yet
HVF Hierarchical Foresight
16 pages
From Skills To Symbols: Learning Symbolic Representations For Abstract High-Level Planning
No ratings yet
From Skills To Symbols: Learning Symbolic Representations For Abstract High-Level Planning
75 pages
Motion Planning For Robotics A Review For Sam - 2025 - Biomimetic Intelligence
No ratings yet
Motion Planning For Robotics A Review For Sam - 2025 - Biomimetic Intelligence
20 pages
GPT 2 Finetunning
No ratings yet
GPT 2 Finetunning
15 pages
A Comprehensive Study On Path Planning Algorithms in Multi-Agent Robot Soccer Domain
No ratings yet
A Comprehensive Study On Path Planning Algorithms in Multi-Agent Robot Soccer Domain
7 pages
Robot Path Planning For Maze Navigation
No ratings yet
Robot Path Planning For Maze Navigation
5 pages
Mobile Robot Path Planning in Dynamic Environments Through Globally Guided Reinforcement Learning
No ratings yet
Mobile Robot Path Planning in Dynamic Environments Through Globally Guided Reinforcement Learning
8 pages
Computation 12 00116
No ratings yet
Computation 12 00116
17 pages
Learning Constraint Based Planning Models From Demonstrations
No ratings yet
Learning Constraint Based Planning Models From Demonstrations
7 pages
Risk-Bounded Robot Motion Planning
No ratings yet
Risk-Bounded Robot Motion Planning
20 pages
Biomimetics 09 00612
No ratings yet
Biomimetics 09 00612
17 pages
XPG-RL RL With Explainable Priority Guidance For Efficiency-Boosted Mechanical Search
No ratings yet
XPG-RL RL With Explainable Priority Guidance For Efficiency-Boosted Mechanical Search
13 pages
Neural Networks Based Reinforcement Learning For Mobile Robots Obstacle Avoidance
No ratings yet
Neural Networks Based Reinforcement Learning For Mobile Robots Obstacle Avoidance
12 pages
2014 Acc Abstractions
No ratings yet
2014 Acc Abstractions
8 pages
Reinforcement Learning Based Approach For Mobile Robot Navigation
No ratings yet
Reinforcement Learning Based Approach For Mobile Robot Navigation
4 pages
502 61robotics
No ratings yet
502 61robotics
3 pages
Vision 2 Motion Planning 1
No ratings yet
Vision 2 Motion Planning 1
50 pages
Advanced Motion Planning Guide
No ratings yet
Advanced Motion Planning Guide
65 pages
Actuadores
No ratings yet
Actuadores
15 pages
Bagaria 2021 DSG
No ratings yet
Bagaria 2021 DSG
15 pages
Path Planning with Local Motion Estimations
No ratings yet
Path Planning with Local Motion Estimations
8 pages
Applsci 14 07654
No ratings yet
Applsci 14 07654
22 pages
Ai (Un 06)
No ratings yet
Ai (Un 06)
35 pages
Xi Dias 2021
No ratings yet
Xi Dias 2021
21 pages
FP 3
No ratings yet
FP 3
7 pages
AI-Driven Obstacle Avoidance for Robots
No ratings yet
AI-Driven Obstacle Avoidance for Robots
14 pages
Path Planning & Navigation
No ratings yet
Path Planning & Navigation
77 pages
Bcse306l Ai Module-6 Smsatapathy
No ratings yet
Bcse306l Ai Module-6 Smsatapathy
86 pages
EECS 602 Theory of RL Final Project Report Team1
No ratings yet
EECS 602 Theory of RL Final Project Report Team1
11 pages
Planning With Spatial-Temporal Abstraction From Point Clouds For Deformable Object Manipulation
No ratings yet
Planning With Spatial-Temporal Abstraction From Point Clouds For Deformable Object Manipulation
25 pages
Discovering & Learning To
No ratings yet
Discovering & Learning To
22 pages
A Representational Framework For Learning and Encoding Structurally Enriched Trajectories in Complex Agent Environments
No ratings yet
A Representational Framework For Learning and Encoding Structurally Enriched Trajectories in Complex Agent Environments
44 pages
P - S - L: L M G RL S L H R T: LAN EQ Earn Anguage Odel Uided FOR Olving ONG Orizon Obotics Asks
No ratings yet
P - S - L: L M G RL S L H R T: LAN EQ Earn Anguage Odel Uided FOR Olving ONG Orizon Obotics Asks
29 pages
SCH Wager 2017
No ratings yet
SCH Wager 2017
14 pages
Dynamic Planning in Human-Robot Teams
No ratings yet
Dynamic Planning in Human-Robot Teams
7 pages
Randomized Motion Planning
No ratings yet
Randomized Motion Planning
55 pages
Continual Curiosity Driven Skill Acquisition From High Di - 2017 - Artificial in
No ratings yet
Continual Curiosity Driven Skill Acquisition From High Di - 2017 - Artificial in
23 pages
2 - Ref - Cross-Entropy Motion Planning
No ratings yet
2 - Ref - Cross-Entropy Motion Planning
17 pages
Automated Planning For Robotics
No ratings yet
Automated Planning For Robotics
23 pages
Compact and Efficient Encodings For Planning in Factored Sta - 2020 - Artificial
No ratings yet
Compact and Efficient Encodings For Planning in Factored Sta - 2020 - Artificial
21 pages
Unit 4
No ratings yet
Unit 4
37 pages
27 Submission
No ratings yet
27 Submission
10 pages
Planning and Advantage and Disadvantage The Planning Graph
No ratings yet
Planning and Advantage and Disadvantage The Planning Graph
14 pages
UniZero Generalized and Efficient Planning With Scalable Latent World Models
No ratings yet
UniZero Generalized and Efficient Planning With Scalable Latent World Models
42 pages
Papers Upto 28th March 2025
No ratings yet
Papers Upto 28th March 2025
9 pages
Programming Your Personal Robot Part 3: Reasoning Under Uncertainty
No ratings yet
Programming Your Personal Robot Part 3: Reasoning Under Uncertainty
27 pages
2.2. Task Planning
No ratings yet
2.2. Task Planning
38 pages
Beeftink - Mart - LiteratureSurvey - Learning Manipulation Tasks in A Domestic Care Robot Application Using Teleoperation
No ratings yet
Beeftink - Mart - LiteratureSurvey - Learning Manipulation Tasks in A Domestic Care Robot Application Using Teleoperation
78 pages
LaValle Iros11workshop
No ratings yet
LaValle Iros11workshop
1 page
Program Generation For Situated Robot Task Planning Using Large Language Models
No ratings yet
Program Generation For Situated Robot Task Planning Using Large Language Models
14 pages
Adaptive Robotics Papers
No ratings yet
Adaptive Robotics Papers
56 pages
Reinforcement Learning and Transfer Learning: Simulation-Robot System For Object-Handling
No ratings yet
Reinforcement Learning and Transfer Learning: Simulation-Robot System For Object-Handling
3 pages
Obstacle Avoidance Algorithms A Review
No ratings yet
Obstacle Avoidance Algorithms A Review
24 pages
Research Article: An Improved VFF Approach For Robot Path Planning in Unknown and Dynamic Environments
No ratings yet
Research Article: An Improved VFF Approach For Robot Path Planning in Unknown and Dynamic Environments
11 pages
Reinforcement Learning For IoT - Final
No ratings yet
Reinforcement Learning For IoT - Final
45 pages
Graph-Enhanced Model-Free Reinforcement Learning Agents For Efficient Power Grid Topological Control
No ratings yet
Graph-Enhanced Model-Free Reinforcement Learning Agents For Efficient Power Grid Topological Control
32 pages
Path Planning For An Industrial Robotic Arm
No ratings yet
Path Planning For An Industrial Robotic Arm
7 pages
TEE - CSE3001 - DBMS - 100237 - Dr. Harihrasitaraman.S - Winter21-22-Block1 - QP
No ratings yet
TEE - CSE3001 - DBMS - 100237 - Dr. Harihrasitaraman.S - Winter21-22-Block1 - QP
3 pages
Biochem Molecular Bio Educ - 2023 - Garma - Demystifying Dimensionality Reduction Techniques in The Omics Era A
No ratings yet
Biochem Molecular Bio Educ - 2023 - Garma - Demystifying Dimensionality Reduction Techniques in The Omics Era A
14 pages
The Structure of Matter
No ratings yet
The Structure of Matter
3 pages
Kurmanji Basic Learning Manual
No ratings yet
Kurmanji Basic Learning Manual
32 pages
XED and YED + Exercises
No ratings yet
XED and YED + Exercises
5 pages
Science 10 Second Grading Exam
No ratings yet
Science 10 Second Grading Exam
2 pages
Lead Mechanical Design Engineer in Atlanta GA Resume Tatiana Laguna
No ratings yet
Lead Mechanical Design Engineer in Atlanta GA Resume Tatiana Laguna
2 pages
NPT-National Pipe Thread Chart: Connect With Us On: 855.728.5460
No ratings yet
NPT-National Pipe Thread Chart: Connect With Us On: 855.728.5460
1 page
Medium HighVoltageCapacitors 12022ghjkb JJGKG
No ratings yet
Medium HighVoltageCapacitors 12022ghjkb JJGKG
11 pages
8 SET Science Model Papers 2021
No ratings yet
8 SET Science Model Papers 2021
68 pages
Tesla Patent 685957
100% (2)
Tesla Patent 685957
5 pages
Becc 102
No ratings yet
Becc 102
4 pages
Pma - ks98 2 2 Us 1802 - Dat
No ratings yet
Pma - ks98 2 2 Us 1802 - Dat
10 pages
Sugiyono. (2016) - Metode Penelitian Pendidikan. Bandung:Alfabeta.p.116
No ratings yet
Sugiyono. (2016) - Metode Penelitian Pendidikan. Bandung:Alfabeta.p.116
9 pages
WPS & PQR of Ravindra Kumar
No ratings yet
WPS & PQR of Ravindra Kumar
4 pages
Weibull-Analysis-In-Excel Standard IEC 61649
No ratings yet
Weibull-Analysis-In-Excel Standard IEC 61649
113 pages
(03-511 To 03-0520) End Mills, HTPM, Speeds and Feeds, Slotting and Side Cutting, Metric
No ratings yet
(03-511 To 03-0520) End Mills, HTPM, Speeds and Feeds, Slotting and Side Cutting, Metric
3 pages
Keiler Realtime Prediction Dafx2000
No ratings yet
Keiler Realtime Prediction Dafx2000
6 pages
Light and Shadows Quiz Key
No ratings yet
Light and Shadows Quiz Key
2 pages
13 Lecture
No ratings yet
13 Lecture
44 pages
Bridge Pier Design Specifications
No ratings yet
Bridge Pier Design Specifications
25 pages
Legendre Polynomials Assignment
100% (1)
Legendre Polynomials Assignment
2 pages
Power Series Solutions - Complete
No ratings yet
Power Series Solutions - Complete
65 pages
Latihan Soal-Soal Bab 1-4 (Fismod)
No ratings yet
Latihan Soal-Soal Bab 1-4 (Fismod)
35 pages
C. Henry Edwards, David E. Penney - Differential Equations - Computing and Modeling-Pearson (2013) - 1
No ratings yet
C. Henry Edwards, David E. Penney - Differential Equations - Computing and Modeling-Pearson (2013) - 1
13 pages
TCC Number 119 4 4
No ratings yet
TCC Number 119 4 4
1 page
Bochaver Et Al 20221687066806424
No ratings yet
Bochaver Et Al 20221687066806424
17 pages
Fixed Axial Pump
No ratings yet
Fixed Axial Pump
76 pages
Smart Polymers and Their Applications: September 2014, Volume 2 Issue 4, ISSN 2349-4476
No ratings yet
Smart Polymers and Their Applications: September 2014, Volume 2 Issue 4, ISSN 2349-4476
12 pages
Chain Rule Differentiation Lecture
No ratings yet
Chain Rule Differentiation Lecture
17 pages

30409-Article Text-34463-1-2-20240324

Uploaded by

30409-Article Text-34463-1-2-20240324

Uploaded by

The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

Learning Neuro-Symbolic Abstractions for Robot Planning and Learning

You might also like