Quantifying grasp quality using an inverse reinforcement learning algorithm

Horn, Matthew William

Quantifying grasp quality using an inverse reinforcement learning algorithm

dc.contributor.advisor	Landsberger, Sheldon
dc.contributor.advisor	Pryor, Mitchell Wayne
dc.creator	Horn, Matthew William
dc.creator.orcid	0000-0002-7202-287X
dc.date.accessioned	2017-06-21T18:05:36Z
dc.date.available	2017-06-21T18:05:36Z
dc.date.issued	2017-05
dc.date.submitted	May 2017
dc.date.updated	2017-06-21T18:05:36Z
dc.description.abstract	This thesis considers the problem of using a learning algorithm to recognize when a mechanical gripper and sensor combination has achieved a robust grasp. Robotic hands are continuously evolving with finer motor control and higher degrees of freedom which can complicate the ability of an operator to determine if a gripper has achieved a successful grasp. Robots working in hazardous environments especially need confirmation of a successful grasp as the cost of failure is often higher than in traditional factory environments. The object set found in a nuclear environment is the focus of this effort. Objects in this environment are typically expensive (or one-of-a-kind), rigid, radioactive (or toxic), dense, and susceptible to dents, scratches, and oxidation. To validate the robustness of a grasp option, an online inverse reinforcement learning approach is evaluated as a method to quantify grasp quality. This approach is applied to an industrial-grade under-actuated robotic hand equipped with 36 pressure sensors. An expert trains the inverse reinforcement learning algorithm to generate a reward function which scores each grasp so - when combined with fuzzy logic - provides a general success or fail along with a confidence level. Utilizing the trained inverse reinforcement learning algorithm in a glovebox environment reduces the number of potential failing and untrustworthy grasps by scoring executed grasps and rejecting grasps that are similar to prior failed grasps while allowing further execution of movement when a grasp has been scored highly. The trained algorithm incorrectly classified grasps of insufficient quality less than 5% of the time in experimental hardware tests, showing that the algorithm can be applied to the glovebox environment to improve grasp safety. Thus the combination of grasp selection and pressure sensor validation provides a more efficient, robust, and redundant method to assure items can be safely handled during remote automation processes.
dc.description.department	Mechanical Engineering
dc.format.mimetype	application/pdf
dc.identifier	doi:10.15781/T2ZC7S092
dc.identifier.uri	http://hdl.handle.net/2152/47303
dc.language.iso	en
dc.subject	Robotics
dc.subject	Nuclear
dc.subject	Grasp
dc.subject	Grasping
dc.subject	Validation
dc.subject	Grasp validation
dc.subject	Radiation
dc.subject	Glovebox
dc.subject	Grasp quality
dc.subject	Machine learning
dc.subject	Inverse reinforcement learning
dc.subject	Learning
dc.subject	Algorithm
dc.subject	Reinforcement
dc.subject	Safety
dc.title	Quantifying grasp quality using an inverse reinforcement learning algorithm
dc.type	Thesis
dc.type.material	text
thesis.degree.department	Mechanical Engineering
thesis.degree.discipline	Mechanical Engineering
thesis.degree.grantor	The University of Texas at Austin
thesis.degree.level	Masters
thesis.degree.name	Master of Science in Engineering

Access full-text files

Original bundle

Now showing 1 - 1 of 1

Name:: HORN-THESIS-2017.pdf
Size:: 30.01 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 4.45 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 1.84 KB
Format:: Plain Text
Description:

Download

Collections

UT Electronic Theses and Dissertations