A Case of Cups and a Ball: Utilizing Generative Artificial Intelligence for Human-robotic Collaboration in Task Execution

Nurdin Khoirurizka, Joy Chrissetyo Prajogo, Spencer Perkins, Chao Hsiang Kuo, Hsien I. Lin*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Within the domain of robotics, effective interaction between a human collaborator and robot is a topic of great importance. With the continued advancements in Generative Artificial Intelligence (GAI), we propose a prototype system that utilizes GAI and seeks to bridge the gap between the human and robot. This prototype utilizes a Large Language Model (LLM) combined with a vision system for detection and tracking, to allow the GAI system to control a robotic arm and manipulate objects within the action space. We model the action space within our experiments on the 3-cup Monte game where the user can prompt the system to manipulate the cups within the action space based on the ball's location. Experimentation is done on prompts of varying lengths of sequences, and we evaluate our system based on its ability to understand the prompt and correctly execute the desired task. With this collaboration of techniques and technologies, we hope to open the possibilities and extend human and robotic collaboration into new areas. The results of our experiments show promising results to this end.

Original languageEnglish
Title of host publication2024 International Automatic Control Conference, CACS 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350354904
DOIs
StatePublished - 2024
Event2024 International Automatic Control Conference, CACS 2024 - Taoyuan, Taiwan
Duration: 31 Oct 20243 Nov 2024

Publication series

Name2024 International Automatic Control Conference, CACS 2024

Conference

Conference2024 International Automatic Control Conference, CACS 2024
Country/TerritoryTaiwan
CityTaoyuan
Period31/10/243/11/24

Keywords

  • Generative Artificial Intelligence (GAI)
  • Human Robot Collaboration
  • Large Language Model (LLM)
  • Multiple Object Tracking
  • Object Detection

Fingerprint

Dive into the research topics of 'A Case of Cups and a Ball: Utilizing Generative Artificial Intelligence for Human-robotic Collaboration in Task Execution'. Together they form a unique fingerprint.

Cite this