TY - GEN
T1 - A Case of Cups and a Ball
T2 - 2024 International Automatic Control Conference, CACS 2024
AU - Khoirurizka, Nurdin
AU - Prajogo, Joy Chrissetyo
AU - Perkins, Spencer
AU - Kuo, Chao Hsiang
AU - Lin, Hsien I.
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Within the domain of robotics, effective interaction between a human collaborator and robot is a topic of great importance. With the continued advancements in Generative Artificial Intelligence (GAI), we propose a prototype system that utilizes GAI and seeks to bridge the gap between the human and robot. This prototype utilizes a Large Language Model (LLM) combined with a vision system for detection and tracking, to allow the GAI system to control a robotic arm and manipulate objects within the action space. We model the action space within our experiments on the 3-cup Monte game where the user can prompt the system to manipulate the cups within the action space based on the ball's location. Experimentation is done on prompts of varying lengths of sequences, and we evaluate our system based on its ability to understand the prompt and correctly execute the desired task. With this collaboration of techniques and technologies, we hope to open the possibilities and extend human and robotic collaboration into new areas. The results of our experiments show promising results to this end.
AB - Within the domain of robotics, effective interaction between a human collaborator and robot is a topic of great importance. With the continued advancements in Generative Artificial Intelligence (GAI), we propose a prototype system that utilizes GAI and seeks to bridge the gap between the human and robot. This prototype utilizes a Large Language Model (LLM) combined with a vision system for detection and tracking, to allow the GAI system to control a robotic arm and manipulate objects within the action space. We model the action space within our experiments on the 3-cup Monte game where the user can prompt the system to manipulate the cups within the action space based on the ball's location. Experimentation is done on prompts of varying lengths of sequences, and we evaluate our system based on its ability to understand the prompt and correctly execute the desired task. With this collaboration of techniques and technologies, we hope to open the possibilities and extend human and robotic collaboration into new areas. The results of our experiments show promising results to this end.
KW - Generative Artificial Intelligence (GAI)
KW - Human Robot Collaboration
KW - Large Language Model (LLM)
KW - Multiple Object Tracking
KW - Object Detection
UR - http://www.scopus.com/inward/record.url?scp=85214990308&partnerID=8YFLogxK
U2 - 10.1109/CACS63404.2024.10773301
DO - 10.1109/CACS63404.2024.10773301
M3 - Conference contribution
AN - SCOPUS:85214990308
T3 - 2024 International Automatic Control Conference, CACS 2024
BT - 2024 International Automatic Control Conference, CACS 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 31 October 2024 through 3 November 2024
ER -