Control of a Ball Catch Robot Using Machine Learning
- https://doi.org/10.2991/jrnal.k.201215.005How to use a DOI?
- Machine learning, control, experiment, evaluation
Robots, such as industrial robots, have been used in the world of industry since the 1970s. There has been particularly rapid development in the field of robots in recent years, and there has been progress in robot research in industries such as communications and automobiles. For this reason, in the near future, robots with a diverse range of applications will be required around us. In this paper, as part of foundational research on robots and artificial intelligence, we propose a method for learning ball trajectories, using machine learning, to estimate target values for the distance in which robots move. In the proposed method, we use a linear regression model for supervised learning, and validate its effectiveness through experimentation.
- © 2020 The Authors. Published by Atlantis Press B.V.
- Open Access
- This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).
Robots, such as industrial robots, have been used in the world of industry since the 1970s. There has been particularly rapid development in the field of robots in recent years, and there has been progress in robot research in industries such as communications and automobiles [1,2]. For this reason, it is thought that robots will exist all around us in the near future. Recently, we have also seen the birth of innovation, in which new technologies, such as Internet of Things (IoT) [3,4], Artificial Intelligence (AI) [5,6], and Big Data, are being incorporated into all industries and society and lifestyles, and Society 5.0 is being promoted to resolve social issues in a way that meets the needs of each individual.
Additionally, the RoboCup [7–9] project is being conducted on the theme of soccer, using autonomous mobile robots, to promote research into the fusion of robot engineering and AI. In this project, a significant amount of research has taken place into the fusion of robot engineering and AI, but most has concerned tracking of robot movement etc., and there have been few studies focusing on the control methods to estimate the trajectory of the ball. Other reports include learning of billiards using reinforcement learning  and cases of AI soccer . In this paper, as part of foundational research on robots and AI, we propose a method for learning ball trajectories, using machine learning, to estimate target values for the distance in which robots move.
2. DESIGN OF REFERENCE
Here, we explain the method for estimating target values using machine learning. The system concept diagram is shown in Figure 1. Here, a motor for moving a keeper robot is controlled and we use feed-forward control. Additionally, “y” is control output, “u” is control input, and r is the target value. The target value “r” refers to the position of the keeper robot, and the target value is estimated, using machine learning, based on the shoot angle of the attacker robot and the position of the ball.
2.1. Machine Learning
In the case of machine learning, we use a linear regression model with supervised learning. The linear regression model can be shown using the following formula.
The solution of is the least-squares estimate. Therefore, r is calculated by the following equation:
The estimated amount obtained here is used as reference.
3. EXPERIMENTAL SYSTEM
Figure 2 shows the configuration of the experiment system, and Figure 3 shows the circuit diagram of the experiment device. The experiment system, as shown in Figure 3, is comprised of an attacker robot and a keeper robot. As shown in Figure 4, to detect the position of the ball, 10 touch sensors are placed at equal intervals. The keeper robot is in front of the touch sensor, and he can catch the ball. The keeper robot can only move left or right. Next, the attacker robot has a potentiometer attached, and is able to measure the shoot angle of the ball. The attacker robot moves manually. Here, it obtains the voltage of the potentiometer to detect the shoot position, and the given control quantity is provided as an instruction to the motor of the keeper. The controller, as shown in Figure 5, uses the Raspberry Pi, as well as Python, which is often used in the programming of machine learning and AI. As the Raspberry Pi is equipped with A/D conversion, it is not possible to directly incorporate an analog sensor. For this reason, this is supported by attaching an A/D converter between the analog sensor and Raspberry Pi. There is a switch used to switch between “learning mode”, in which learning is performed, and “execution mode”, in which it actually moves, and two switches for starting. The details of the hardware are shown in Table 1.
|Hardware||Raspberry Pi 3 Model B|
|A/D converter||mcp3008 (10 bit 8 ch)|
|Microswitch (JF11210) × 10|
|Actuator||LEGO Mind storm RCX motor × 2|
|Other||Motor driver (TA7291P)|
The response variable (Y) in Subsection 2.1 expresses the angle of the attacker, and the predictor variable (X) shows the position of the limit switch. β in Expression 1 is calculated by machine learning. We can estimate the position of the keeper (target value) using these two variables. Estimation of learning and target values are all performed using the Raspberry Pi. Furthermore, the learned data and estimated target values are stored by the Raspberry Pi in SD memory in csv format.
4. EXPERIMENTAL RESULT
The results of learning the kicker robot angle and goal position are shown in Figures 6–8. Learning takes place five times in Figure 6, 10 times in Figure 7, and 100 times in Figure 8. We can see that, as the number of learning times increase, the more the positional relationship between the kicker robot angle and goal approaches a linear one.
The target values are estimated using this learning data, shown in Figure 9. The results of the number of learning times and the correct answer rate, in terms of catching the ball, are shown in Figure 10. Here, the ratio of catching the ball can be defined as shown in the following formula:
From these results, we can see that the correct answer rate increases as the number of learning times increases. Based on these results, the keeper robot, as in the beginning no learning was performed, was unable to catch the ball, but after performing learning, inappropriate actions gradually disappear, and eventually it becomes able to catch the ball.
In this paper, we proposed a method of estimating the target value for a robot controlling the trajectory of a ball using machine learning. The result of this is that, as the number of learning times increase, the accuracy can be increased, and we were able to confirm the effectiveness of the proposed method through experimentation. Moving forward, we plan to look into the application of the method in a separate device to create a versatile system. Additionally, we plan to provide support for models other than the linear regression model, such as reinforced learning and deep learning etc.
CONFLICTS OF INTEREST
The author declares no conflicts of interest.
This work was supported by enPiT. Thank you for the Smart SE: Smart Systems and Services innovative professional Education program. We asked students from Tokyo Gakugei University to cooperate in the development of this device.
Dr. Shinichi Imai
He graduated doctor course at department of engineering in Hiroshima University. He works at department of education in Tokyo Gakugei University. His research area is about control system design, educational engineering.
Cite this article
TY - JOUR AU - Shinichi Imai PY - 2020 DA - 2020/12 TI - Control of a Ball Catch Robot Using Machine Learning JO - Journal of Robotics, Networking and Artificial Life SP - 236 EP - 239 VL - 7 IS - 4 SN - 2352-6386 UR - https://doi.org/10.2991/jrnal.k.201215.005 DO - https://doi.org/10.2991/jrnal.k.201215.005 ID - Imai2020 ER -