Evaluation of Reinforcement and Deep Learning Algorithms in Controlling Unmanned Aerial Vehicles
release_qblc3c6io5gqhfefmfauszkfmq
by
Yalew Zelalem Jembre, Yuniarto Wimbo Nugroho, Muhammad Toaha Raza Khan, Muhammad Attique, Rajib Paul, Syed Hassan Ahmed Shah, Beomjoon Kim
Abstract
Unmanned Aerial Vehicles (UAVs) are abundantly becoming a part of society, which is a trend that is expected to grow even further. The quadrotor is one of the drone technologies that is applicable in many sectors and in both military and civilian activities, with some applications requiring autonomous flight. However, stability, path planning, and control remain significant challenges in autonomous quadrotor flights. Traditional control algorithms, such as proportional-integral-derivative (PID), have deficiencies, especially in tuning. Recently, machine learning has received great attention in flying UAVs to desired positions autonomously. In this work, we configure the quadrotor to fly autonomously by using agents (the machine learning schemes being used to fly the quadrotor autonomously) to learn about the virtual physical environment. The quadrotor will fly from an initial to a desired position. When the agent brings the quadrotor closer to the desired position, it is rewarded; otherwise, it is punished. Two reinforcement learning models, Q-learning and SARSA, and a deep learning deep Q-network network are used as agents. The simulation is conducted by integrating the robot operating system (ROS) and Gazebo, which allowed for the implementation of the learning algorithms and the physical environment, respectively. The result has shown that the Deep Q-network network with Adadelta optimizer is the best setting to fly the quadrotor from the initial to desired position.
In application/xml+jats
format
Archived Files and Locations
application/pdf 629.4 kB
file_x2r4pg4zo5gulfvhr5uvr2kh4q
|
res.mdpi.com (publisher) web.archive.org (webarchive) |
application/pdf 629.3 kB
file_ezaxpk4vdveq3pyrynj64ldhoy
|
mdpi-res.com (web) web.archive.org (webarchive) |
Web Captures
https://www.mdpi.com/2076-3417/11/16/7240/htm
2022-06-26 23:31:21 | 51 resources webcapture_kem6eb64zvbk7fldiynlzau4ma
|
web.archive.org (webarchive) |
Open Access Publication
In DOAJ
In ISSN ROAD
In Keepers Registry
ISSN-L:
2076-3417
access all versions, variants, and formats of this works (eg, pre-prints)
Crossref Metadata (via API)
Worldcat
SHERPA/RoMEO (journal policies)
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar