臺灣博碩士論文加值系統

English | Mobile

免費會員登入| 註冊

功能切換導覽列

訪客IP：216.73.216.135

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
QR Code

本論文永久網址:

研究生:

余品源

研究生(外文):

YU, PIN-YUAN

論文名稱:

基於TD3之自主移動機器人的區域路徑規劃與避障

論文名稱(外文):

Local Path Planning and Obstacle Avoidance Based on TD3 for an AMR

指導教授:

游允帥

指導教授(外文):

YU, YUN-SHUAI

口試委員:

游允帥、陳永昇、柯志亨、陳柏宏

口試委員(外文):

YU, YUN-SHUAI、CHEN, YEONG-SHENG、KE, CHIH-HENG、CHEN, PO-HUNG

口試日期:

2021-07-27

學位類別:

碩士

校院名稱:

國立虎尾科技大學

系所名稱:

資訊工程系碩士班

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2021

畢業學年度:

109

語文別:

中文

論文頁數:

中文關鍵詞:

深度增強式學習、Twin Delayed Deep Deterministic Policy Gradient (TD3)、Gated Recurrent Unit (GRU)、機器人作業系統、區域路徑規劃演算法

外文關鍵詞:

DRL、TD3、GRU、ROS、Local path planning algorithm

相關次數:

被引用:0
點閱:463
評分:
下載:19
書目收藏:1

[1]機器人作業系統, available from: https://www.ros.org.
[2]Base local planner, available from: http://wiki.ros.org/base_local_planner.
[3]Dieter Foxy, Wolfram Burgardy, Sebastian Thrun, 1997, “The Dynamic Window Approach to Collision Avoidance,” IEEE Robotics & Automation Magazine, vol. 4, issue 1, pp. 23-33, March.
[4]DWA local planner, available from: http://wiki.ros.org/dwa_local_planner.
[5]Scott Fujimoto, Herke van Hoof, David Meger, 2018, “Addressing Function Approximation Error in Actor-Critic Methods”, International Conference on Machine Learning, 1582-1591, February.
[6]Talebpour, Zeynab, Martinoli, Alcherio, 2018, “Risk-Based Human-Aware Multi-Robot Coordination in Dynamic Environments Shared with Humans”, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3365-3372.
[7]Qingyang Tan, Tingxiang Fan, Jia Pan, Dinesh Manocha, 2020, “DeepMNavigate: Deep Reinforced Multi-Robot Navigation Unifying Local & Global Collision Avoidance”, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA (Virtual), October 25-29.
[8]Xiaoyun Lei, Zhian Zhang, Peifang Dong, 2018, “Dynamic Path Planning of Unknown Environment Based on Deep Reinforcement Learning”, Journal of Robotics, vol. 2018, September.
[9]Matej Dobrevski, Danijel Skočaj, 2020, “Adaptive Dynamic Window Approach for Local Navigation”, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 6930-6936, February.
[10]Dainel Zhang, Colleen P. Bailey, 2020, “Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with Reward Shaping”, arXiv: 2003.12863, March.
[11]Herrn Anton Maximilian Schafer, 2008, “Reinforcement Learning with Recurrent Neural Networks”, University of Osnabruck, Institute for Computer Science Neuroinformatics Group.
[12]Lu Wang, Wei Zhang, Xiaofeng He, Hongyuan Zha, 2018, “Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation”, Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp.2447-2456, July.
[13]Steven Kapturowski, Georg Ostrovski, John Quan, Remi Munos, Will Dabney, 2019, “Recurrent Experience Replay in Distributed Reinforcement Learning”, International Conference on Learning Representations, January.
[14]Dynamic Programming, available from: https://en.wikipedia.org/wiki/Dynamic_programming.
[15]Supervised Learning, available from: https://en.wikipedia.org/wiki/Supervised_learning.
[16]Deep Mind AlphaGo, available from: https://deepmind.com/research/case-studies/alphago-the-story-so-far.
[17]Microsoft Bonsai, available from: https://docs.microsoft.com/en-us/bonsai/product.
[18]What is the difference between model-based and model-free reinforcement learning? ,available from: https://www.quora.com/What-is-the-difference-between-model-based-and-model-free-reinforcement-learning.
[19]RL survey, available from: https://github.com/AI4Finance-LLC/ElegantRL/blob/master/figs/RL_survey_2020.pdf.
[20]Mean Square Error, Available from: https://en.wikipedia.org/wiki/Mean_squared_error.
[21]Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra, 2016, “Continuous control with deep reinforcement learning”, arXiv: 1509.02971, September.
[22]Van Hasselt, Hado, 2010, “Double Q-learning”, pp. 2613-2621, January.
[23]Markov Decision Process, available from: https://en.wikipedia.org/wiki/Markov_decision_process.
[24]Matthew F. Dixon, Igor Halperin, Paul Bilokon, 2020, “Inverse Reinforcement Learning and Imitation Learning”, Machine Learning in Finance, Springer, Cham.
[25]Felipe Codevilla, Eder Santana, Antonio Lopez, Adrien Gaidon, 2019, “Exploring the Limitations of Behavior Cloning for Autonomous Driving”, 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 9328-9337.
[26]ROBOTIS-Turtlebot3. Available from: https://emanual.robotis.com/docs/en/platform/turtlebot3/overview.
[27]turtlebot3_simulations open source code. Available from: https://github.com/ROBOTIS-GIT/turtlebot3_simulations.
[28]Shreyansh Daftry, J. Andrew Bagnell, Martial Hebert, 2016, “Learning Transferable Policies for Monocular Reactive MAV Control”, arXiv: 1608.00627, August.
[29]車聯網, available from: https://zh.wikipedia.org/wiki/%E8%BB%8A%E8%81%AF%E7%B6%B2.

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	基於ROS無人載具之DQN導引學習
2.	應用TD3深度強化學習演算法進行資產優化管理配置

簡易查詢 | 進階查詢 |