臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.103) 您好！臺灣時間：2025/06/20 09:23

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
目次
參考文獻
電子全文
QR Code

本論文永久網址:

研究生:

朱世傑

研究生(外文):

ZHU,SHI-JIE

論文名稱:

使用二維卷積激勵框架與注意力長短期記憶模型於行為辨識

論文名稱(外文):

A Distilled 2D CNN-LSTM Framework with Temporal Attention Mechanism for Action Recognition

指導教授:

陳洳瑾

指導教授(外文):

CHEN,JU-CHIN

口試委員:

林威成、陳朝鈞、楊孟翰、梁廷宇

口試委員(外文):

LIN,WEI-CHENG、CHEN,CHAO-CHUN、YANG,MENG-HAN、LIANG,TYNG-YEU

口試日期:

2021-08-13

學位類別:

碩士

校院名稱:

國立高雄科技大學

系所名稱:

資訊工程系

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2021

畢業學年度:

109

語文別:

中文

論文頁數:

中文關鍵詞:

行為辨識、激勵框架、長短期記憶、注意力機制

外文關鍵詞:

Action Recognition、Distilled Framework、Long Short-Term Memory、Temporal Attention

相關次數:

被引用:0
點閱:184
評分:
下載:0
書目收藏:0

[1]R. Gao, T. H. Oh, K. Grauman, and L. Torresani, “Listen to look: Action recognition by previewing audio,” CVPR, pp. 10457-10467, 2020.
[2]J. Carreira and A. Zisserman, “Quo vadis, action recognition? A new model and the kinetics dataset,” CVPR, pp. 4724-4733, 2017.
[3]J. Donahue, L.A. Hendricks, M. Rohrbach, S. Venugopalan, S. Guadarrama, K. Saenko, and T. Darrell, “Long-term recurrent convolutional networks for visual recognition and description,” CVPR, 2017.
[4]Y.H. Ng, M. Hausknecht, S. Vijayanarasimhan, O. Vinyals, R. Monga, and G. Toderici, “Beyond short snippets: deep networks for video classification,”CVPR, 2015.
[5]Z. Qiu, T. Yao, and T. Mei, “Learning spatio temporal representation with pseudo3d residual networks,” ICCV, pp. 5534-5542, 2017.
[6]G. Thung and H. Jiang, “A torch library for action recognition and detection using CNNs and LSTMs,” 2016.
[7]D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri, “Learning spatiotemporal features with 3d convolutional networks,” CVPR, pp. 4489-4497, 2015.
[8]S. Ji, W. Xu, M. Yang, and K. Yu, “3D convolutional neural networks for human action recognition.,” TPAMI, Vol.35, No.1, pp. 221-231, 2012.
[9]D. Tran, H. Wang, L. Torresani, J. Ray, Y. LeCun, and M. Paluri, “A closer look at spatiotemporal convolutions for action recognition,” CVPR, pp. 6450-6459, 2018.
[10]K. Simonyan and A. Zisserman, “Two-stream convolutional networks for action recognition in videos,” NIPS, 2014.
[11]C. Feichtenhofer, A. Pinz, and A. Zisserman, “Convolutional two-stream network fusion for video action recognition,” CVPR, pp. 1933-1941, 2016.
[12]L. Wang, Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. Van Gool, “Temporal segment networks: Towards good practices for deep action recognition,” CVPR, pp. 20-36, 2016.
[13]S. Karaman, L. Sei denari, and A.D. Bimbo, “Fast saliency based pooling of Fisher encoded dense trajectories,” ECCV, 2014.
[14]D. Oneata, J. Verbeek, and C. Schmid, “The LEAR submission at Thumos 2014,” ECCV, 2014.
[15]L. Wang, Y. Yu Qiao, and X. Tang, “Action recognition and detection by combining motion and appearance features,” ECCV, 2014.
[16]R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” CVPR, 2014.
[17]S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards real time object detection with region proposal networks” NIPS, pp. 91-99, 2015.
[18]Y. Xiong, Y. Zhao, L. Wang, D. Lin, and X. Tang, “A pursuit of temporal accuracy in general activity detection,” arXiv:1703.02716, 2017.
[19]R. Hou, C. Chen, and M. Shah, “Tube convolutional neural network (T-CNN) for action detection in videos,” ICCV, 2017.
[20]T. Lin, X. Liu, X. Li, E. Ding, and S. Wen, “Bmn: Boundary-matching network for temporal action proposal generation,” ICCV, pp. 3889-3898, 2019.
[21]Z. Shou, D. Wang, and S.F. Chang, “Temporal action localization in untrimmed videos via multi-stage cnns,” CVPR, pp. 1049-1058, 2016.
[22]V. Escorcia, F. Heilbron, J. Niebles, and B. Ghanem, “DAPs: deep action Proposals for Action Understanding,” ECCV, pp. 768-784, 2016.
[23]A. Montes, A. Salvador, and X. Nieto, “Temporal activity detection in untrimmed videos with recurrent neural networks,” arXiv:1608.08128, 2016.
[24]B. Singh, T. Marks, M. Jones, O. Tuzel, and M. Shao, “A Multi stream bidirectional recurrent neural network for fine-grained action detection,” CVPR, 2016.
[25]S. Yeung, O. Russakovsky, G. Mori, and L. FeiFei, “End-to-end learning of action detection from frame glimpses in videos,” CVPR, pp. 2678-2687, 2016.
[26]S. Ma, L. Sigal, and S. Sclaroff, “Learning activity progression in LSTMs for activity detection and early detection,” CVPR, pp. 1942-1950, 2016.
[27]S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Comput, Vol.9, No.8, pp. 1735-1780, 1997.
[28]National Taiwan University Ph.D Hung-yi Lee Official website, https://speech.ee.ntu.edu.tw/~hylee/.
[29]V. Mnih, N. Heess, and A. Graves, “Recurrent models of visual attention,” NIPS, pp. 2204-2212, 2014.
[30]D. Bandanna, K. Cho, and Y. Bengio, “Neural machine translation by jointly learning to align and translate,” arXiv: 1409.0473, 2014.
[31]A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, and I. Polosukhin, “Attention is all you need,” NIPS, pp. 5998-6008, 2017.
[32]A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, and N. Houlsby, “An image is worth 16x16 words: Transformers for image recognition at scale,” arXiv: 2010.11929, 2020.
[33]pytorch Official website, https://pytorch.org/
[34]K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” CVPR, pp. 770-778, 2016.
[35]K. Soomro, A.R. Zamir, and M. Shah, “UCF101: A dataset of 101 human actions classes from videos in the wild,” arXiv: 1212.0402, 2012.
[36]F. Caba Heilbron, V. Escorcia, B. Ghanem, and J. Carlos Niebles, “Activitynet: A large-scale video benchmark for human activity understanding,” CVPR, pp. 961-970, 2015.
[37]H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, and T. Serre, “HMDB: a large video database for human motion recognition,” ICCV, 2011.
[38]Kingma, P. Diederik, and B. Jimmy. "Adam: A method for stochastic optimization," arXiv:1412.6980, 2014.

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	卷積注意力機制長短期記憶深度學習用於軸承剩餘可用壽命預估
2.	使用影片與慣性測量單元的牛隻行為辨識深度學習演算法研發
3.	透過強化視覺問答模型進行微影製程熱點偵測之佈局圖案擷取
4.	基於注意力機制之階層式強化學習應用於晶圓測試路徑規劃
5.	以注意力機制結合強化學習實現多代理人合作平台
6.	應用深度學習與注意力機制分析網路社群文本之情緒：以日本動畫社群Twitter資料為例
7.	具注意力機制的多模式神經網路跨年齡臉部驗證
8.	基於OpenPose骨架及LSTM/GRU模型之跌倒檢測研究
9.	應用注意力機制於深度學習之行為辨識
10.	使用骨架序列的走路姿態之身份辨識
11.	基於注意力機制長短期記憶深度學習之機器剩餘可用壽命預估
12.	基於長短期記憶深層學習方法之動作辨識

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室