|
[1] A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever, "Improving language understanding by generative pre-training," 2018. [2] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2016, pp. 779-788. [3] A. Vaswani et al., "Attention is all you need," Advances in neural information processing systems, vol. 30, 2017. [4] A. Ramesh, P. Dhariwal, A. Nichol, C. Chu, and M. Chen, "Hierarchical text-conditional image generation with clip latents," arXiv:2204.06125 [cs.CV], 2022. [5] R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, "High-resolution image synthesis with latent diffusion models," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 10684-10695. [6] A. Kirillov et al., "Segment anything," arXiv:2304.02643[cs.CV], 2023. [7] OpenAI, "GPT-4 Technical Report," arXiv:2303.08774 [cs.CL], 2023. [8] C.-Y. Wang, A. Bochkovskiy, and H.-Y. M. Liao, "YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 7464-7475. [9] Q. Yu et al., "k-means Mask Transformer," in Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXIX, 2022: Springer, pp. 288-307. [10] K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2016, pp. 770-778. [11] A. Kirillov, K. He, R. Girshick, C. Rother, and P. Dollár, "Panoptic segmentation," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 9404-9413. [12] 張家銘. "最新的物件偵測王者 YOLOv7 介紹." https://medium.com/ai-academy-taiwan/%E6%9C%80%E6%96%B0%E7%9A%84%E7%89%A9%E4%BB%B6%E5%81%B5%E6%B8%AC%E7%8E%8B%E8%80%85-yolov7-%E4%BB%8B%E7%B4%B9-206c6adf2e69 (accessed June 7th, 2023). [13] 秋庭伸也、杉山阿聖、寺田學, 零基礎入門的機器學習圖鑑. 采實文化, 2021, pp. 64-72. [14] 維基百科. "范德蒙矩陣." https://zh.wikipedia.org/zh-tw/%E8%8C%83%E5%BE%B7%E8%92%99%E7%9F%A9%E9%99%A3 (accessed June 7th, 2023). [15] Kukil. "Mean Average Precision (mAP) in Object Detection." https://learnopencv.com/mean-average-precision-map-object-detection-model-evaluation-metric/ (accessed June 7th, 2023). [16] T.-Y. Lin et al., "Microsoft coco: Common objects in context," in Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, 2014: Springer, pp. 740-755. [17] Nitfianr. "Python中使用YOLOv7进行实例分割以及Detectron2的使用." https://news.sangniao.com/p/4277861659 (accessed June 7th, 2023).
|