Responsive image
博碩士論文 etd-0623124-232650 詳細資訊
Title page for etd-0623124-232650
論文名稱
Title
多任務機器學習圖形辨識於藥物辨識之研究
Drug Identification Using Multi-tasking Learning
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
31
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2024-07-11
繳交日期
Date of Submission
2024-07-23
關鍵字
Keywords
多任務學習、卷積神經網路、遷移學習、ResNeSt、Grad-CAM、LIME
Multi-Task Learning, Convolutional Neural Networks, Transfer Learning, ResNeSt, Grad-CAM, LIME
統計
Statistics
本論文已被瀏覽 378 次,被下載 5
The thesis/dissertation has been browsed 378 times, has been downloaded 5 times.
中文摘要
在台灣,藥盒外部必須清楚標示藥品成分及注意事項,以保障用藥安全。然而,感冒藥的外包裝經常因損壞而導致辨識困難。為解決此問題,本研究提出利用影像辨識技術,通過多任務學習模型同時識別藥物類別和名稱。
本研究旨在開發一個基於多任務學習(Multi-Task Learning)的影像辨識模型,以提升藥物辨識的準確性和效率。實驗設計並訓練了三種模型:藥物類別辨識模型、藥物名稱辨識模型和多任務學習綜合辨識模型。實驗結果顯示,藥物類別辨識模型在辨識上具備極高的準確性和精確度。然而,藥物名稱辨識模型在準確率方面有待進一步提高。
應用多任務學習技術的綜合辨識模型展示了在同時辨識藥物類別和名稱方面的顯著優勢。其結果顯示,在多任務環境下,模型能夠更有效地共享資源和特徵,提升了藥名識別的整體表現。
綜上所述,本研究所開發的藥物辨識模型在藥物類別和名稱分類上均取得了較顯著的性能提升。單任務模型在特定任務上表現優異,而多任務模型則在綜合任務場景中展現了更大的潛力,尤其在提升藥名識別準確性上有明顯改進。未來的工作將進一步優化訓練資料和模型算法,以提升模型的泛化能力和實際應用效果,並為多樣的藥物辨識需求提供全面的技術支持。
Abstract
In Taiwan, the outer packaging of medication boxes must clearly indicate the drug's ingredients and precautions to ensure safe usage. However, the packaging of cold medications often gets damaged, making identification difficult. To address this issue, this study proposes using image recognition technology through a multi-task learning model to simultaneously identify drug categories and names.
This research aims to develop an image recognition model based on Multi-Task Learning (MTL) to enhance the accuracy and efficiency of drug identification. Three models were designed and trained: a drug category identification model, a drug name identification model, and a comprehensive multi-task learning model. The experimental results show that the drug category identification model exhibits extremely high accuracy and precision in identification. However, the drug name identification model requires further improvement in accuracy.
The comprehensive identification model utilizing multi-task learning technology demonstrates significant advantages in simultaneously identifying drug categories and names. The results indicate that in a multi-task environment, the model can effectively share resources and features, enhancing the overall performance of drug name recognition.
In conclusion, the drug identification model developed in this study achieved significant performance improvements in the classification of drug categories and names. The single-task models performed excellently in specific tasks, while the multi-task model displayed greater potential in comprehensive task scenarios, especially in improving the accuracy of drug name recognition. Future work will further optimize the training data and model algorithms to enhance the model's generalization ability and practical application effect, providing comprehensive technical support for various drug identification needs.
目次 Table of Contents
論文審定書 i
誌謝 ii
摘要 iii
Abstract iv
目錄 v
圖次 vii
表次 viii
第一章 緒論 1
第一節 研究背景 1
第二節 研究動機 2
第三節 研究目的 3
第二章 文獻探討 4
第一節 分類模型 4
第二節 卷積神經網路 4
第三節 遷移學習 5
第四節 數據增強 6
第五節 多任務學習 7
第三章 研究方法與步驟 8
第一節 研究流程 8
第二節 研究方法 9
第三節 評估模型與標準 11
第四章 研究結果與分析 13
第一節 資料來源蒐集 13
第二節 資料清理 13
第三節 評估模型 14
第四節 解釋模型 15
第五章 研究結論與建議 19
第一節 研究結論 19
第二節 研究限制與建議 19
參考文獻 21

參考文獻 References
Caruana, R. (1997). Multitask Learning. Machine Learning, 28(1), 41–75. https://doi.org/10.1023/A:1007379606734
He, K., Zhang, X., Ren, S., & Sun, J. (2015). Deep Residual Learning for Image Recognition (arXiv:1512.03385). arXiv. https://doi.org/10.48550/arXiv.1512.03385
Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2017). ImageNet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84–90. https://doi.org/10.1145/3065386
Perez, L., & Wang, J. (2017). The Effectiveness of Data Augmentation in Image Classification using Deep Learning (arXiv:1712.04621). arXiv. https://doi.org/10.48550/arXiv.1712.04621
PRIDE政策研究指標資料庫. (n.d.). Retrieved June 12, 2024, from https://pride.stpi.narl.org.tw/index
Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). “Why Should I Trust You?”: Explaining the Predictions of Any Classifier (arXiv:1602.04938). arXiv. https://doi.org/10.48550/arXiv.1602.04938
Schmidt, R. M. (2019). Recurrent Neural Networks (RNNs): A gentle Introduction and Overview (arXiv:1912.05911). arXiv. https://doi.org/10.48550/arXiv.1912.05911
Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2020). Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization. International Journal of Computer Vision, 128(2), 336–359. https://doi.org/10.1007/s11263-019-01228-7
Xie, S., Girshick, R., Dollár, P., Tu, Z., & He, K. (2017). Aggregated Residual Transformations for Deep Neural Networks (arXiv:1611.05431). arXiv. https://doi.org/10.48550/arXiv.1611.05431
Zhai, S., Cheng, Y., Lu, W., & Zhang, Z. (2016). Doubly Convolutional Neural Networks (arXiv:1610.09716). arXiv. https://doi.org/10.48550/arXiv.1610.09716
Zhang, H., Wu, C., Zhang, Z., Zhu, Y., Lin, H., Zhang, Z., Sun, Y., He, T., Mueller, J., Manmatha, R., Li, M., & Smola, A. (2020). ResNeSt: Split-Attention Networks (arXiv:2004.08955). arXiv. https://doi.org/10.48550/arXiv.2004.08955

電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:校內校外完全公開 unrestricted
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code