[논문]딥러닝 기반의 의료 OCR 기술 동향

윤성연; 최아린; 김채원; 오수민; 손서영; 김지연; 이현희; 한명은; 박민서

doi:10.17703/jcct.2024.10.2.453

딥러닝 기반의 의료 OCR 기술 동향
Trends in Deep Learning-based Medical Optical Character Recognition 원문보기

Journal of the convergence on culture technology : JCCT = 문화기술의 융합, v.10 no.2, 2024년, pp.453 - 458

윤성연 (서울여자대학교 데이터사이언스학과) , 최아린 (서울여자대학교 데이터사이언스학과) , 김채원 (서울여자대학교 데이터사이언스학과) , 오수민 (서울여자대학교 데이터사이언스학과) , 손서영 (서울여자대학교 데이터사이언스학과) , 김지연 (서울여자대학교 디지털미디어학과) , 이현희 (서울여자대학교 데이터과학전공) , 한명은 (서울여자대학교 데이터과학전공) , 박민서 (서울여자대학교 데이터사이언스학과)

초록
AI-Helper

광학 문자 인식(Optical Character Recognition, OCR)은 이미지 내의 문자를 인식하여 디지털 포맷(Digital Format)의 텍스트로 변환하는 기술이다. 딥러닝(Deep Learning) 기반의 OCR이 높은 인식률을 보여줌에 따라 대량의 기록 자료를 보유한 많은 산업 분야에서 OCR을 활용하고 있다. 특히, 의료 산업 분야는 의료 서비스 향상을 위해 딥러닝 기반의 OCR을 적극 도입하였다. 본 논문에서는 딥러닝 기반 OCR 엔진(Engine) 및 의료 데이터에 특화된 OCR의 동향을 살펴보고, 의료 OCR의 발전 방향에 대해 제시한다. 현재의 의료 OCR은 검출한 문자 데이터를 자연어 처리(Natural Language Processing, NLP)하여 인식률을 개선하였다. 그러나, 정형화되지 않은 손글씨(Handwriting)나 변형된 문자에서는 여전히 인식 정확도에 한계를 보였다. 의료 데이터의 데이터베이스(Database)화, 이미지 전처리(Pre-processing), 특화된 자연어 처리를 통해 더욱 고도화된 의료 OCR을 발전시키는 것이 필요하다.

Abstract ▼ AI-Helper

Optical Character Recognition is the technology that recognizes text in images and converts them into digital format. Deep learning-based OCR is being used in many industries with large quantities of recorded data due to its high recognition performance. To improve medical services, deep learning-based OCR was actively introduced by the medical industry. In this paper, we discussed trends in OCR engines and medical OCR and provided a roadmap for development of medical OCR. By using natural language processing on detected text data, current medical OCR has improved its recognition performance. However, there are limits to the recognition performance, especially for non-standard handwriting and modified text. To develop advanced medical OCR, databaseization of medical data, image pre-processing, and natural language processing are necessary.

주제어

참고문헌 (34)

J. Memon, M. Sami, R. A. Khan, and M. Uddin,？"Handwritten Optical Character Recognition？(OCR): A Comprehensive Systematic Literature？Review (SLR)," IEEE Access, Vol. 8, pp.？142642-142668, 2020. DOI:10.1109/ACCESS.2020.3012542

상세보기
Z. Raisi, M. A. Naiel, P. Fieguth, S. Wardell,？and J. Zelek, "Text detection and recognition in？the wild: A review," arXiv preprint？arXiv:2006.04305, 30 Jun 2020. DOI:？10.48550/arXiv.2006.04305
S. Ahn, H. Hwang, and J. Hee, "A Case Study？on the Application of AI-OCR for Data？Transformation of Paper Records," Journal of？Information Management Society, Vol. 39, No. 3,？pp. 165-193, 2022. DOI:10.3743/KOSIM.2022.3.165
G. Min, A. Lee, K. S. Kim, J. E. Kim, H. S.？Kang, and G. H. Lee, "Recent Trends in Deep？Learning-Based Optical Character Recognition,"？Electronics and Telecommunications Trends,？Vol. 37, No. 5, pp. 22-32, Oct 2022. DOI:？10.22648/ETRI.2022.J.370503

원문보기 상세보기
D. Gifu, "AI-backed OCR in Healthcare,"？Procedia Computer Science, Vol. 207, No. 2, pp.？1134-1143, Oct 2022. DOI: 10.1016/j.procs.2022.09.169

상세보기
J. M. Park, S. K, Choi, J. Y. Kim, S. H. Jung,？and C. B. Sim, "Implementation of a Drug？Information Retrieval System Through OCR API？pErformance Comparison," The J ournal of The？Korea Institute of Electronic Communication？Sciences (KIECS), Vol. 18, No. 5, pp. 989-998,？31 Oct 2023. DOI: 10.13067/JKIECS.2023.18.5.989

원문보기 상세보기
E. Hsu, I. Malagaris, Y. F. Kuo, R. Sultana, and？K. Roberts, "Deep learning-based NLP data？pipeline for EHR-scanned document information？extraction," JAMIA open, Vol. 5, No. 2, pp.？1-12, 2022. DOI: 10.1093/jamiaopen/ooac045

상세보기
W. A. Qader and M. M. Ameen, "Diagnosis of？Diseases from Medical Check-up Test Reports？Using OCR Technology with BoW and？AdaBoost algorithms," In 2019 International？Engineering Conference (IEC), pp. 205-210,？23-25 June 2019. DOI: 10.1109/IEC47844.2019.8950605
W. A. J. R. Silva, H. M. K. Shirantha, L. J. M.？V. N. Balalla, R. A. D. V. K., N.？Kuruwitaarachchi, and D. Kasthurirathna,？"Predicting Diabetes Mellitus Using Machine？Learning and Optical Character Recognition," In？2021 6th International Conference for？Convergence in Technology (I2CT), pp. 1-6,？02-04 Apr 2021. DOI: 10.1109/I2CT51068.2021.9417941
N. Lee, M. Jeong, Y. Kim, J. Shin, I. Joe, S.？Jeon, and B. Ko, "IoT-based Architecture and？Implementation for Automatic Shock Treatment,"？KSII Transactions on Internet and Information？Systems, Vol. 16, No. 7, pp. 2209-2224, June？2022. DOI: 10.3837/tiis.2022.07.005

원문보기 상세보기
P. Batra, N. Pulkit, D. Kurmi, J. Tembhurne,？P. Sahare, and T. Diwan, "OCR-MRD:？performance analysis of different optical？character recognition engines for medical report？digitization," International Journal of Information？Technology, Vol. 16, No.1, pp. 447-455, 24？November 2023. DOI:10.21203/rs.3.rs-2513255/v1
S. Tangkawanit, J. Pooksook, J. Ieamsaard, and？P. Sornkhom, "OCR Application for Cancer？Care," 2022 Asia-Pacific Signal and Information？Processing Association Annual Summit and？Conference (APSIPA ASC), 7-10 November？2022. DOI:10.23919/APSIPAASC55919.2022.9980078
L. Drukker, R. Droste, C. Ioannou, L. Impey, J.？A. Noble, and A. T. Papageorghiou, "Function？and safety of SlowFlowHD ultrasound Doppler？in obstetrics," Ultrasound in Medicine &？Biology, Vol. 48, No. 6, pp. 1157-1162. DOI:？10.1016/j.ultrasmedbio.2022.02.012

상세보기
R. Smith, "An overview of the Tesseract OCR？engine," In Proceedings of the Ninth？International Conference on Document Analysis？and Recognition (ICDAR 2007), Vol. 2, pp.？629-633, 23-26 Sep 2007.
EasyOCR, JaidedAI, 2023. Available online:？https://github.com/JaidedAI/EasyOCR (accessed？on 20 February 2024)
H. Feng, Y. Wang, W. Zhou, J. Deng, and H.？Li, "Doctr: Document image transformer for？geometric unwarping and illumination？correction," Proceedings of the 29th ACMInternational Conference on Multimedia, pp.？273-281, October 2021. DOI:10.48550/arXiv.2110.12942
Keras-ocr, 2019. Available online: https://keras-ocr.readthedocs.io/en/latest/ (accessed on 20？February 2024)
Naver CLOVA OCR, 2023. Available online:？https://clova.ai/ocr/?langko (accessed on 20 Feb？ruary 2024)
Cloud Vision API, Detect text in images, 2023.？Available online: https://cloud.google.com/vision/docs/ocr?hlko (accessed on 20 February 2024)
Y. Wang, M. Huang, L. Zhao, and X. Zhu,？"Attention-based LSTM for aspect-level？sentiment classification," Proceedings of the？2016 Conference on Empirical Methods in？Natural Language Processing, pp. 606-315,？January 2016. DOI:10.18653/v1/D16-1058
Y. Baek, B. Lee, D. Han, S. Yun, and H Lee,？"Character region awareness for text detection,"？In Proceedings of the IEEE/CVF conference on？computer vision and pattern recognition, pp.？9365-9374, DOI: 10.48550/arXiv.1904.01941
A. Graves, S. Fernandez, F. Gomez, and J.？Schmidhuber, "Connectionist temporal？classification: labelling unsegmented sequence？data with recurrent neural networks, Proceedings of the 23rd international conference？on Machine learning, pp. 369-376, January 2006.？DOI:10.1145/1143844.1143891
A. Chaurasia and E. Culurciello, "Linknet:？Exploiting encoder representations for efficient？semantic segmentation," In 2017 IEEE visual？communications and image processing (VCIP),？pp. 1-4, 10-13 December 2017. DOI:？10.1109/VCIP.2017.8305148
M. Liao, Z. Wan, C. Yao, K. Chen, and X. Bai,？"Real-time scene text detection with differentiable？binarization," In Proceedings of the AAAI？conference on artificial intelligence, Vol. 34, No. 7,？pp. 11474-11481, April 2020. DOI:？10.1609/aaai.v34i07.6812

상세보기
B. Shi, X. Bai, and C. Yao, "An end-to-end？trainable neural network for image-based？sequence recognition and its application to scene？text recognition," IEEE transactions on pattern？analysis and machine intelligence, Vol. 39, No. 11,？pp. 2298-2304, November 2017. DOI:？10.1109/TPAMI.2016.2646371

상세보기
H. Li, P. Wang, C. Shen, and G. Zhang, "Show,？attend and read: A simple and strong baseline for？irregular text recognition," In Proceedings of the？AAAI conference on artificial intelligence, Vol. 33,？No. 01, pp. 8610-8617, July 2019. DOI:？10.1609/aaai.v33i01.33018610

상세보기
L. Ning, Y. Wenwen, Q. Xianbiao, C. Yihao, G.？Ping, X. Rong, and B. Xiang, "Master:？Multi-aspect non-local network for scene text？recognition," Pattern Recognition, 15 April 2021.？DOI:10.1016/j.patcog.2021.107980

상세보기
J. Devlin, M. W. Chang, K. Lee, K. Toutanova,？"BERT: Pre-training of Deep Bidirectional？Transformers for Language Understanding,"？arXiv preprint arXiv:1810.04805, 2018. DOI:？10.48550/arXiv.1810.04805
E. Alsentzer, J. R. Murphy, W. Boag, W. H.？Weng, D. Jin, T. Naumann, and M. B. A.？McDermott, "Publicly available clinical BERT？embeddings," In Proceedings of the 2nd Clinical？Natural Language Processing (ClinicalNLP)？Workshop in North American Chapter of the？Association for Computational Linguistics？(NAACL), 2019. DOI: 10.48550/arXiv.1904.03323
J. Polpinij and A. K. Ghose, "An？ontology-based sentiment classification？methodology for online consumer reviews," In？2008 IEEE/WIC/ACM International Conference？on Web Intelligence and Intelligent Agent？Technology, Vol. 1, pp. 518-524, 09-12 Dec 2008.？DOI: 10.1109/WIIAT.2008.68
P. Viola and M. Jones, "Rapid object detection？using a boosted cascade of simple features," In？Proceedings of the 2001 IEEE Computer？Society Conference on Computer Vision and？Pattern Recognition (CVPR), Vol. 1, 8-14？December 2001. DOI: 10.1109/CVPR.2001.990517
S. Yoon and M. Park, "Media-based Analysis？of Gasoline Inventory with Korean Text？Summarization," The Journal of the？Convergence on Culture Technology (JCCT),？Vol. 9, No. 5, pp. 509-515, Oct 2023. DOI:？10.17703/JCCT.2023.9.5.509

원문보기 상세보기
H. S. Lee, "Rearch of Late Adolcent Activity？based on Using Big Data Analysis," The？International J ournal of Advanced Culture？Technology (IJACT), Vol. 10, No. 4, pp. 361-368, Dec 2022. DOI: 10.17703/IJACT.2022.10.4.361

원문보기 상세보기
M. R. Segal, "Machine learning benchmarks？and random forest regression," Center for？Bioinformatics and Molecular Biostatistics, 2004.？

저자의 다른 논문 :

표제어: PCR

동의어: Packet Collision Rate

용어 설명 출처 목록 (6)

용어 설명: PCR은 세균 특이성이 있는 primer를 이용하여 적은 수의 세균이 있을지라도 쉽게 검출할 수 있는 유용한 방법이며, 이를 이용하여 구강 내 치면세균막이나 타액에서 직접 세균을 검출할 수 있게 되었다[8].

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증