[논문]문서 처리 자동화를 위한 인보이스 이미지의 구조 인식 방법

이동석; 권순각

doi:10.9723/jksiis.2023.28.2.011

초록
AI-Helper

본 논문은 인보이스 문서 이미지에 문서 처리 자동화를 적용하기 위한 문서 구조 인식 방법과 문서 구조 인식 결과를 토대로 스프레드문서 형태로 출력하는 방법을 제안한다. 딥러닝 OCR 엔진을 통해 문서 내 단어 블록들과 해당 블록들의 문자 인식 결과를 얻는다. 단어 블록의 위치 정보들을 통해 같은 행과 같은 열에 존재하는 단어 블록들을 검출한다. 단어 블록들의 배치 정보를 통해 문서 영역을 분할한다. 문서의 구역 정보를 통해 얻어진 문서 구조를 토대로 스프레드시트의 알맞은 위치에 문자 인식 결과를 입력한다. 실험 결과 제안된 방법을 통한 항목 배치는 평균 92.30%의 정확도를 보인다.

Abstract ▼ AI-Helper

In this paper, we propose the methods of invoice document structure recognition and of making a spreadsheet electronic document. The texts and block location information of word blocks are recognized by an optical character recognition engine through deep learning. The word blocks on the same row an...

In this paper, we propose the methods of invoice document structure recognition and of making a spreadsheet electronic document. The texts and block location information of word blocks are recognized by an optical character recognition engine through deep learning. The word blocks on the same row and same column are found through their coordinates. The document area is divided through arrangement information of the word blocks. The character recognition result is inputted in the spreadsheet based on the document structure. In simulation result, the item placement through the proposed method shows an average accuracy of 92.30%.

Keyword

표/그림 (13)

그림 Fig. 1 Table structure types: (a) table with dividing lines and (b) table without dividing line
그림 Fig. 2 Flow of proposed method
그림 Fig. 3 Word block detection result
그림 Fig. 4 Detection of word blocks on a column: (a) left aligned, (b) center aligned, and (c) right aligned
그림 Fig. 5 Detection results of word blocks on same column
그림 Fig. 6 Detection of column items: (a) detection based on horizontal aligned, (b) detection based on line space, and (c) histogram of line spaces
그림 Fig. 7 Row item detection: (a) detection based on y-coordinate, (b) y- coordinate collision between two word blocks, and (c) detection based on y-coordinate collision
그림 Fig. 8 Document division result
그림 Fig. 9 Generated spreadsheet document through proposed method
그림 Fig. 10 Invoice document image dataset for simulation: (a) samples of electronic invoices and (b) RVL-CDIP dataset
그림 Fig. 11 Result of spreadsheet document generation: (a) original document image, (b) based on top-left point coordinates of word blocks, and (c) by applying proposed method.
그림 Fig. 12 Wrong word block placement due to rotated document image
표 Table 1 Accuracies of word block placement

참고문헌 (14)

Cai, Z. and Vasconcelos, N. (2018). Cascade？R-CNN: Delving Into High Quality Object？Detection, Preceedings of the IEEE/CVF？Conference on Computer Vision and？Pattern Recognition, June 18-23, Salt Lake？City, UT, USA, pp. 6154-6162, 2018.
Carion, N., Massa, F., Synnaeve, G., Usunier,？N., Kirillov, A., and Zagoruyko, S. (2020).？End-to-End Object Detection with Transformers,？Proceedings of the European Conference on？Computer Vision, Aug. 23-28, pp. 213-229.
Feng, H., Wang, Y., Zhou, W., Deng, J., and？Li, H. (2021). DocTr: Document Image？Transformer for Geometric Unwarping and？Illumination Correction, Proceeding of ACM？International Conference on Multimedia,？Oct. 20-24, Chengdu, China, pp. 273-281.
Harley, A. W., Ufkes, A., and Derpanis, K. G.？(2015). Evaluation of Deep Convolutional？Nets for Document Image Classification and？Retrieval. Proceedings of the International？Conference on Document Analysis and？Recognition, Aug. 23-26, Tunis, Tunisia,？pp. 991-995.
He, T., Tian, Z., Huang, W., Shen, C., Qiao？Y., and Sun, C. (2018). An End-to-End？TextSpotter with Explicit Alignment and？Attention, Proceedings of the IEEE/CVF？Conference on Computer Vision and？Pattern Recognition, June 18-23, Salt Lake？City, UT, USA, pp. 5020-5029.
Kozlowski, M. and Weichbroth, P. (2021).？Samples of Electronic Invoices, Mendeley？Data. https://doi.org/10.17632/tnj49gpmtz.2.
Lee, D. S. and Kwon, S. K. (2022). Structure？Recognition Method in Various Table Types？for Document Processing Automation.？Journal of Korea Multimedia Society, 25(5),？695-702. https://doi.org/10.9717/kmms.2022.25.5.69
Liao, M., Wan, Z., Yao, C., Chen, K., and Bai,？X. (2020). Real-time Scene Text Detection？with Differentiable Binarization. Proceedings？of the AAAI conference on artificial？intelligence, Feb. 7-12, New York, NY,？USA, pp. 11474-11481
Prasad, D., Gadpal, A., Kapadni, K., Visave, M.,？and Sultanpure, K. (2020). CascadeTabNet:？An Approach for End to End Table？Detection and Structure Recognition from？Image-based Documents, Proceedings of？the IEEE/CVF Conference on Computer？Vision and Pattern Recognition Workshops,？June 14-19, Seattle, Wa, USA, pp.？2439-2447.
Shaoqing, R., Kaiming, H., Girshick, R., and？Sun, J. (2017). Faster R-CNN: Towards？Real-time Object Detection with Region？Proposal Networks, IEEE Transection on？Pattern Analysis and Machine Intelligence,？39(6), 1137-1149. https://doi.org/10.1109/TPAMI.2016.2577031.

상세보기
Shi, B., Bai, X., and Yao, C. (2016). An？End-to-end Trainable Neural Network for？Image-based Sequence Recognition and Its？Application to Scene Text Recognition,？IEEE Transactions on Pattern Analysis and？Machine Intelligence, 39(11), 2298-2304.？https://doi.org/10.1109/TPAMI.2016.2646371.

상세보기
Shi, B., Yang, M., Wang. X., Lyu, P., Yao, C.,？and Bai, X. (2019). ASTER: An Attentional？Scene Text Recognizer with Flexible？Rectification, IEEE Transactions on Pattern？Analysis and Machine Intelligence, 41(9),？2035-2048. https://doi.org/10.1109/TPAMI.2018.2848939.

상세보기
Smock, B., Pesala R., and Abraham, R. (2022).？PubTables-1M: Towards Comprehensive？Table Extraction from Unstructured？Documents, Proceedings of the IEEE/CVF？Conference on Computer Vision and？Pattern Recognition, June 19-20, New？Orleans, LA, USA, pp. 4624-4632.
Zhong, X., Bavani, E. S., and Yepes, A. J.？(2020). Image-Based Table Recognition:？Data, Model, and Evaluation, Proceedings of？the European Conference on Computer？Vison, Aug. 23-28, pp. 564-580.

이 논문을 인용한 문헌

저자의 다른 논문 :

활용도 분석정보

상세보기

다운로드

내보내기

활용도 Top5 논문

해당 논문의 주제분야에서 활용도가 높은 상위 5개 콘텐츠를 보여줍니다.
더보기 버튼을 클릭하시면 더 많은 관련자료를 살펴볼 수 있습니다.

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

[국내논문] 문서 처리 자동화를 위한 인보이스 이미지의 구조 인식 방법
Structure Recognition Method of Invoice Document Image for Document Processing Automation 원문보기

초록
AI-Helper

Abstract ▼ AI-Helper

Keyword

표/그림 (13)

표/그림 (13)

참고문헌 (14)

이 논문을 인용한 문헌

저자의 다른 논문 :

활용도 분석정보

활용도 Top5 논문

관련 콘텐츠

원문 보기

원문 URL 링크

오픈액세스(OA) 유형

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

[국내논문] 문서 처리 자동화를 위한 인보이스 이미지의 구조 인식 방법 Structure Recognition Method of Invoice Document Image for Document Processing Automation 원문보기

초록 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

Keyword

표/그림 (13) 모든 표/그림 보기

표/그림 (13) 슬라이드로 보기

참고문헌 (14)

이 논문을 인용한 문헌

저자의 다른 논문 :

이동석 (22) 권순각 (63)

활용도 분석정보

활용도 Top5 논문 더보기

관련 콘텐츠

원문 보기

원문 URL 링크

오픈액세스(OA) 유형

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

[국내논문] 문서 처리 자동화를 위한 인보이스 이미지의 구조 인식 방법
Structure Recognition Method of Invoice Document Image for Document Processing Automation 원문보기

초록
AI-Helper

표/그림 (13)

표/그림 (13)

활용도 Top5 논문