[논문]ChatGPT 및 거대언어모델의 추론 능력 향상을 위한 프롬프트 엔지니어링 방법론 및 연구 현황 분석

박상언; 강주영

doi:10.13088/jiis.2023.29.4.287

ChatGPT 및 거대언어모델의 추론 능력 향상을 위한 프롬프트 엔지니어링 방법론 및 연구 현황 분석
Analysis of Prompt Engineering Methodologies and Research Status to Improve Inference Capability of ChatGPT and Other Large Language Models 원문보기

지능정보연구 = Journal of intelligence and information systems, v.29 no.4, 2023년, pp.287 - 308

박상언 (경기대학교 산업경영정보공학과) , 강주영 (아주대학교 e비즈니스학부)

초록
AI-Helper

ChatGPT는 2022년 11월에 서비스를 시작한 후 급격하게 사용자 수가 늘어나며 인공지능의 역사에서 큰 전환점을 가져올 정도로 사회 곳곳에 많은 영향을 미치고 있다. 특히 ChatGPT와 같은 거대언어모델의 추론 능력은 프롬프트 엔지니어링 기법을 통해 빠른 속도로 그 성능이 발전하고 있다. 인공지능을 워크플로우에 도입하려고 하는 기업이나 활용하려고 하는 개인에게 이와 같은 추론 능력은 중요한 요소로 고려될 수 있다. 본 논문에서는 거대언어모델에서 추론을 가능하게 한 문맥내 학습에 대한 이해를 시작으로 하여 프롬프트 엔지니어링의 개념과 추론 유형 및 벤치마크 데이터에 대해 설명하고, 이를 기반으로 하여 최근 거대언어모델의 추론 성능을 급격히 향상시킨 프롬프트 엔지니어링 기법들에 대해 조사하고 발전과정과 기법들 간의 연관성에 대해 상세히 알아보고자 한다.

Abstract ▼ AI-Helper

After launching its service in November 2022, ChatGPT has rapidly increased the number of users and is having a significant impact on all aspects of society, bringing a major turning point in the history of artificial intelligence. In particular, the inference ability of large language models such as ChatGPT is improving at a rapid pace through prompt engineering techniques. This reasoning ability can be considered as an important factor for companies that want to adopt artificial intelligence into their workflows or for individuals looking to utilize it. In this paper, we begin with an understanding of in-context learning that enables inference in large language models, explain the concept of prompt engineering, inference with in-context learning, and benchmark data. Moreover, we investigate the prompt engineering techniques that have rapidly improved the inference performance of large language models, and the relationship between the techniques.

주제어

참고문헌 (28)

김맹근. (2023). ChatGPT 활용 사례 및 전망. 디지털 비즈온. http://www.digitalbizon.com/news/articleView.html?idxno2331610？
Brown, T., Mann, B., Ryder, N., Subbiah, M.,？Kaplan, J. D., Dhariwal, P., ... & Amodei, D.？(2020). Language models are few-shot learners.？Advances in Neural Information Processing？Systems, 33, 1877-1901.
Chowdhery, A., Narang, S., Devlin, J., Bosma, M.,？Mishra, G., Roberts, A., ... & Fiedel, N.？(2023). Palm: Scaling language modeling？with pathways. Journal of Machine Learning？Research, 24(240), 1-113.
Cobbe, K., Kosaraju, V., Bavarian, M., Chen, M.,？Jun, H., Kaiser, L., ... & Schulman, J. (2021).？Training verifiers to solve math word problems.？arXiv preprint arXiv:2110.14168.
Diao, S., Wang, P., Lin, Y., & Zhang, T. (2023).？Active prompting with chain-of-thought for？large language models. arXiv preprint arXiv:2302.12246.
Fan, A., Lewis, M., & Dauphin, Y. (2018).？Hierarchical neural story generation. Annual？Meeting of the Association for Computational？Linguistics, 56(1), 889-898.
Ficler, J., & Goldberg, Y. (2017). Controlling？linguistic style aspects in neural language？generation. arXiv preprint arXiv:1707.02633.
Geva, M., Khashabi, D., Segal, E., Khot, T., Roth,？D., & Berant, J. (2021). Did aristotle use a？laptop? a question answering benchmark with？implicit reasoning strategies. Transactions of？the Association for Computational Linguistics,？9, 346-361.

상세보기
Holtzman, A., Buys, J., Du, L., Forbes, M., & Choi,？Y. (2019). The curious case of neural text？degeneration. arXiv preprint arXiv:1904.09751.
Kojima, T., Gu, S. S., Reid, M., Matsuo, Y., &？Iwasawa, Y. (2022). Large language models？are zero-shot reasoners. Advances in Neural？Information Processing Systems, 35, 22199-22213.
Koncel-Kedziorski, R., Hajishirzi, H., Sabharwal, A.,？Etzioni, O., & Ang, S. D. (2015). Parsing algebraic？word problems into equations. Transactions？of the Association for Computational Linguistics,？3, 585-597.

상세보기
Koncel-Kedziorski, R., Roy, S., Amini, A., Kushman,？N., & Hajishirzi, H. (2016, June). MAWPS: A？math word problem repository. Proceedings？of the 2016 Conference of the North American？Chapter of the Association for Computational？Linguistics: Human Language Technologies？(pp. 1152-1157).
Lampinen, A. K., Dasgupta, I., Chan, S. C.,？Matthewson, K., Tessler, M. H., Creswell,？A., ... & Hill, F. (2022). Can language？models learn from explanations in context?.？arXiv preprint arXiv:2204.02329.
Liu, J., Liu, A., Lu, X., Welleck, S., West, P., Bras,？R. L., ... & Hajishirzi, H. (2021). Generated？knowledge prompting for commonsense？reasoning. arXiv preprint arXiv:2110.08387.
Liu, P., Yuan, W., Fu, J., Jiang, Z., Hayashi, H.,？& Neubig, G. (2023). Pre-train, prompt, and？predict: A systematic survey of prompting？methods in natural language processing. ACM？Computing Surveys, 55(9), 1-35.
Ouyang, L., Wu, J., Jiang, X., Almeida, D.,？Wainwright, C., Mishkin, P., ... & Lowe, R.？(2022). Training language models to follow？instructions with human feedback. Advances？in Neural Information Processing Systems,？35, 27730-27744.
Patel, A., Bhattamishra, S., & Goyal, N. (2021).？Are NLP models really able to solve simple？math word problems?. arXiv preprint arXiv:2103.07191.
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D.,？& Sutskever, I. (2019). Language models are？unsupervised multitask learners. OpenAI blog,？1(8), 9.
Reynolds, L., & McDonell, K. (2021, May). Prompt？programming for large language models:？Beyond the few-shot paradigm. Extended？Abstracts of the 2021 CHI Conference on？Human Factors in Computing Systems (pp. 1-7).
Roy, S., & Roth, D. (2016). Solving general arithmetic？word problems. arXiv preprint arXiv:1608.01413.
Srivastava, A., Rastogi, A., Rao, A., Shoeb, A. A. M.,？Abid, A., Fisch, A., ... & Wang, G. (2022).？Beyond the imitation game: Quantifying and？extrapolating the capabilities of language models.？arXiv preprint arXiv:2206.04615.
Talmor, A., Herzig, J., Lourie, N., & Berant, J.？(2018). Commonsenseqa: A question answering？challenge targeting commonsense knowledge.？arXiv preprint arXiv:1811.00937.
Thoppilan, R., De Freitas, D., Hall, J., Shazeer, N.,？Kulshreshtha, A., Cheng, H. T., ... & Le, Q.？(2022). Lamda: Language models for dialog？applications. arXiv preprint arXiv:2201.08239.
Wang, X., Wei, J., Schuurmans, D., Le, Q., Chi,？E., Narang, S., ... & Zhou, D. (2022).？Self-consistency improves chain of thought？reasoning in language models. arXiv preprint？arXiv:2203.11171.
Wei, J., Wang, X., Schuurmans, D., Bosma, M.,？Xia, F., Chi, E., ... & Zhou, D. (2022).？Chain-of-thought prompting elicits reasoning？in large language models. Advances in Neural？Information Processing Systems, 35, 24824-24837.
Yao, S., Yu, D., Zhao, J., Shafran, I., Griffiths, T.？L., Cao, Y., & Narasimhan, K. (2023). Tree？of thoughts: Deliberate problem solving with？large language models. arXiv preprint arXiv:2305.10601.
Zhang, T., Kishore, V., Wu, F., Weinberger, K.？Q., & Artzi, Y. (2019). Bertscore: Evaluating？text generation with bert. arXiv preprint arXiv:1904.09675.
Zhang, Z., Zhang, A., Li, M., & Smola, A. (2022).？Automatic chain of thought prompting in large？language models. arXiv preprint arXiv:2210.03493.

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 논문명, 저널/프로시딩명, 저자 , 발행년, 권, 호, 시작페이지, 끝페이지, 발행기관 관리번호, 논문명, 대등논문명, 저자 , 저널/프로시딩명, 발행기관, 발행년, 발행언어, 권, 호, 시작페이지, 끝페이지, ISBN, ISSN, 주제분야, 키워드, 초록(한글), 초록(영문), 저자(소속기관)
저장형식	Text(ASCII format) Excel format RefWorks Direct Export RIS format (for Reference Manager, ProCite, EndNote), Scholar's Aids, Mendeley
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

ChatGPT 및 거대언어모델의 추론 능력 향상을 위한 프롬프트 엔지니어링 방법론 및 연구 현황 분석
Analysis of Prompt Engineering Methodologies and Research Status to Improve Inference Capability of ChatGPT and Other Large Language Models 원문보기

초록
AI-Helper

Abstract ▼ AI-Helper

주제어

참고문헌 (28)

이 논문을 인용한 문헌

저자의 다른 논문 :

관련 콘텐츠

원문 보기

원문 URL 링크

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

ChatGPT 및 거대언어모델의 추론 능력 향상을 위한 프롬프트 엔지니어링 방법론 및 연구 현황 분석 Analysis of Prompt Engineering Methodologies and Research Status to Improve Inference Capability of ChatGPT and Other Large Language Models 원문보기

초록 용어보기논문에서 용어와 풀이말을 자동 추출한 결과로, 시범 서비스 중입니다. AI-Helper

Abstract ▼ AI-Helper

주제어

참고문헌 (28)

이 논문을 인용한 문헌

저자의 다른 논문 :

박상언 (16) 강주영 (51)

관련 콘텐츠

원문 보기

원문 URL 링크

이 논문과 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

ChatGPT 및 거대언어모델의 추론 능력 향상을 위한 프롬프트 엔지니어링 방법론 및 연구 현황 분석
Analysis of Prompt Engineering Methodologies and Research Status to Improve Inference Capability of ChatGPT and Other Large Language Models 원문보기

초록
AI-Helper