최소 단어 이상 선택하여야 합니다.
최대 10 단어까지만 선택 가능합니다.
다음과 같은 기능을 한번의 로그인으로 사용 할 수 있습니다.
NTIS 바로가기서비스연구 = Journal of service research and studies, v.14 no.1, 2024년, pp.13 - 26
임정현 (대구대학교 AI학부) , 차경애 (대구대학교 AI학과) , 고재필 (국립금오공과대학교 컴퓨터공학과) , 홍원기 (대구대학교 컴퓨터정보공학부)
Multi-modal generation is the process of generating results based on a variety of information, such as text, images, and audio. With the rapid development of AI technology, there is a growing number of multi-modal based systems that synthesize different types of data to produce results. In this pape...
AI-Hub K-Fashion(2024), https://aihub.or.kr/aihubdata/data/view.do?currMenu115&topMenu100&aihubDataSerealm&dataSetSn51?
AI-Hub Montage(2024), https://www.aihub.or.kr/aihubdata/data/view.do?currMenu115&topMenu100&dataSetSn618?
Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D., Wu, J., Winter, C., Hesse, C., Chen, M., Sigler, E., Litwin, M., Gray, S., Chess, B., Clark, J., Berner, C., McCandlish, S., Radford, A., Sutskever, I., and Amodei, D.(2020), Language models are few-shot learners. Advances in neural information processing systems, 33, 1877-1901?
Esser, P., Rombach, R., and Ommer, B.(2021), Taming Transformers for High-Resolution Image Synthesis, In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 12873-12883?
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y.(2020), Generative adversarial networks. Communications of the ACM, 63(11), pp. 139-144?
Joh, H., and Park, B.S.(2018), A Comparative Study of Montage investigation and portrait investigation. 가천법학, 11(3), pp. 235-264
Kingma, D.P., and Welling, M.(2013). Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114?
KoDALLE (2024), https://github.com/KR-HappyFace/KoDALLE?
KoGPT of Kakao Brain (2024), https://github.com/kakaobrain/kogpt?
KoGPT Trinity of SKT (2024), https://github.com/SKT-AI/KoGPT2?
Park, B., Nam, S., Chang, H. and Choi, C. (2013), EsFit - A facial composites methodology to help eyewitness, Annual Conference of IEIE, 1393-1396
Park, S., Moon, J., Kim, S., Cho, W. I., Han, J., Park, J., Song, C., Kim, J., Song, Y., Oh, T., Lee, J., Oh, J., Lyu, S., Jeong, Y., Lee, I., Seo, S., Lee, D., Kim, H., Lee, M., Jang, S., Do, S., Kim, S., Lim, K., Lee, J., Park, K., Shin, J., Kim, S., Park, L., Oh, A., Ha, J.-W., and Cho, K. (2021), Klue: Korean language understanding evaluation. arXiv preprint arXiv:2105.09680?
Ramesh, A., Pavlov, M., Goh, G., Gray, S., Voss, C., Radford, A., Chen, M., and Sutskever, I.(2021), Zero-shot text-to-image generation, In International Conference on Machine Learning, 8821-8831?
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., and Ommer, B. (2022), High-resolution image synthesis with latent diffusion models, In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 10684-10695?
Saharia, C., Chan, W., Saxena, S., Li, L., Whang, J., Denton, E. L., Ghasemipour, K., Gontijo Lopes, R., Ayan, B. K., Salimans, T., Ho, J., Fleet, D. J., and Norouzi, M.(2022), Photorealistic text-to-image diffusion models with deep language understanding, Advances in Neural Information Processing Systems, 35, 36479-36494?
Van Den Oord, A., and Vinyals, O.(2017), Neural discrete representation learning, Advances in neural information processing systems, 30?
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., and Polosukhin, I. (2017), Attention is all you need. Advances in neural information processing systems, 30?
Weight&Biases(2024), https://wandb.ai/site
*원문 PDF 파일 및 링크정보가 존재하지 않을 경우 KISTI DDS 시스템에서 제공하는 원문복사서비스를 사용할 수 있습니다.
출판사/학술단체 등이 한시적으로 특별한 프로모션 또는 일정기간 경과 후 접근을 허용하여, 출판사/학술단체 등의 사이트에서 이용 가능한 논문
※ AI-Helper는 부적절한 답변을 할 수 있습니다.