$\require{mediawiki-texvc}$

연합인증

연합인증 가입 기관의 연구자들은 소속기관의 인증정보(ID와 암호)를 이용해 다른 대학, 연구기관, 서비스 공급자의 다양한 온라인 자원과 연구 데이터를 이용할 수 있습니다.

이는 여행자가 자국에서 발행 받은 여권으로 세계 각국을 자유롭게 여행할 수 있는 것과 같습니다.

연합인증으로 이용이 가능한 서비스는 NTIS, DataON, Edison, Kafe, Webinar 등이 있습니다.

한번의 인증절차만으로 연합인증 가입 서비스에 추가 로그인 없이 이용이 가능합니다.

다만, 연합인증을 위해서는 최초 1회만 인증 절차가 필요합니다. (회원이 아닐 경우 회원 가입이 필요합니다.)

연합인증 절차는 다음과 같습니다.

최초이용시에는
ScienceON에 로그인 → 연합인증 서비스 접속 → 로그인 (본인 확인 또는 회원가입) → 서비스 이용

그 이후에는
ScienceON 로그인 → 연합인증 서비스 접속 → 서비스 이용

연합인증을 활용하시면 KISTI가 제공하는 다양한 서비스를 편리하게 이용하실 수 있습니다.

[국내논문] 비디오 시각적 관계 이해 기술 동향
Trends in Video Visual Relationship Understanding 원문보기

전자통신동향분석 = Electronics and telecommunications trends, v.38 no.6, 2023년, pp.12 - 21  

권용진 (시각지능연구실) ,  김대회 (시각지능연구실) ,  김종희 (시각지능연구실) ,  오성찬 (시각지능연구실) ,  함제석 (시각지능연구실) ,  문진영 (시각지능연구실)

Abstract AI-Helper 아이콘AI-Helper

Visual relationship understanding in computer vision allows to recognize meaningful relationships between objects in a scene. This technology enables the extraction of representative information within visual content. We discuss the technology of visual relationship understanding, specifically focus...

주제어

표/그림 (2)

참고문헌 (38)

  1. J. Johnson et al., "Image retrieval using scene graphs,"?in Proc. IEEE/CVF CVPR, (Boston, MA, USA), June?2015, pp. 3668-3678. 

  2. C. Lu et al., "Visual relationship detection with language?priors," in Proc. ECCV, Oct. 2016, pp. 852-569. 

  3. R. Krishna et al., "Visual genome: Connecting?language and vision using crowdsourced dense image?annotations," Int. J. Comput. Vis., vol. 123, no. 1, May?2017, pp. 32-73. 

  4. J. Ji et al., "Action genome: actions as compositions?of spatio-temporal scene graphs," in Proc. IEEE/CVF?CVPR, June 2020, pp. 10233-10244. 

  5. Y. Zhong et al., "Comprehensive image captioning via?scene graph decomposition," in Proc. ECCV, Aug. 2020,?pp. 211-229. 

  6. X. Yang et al., "Auto-encoding and distilling scene?graphs for image captioning," IEEE Trans. Pattern Anal.?Mach. Intell., vol. 44, no. 5, May 2022, pp. 2313-2327. 

  7. X. Lu and Y. Gao, "Guide and interact: SceneGraph?based generation and control of video captions,"?Multimed. Syst., vol. 29, no. 2, Apr. 2023, pp. 797-809. 

  8. C. Zhang et al., "An empirical study on leveraging scene?graphs for visual question answering," in Proc. BMVC,?Sept. 2019. 

  9. L. Li et al., "Relation-aware graph attention network for?visual question answering," in Proc. IEEE/CVF ICCV,?Oct. 2019, pp. 10312-10321. 

  10. J. Mao et al., "Dynamic multistep reasoning based on?video scene graph for video question answering," in?Proc. NAACL, Jul. 2022, pp. 3894-3904. 

  11. M. Qi et al., "Online cross-modal scene retrieval by?binary representation and semantic graph," in Proc.?ACM MM, Oct. 2017, pp. 744-752. 

  12. M. Daum et al., "VOCAL: Video organization and?interactive compositional analytics," in Proc. CIDR, Jan.?2022. 

  13. X. Chang et al., "A Comprehensive survey of scene?graphs: generation and application," IEEE Trans. Pattern?Anal. Mach. Intell., vol. 45, no. 1, 2023, pp. 1-26. 

  14. O. Russakovsky et al., "ImageNet large scale visual?recognition challenge," Int. J. Comput. Vis., vol. 115,?no. 3, 2015, pp. 211-252. 

  15. C. Liu et al., "Beyond short-term snippet: Video relation?detection with spatio-temporal global context," in Proc.?IEEE/CVF CVPR, June 2020, pp. 10837-10846. 

  16. Y. Li et al., "Interventional video relation detection," in?Proc. ACM MM, Oct. 2021, pp. 4091-4099. 

  17. X. Shang et al., "Video visual relation detection," in?Proc. ACM MM, Oct. 2017, pp. 1300-1308. 

  18. A. Vaswani et al., "Attention is all you need," in Proc.?NIPS, Dec. 2017, pp. 5998-6008. 

  19. Y.H.H. Tsai et al., "Video relationship reasoning using?gated spatio-temporal energy graph," in Proc. IEEE/CVF?CVPR, June 2019, pp. 10416-10425. 

  20. X. Qian et al., "Video relation detection with spatiotemporal graph," in Proc. ACM MM, Oct. 2019, pp. 84-93. 

  21. T. N. Kipf and M. Welling, "Semi-supervised classification with graph convolutional networks," in Proc.?ICLR, Apr. 2017. 

  22. L. Bertinetto et al., "Fully-connected siamese networks?for object tracking," in Proc. ECCVW, Oct. 2016, pp.?850-865. 

  23. Q. Cao et al., "3-D relation network for visual relation?recognition in videos," Neurocomputing, vol. 432, 2021,?pp. 91-100. 

  24. X. Shang et al., "Video visual relation detection via?iterative inference," in Proc. ACM MM, Oct. 2021, pp.?3654-3663. 

  25. S. Chen et al., "Social fabric: tubelet compositions for?video relation detection," in Proc. IEEE/CVF ICCV, Oct.?2021, pp. 13465-13474. 

  26. K. Gao et al., "Classification-then-grounding: Reformulating video scene graphs as temporal bipartite?graphs," in Proc. IEEE/CVF CVPR, June 2022, pp.?19475-19484. 

  27. C. Lu et al., "DEBUG: A dense bottom-up grounding?approach for natural language video localization," in?Proc. EMNLP-IJCNLP, Nov. 2019, pp. 5144-5153. 

  28. Y. Teng et al., "Target adaptive context aggregation for?video scene graph generation," in Proc. IEEE/CVF ICCV,?Oct. 2021, pp. 13668-13677. 

  29. Y. Cong et al., "Spatial-temporal transformer for?dynamic scene graph generation," in Proc. IEEE/CVF?ICCV, Oct. 2021, pp. 16352-16363. 

  30. Y. Li et al., "Dynamic scene graph generation via?anticipatory pre-training," in Proc. IEEE/CVF CVPR,?June 2022, pp. 13864-13873. 

  31. S. Feng et al., "Exploiting long-term dependencies for?generating dynamic scene graphs," in Proc. IEEE/CVF?WACV, Jan. 2023, pp. 5119-5128. 

  32. S. Nag et al., "Unbiased Scene graph generation in?videos," in Proc. IEEE/CVF CVPR, June 2023, pp.?22803-22813. 

  33. L. Xu et al., "Meta spatio-temporal debiasing for video?scene graph generation," in Proc. ECCV, Oct. 2022, pp.?374-390. 

  34. X. Shang et al., "Annotating objects and relations in?user-generated videos," in Proc. ACM ICMR, June?2019, pp. 279-287. 

  35. B. Thomee et al., "YFCC100M: The new data in?multimedia research," Commun. ACM, vol. 59, no. 2,?2016, pp. 64-73. 

  36. J. Ji et al., "Action genome: actions as compositions?of spatio-temporal scene graphs," in Proc. IEEE/CVF?CVPR, June 2020, pp. 10233-10244. 

  37. G. A. Sigurdsson et al., "Hollywood in homes: Crowdsourcing data collection for activity understanding," in?Proc. ECCV, Oct. 2016, pp. 510-526. 

  38. J. Yang et al., "Panoptic video scene graph generation,"?in Proc. IEEE/CVF CVPR, June 2023, pp. 18675-18685. 

섹션별 컨텐츠 바로가기

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

AI-Helper 아이콘
AI-Helper
안녕하세요, AI-Helper입니다. 좌측 "선택된 텍스트"에서 텍스트를 선택하여 요약, 번역, 용어설명을 실행하세요.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.

선택된 텍스트

맨위로