[특허]Method and system for estimating gaze target, gaze sequence, and gaze map from video

Method and system for estimating gaze target, gaze sequence, and gaze map from video 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06K-009/00
출원번호	UP-0221552 (2008-08-04)
등록번호	US-7742623 (2010-07-12)
발명자 / 주소	Moon, Hankyu Sharma, Rajeev Jung, Namsoon
출원인 / 주소	VideoMining Corporation
인용정보	피인용 횟수 : 47 인용 특허 : 7

초록 ▼

The present invention is a method and system to estimate the visual target that people are looking, based on automatic image measurements. The system utilizes image measurements from both face-view cameras and top-down view cameras. The cameras are calibrated with respect to the site and the visual target, so that the gaze target is determined from the estimated position and gaze direction of a person. Face detection and two-dimensional pose estimation locate and normalize the face of the person so that the eyes can be accurately localized and the three-dimensional facial pose can be estimated. The eye gaze is estimated based on either the positions of localized eyes and irises or on the eye image itself, depending on the quality of the image. The gaze direction is estimated from the eye gaze measurement in the context of the three-dimensional facial pose. From the top-down view the body of the person is detected and tracked, so that the position of the head is estimated using a body blob model that depends on the body position in the view. The gaze target is determined based on the estimated gaze direction, estimated head pose, and the camera calibration. The gaze target estimation can provide a gaze trajectory of the person or a collective gaze map from many instances of gaze.

대표청구항 ▼

What is claimed is: 1. A method for estimating a gaze target within a visual target that a person is looking based on automatic image measurements, comprising the following steps of: a) processing calibrations for at least a first means for capturing images for face-view and at least a second means for capturing images for top-down view, b) determining a target grid of the visual target, c) detecting and tracking a face of the person from first input images captured by the first means for capturing images, d) estimating a two-dimensional pose and a three-dimensional pose of the face, e) localizing facial features to extract an eye image of the face, f) estimating eye gaze of the person and estimating gaze direction of the person based on the estimated eye gaze and the three-dimensional facial pose of the person, g) detecting and tracking the person from second input images captured by the second means for capturing images, h) estimating a head position using the top-down view calibration, and i) estimating the gaze target of the person from the estimated gaze direction and the head position of the person using the face-view calibration. 2. The method according to claim 1, wherein the method further comprises a step of taking geometric measurements of the site and the visual target to come up with specifications and the calibrations for the means for capturing images. 3. The method according to claim 1, wherein the method further comprises steps of: a) estimating a gaze direction estimation error distribution, and b) determining the target grid based on the gaze direction estimation error distribution and average distance between the person and the visual target. 4. The method according to claim 1, wherein the method further comprises a step of determining a mapping from the estimated head position and the estimated gaze direction to the target grid. 5. The method according to claim 1, wherein the method further comprises a step of determining the mapping from the second input image coordinate to the floor coordinate, based on the position and orientation of the first means for capturing images. 6. The method according to claim 1, wherein the method further comprises a step of training a plurality of first machines for estimating the three-dimensional pose of the face. 7. The method according to claim 1, wherein the method further comprises a step of training a plurality of second machines for estimating the two-dimensional pose of the face. 8. The method according to claim 1, wherein the method further comprises a step of training a plurality of third machines for localizing each facial feature of the face. 9. The method according to claim 1, wherein the method further comprises a step of training at least a fourth machine for estimating the eye gaze from the eye image. 10. The method according to claim 9, wherein the method further comprises a step of annotating the eye images with both the eye gaze and a confidence level of the eye gaze annotation. 11. The method according to claim 10, wherein the method further comprises a step of training the fourth machine so that the machine outputs both the eye gaze and the confidence level of the eye gaze estimate. 12. The method according to claim 1, wherein the method further comprises a step of training at least a fifth machine for estimating the gaze direction. 13. The method according to claim 12, wherein the method further comprises a step of training the fifth machine for estimating the gaze direction from the eye gaze and the three-dimensional facial pose. 14. The method according to claim 12, wherein the method further comprises a step of employing the fifth machine for estimating the gaze direction from the eye image and the three-dimensional facial pose. 15. The method according to claim 12, wherein the method further comprises a step of training the fifth machine so that the machine outputs both the gaze direction and the confidence level of the gaze direction estimate. 16. The method according to claim 15, wherein the method further comprises a step of estimating a gaze map by weighting each of the gaze target estimates with the confidence levels corresponding to the gaze direction estimates. 17. The method according to claim 1, wherein the method further comprises a step of selecting a stream of first input images among a plurality of streams of first input images when the person's face appears to more than one stream of first input images, based on the person's distance to each of the plurality of first means for capturing images and the three-dimensional facial poses relative to each of the plurality of first means for capturing images. 18. The method according to claim 1, wherein the method further comprises a step of utilizing a view-based body blob model to estimate the head position of the person. 19. The method according to claim 1, wherein the method further comprises a step of constructing a gaze trajectory and a gaze map based on the estimated gaze target. 20. An apparatus for estimating a gaze target within a visual target that a person is looking based on automatic image measurements, comprising: a) means for processing calibrations for at least a first means for capturing images for face-view and at least a second means for capturing images for top-down view, b) means for determining a target grid of the visual target, c) means for detecting and tracking a face of the person from first input images captured by the first means for capturing images, d) means for estimating a two-dimensional pose and a three-dimensional pose of the face, e) means for localizing facial features to extract an eye image of the face, f) means for estimating eye gaze of the person and estimating gaze direction of the person based on the estimated eye gaze and the three-dimensional facial pose of the person, g) means for detecting and tracking the person from second input images captured by the second means for capturing images, h) means for estimating a head position using the top-down view calibration, and i) means for estimating the gaze target of the person from the estimated gaze direction and the head position of the person using the face-view calibration. 21. The apparatus according to claim 20, wherein the apparatus further comprises means for taking geometric measurements of the site and the visual target to come up with specifications and the calibrations for the means for capturing images. 22. The apparatus according to claim 20, wherein the apparatus further comprises: a) means for estimating a gaze direction estimation error distribution, and b) means for determining the target grid based on the gaze direction estimation error distribution and average distance between the person and the visual target. 23. The apparatus according to claim 20, wherein the apparatus further comprises means for determining a mapping from the estimated head position and the estimated gaze direction to the target grid. 24. The apparatus according to claim 20, wherein the apparatus further comprises means for determining the mapping from the second input image coordinate to the floor coordinate, based on the position and orientation of the first means for capturing images. 25. The apparatus according to claim 20, wherein the apparatus further comprises means for training a plurality of first machines for estimating the three-dimensional pose of the face. 26. The apparatus according to claim 20, wherein the apparatus further comprises means for training a plurality of second machines for estimating the two-dimensional pose of the face. 27. The apparatus according to claim 20, wherein the apparatus further comprises means for training a plurality of third machines for localizing each facial feature of the face. 28. The apparatus according to claim 20, wherein the apparatus further comprises means for training at least a fourth machine for estimating the eye gaze from the eye image. 29. The apparatus according to claim 28, wherein the apparatus further comprises means for annotating the eye images with both the eye gaze and a confidence level of the eye gaze annotation. 30. The apparatus according to claim 29, wherein the apparatus further comprises means for training the fourth machine so that the machine outputs both the eye gaze and the confidence level of the eye gaze estimate. 31. The apparatus according to claim 20, wherein the apparatus further comprises means for training at least a fifth machine for estimating the gaze direction. 32. The apparatus according to claim 31, wherein the apparatus further comprises means for training the fifth machine for estimating the gaze direction from the eye gaze and the three-dimensional facial pose. 33. The apparatus according to claim 31, wherein the apparatus further comprises means for employing the fifth machine for estimating the gaze direction from the eye image and the three-dimensional facial pose. 34. The apparatus according to claim 31, wherein the apparatus further comprises means for training the fifth machine so that the machine outputs both the gaze direction and the confidence level of the gaze direction estimate. 35. The apparatus according to claim 34, wherein the apparatus further comprises means for estimating a gaze map by weighting each of the gaze target estimates with the confidence levels corresponding to the gaze direction estimates. 36. The apparatus according to claim 20, wherein the apparatus further comprises means for selecting a stream of first input images among a plurality of streams of first input images when the person's face appears to more than one stream of first input images, based on the person's distance to each of the plurality of first means for capturing images and the three-dimensional facial poses relative to each of the plurality of first means for capturing images. 37. The apparatus according to claim 20, wherein the apparatus further comprises means for utilizing a view-based body blob model to estimate the head position of the person. 38. The apparatus according to claim 20, wherein the apparatus further comprises means for constructing a gaze trajectory and a gaze map based on the estimated gaze target.

이 특허에 인용된 특허 (7)

Ryan,Mathew David, Eye tracking using image data.
상세보기
Edwards,Timothy; Heinzmann,Jochen; Rougeaux,Sebastien; Zelinsky,Alex, Facial image processing system.
상세보기
Fukui Kazuhiro,JPX ; Yamaguchi Osamu,JPX, Gaze position detection apparatus and method.
상세보기
Miller,Michael E.; Cerosaletti,Cathleen D.; Fedorovskaya,Elena A.; Covannon,Edward A., Method and computer program product for determining an area of importance in an image using eye monitoring information.
상세보기
Tomono Akira,JPX ; Iida Muneo,JPX ; Ohmura Kazunori,JPX, Method of detecting eye fixation using image processing.
상세보기
Beardsley Paul Anthony, System for classifying an individual's gaze direction.
상세보기
Nagano Akihiko,JPX ; Yamada Akira,JPX ; Irie Yoshiaki,JPX, Visual axis controllable optical apparatus including a visual axis detecting device for detecting a visual axis.
상세보기

이 특허를 인용한 특허 (47)

Shen, Xiaohui; Lin, Zhe; Brandt, Jonathan W., Accelerating object detection.
상세보기
Nishino, Daisuke; Yano, Kotaro; Kaneda, Yuji; Uchiyama, Hiroyuki, Apparatus and method for estimating gazed position of person.
상세보기
Lee, Hans C.; Wilson, Josh; Fettiplace, Michael; Lee, Michael J., Apparatus and method for objectively determining human response to media.
상세보기
Inada, Junya, Apparatus for detecting a pupil, program for the same, and method for detecting a pupil.
상세보기
Stack, Matthew E., Application of smooth pursuit cognitive testing paradigms to clinical drug development.
상세보기
Yu, Ting; Tu, Peter Henry; Liu, Xiaoming; Lim, Ser Nam, Automatic surveillance video matting using a shape prior.
상세보기
Yu, Ting; Tu, Peter Henry; Liu, Xiaoming; Lim, Ser Nam, Automatic surveillance video matting using a shape prior.
상세보기
Sharma, Rajeev; Shin, Joonhwa; Yoon, Youngrock R; Kim, Donghun, Cross-channel in-store shopper behavior analysis.
상세보기
Marci, Carl D.; Levine, Brian; Kothuri, Ravi Kanth V; Gill, Geoff, Data collection system for aggregating biologically based measures in asynchronous geographically distributed public environments.
상세보기
Lee, Michael J.; Lee, Hans C., Device and method for sensing electrical activity in tissue.
상세보기
Vaught, Benjamin Isaac; Crocco, Jr., Robert L.; Lewis, John; Sun, Jian; Wei, Yichen, Enhanced user eye gaze estimation.
상세보기
Kaneda, Yuji; Matsugu, Masakazu; Mori, Katsuhiko, Estimating gaze direction.
상세보기
Zhang, Zhengyou; Cai, Qin; Liu, Zicheng; Huang, Jia-Bin, Eye gaze tracking based upon adaptive homography mapping.
상세보기
Sasahara, Hideaki; Noguchi, Yoshihiro; Shimada, Keiji, Face pose estimation device, face pose estimation method and face pose estimation program.
상세보기
Miao, Xu; Conrad, Michael J.; Wu, Dijia, Gaze detection offset for gaze tracking models.
상세보기
Sakamaki, Ryuji, Gaze position estimation system, control method for gaze position estimation system, gaze position estimation device, control method for gaze position estimation device, program, and information storage medium.
상세보기
Hong, Kwang Seok; Yoon, Jun Ho; Jeon, Jong Bae; Lee, Kue Bum, Gaze tracking apparatus and method using difference image entropy.
상세보기
Wu, Dijia; Conrad, Michael J.; Burrell, Tim; Miao, Xu; Liu, Zicheng; Cai, Qin; Zhang, Zhengyou, Gaze tracking via eye gaze model.
상세보기
Tan, Tieniu; Sun, Zhenan; Zhang, Xiaobo; Zhang, Hui, Identity recognition based on multiple feature fusion for an eye image.
상세보기
Liu, Xiaoming; Tu, Peter Henry; Xue, Ya, Image concealing via efficient feature selection.
상세보기
Liu, Xiaoming; Tu, Peter Henry; Xue, Ya, Image congealing via efficient feature selection.
상세보기
Liu, Xiaoming; Tu, Peter Henry; Xue, Ya, Image congealing via efficient feature selection.
상세보기
Kim, Soo Chang; Kim, Young Hoon; Bang, Seung Chan; Lee, Hee-Jae; Lee, Sang-Goog, Method and apparatus for tracking user's gaze point using mobile terminal.
상세보기
Sharma, Rajeev; Mummareddy, Satish; Hershey, Jeff; Moon, Hankyu, Method and system for characterizing physical retail spaces by determining the demographic composition of people in the physical retail spaces utilizing video image analysis.
상세보기
Kothuri, Ravi Kanth V.; Marci, Carl; Levine, Brian, Method and system for gathering and computing an audience's neurologically-based reactions in a distributed framework involving remote storage and computing.
상세보기
Lee, Hans C.; Hong, Timmie T.; Williams, William H.; Fettiplace, Michael R.; Lee, Michael J., Method and system for measuring and ranking a positive or negative response to audiovisual or interactive media, products or activities using physiological signals.
상세보기
Lee, Hans C.; Hong, Timmie T.; Williams, William H.; Fettiplace, Michael R.; Lee, Michael J., Method and system for measuring and ranking an “engagement” response to audiovisual or interactive media, products, or activities using physiological signals.
상세보기
Lee, Hans C.; Hong, Timmie T.; Williams, William H.; Fettiplace, Michael R., Method and system for rating media and events in media based on physiological data.
상세보기
Zhao, Wenyi; Itkowitz, Brandon D.; Mylonas, George; Yang, Gung-Zhong, Method and system for stereo gaze tracking.
상세보기
Lee, Hans C.; Hong, Timmie T.; Williams, William H.; Fettiplace, Michael R., Method and system for using coherence of biological responses as a measure of performance of a media.
상세보기
Chao, Hui; Chen, Jiajian; Bi, Ning, Method for image-based status determination.
상세보기
Kuusisto, Markus; Sainio, Jussi, Method, arrangement, and computer program product for coordinating video information with other measurements.
상세보기
Badower, Yakob, Methods and apparatus to gather and analyze electroencephalographic data.
상세보기
Badower, Yakob, Methods and apparatus to gather and analyze electroencephalographic data.
상세보기
Badower, Yakob; Lawrence, Bradley R.; Male, Marcos; Jovanovic, Marko; Olrik, Jakob, Methods and apparatus to gather and analyze electroencephalographic data.
상세보기
Badower, Yakob; Lawrence, Bradley R.; Male, Marcos; Jovanovic, Marko; Olrik, Jakob, Methods and apparatus to gather and analyze electroencephalographic data.
상세보기
Surti, Prasoonkumar; Vaidyanathan, Karthik; Kuwahara, Atsuo; Labbe, Hugues; KP, Sameer; Kennedy, Jonathan; Ray, Joydeep; Schluessler, Travis T.; Feit, John H.; Kaburlasos, Nikos; Kwiatkowski, Jacek; Bar-On, Tomer; Benthin, Carsten; Lake, Adam T.; Ranganathan, Vasanth; Appu, Abhishek R., Motion biased foveated renderer.
상세보기
Stack, Matthew E., Optical neuroinformatics.
상세보기
Liu, Xiaoming; Wheeler, Frederick Wilson; Tu, Peter Henry; Tu, Jilin, Optimal gradient pursuit for image alignment.
상세보기
Liu, Xiaoming; Wheeler, Frederick Wilson; Tu, Peter Henry; Tu, Jilin, Optimal gradient pursuit for image alignment.
상세보기
Huang, Weicai; Du, Lin, Positioning method and positioning system.
상세보기
Yin, Lijun; Reale, Michael, Real time eye tracking for human computer interaction.
상세보기
Gallup, David; Raguram, Rahul, System and method for generating a classifier for semantically segmenting an image.
상세보기
Badower, Yakob; Gurumoorthy, Ramachandran; Pradeep, Anantha K.; Knight, Robert T., Systems and methods to gather and analyze electroencephalographic data.
상세보기
Badower, Yakob; Gurumoorthy, Ramachandran; Pradeep, Anantha K.; Knight, Robert T., Systems and methods to gather and analyze electroencephalographic data.
상세보기
Knight, Robert T.; Gurumoorthy, Ramachandran; Badower, Yakob; Pradeep, A. K., Systems and methods to gather and analyze electroencephalographic data.
상세보기
Knight, Robert T.; Gurumoorthy, Ramachandran; Badower, Yakob; Pradeep, A. K., Systems and methods to gather and analyze electroencephalographic data.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Method and system for estimating gaze target, gaze sequence, and gaze map from video 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (7)

이 특허를 인용한 특허 (47)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Method and system for estimating gaze target, gaze sequence, and gaze map from video 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (7)

이 특허를 인용한 특허 (47)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트