[특허]Image processing apparatus

Image processing apparatus 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	H04N-007/14
출원번호	US-0533398 (2000-03-22)
우선권정보	GB-9908545(1999-04-14)
발명자 / 주소	Taylor,Michael James Rowe,Simon Michael
출원인 / 주소	Canon Kabushiki Kaisha
대리인 / 주소	Fitzpatrick, Cella, Harper &
인용정보	피인용 횟수 : 38 인용 특허 : 9

초록 ▼

Image data from a plurality of cameras 2-1, 2-2, 2-3 showing the movements of a number of people, for example in a meeting, and sound data from a directional microphone array 4 is processed by a computer processing apparatus 24 to archive the data in a meeting archive database 60. The image data is processed to determine the three-dimensional position and orientation of each person's head and to determine at whom each person is looking. The sound data is processed to determine the direction from which the sound came. Processing is carried out to determine who is speaking by determining which person has his head in a position corresponding to the direction from which the sound came. Having determined which person is speaking, the personal speech recognition parameters for that person are selected and used to convert the sound data to text data. Image data to be archived is chosen by selecting the camera which best shows the speaking participant and the participant to whom he is speaking. Image data, sound data, text data and data defining at whom each person is looking is stored in the meeting archive database 60.

대표청구항 ▼

What is claimed is: 1. Image processing apparatus, comprising: an image data receiver for receiving image data recorded by a plurality of cameras showing the movements of a plurality of people; a speaker identifier for determining which of the people is speaking; a speech recipient identifier for determining at whom the speaker is looking; a position calculator for determining the position of the speaker and the position of the person at whom the speaker is looking; and camera selection means for selecting image data from the received image data on the basis of the determined positions of the speaker and the person at whom the speaker is looking, said camera selection means being arranged to select image data in which both the speaker and the person at whom the speaker is looking appear, and wherein the camera selection means is arranged to generate quality values representing a quality of the views that at least some of the cameras have of the speaker and the person at whom the speaker is looking, and to select the image data on the basis of which camera has the quality value representing the highest quality. 2. Apparatus according to claim 1, wherein the camera selection means is arranged to determine which of the cameras have a view of the speaker and the person at whom the speaker is looking, and to generate a respective quality value for each camera which has a view of the speaker and the person at whom the speaker is looking. 3. Apparatus according to claim 1, wherein the camera selection means is arranged to generate each quality value in dependence upon the position and orientation of the head of the speaker and the position and orientation of the head of the person at whom the speaker is looking. 4. Apparatus according to claim 1, wherein the camera selection means comprises: a data store for storing data defining a camera from which image data is to be selected for respective pairs of positions; and an image data selector arranged to use data stored in the data store to select the image data in dependence upon the positions of the speaker and the person at whom the speaker is looking. 5. Apparatus according to claim 1, wherein the speech recipient identifier and the position calculator comprise an image processor for processing the image data from at least one of the cameras to determine at whom the speaker is looking and the positions. 6. Apparatus according to claim 5, wherein the image processor is arranged to determine the position of each person and at whom each person is looking by processing the image data from the at least one camera. 7. Apparatus according to claim 5, wherein the image processor is arranged to track the position and orientation of each person's head in three dimensions. 8. Apparatus according to claim 1, wherein the speaker identifier is arranged to receive speech data from a plurality of microphones each of which is allocated to a respective one of the people, and to determine which of the people is speaking on the basis of the microphone from which the speech data was received. 9. Apparatus according to claim 1, further comprising a sound processor for processing sound data defining words spoken by the people to generate text data therefrom in dependence upon the result of the processing performed by the speaker identifier. 10. Apparatus according to claim 9, wherein the sound processor has associated therewith a store for storing respective voice recognition parameters for each of the people, and a parameter selector for selecting the voice recognition parameters to be used to process the sound data in dependence upon the person determined to be speaking by the speaker identifier. 11. Apparatus according to claim 9, further comprising a database for storing at least some of the received image data, the sound data, the text data produced by the sound processor and viewing data defining at whom at least the person who is speaking is looking, the database being arranged to store the data such that corresponding text data and viewing data are associated with each other and with the corresponding image data and sound data. 12. Apparatus according to claim 11, further comprising a data compressor for compressing the image data and the sound data for storage in the database. 13. Apparatus according to claim 12, wherein the data compressor comprises an encoder for encoding the image data and the sound data as MPEG data. 14. Apparatus according to claim 11, further comprising a gaze time data generator for generating gaze time data defining, for a predetermined period, the proportion of time spent by a given person looking at each of the other people during the predetermined period, and wherein the database is arranged to store the gaze time data so that it is associated with the corresponding image data, sound data, text data and viewing data. 15. Apparatus according to claim 14, wherein the predetermined period comprises a period during which the given person was talking. 16. A method of processing image data recorded by a plurality of cameras showing the movements of a plurality of people to select image data for storage, the method comprising: a speaker identification step of determining which of the people is speaking; a step of determining at whom the speaker is looking; a step of determining the position of the speaker and the position of the person at whom the speaker is looking; and a camera selection step for selecting image data on the basis of the determined positions of the speaker and the person at whom the speaker is looking, wherein, in the camera selection step, image data is selected in which both the speaker and the person at whom the speaker is looking appear, quality values are generated representing a quality of the views that at least some of the cameras have of the speaker and the person at whom the speaker is looking, and the image data is selected on the basis of which camera has the quality value representing the highest quality. 17. A method according to claim 16, wherein, in the camera selection step, processing is performed to determine which of the cameras have a view of the speaker and the person at whom the speaker is looking, and to generate a respective quality value for each camera which has a view of the speaker and the person at whom the speaker is looking. 18. A method according to claim 16, wherein, in the camera selection step, each quality value is generated in dependence upon the position and orientation of the head of the speaker and the position and orientation of the head of the person at whom the speaker is looking. 19. A method according to claim 16, wherein, in the camera selection step pre-stored data defining a camera from which image data is to be selected for respective pairs of positions is used to select the image data in dependence upon the positions of the speaker and the person at whom the speaker is looking. 20. A method according to claim 16, wherein, in the steps of determining at whom the speaker is looking and determining the positions of the speaker and the person at whom the speaker is looking, image data from at least one of the cameras is processed to determine at whom the speaker is looking and the positions. 21. A method according to claim 20, wherein the image data from that at least one camera is processed to determine the position of each person and at whom each person is looking. 22. A method according to claim 20, wherein image data is processed to track the position and orientation of each person's head in three dimensions. 23. A method according to claim 16, wherein speech data is received from a plurality of microphones each of which is allocated to a respective one of the people, and, in the speaker identification step, it is determined which of the people is speaking on the basis of the microphone from which the speech data was received. 24. A method according to claim 16, further comprising a sound processing step of processing sound data defining words spoken by the people to generate text data therefrom in dependence upon the result of the processing performed in the speaker identification step. 25. A method according to claim 24, wherein the sound processing step includes selecting, from among stored respective voice recognition parameters for each of the people, the voice recognition parameters to be used to process the sound data in dependence upon the person determined to be speaking in the speaker identification step. 26. A method according to claim 24, further comprising the step of storing in a database at least some of the received image data, the sound data, the text data produced in the sound processing step and viewing data defining at whom at least the person who is speaking is looking, the data being stored in the database such that corresponding text data and viewing data are associated with each other and with the corresponding image data and sound data. 27. A method according to claim 26, wherein the image data and the sound data are stored in the database in compressed form. 28. A method according to claim 27, wherein the image data and the sound data are stored as MPEG data. 29. A method according to claim 26, further comprising the steps of generating data defining, for a predetermined period, the proportion of time spent by a given person looking at each of the other people during the predetermined period, and storing the data in the database so that it is associated with the corresponding image data, sound data, text data and viewing data. 30. A method according to claim 29, wherein the predetermined period comprises a period during which the given person was talking. 31. A method according to claim 26, further comprising the step of generating a signal conveying the database with data therein. 32. A method according to claim 31, further comprising the step of recording the signal either directly or indirectly to generate a recording thereof. 33. A method according to claim 16, further comprising the step of generating a signal conveying information defining the image data selected in the camera selection step.

이 특허에 인용된 특허 (9)

Mathews Lemuel P. (Rancho Palos Verdes CA) Lohman Charles A. (Fullerton CA) Armstrong Paul R. (Yorba Linda CA), Acoustical detection and tracking system.
상세보기
Nitta Tohei (Newton MA), Animated electronic meeting place.
상세보기
Cleveland Dixon (Vienna VA) Cleveland James H. (Clifton VA) Norloff Peter L. (Fairfax VA), Eye tracking method and apparatus.
상세보기
Potts, Steven L.; Wang, Hong; Rabiner, Wendi Beth; Chu, Peter L., Locating an audio source.
상세보기
Brais Louis ; Brenan Colin ; Madden Peter,CAX ITX V6J 3N6, Report generation system and method for capturing prose, audio, and video by voice command and automatically linking sou.
상세보기
Baker Robert G. (Delray Beach FL), Teleconferencing imaging system with automatic camera steering.
상세보기
Ashida Youichi (Kawasaki JPX) Imai Ryusaku (Yamato JPX) Yoshida Yuji (Yokohama JPX) Homma Toshihiro (Yokohama JPX) Sato Hitoshi (Tokyo JPX) Ishiguro Hitoshi (Inagi JPX) Natori Hiroaki (Kawasaki JPX) , Television conference system.
상세보기
Andersson Russell L. (Manalapan NJ) Chen Tsuhan (Middletown NJ) Haskell Barin G. (Tinton Falls NJ), Video conference system and method of providing parallax correction and a sense of presence.
상세보기
Mann W. Steve G.,CAX, Wearable camera system with viewfinder means.
상세보기

이 특허를 인용한 특허 (38)

Whyte, Oliver Arthur; Cutler, Ross; Bhattacharjee, Avronil; Kowdle, Adarsh Prakash Murthy; Kirk, Adam; Birchfield, Stanley T.; Zhang, Cha, Active speaker location detection.
상세보기
Whyte, Oliver Arthur; Cutler, Ross; Bhattacharjee, Avronil; Kowdle, Adarsh Prakash Murthy; Kirk, Adam; Birchfield, Stanley T.; Zhang, Cha, Active speaker location detection.
상세보기
Edwards, Timothy James Henry; Langdale-Smith, Nicholas John, Automatic calibration of a gaze direction algorithm from user behavior.
상세보기
Sculley, Darrin, Content amplification system and method.
상세보기
Epstein, Lewis, Control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Control apparatus and method for sharing information in a collaborative workspace.
상세보기
Cheatle, Stephen Philip; Grosvenor, David Arthur, Editing multiple camera outputs.
상세보기
Anderson, Matthew; Balwalli, Krishna; Benson, Laurie; Bond, Bruce; Brown, Portia; Cai, Ying; Da Silva, Dominic; Edwards, Keay; Figueiredo, Harring; Fiorentino, Mark; Haettich, Eric; Heller, Gary; Kaviani, Saeed; Lewandowski, Keith; McLaughlin, Ellen; Mudd, John; Mukherjee, Kunal; Myles, Zarina; Naskov, Zoran; Phillips, Aaron; Savrda, Steven; Wilson, Robert, Electronic item management and archival system and method of operating the same.
상세보기
Karakotsios, Kenneth M.; Fu, Kah Kuen; Ivanchenko, Volodymyr V.; Huang, Mingjing, Enhanced face recognition in video.
상세보기
Karakotsios, Kenneth M.; Fu, Kah Kuen; Ivanchenko, Volodymyr V.; Huang, Mingjing, Enhanced face recognition in video.
상세보기
Kitazawa, Kazuki; Igarashi, Kiyoto; Uchiyama, Hiroaki; Kuwata, Koji; Takahashi, Masato; Goto, Tomoyuki; Gingawa, Nobumasa, Information processing apparatus, information processing method, and computer program product.
상세보기
Miller, Charles G.; Olmstead, Clifford, Method and apparatus for integrated recording and playback of video audio and data inputs.
상세보기
Marvit, David, Method and system for presenting metadata during a videoconference.
상세보기
Zhan, Wuzhou; Wang, Dongqi, Method, device, and system for video communication.
상세보기
Akin, Jeremiah Joseph, Methods, systems, and apparatus for providing video communications.
상세보기
Hegde, Rajesh K.; Zhang, Zhengyou; Chou, Philip A.; Zhang, Cha; Liu, Zicheng; Junuzovic, Sasa, Multi-device capture and spatial browsing of conferences.
상세보기
Hegde, Rajesh K.; Zhang, Zhengyou; Chou, Philip A.; Zhang, Cha; Liu, Zicheng; Junuzovic, Sasa, Multi-device capture and spatial browsing of conferences.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Epstein, Lewis; Kincaid, Brett; Yoo, Hyun; Stage, Suzanne; Scherrer, Lukas; Cheng, Larry, Personal control apparatus and method for sharing information in a collaborative workspace.
상세보기
Peterson, Brent; Sheeley, Robin, Presentation video control system.
상세보기
Flaks, Jason; Hawkins, Dax; Klein, Christian; Dernis, Mitchell Stephen; Leyvand, Tommer; Vassigh, Ali M.; McKay, Duncan, Speech recognition analysis via identification information.
상세보기
Cutler, Ross; Rui, Yong; Gupta, Anoop, System and method for distributed meetings.
상세보기
Cutler,Ross; Rui,Yong; Gupta,Anoop, System and method for distributed meetings.
상세보기
Marvit, David L., System and method for generating videoconference transcriptions.
상세보기
Buck, Markus; Haulick, Tim; Schmidt, Gerhard Uwe, Vehicle communication system.
상세보기
Gingawa, Nobumasa; Uchiyama, Hiroaki; Kuwata, Koji; Takahashi, Masato; Igarashi, Kiyoto; Goto, Tomoyuki; Kitazawa, Kazuki, Video delivery terminal, non-transitory computer-readable medium, and video delivery method.
상세보기
Kuwata, Koji; Goto, Tomoyuki; Uchiyama, Hiroaki; Igarashi, Kiyoto; Kitazawa, Kazuki; Takahashi, Masato; Gingawa, Nobumasa, Video image processing apparatus and recording medium.
상세보기
Cohen, Tomer, Virtual conference manager.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Image processing apparatus 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (9)

이 특허를 인용한 특허 (38)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Image processing apparatus 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (9)

이 특허를 인용한 특허 (38)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트