[미국특허]
Method and apparatus for processing commands directed to a media center
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
H04N-021/4223
H04N-005/44
H04N-021/45
H04N-021/422
H04N-021/439
H04N-021/44
H04N-021/4402
H04N-021/4415
H04N-021/47
G06F-003/01
G06F-003/03
G06F-003/038
G06F-003/16
H04N-021/442
출원번호
US-0589013
(2017-05-08)
등록번호
US-10219021
(2019-02-26)
발명자
/ 주소
Dimitriadis, Dimitrios
Schroeter, Horst Juergen
출원인 / 주소
AT&T INTELLECTUAL PROPERTY I, L.P.
대리인 / 주소
Guntin & Gust, PLC
인용정보
피인용 횟수 :
0인용 특허 :
11
초록▼
A system that incorporates teachings of the subject disclosure may include, for example, a method for controlling a steering of a plurality of cameras to identify a plurality of potential sources, identifying the plurality of potential sources according to image data provided by the plurality of cam
A system that incorporates teachings of the subject disclosure may include, for example, a method for controlling a steering of a plurality of cameras to identify a plurality of potential sources, identifying the plurality of potential sources according to image data provided by the plurality of cameras, assigning a beam of a plurality of beams of a plurality of microphones to each of the plurality of potential sources, detecting a first command comprising one of a first audible cue based on signals from a portion of the plurality of microphones, a first visual cue based on image data from one of the plurality of cameras, or both for controlling a media center, and configuring the media center according to the first command. Other embodiments are disclosed.
대표청구항▼
1. A method, comprising: associating, by a processing system comprising a processor, a first gesture of a first object of a plurality of objects based on images from image data with a first command for controlling a media center, wherein the image data is captured by a plurality of image sensors;ass
1. A method, comprising: associating, by a processing system comprising a processor, a first gesture of a first object of a plurality of objects based on images from image data with a first command for controlling a media center, wherein the image data is captured by a plurality of image sensors;associating, by the processing system, a second gesture of a second object of the plurality of objects based on the images from the image data with a second command for controlling the media center, wherein the second gesture is based on images detected by the plurality of image sensors;modifying, by the processing system, the first command to a modified command according to a characteristic of the first gesture;determining, by the processing system, a conflict between the first command and the second command;presenting, by the processing system, a notification indicating the conflict and requesting a resolution to the conflict;determining, by the processing system, if a response to the notification indicates the resolution is to perform the first command or the second command; andresponsive to a first determination that the response indicates the resolution is to perform the first command processing, by the processing system, the modified command to control the media center responsive to determining the response indicates the resolution is to perform the first command. 2. The method of claim 1, further comprising: obtaining, by the processing system, directional information associated with the plurality of objects according to the image data provided by the plurality of image sensors; andsteering, by the processing system, a microphone array towards one of the plurality of objects according to the directional information to detect speech signals generated by the plurality of objects. 3. The method of claim 2, wherein the steering of the microphone array comprises steering, by the processing system, a plurality of beams of the microphone array according to a beamforming process. 4. The method of claim 3, further comprising using, by the processing system, the image data provided by the plurality of image sensors to assign a beam of the plurality of beams to each of the plurality of objects. 5. The method of claim 2, further comprising: tracking, by the processing system, movements of the plurality of objects according to the image data provided by the plurality of image sensors;steering, by the processing system, the plurality of image sensors according to the movements of the plurality of objects; andsteering, by the processing system, the microphone array according to the movements of the plurality of objects. 6. The method of claim 1, further comprising determining a quality of performance for the first command for controlling the media center according to the characteristic of the first gesture that is determined. 7. The method of claim 6, wherein the quality of performance of the modified command comprises a magnitude of performance. 8. The method of claim 1, further comprising selecting, by the processing system, the first command provided by the first object based on a conflict resolution strategy, wherein the conflict resolution strategy comprises assigning a first priority to the first object and assigning a second priority to the second object, and wherein selecting the first command comprises selecting the first command based on the first priority having a higher priority than the second priority. 9. The method of claim 1, wherein responsive to a second determination that the response indicates the resolution is to perform the second command processing, by the processing system, the second command to control the media center responsive to determining the response indicates the resolution is to perform the second command. 10. The method of claim 1, further comprising identifying, by the processing system, each of the plurality of objects by comparing the image data with biometric profiles. 11. The method of claim 10, wherein the biometric profiles comprise one of facial data, body contour data, body color data, body surface data, or combinations thereof. 12. The method of claim 1, wherein the characteristic of the first gesture comprises a speed of the first gesture. 13. The method of claim 1, further comprising: assigning, by the processing system, one of an acoustic model, a visual model, or a combination thereof to each of the plurality of objects; andprocessing, by the processing system, one of audible cues, visual cues, or a combination thereof generated by the plurality of objects according to the acoustic model, the visual model or both assigned to each of the plurality of objects. 14. A non-transitory computer-readable storage medium, comprising executable instructions that, when executed by a processing system comprising a processor, facilitate performance of operations, comprising: detecting a first gesture of a first object of a plurality of objects captured by one of a plurality of cameras as image data for controlling a media center, wherein the first gesture is associated with a first command;detecting a second gesture of a second object of the plurality of objects based on the image data from one of the plurality of cameras, wherein the second gesture is associated with a second command for controlling the media center;determining a conflict between the first command and the second command;presenting a notification indicating the conflict and requesting a resolution to the conflict;determining if a response to the notification indicates the resolution is to perform the first command;modifying the first command according to a quality of performance to reflect a characteristic of the first gesture to generate a modified command responsive to determining the response indicates the resolution is to perform the first command; andcontrolling the media center according to the modified command according to the quality of performance responsive to determining the response indicates the resolution is to perform the first command. 15. The non-transitory computer-readable storage medium of claim 14, wherein the operations further comprise directing a plurality of microphones to steer a plurality of beams to each of the plurality of objects, and wherein the characteristic of the first gesture comprises speed of the first gesture, duration of the first gesture, or any combination thereof. 16. The non-transitory computer-readable storage medium of claim 14, wherein the processor further performs operations comprising: selecting the first command according to a conflict resolution strategy, andwherein the quality of performance of the modified command comprises a magnitude of performance, a speed of performance, or any combination thereof. 17. A system, comprising: a processing system including a processor; anda memory that stores executable instructions that, when executed by the processing system, facilitate performance of operations, comprising: detecting a first gesture of a first object of a plurality of objects based on image data captured by a plurality of image sensors, wherein the first gesture is associated with a first command for controlling a media center;detecting a second gesture of a second object of the plurality of objects based on images captured by the plurality of image sensors, wherein the second gesture is associated with a second command for controlling the media center;determining a conflict between the first command and the second command of a second object of the plurality of objects;presenting a notification requesting a resolution to the conflict;determining if a response to the notification indicates the resolution is to perform the first command;modifying the first command according to a quality of performance to reflect a characteristic of the first gesture to generate a modified command responsive to determining the response indicates the resolution is to perform the first command; andcontrolling the media center to execute the modified command according to the quality of performance responsive to determining the response indicates the resolution is to perform the first command. 18. The system of claim 17, wherein the operations further comprise: identifying the plurality of objects according to image data provided by the plurality of image sensors; andexchanging messages with one of the plurality of objects that generated the first command, wherein the messages comprise one of audible speech, visual messages presented by way of a display device coupled to the processor, or a combination thereof. 19. The system of claim 17, wherein the operations further comprise controlling the media center to execute the second command responsive to determining the response indicates the resolution is to perform the second command. 20. The system of claim 17, operations further comprise: tracking movements of the plurality of objects according to the image data provided by the plurality of image sensors;steering the plurality of image sensors according to the movements of the plurality of objects; andsteering a microphone array according to the movements of the plurality of objects.
Zacks,Carolyn A.; Harel,Dan; Marino,Frank; Taxier,Karen M.; Telek,Michael J.; Wertheimer,Alan L.; Archie,William C., Display system and method with multi-person presentation function.
Bell, Matthew; Chennavasin, Tipatat; Clanton, Charles H.; Hulme, Michael; Ophir, Eyal; Vieta, Matthew, Gesture-based user interactions with status indicators for acceptable inputs in volumetric zones.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.