IPC분류정보
국가/구분 |
United States(US) Patent
등록
|
국제특허분류(IPC7판) |
|
출원번호 |
US-0611782
(2000-07-07)
|
발명자
/ 주소 |
|
대리인 / 주소 |
|
인용정보 |
피인용 횟수 :
82 인용 특허 :
18 |
초록
▼
A system and method for searching, assembling, and manipulating a variety of multi-media using voice converted to text commands. Digital images, movies, audio, or text is verbally searched and retrieved from a variety of video and audio databases using a combination of directional commands and a mea
A system and method for searching, assembling, and manipulating a variety of multi-media using voice converted to text commands. Digital images, movies, audio, or text is verbally searched and retrieved from a variety of video and audio databases using a combination of directional commands and a means for juxtaposing and assembling search results. The desired media is then placed onto a platform means for manipulating and editing the media files. Any retrieved media files and/or images can be manipulated and assembled on-screen using commands such as "zoom" or "move left" by having corners and borders read by the grid of the platform means. The image(s) are also capable of being stacked, or overlay one another to define re-proportioned backgrounds. The image(s) from the platform means are displayed without the grid using an image platter as a means of providing a preliminary view of the presentation prior to projection. The system allows for the hand-free assembly and editing of music and movies, and provides a means for verbally assembling pre-planned or impromptu presentations comprising video or audio clips, digital images, or text retrieved from multiple local and remote databases, such as a DVD movie-base or the World Wide Web.
대표청구항
▼
A system and method for searching, assembling, and manipulating a variety of multi-media using voice converted to text commands. Digital images, movies, audio, or text is verbally searched and retrieved from a variety of video and audio databases using a combination of directional commands and a mea
A system and method for searching, assembling, and manipulating a variety of multi-media using voice converted to text commands. Digital images, movies, audio, or text is verbally searched and retrieved from a variety of video and audio databases using a combination of directional commands and a means for juxtaposing and assembling search results. The desired media is then placed onto a platform means for manipulating and editing the media files. Any retrieved media files and/or images can be manipulated and assembled on-screen using commands such as "zoom" or "move left" by having corners and borders read by the grid of the platform means. The image(s) are also capable of being stacked, or overlay one another to define re-proportioned backgrounds. The image(s) from the platform means are displayed without the grid using an image platter as a means of providing a preliminary view of the presentation prior to projection. The system allows for the hand-free assembly and editing of music and movies, and provides a means for verbally assembling pre-planned or impromptu presentations comprising video or audio clips, digital images, or text retrieved from multiple local and remote databases, such as a DVD movie-base or the World Wide Web. east as great as the energy threshold value, said energy detector controls said switch to connect said normalizer and said frequency detector, such that the frequency detector may classify the input signal as either voice or data. 6. An apparatus according to claim 1, wherein said classification unit further comprises an energy detector in electrical communication with said energy estimator, wherein said energy detector initially compares the energy estimate value to an energy threshold value defining a minimum energy level, and wherein said energy detector classifies the input signal as a voice signal if the energy estimate value is no greater than the energy threshold value. 7. An apparatus according to claim 1, wherein said classification unit comprises: a frequency detector in electrical communication with said frequency estimator; an energy detector in electrical communication with both said energy estimator and said frequency detector; and a memory device in electrical communication with said energy detector, said memory device storing a table that includes a plurality of differing data threshold values, wherein said energy detector receives the energy estimate value and selects a data threshold value from the plurality of differing threshold values in said table based on the amplitude of the energy estimation value, and wherein said frequency detector receives the data threshold value from said energy detector and classifies the input signal as a data signal if the frequency estimate value is at least as great as the selected data threshold value. 8. An apparatus according to claim 1, wherein said frequency estimator comprises: a delay that receives the input signal and generates a delayed input signal; a difference integrator in electrical communication with said delay and the input signal, wherein said difference integrator receives the delayed and input signals and generates a frequency estimate value representing both the estimated central frequency of the input signal and the estimated energy of the input signal. 9. An apparatus according to claim 8, wherein said frequency estimator further comprises an absolute value device in electrical communication with said delay, wherein said absolute value device receives the frequency estimate value and generates an absolute value of the frequency estimate value. 10. An apparatus according to claim 8, wherein said frequency estimator further comprises an accumulator device in electrical communication with said delay, wherein said accumulator device receives the frequency estimate value and generates a frequency estimate value. 11. An apparatus according to claim 1, wherein said energy estimator comprises: an absolute value device that receives the input signal and generates an absolute value of the input signal; and an accumulator device in electrical communication with said absolute value device, wherein said accumulator device generates an energy estimate value representing the amplitude of the input signal. 12. A method for classifying an input signal as either a voice or data signal in a communication system, wherein said method comprises the steps of: receiving an input signal; generating a frequency estimate value representing both an estimated central frequency of the input signal and an estimated energy of the input signal; generating an energy estimate value representing an estimate of the energy level of the input signal; and classifying the input signal as either a voice or data signal based on the frequency and energy estimate values and a data threshold value. 13. A method according to claim 12, wherein said classifying step further comprises the step of initially normalizing the frequency estimate value based on the energy estimate value to thereby generate a normalized frequency estimate value representing an estimate of the central frequency of the input signal. 14. A method according to claim 13, wherein said normalizing step co mprises dividing the frequency estimate value by the energy estimate value to thereby generate a normalized frequency estimate value representing an estimate of the central frequency of the input signal. 15. A method according to claim 13, wherein the data threshold value defines a frequency value, wherein said classifying step comprises the steps of: comparing the normalized frequency estimate value to the data threshold value; and classifying the input signal as a data signal if the normalized frequency estimate value is at least as great as the data threshold value. 16. A method according to claim 13, wherein said classifying step further comprises initially comparing the energy estimate value to an energy threshold value defining a minimum energy level, wherein if the energy estimate value is at least as great as the energy threshold value, said classifying step comprises comparing the normalized frequency estimate value to the data threshold value, and classifying the input signal as a data signal if the normalized frequency estimate value is at least as great as the data threshold value. 17. A method according to claim 12, wherein said classifying step further comprises initially comparing the energy estimate value to an energy threshold value defining a minimum energy level, and wherein said classifying step classifies the input signal as a voice signal if the energy estimate value is no greater than the energy threshold value. 18. A method according to claim 12, wherein said classifying step comprises the steps of: receiving the energy estimate value; selecting a data threshold value from a plurality of differing threshold values based on the amplitude of the energy estimate value; and classifying the input signal as a data signal if the frequency estimate value is at least as great as the selected data threshold value. 19. A method according to claim 12, wherein said generating a frequency estimate value step comprises the steps of: delaying the input signal to thereby generate a delayed input signal; determining a difference between the delayed input signal and the input signal; and generating a frequency estimate value representing both the estimated central frequency of the input signal and the estimated energy of the input signal. 20. A method according to claim 19, wherein said generating a frequency estimate value step further comprises the step of generating an absolute value of the frequency estimate value. 21. A method according to claim 19, wherein said generating a frequency estimate value step further comprises the step of accumulating the frequency estimate value. 22. A method according to claim 12, wherein said generating an energy estimate value step comprises the steps of: generating an absolute value of the input signal; and generating an energy estimate value representing the amplitude of the input signal. 23. An apparatus for classifying an input signal as either a voice or data signal in a communication system, wherein said apparatus comprises: a delay that receives the input signal and generates a delayed input signal; a difference integrator in electrical communication with said delay and the input signal, wherein said difference integrator receives the delayed and input signals and generates a frequency estimate value representing both the estimated central frequency of the input signal and the estimated energy of the input signal; and a classification unit in electrical communication with said difference integrator, wherein said classification unit classifies the input signal as either a voice or data signal based on the frequency estimate value and a data threshold value. 24. An apparatus according to claim 23, wherein said classification unit comprises a normalizer in electrical communication with said difference integrator, wherein said normalizer normalizes the frequency estimate value based on an energy estimate value representing the energy of the input si
※ AI-Helper는 부적절한 답변을 할 수 있습니다.