Systems and methods for determining microphone position
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
H04R-029/00
G10L-015/01
G10L-015/06
G10L-015/26
G10L-015/16
G10L-015/02
출원번호
US-0209145
(2016-07-13)
등록번호
US-10085101
(2018-09-25)
발명자
/ 주소
Hardek, David D.
출원인 / 주소
Hand Held Products, Inc.
대리인 / 주소
Additon, Higgins & Pendleton, P.A.
인용정보
피인용 횟수 :
0인용 특허 :
236
초록▼
A method for determining a relative position of a microphone may include capturing speech audio from a user's mouth with the microphone so that the microphone outputs an electrical signal indicative of the speech audio; determining an indication of a position of the microphone relative to the user's
A method for determining a relative position of a microphone may include capturing speech audio from a user's mouth with the microphone so that the microphone outputs an electrical signal indicative of the speech audio; determining an indication of a position of the microphone relative to the user's mouth, which may include providing a plurality of inputs to a computerized discriminative classifier, wherein an input of the plurality of inputs is derived from the electrical signal, and wherein an output from the computerized discriminative classifier is indicative of the position of the microphone relative to the user's mouth.
대표청구항▼
1. A method for determining a relative position of a microphone, the method comprising: capturing speech audio from a user's mouth with the microphone so that the microphone outputs an electrical signal indicative of the speech audio;a computer deriving a plurality of inputs from the electrical sign
1. A method for determining a relative position of a microphone, the method comprising: capturing speech audio from a user's mouth with the microphone so that the microphone outputs an electrical signal indicative of the speech audio;a computer deriving a plurality of inputs from the electrical signal;determining a derived value of an approximate position of the microphone relative to the user's mouth comprising providing, to a discriminative classifier implemented on the computer: at least the plurality of inputs and at least some contextual data, the contextual data originating and/or representing conditions occurring at a time when the speech audio is captured;the discriminative classifier comprising a model derived from training data and/or test data, the training data and/or the test data comprising a manually measured actual position of a microphone relative to a user's mouth;the computer receiving an output from the discriminative classifier, the output providing the derived value of the approximate position of the microphone relative to the user's mouth based at least in part on the plurality of inputs and the contextual data; andthe computer determining whether the derived value of the approximate position of the microphone is unacceptable at least in part by comparing the derived value to a value or range indicative of the microphone being an acceptable distance relative to the user's mouth, and providing a signal to a user if the derived value of the approximate position of the microphone is unacceptable. 2. The method of claim 1, comprising the computer calculating a Fourier transformation on data selected from the group consisting of the electrical signal and data derived from the electrical signal. 3. The method of claim 2, wherein an input of the plurality of inputs is derived from results from the calculating of the Fourier transformation. 4. The method of claim 1, comprising the computer decoding a phoneme from data selected from the group consisting of the electrical signal and data derived from the electrical signal, wherein an input of the plurality of inputs comprises the phoneme. 5. The method of claim 4, comprising the computer decoding the phoneme using a text-to-phoneme engine. 6. The method of claim 1, comprising: deriving first and second inputs of the plurality of inputs from the electrical signal; andweighting the first input more heavily than any weighting of the second input in the discriminative classifier. 7. The method of claim 6, comprising providing first and second phenomes that are different from one another, comprising performing text-to-phenome conversions, wherein: the first input comprises the first phenome; andthe second input comprises the second phenome. 8. The method of claim 1, wherein the contextual data comprises at least one of a gain setting, and/or a classification of background noise. 9. A method for determining a relative position of a microphone, the method comprising: capturing speech audio from a user's mouth with the microphone so that the microphone outputs an electrical signal indicative of the speech audio;determining a derived value of an approximate position of the microphone relative to the user's mouth, comprising providing a plurality of inputs to a computerized discriminative classifier, the discriminative classifier comprising a model derived from training data and/or test data, the training data and/or the test data comprising a manually measured actual position of a microphone relative to a user's mouth wherein: a first input of the plurality of inputs is derived from the electrical signal, and a second input of the plurality of inputs comprises contextual data, the contextual data originating and/or representing conditions occurring at a time when the speech audio is captured; andan output from the computerized discriminative classifier is the derived value of the approximate position of the microphone relative to the user's mouth, the output derived at least in part from the plurality of inputs. 10. The method of claim 9, comprising: a computer determining whether the derived value of the approximate position of the microphone is unacceptable; andthe computer providing a signal in response to the computer determining that the derived value of the approximate position of the microphone is unacceptable. 11. The method of claim 9, comprising a computer deriving the input from the electrical signal. 12. The method of claim 11, comprising calculating a Fourier transformation on data selected from the group consisting of the electrical signal and data derived from the electrical signal. 13. The method of claim 12, wherein the input comprises results from the calculating of the Fourier transformation. 14. The method of claim 12, wherein the input is derived from results from the calculating of the Fourier transformation. 15. The method of claim 11, comprising decoding a phoneme from data selected from the group consisting of the electrical signal and data derived from the electrical signal. 16. The method of claim 15, wherein the input comprises the phoneme, and the decoding of the phoneme is comprised of using a text-to-phoneme engine. 17. The method of claim 9, comprising: deriving first and second inputs of the plurality of inputs from the electrical signal; andweighting the first input more heavily than any weighting of the second input in the computerized discriminative classifier. 18. The method of claim 17, comprising providing first and second phenomes that are different from one another, comprising performing text-to-phenome conversions, wherein: the first input comprises the first phenome; andthe second input comprises the second phenome. 19. A method for determining a relative position of a microphone, the method comprising: providing a plurality of inputs to a discriminative classifier implemented on a computer, the discriminative classifier comprising a model derived from training data and/or test data, the training data and/or the test data comprising a manually measured actual position of a microphone relative to a user's mouth, and the plurality of inputs comprising: (i) an electrical signal output from the microphone in response to the microphone capturing speech audio from a user's mouth while the microphone is at a position relative to the user's mouth, and/or data derived from the electrical signal; and (ii) contextual data, the contextual data originating and/or representing conditions occurring at a time when the speech audio is captured;the computer receiving an output from the discriminative classifier, the output providing a derived value of an approximate position of the microphone relative to the user's mouth, the derived value based at least in part on the plurality of inputs; andthe computer determining whether the derived value of the approximate position of the microphone is unacceptable at least in part by comparing the derived value output from the discriminative classifier to a value or range indicative of the microphone being an acceptable distance relative to the user's mouth, and providing a signal if the derived value of the approximate position of the microphone is unacceptable. 20. The method of claim 19, wherein the microphone is part of a head set that further comprises a speaker, and the method comprises the speaker providing an audio indication that the position of the microphone is unacceptable, wherein the speaker providing the audio indication is in response to the computer providing the signal. 21. The method of claim 19, comprising deriving the input from the electrical signal, wherein the input is selected from the group consisting of a Fourier transform and a phenome.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (236)
Woodburn, William, Access door with integrated switch actuator.
Caballero, Aldo M.; French, Daniel Brant; Hinson, Douglas M.; Kosecki, James C.; Mangicaro, David; Reynolds, Scott; Yeakley, Daniel Duane, Apparatus and methods for monitoring one or more portable data terminals.
Havens, William H.; Barber, Charles P.; Gannon, Colleen; Gardiner, Robert C.; Hennick, Robert J.; Pettinelli, John A., Apparatus operative for capture of image data.
Horn, Erik Van; Giordano, Patrick Anthony; Amundsen, Thomas; Olson, Daniel James; Brady, Robert Hugh; Colavito, Stephen; Saber, Kevin; Haggerty, Thomas; Wilz, Sr., David M., Bar code symbol reading system employing an extremely elongated laser scanning beam capable of reading poor and damaged quality bar code symbols with improved levels of performance.
Xian, Tao; Ellis, Duane; Good, Timothy; Zhu, Xiaoxun, Bar code symbol reading system supporting visual or/and audible display of product scan speed for throughput optimization in point of sale (POS) environments.
Todeschini, Erik; Deloge, Stephen Patrick; Meier, Timothy; Anderson, Donald; Hejl, Benjamin; Koziol, Thomas, Cloud-based system for reading of decodable indicia.
Kearney, Sean Philip; Giordano, Patrick Anthony; Cunningham, Charles Joseph; Bond, Desmond; Amundsen, Thomas, Decodable indicia reading terminal with combined illumination.
Biss, Charles E.; Havens, William H.; Robinson, Michael D.; Balschweit, Paul; Fitch, Timothy R.; McCall, Melvin D.; Gomez, Garrison; McClaude, Mark A.; Longacre, Andrew; Sonneville, Eunice, Device and system for processing image data representing bar codes.
Edmonds, Shane Michael; Keaney, Sean Philip, Hybrid-type bioptical laser scanning and digital imaging system supporting automatic object motion detection at the edges of a 3D scanning volume.
Edmonds, Shane Michael; Kearney, Sean Philip, Hybrid-type bioptical laser scanning and digital imaging system supporting automatic object motion detection at the edges of a 3D scanning volume.
Kearney, Sean Philip, Hybrid-type bioptical laser scanning and imaging system supporting digital-imaging based bar code symbol reading at the surface of a laser scanning window.
Barber, Charles P.; Gerst, Carl W.; Smith, George S.; Hussey, Robert M.; Gardiner, Robert C.; Pankow, Matthew W., Imaging apparatus having imaging assembly.
Barber, Charles P.; Gerst, III, Carl W.; Smith, II, George S.; Hussey, Robert M.; Gardiner, Robert C.; Pankow, Matthew W., Imaging apparatus having imaging assembly.
Havens, William H.; Pitou, David Stewart; McColloch, Laurence Ray; Barber, Charles Paul; Gannon, Colleen Patricia, Imaging module having lead frame supported light source or sources.
Wang, Ynjiun P.; Ahearn, Kevin; Deloge, Stephen P.; Ehrhart, Michael A.; Havens, William H.; Hussey, Robert M.; Koziol, Thomas J.; Li, Jianhua; Li, Jingquan; Montoro, James; Powilleit, Sven M. A., Indicia reading terminal having spatial measurement functionality.
Havens, William H.; Wang, Ynjiun P.; Hennick, Robert J.; Gannon, Colleen; Anderson, Donald; Hunter, Vivian L.; Bremer, Edward C.; Feng, Chen, Indicia reading terminal including focus element with expanded range of focus distances.
Wang, Ynjiun P.; Bremer, Edward C.; Feng, Chen; Gannon, Colleen P.; Havens, William H.; Li, Jianhua; Meier, Timothy P., Indicia reading terminal processing plurality of frames of image data responsively to trigger signal activation.
Hennick, Robert J.; Havens, William H.; Meier, Timothy; McCloskey, Scott; Anderson, Donald; Wang, Ynjiun P.; Hussey, Robert M.; Van Horn, Erik; Kearney, Sean P., Indicia reading terminals and methods for decoding decodable indicia employing light field imaging.
Wilz, Sr., David M., Laser scanning bar code symbol reading system having intelligent scan sweep angle adjustment capabilities over the working range of the system for optimized bar code symbol reading performance.
Xian, Tao; Wang, Ynjiun P.; Liu, Yong; Feng, Chen, Laser scanning code symbol reading system employing multi-channel scan data signal processing with synchronized digital gain control (SDGC) for full range scanning.
Brady, Robert Hugh; Colavito, Stephen; Wilz, Sr., David; Teng, Zhipeng; Dixon, Myron Levon, Laser scanning code symbol reading system providing improved control over the length and intensity characteristics of a laser scan line projected therefrom using laser source blanking control.
Fritz, Bernard; Cox, James Allen; Reutiman, Peter L., Laser scanning system employing an optics module capable of forming a laser beam having an extended depth of focus (DOF) over the laser scanning field.
Havens, William; Kearney, Sean Philip, Laser scanning system using laser beam sources for producing long and short wavelengths in combination with beam-waist extending optics to extend the depth of field thereof while resolving high resolution bar code symbols having minimum code element widths.
Todeschini, Erik, Method and application for scanning a barcode with a smart device while continuously running and displaying an application on the smart device display.
Braho, Keith; El-Jaroudi, Amro; Pike, Jeffrey, Method and system for considering information about an expected response when performing speech recognition.
Van Horn, Erik; Olson, Daniel James, Method of and apparatus for managing and redeeming bar-coded coupons displayed from the light emitting display surfaces of information display devices.
Amundsen, Thomas; Kearney, Sean Philip; Edmonds, Shane Michael; Wang, Ynjiun Paul; Good, Timothy; Miraglia, Michael; Cunningham, IV, Charles Joseph; Zhu, Xiaoxun; Giordano, Patrick Anthony, Method of and system for detecting object weighing interferences.
Amundsen, Thomas; Kearney, Sean Philip; Edmonds, Shane Michael; Wang, Ynjiun Paul; Good, Timothy; Miraglia, Michael; Cunningham, IV, Charles Joseph; Zhu, Xiaoxun; Giordano, Patrick Anthony, Method of and system for detecting produce weighing interferences in a POS-based checkout/scale system.
Van Horn, Erik; Kearney, Sean Philip, Method of and system for reading visible and/or invisible code symbols in a user-transparent manner using visible/invisible illumination source switching during data capture and processing operations.
Berthiaume, Guy H.; Caballero, Aldo M.; Cairns, James A.; Havens, William H.; Koziol, Thomas J.; Stewart, James W.; Wang, Ynjiun P.; Yeakley, Daniel D., Methods and apparatus to change a feature set on data collection devices.
Plesko, George, Molded elastomeric flexural elements for use in a laser scanning assemblies and scanners, and methods of manufacturing, tuning and adjusting the same.
Van Horn, Erik; Kearney, Sean Philip; Giordano, Patrick Anthony; Good, Timothy; Dickinson, Chandler; Au, Ka Man; Wilz, Sr., David; Furlong, John A.; Hejl, Benjamin; Walczyk, Joseph A.; Coyle, Larry; Rosetti, James; Haggerty, Thomas, Multifunction point of sale system.
Good, Timothy, Omnidirectional laser scanning bar code symbol reader generating a laser scanning pattern with a highly non-uniform scan density with respect to line orientation.
Kotlarsky, Anatoly; Zhu, Xiaoxun; Veksland, Michael; Au, Ka Man; Giordano, Patrick; Yan, Weizhen; Ren, Jie; Smith, Taylor; Miraglia, Michael V.; Knowles, C. Harry; Mandal, Sudhin; De Foney, Shawn; Allen, Christopher; Wilz, Sr., David M., Optical code symbol reading system employing a LED-driven optical-waveguide structure for illuminating a manually-actuated trigger switch integrated within a hand-supportable system housing.
Kotlarsky, Anatoly; Zhu, Xiaoxun; Veksland, Michael; Au, Ka Man; Giordano, Patrick; Yan, Weizhen; Ren, Jie; Smith, Taylor; Miraglia, Michael V.; Knowles, C. Harry; Mandal, Sudhin; De Foney, Shawn; Allen, Christopher; Wilz, Sr., David M., Optical code symbol reading system employing an acoustic-waveguide structure for coupling sonic energy, produced from an electro-transducer, to sound wave ports formed in the system housing.
Kotlarsky, Anatoly; Zhu, Xiaoxun; Veksland, Michael; Au, Ka Man; Giordano, Patrick; Yan, Weizhen; Ren, Jie; Smith, Taylor; Miraglia, Michael V.; Knowles, C. Harry; Mandal, Sudhin; De Foney, Shawn; Allen, Christopher; Wilz, Sr., David M., Optical scanning system having an extended programming mode and method of unlocking restricted extended classes of features and functionalities embodied therewithin.
Barten, Henri Jozef Maria, POS-based code symbol reading system with integrated scale base and system housing having an improved produce weight capturing surface design.
Cunningham, Charles; Good, Timothy; Kearney, Sean Philip; Miraglia, Michael; Amundsen, Thomas; Giordano, Patrick; Wang, Yujiun Paul; Zhu, Xiaoxun, Point of sale (POS) based checkout system supporting a customer-transparent two-factor authentication process during product checkout operations.
Barber, Charles P.; Gerst, III, Carl W.; Smith, II, George S.; Hussey, Robert M.; Gardiner, Robert C.; Pankow, Matthew W., Reading apparatus having partial frame operating mode.
Murawski, Mark David; Russell, Philip E., Receiving application specific individual battery adjusted battery use profile data upon loading of work application for managing remaining power of a mobile device.
Soule, III, Robert M.; Berthiaume, Guy H.; Caballero, Aldo Mario; Conti, Brian V.; Harper, Jeffrey Dean; Hooks, Larry K.; Meggitt, Adam Edward; Sauerwein, James T.; Yeakley, Daniel D., Reprogramming system and method for devices including programming symbol.
Maloy, James D.; Kusar, Michael; Mranca, Alexander; Narayan, Venkatesh; Thorsen, Jeffrey, System and method for generating and updating location check digits.
Gomez, Garrison; Siegler, Thomas A.; Soule, III, Robert M.; Daddabbo, Nick; Sperduti, David, System and method to store and retrieve identifier associated information content.
Furlong, John A.; Hernandez, Mark Jose Antonio; Koch, Craig; Nahill, James; Cunningham, IV, Charles Joseph; Kearney, Sean Philip; Smith, Taylor, System having imaging assembly for use in output of image data.
Hendrickson, James; Scott, Debra Drylie; Littleton, Duane; Pecorari, John; Slusarczyk, Arkadiusz, Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment.
Pease, Michael; Bouchat, Christopher; Dobeck, Brian Roman; Sauerwein, Jr., James T.; Youngblood, Eric, Terminal configurable for use within an unknown regulatory domain.
Harding, Andrew C.; Suhr, Jeffrey K.; Allen, Nicholas P., Testing automatic data collection devices, such as barcode, RFID and/or magnetic stripe readers.
Essinger, Steven; Zhu, Xiaoxun; Schnee, Michael; Liu, JiBin; Shen, Xin; Chen, LiangLiang; Lu, Jun, Wireless dual-function network device dynamically switching and reconfiguring from a wireless network router state of operation into a wireless network coordinator state of operation in a wireless communication network.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.