System and method for recognizing audio pieces via audio fingerprinting
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06F-017/30
G06F-017/00
출원번호
US-0345548
(2006-01-31)
등록번호
US-7487180
(2009-02-03)
발명자
/ 주소
Holm,Frode
Hicken,Wendell T.
출원인 / 주소
MusicIP Corporation
대리인 / 주소
Christie, Parker & Hale, LLP.
인용정보
피인용 횟수 :
12인용 특허 :
78
초록▼
An audio fingerprinting system and method. A server receives an audio fingerprint of a first audio piece, searches a database for the audio fingerprint, retrieves an audio profile vector associated with the audio fingerprint, updates user preference information based on the audio profile vector, and
An audio fingerprinting system and method. A server receives an audio fingerprint of a first audio piece, searches a database for the audio fingerprint, retrieves an audio profile vector associated with the audio fingerprint, updates user preference information based on the audio profile vector, and selects a second audio piece based on the user preference information. The audio fingerprint is generated by creating a matrix based on the frequency measurements of the audio piece, and performing a singular value decomposition of the matrix. To expedite the search of the database and to increase matching accuracy, a subset of candidates in the database is identified based on the most prominent musical notes of the audio piece, and the search is limited to the identified subset. One of the attributes of the audio profile vector is a particular audio class. An identifier for the audio class is generated based on an average of audio fingerprints of the audio pieces belonging to the audio class.
대표청구항▼
What is claimed is: 1. An audio recognition method comprising: receiving an audio fingerprint of a musical piece from a client device; comparing the received audio fingerprint against a plurality of stored audio fingerprints for a match; determining if the received audio fingerprint corresponds to
What is claimed is: 1. An audio recognition method comprising: receiving an audio fingerprint of a musical piece from a client device; comparing the received audio fingerprint against a plurality of stored audio fingerprints for a match; determining if the received audio fingerprint corresponds to a particular one of the stored audio fingerprints; if the received audio fingerprint corresponds to the particular one of the stored audio fingerprints; retrieving an audio profile vector stored in association with the particular one of the stored audio fingerprints, the audio profile vector including at least N numerical values quantifying N acoustic attributes of the musical piece, wherein N>0, and wherein at least one of the acoustic attributes is tempo, and the associated numerical value quantifies the tempo of the musical piece based on an automatic processing of audio signals of the musical piece by a music profiling engine; and transmitting information stored in association with the retrieved audio profile vector to the client device for doing at least one of generating a music playlist, making music related recommendations, and making other music-related selections; and if the received audio fingerprint does not correspond to the particular one of the stored audio fingerprints, prompting the client device for generating the audio profile vector. 2. The method of claim 1, wherein the audio profile vector is generated based on an automatic processing of audio signals of the audio piece. 3. The method of claim 1, wherein one of the plurality of acoustic attributes included in the audio profile vector is associated with a particular audio class, and the numerical value indicates a distance of the audio piece to the audio class. 4. The method of claim 3, wherein the audio class is identified based on an audio class fingerprint, the audio class fingerprint being an average of audio fingerprints of audio pieces associated with the audio class. 5. The method of claim 4, wherein the numerical value indicating the distance of the audio piece to the audio class is determined based on a distance calculation of the received audio fingerprint and the audio class fingerprint. 6. The method of claim 1 further comprising: identifying an index of a subset of the plurality of stored audio fingerprints, the index identifying a plurality of musical notes determined to be most prominent for the audio fingerprints in the subset; and searching the identified subset for the match. 7. The method of claim 1, wherein if none of the stored audio fingerprints correspond to the received audio fingerprint, invoking the client device to generate the audio profile vector of the audio piece. 8. The method of claim 1, wherein the received and stored audio fingerprints are each represented as a matrix of vectors. 9. The method of claim 8, wherein the received audio fingerprint corresponds to the particular one of the stored audio fingerprints if a distance computation between the matrix representing the received audio fingerprint and the matrix representing the particular one of the stored audio fingerprints results in a single scalar distance value that satisfies a threshold distance. 10. The method of claim 1, wherein the information transmitted to the client device is the retrieved audio profile vector. 11. The method of claim 1 further comprising: recommending a music item based on the audio profile vector. 12. The method of claim 1 further comprising: receiving from the client device a user rating with the received audio fingerprint; modifying user preference information based on the user rating and the retrieved audio profile vector; and recommending a music item based on the user preference information. 13. The method of claim 1, wherein none of the N numerical values quantifying the N acoustic attributes of the musical piece is determined based on human analysis of the musical piece. 14. An audio recognition method comprising: receiving an audio fingerprint of an audio piece; comparing the received audio fingerprint against a plurality of stored audio fingerprints for a match; identifying the audio piece responsive to a match of the audio fingerprint; and retrieving information stored in association with the identified audio piece, wherein the audio fingerprint is a representation of matrix VT generated from a singular value decomposition (SVD) of an N��M matrix A, the matrix A being built based on frequency measurements of audio signals associated with the audio piece, wherein SVD(A)=USVT, where U is an N��M orthogonal matrix, S in an M��M diagonal matrix, and VT is a transpose of an M��M orthogonal matrix. 15. The method of claim 14, wherein rows of the matrix A represent time, and columns of the matrix A represent the frequency measurements. 16. An audio recognition system comprising: a first data store storing a plurality of audio fingerprints for a plurality of audio pieces; one or more processors; one or more memory devices operably coupled to the one or more processors storing program instructions therein, each of the one or more processors being operable to execute one or more of the program instructions, the program instructions including: receiving an audio fingerprint of a particular musical piece from a client device; comparing the received audio fingerprint against the plurality of stored audio fingerprints for a match; determining if the received audio fingerprint corresponds to a particular one of the stored audio fingerprints; retrieving an audio profile vector stored in association with the particular one of the stored audio fingerprints if the received audio fingerprint corresponds to the particular one of the stored audio fingerprints, the audio profile vector including at least N numerical values quantifying N acoustic attributes of the musical piece, wherein N>0, and wherein at least one of the acoustic attributes is tempo, and the associated numerical value quantifies the tempo of the musical piece based on an automatic processing of audio signals of the audio piece by a music profiling engine; transmitting information stored in association with the retrieved audio profile vector to the client device for doing at least one of generating a music playlist, making music related recommendations, and making other music-related selections; and prompting the client device for generating the audio profile vector if the received audio fingerprint does not correspond to the particular one of the stored audio fingerprints. 17. The system of claim 16 further comprising: a second data store storing the audio profile vector in association with the particular one of the stored audio fingerprints, the audio profile vector being generated based on an automatic processing of audio signals of the audio piece. 18. The system of claim 17, wherein one of the plurality of acoustic attributes included in the audio profile vector is associated with a particular audio class, and the numerical value indicates a distance of the audio piece to the audio class. 19. The system of claim 18, wherein the audio class is identified based on an audio class fingerprint, the audio class fingerprint being an average of audio fingerprints of audio pieces associated with the audio class. 20. The system of claim 16, wherein the program instructions further include: identifying an index of a subset of the plurality of stored audio fingerprints based on the audio fingerprint, the index identifying a plurality of musical notes determined to be most prominent for the audio fingerprints in the subset; and searching the identified subset for the match. 21. An audio recognition system comprising: a first data store storing a plurality of audio fingerprints for a plurality of audio pieces; one or more processors; one or more memory devices operably coupled to the one or more processors storing program instructions therein, each of the one or more processors being operable to execute one or more of the program instructions, the program instructions including: receiving an audio fingerprint of a particular audio piece; comparing the received audio fingerprint against the plurality of stored audio fingerprints for a match; identifying the audio piece responsive to a match of the audio fingerprint; and retrieving information stored in association with the identified audio piece, wherein the audio fingerprint is a representation of matrix VT generated from a singular value decomposition (SVD) of an N��M matrix A, the matrix A being built based on frequency measurements of audio signals associated with the audio piece, wherein SVD(A)=USVT, where U is an N��M orthogonal matrix, S in an M��M diagonal matrix, and VT is a transpose of an M��M orthogonal matrix. 22. The system of claim 21, wherein rows of the matrix A represent time, and columns of the matrix A represent the frequency measurements.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (78)
Sass, Jonathan, Apparatus for distributing and playing audio information.
Cifra,Christopher G.; Joffrain,Philippe G.; Moriat,Alain G.; Restat,Christophe A.; Pasquarette,John A.; Fletcher,J. Clinton, Automatic configuration of function blocks in a signal analysis system.
Hinman,Jeremy D. F.; Kaiser,Rolf W.; Stanfield,Geoffrey R.; Deeds,Paul; Flannery,Frank; Rivelli,David, Classification of information and use of classifications in searching and retrieval of information.
Heckerman David E. (Bellevue WA) Breese John S. (Mercer Island WA) Horvitz Eric (Kirkland WA) Chickering David Maxwell (Los Angeles CA), Collaborative filtering utilizing a belief network.
Bellegarda, Jerome; Naik, Devang; Neeracher, Matthias; Silverman, Kim, Combined dual spectral and temporal alignment method for user authentication by voice.
Bergh Christopher P. ; Metral Max E. ; Ritter David Henry ; Sheena Jonathan Ari ; Sullivan James J., Distributed system for facilitating exchange of user information and opinion using automated collaborative filtering.
Adam K. Kolawa ; Arthur Ray Hicken ; Stephen Frankini ; Skirmantas Kligys ; Emelina Gesheshyen ; Maxim Troukhan, Method and apparatus for automated selection, organization, and recommendation of items.
Gaske, T. Paul; Kepley, Walter R.; Casavant, Scott; Joseph, Kuriacose, Method and apparatus for background caching of encrypted programming data for later playback.
Atcheson John (San Francisco CA) Miller ; III James R. (Stanford CA), Method and apparatus for recommending selections based on preferences in a multi-user system.
Blum Thomas L. ; Keislar Douglas F. ; Wheaton James A. ; Wold Erling H., Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information.
Kretsch Mary J. (Vallejo CA) Gunn Moira A. (San Francisco CA) Fong Alice K. (San Francisco CA), Method and system for measurement of intake of foods, nutrients and other food components in the diet.
Alcalde,Vicen챌 Gaitan; Ullod,Carlos Mar챠a L처pez; Bonet,Antonio Trias; Llopis,Antonio Trias; Marcos,Jes첬s Sanz; Ysern,Daniel Caldentey; Arkwright,Dominic, Method and system for music recommendation.
Tracy William X. ; Roslak Thomas K. ; Murrah Judith ; Riso Francis ; Beach Robert ; Sandler Robert ; Klein John, Method and system for presenting item information using a portable data terminal.
Ellis Michael D. (Boulder CO) Dunn Stephen M. (Boulder CO) Fellinger Michael W. (Boulder CO) Younglove Fancy B. (Boulder CO) James David M. (Fort Collins CO) Clifton David L. (Boulder CO) Land Richar, Method and system for producing a signature characterizing an audio broadcast signal.
Gjerdingen, Robert O.; Khan, Rehan M.; Mathys, Marc; Pirkner, Christian D.; Rice, Pete W.; Sulzer, Thomas R., Method for creating a database for comparing music.
Breese John S. ; Kadie Carl M., Methods and apparatus for matching entities and for predicting an attribute of an entity based on an attribute frequency value.
David M. Bebiak ; Sandeep Bhatnagar ; Timothy Gleason ; Cathryn Fritz-Jung ; Karl Kettinger ; Daryl F. Klas ; Bhajmohan Singh ; Donald Speck ; Jill Stoll, Methods for customizing pet food.
Tagawa,Junichi; Yamane,Hiroaki; Misaki,Masayuki, Sound features extracting apparatus, sound data registering apparatus, sound data retrieving apparatus, and methods and programs for implementing the same.
Herz Frederick ; Ungar Lyle ; Zhang Jian ; Wachob David ; Salganicoff Marcos, System and method for scheduling broadcast of and access to video programs and other data using customer profiles.
Atkins Charles Agee (Charleston SC), System for the operation and management of one or more financial accounts through the use of a digital communication and.
Kanevsky Dimitri ; Zadrozny Wlodek Wlodzimierz, Training speech recognition by matching audio segment frequency of occurrence with frequency of words and letter combinations in a corpus.
Kurozumi, Takayuki; Nagano, Hidehisa; Kashino, Kunio, Sound signal detection system, sound signal detection server, image signal search apparatus, image signal search method, image signal search program and medium, signal search apparatus, signal search method and signal search program and medium.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.