[특허]System and method for keyword spotting using representative dictionary

System and method for keyword spotting using representative dictionary 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06F-017/27 G06F-017/30 G06F-021/55
출원번호	US-0704702 (2017-09-14)
등록번호	US-10198427 (2019-02-05)
우선권정보	IL-224482 (2013-01-29)
발명자 / 주소	Yishay, Yitshak
출원인 / 주소	VERINT SYSTEMS LTD.
대리인 / 주소	Meunier Carlin & Curfman LLC
인용정보	피인용 횟수 : 0 인용 특허 : 69

초록 ▼

Methods and systems for keyword spotting, i.e., for identifying textual phrases of interest in input data. In the embodiments described herein, the input data comprises communication packets exchanged in a communication network. The disclosed keyword spotting techniques can be used, for example, in applications such as Data Leakage Prevention (DLP), Intrusion Detection Systems (IDS) or Intrusion Prevention Systems (IPS), and spam e-mail detection. A keyword spotting system holds a dictionary of textual phrases for searching input data. In a communication analytics system, for example, the dictionary defines textual phrases to be located in communication packets—such as e-mail addresses or Uniform Resource Locators (URLs).

대표청구항 ▼

1. A method for searching input data for textual phrases, the method comprising: providing a system having an external memory containing a first dictionary of first textual phrases and a cache memory containing a second dictionary of second textual phrases, wherein the cache memory has a faster access speed than the external memory, and wherein the second dictionary represents the first dictionary but has a smaller data size than the first dictionary because the second textual phrases are sub-strings derived from the first textual phrases that are shorter than the first textual phrases;receiving input data using the system;searching the input data with the second dictionary;in response to identifying in the input data a second textual phrase from the second dictionary, locating in the input data a first textual phrase from the first dictionary corresponding to the identified second textual phrase; andusing the located first textual phrase to perform one of data leakage prevention, intrusion detection, intrusion prevention, spam e-mail detection, or detection of inappropriate content. 2. The method according to claim 1, wherein each first textual phrase in the first dictionary corresponds to at least one of the second textual phrases in the second dictionary. 3. The method according to claim 1, wherein the first textual phrases are strings of characters that include wildcard characters. 4. The method according to claim 3, wherein a string of characters corresponds to a data communication packet. 5. The method according to claim 4, wherein the second dictionary comprises rectangles, wherein each rectangle comprises a list of sub-strings. 6. The method according to claim 5, wherein each sub-string in a rectangle has the same number of characters. 7. The method according to claim 1, wherein a plurality of first textual phrases in the first dictionary correspond to a single second textual phrase in the second dictionary. 8. The method according to claim 1, wherein the first textual phrases include commonly found sub-strings that are common to a majority of the first textual phrases, and wherein the second textual phrases do not include the commonly found sub-strings. 9. The method according to claim 1, wherein the cache memory is large enough to contain the second dictionary but is too small to contain the first dictionary. 10. A system for searching input data for textual phrases, the system comprising: an external memory containing a first dictionary of first textual phrases;a cache memory containing a second dictionary of second textual phrases, wherein the cache memory has a faster access speed than the external memory, and wherein the second dictionary represents the first dictionary but has a smaller data size than the first dictionary because the second textual phrases are sub-strings derived from the first textual phrases that are shorter than the first textual phrases;a network interface card (NIC) that receives input data from a network; anda processor that is communicatively coupled to the external memory, the cache memory, and the NIC, wherein the processor is configured by software to:receive the input data from the NIC,search the input data with the second dictionary,in response to identifying in the input data a second textual phrase from the second dictionary, locating in the input data a first textual phrase from the first dictionary corresponding to the identified second textual phrase, andusing the located first textual phrase to perform one of data leakage prevention,intrusion detection, intrusion prevention, spam e-mail detection, or detection of inappropriate content. 11. The system according to claim 10, wherein the textual phrases comprise e-mail addresses and/or uniform resource locators (URLs). 12. The system according to claim 10, wherein each first textual phrase in the first dictionary corresponds to at least one of the second textual phrases in the second dictionary. 13. The system according to claim 10, wherein the first textual phrases are strings of characters that include wildcard characters. 14. The system according to claim 13, wherein a string of characters corresponds to a data communication packet. 15. The system according to claim 14, wherein the second dictionary comprises rectangles, wherein each rectangle comprises a list of sub-strings. 16. The system according to claim 15, wherein each sub-string in a rectangle has the same number of characters. 17. The system according to claim 10, wherein a plurality of first textual phrases in the first dictionary correspond to a single second textual phrase in the second dictionary. 18. The system according to claim 10, wherein the first textual phrases include commonly found sub-strings that are common to a majority of the first textual phrases, and wherein the second textual phrases do not include the commonly found sub-strings. 19. The system according to claim 10, wherein the cache memory is large enough to contain the second dictionary but is too small to contain the first dictionary. 20. The system according to claim 10, wherein the cache memory is a level-two (L2) cache of the processor.

이 특허에 인용된 특허 (69)

Baarman David W. (Zeeland MI) Richards David M. (Littleton CO 4), Apparatus and method for effecting data compression.
상세보기
Kalinichenko, Michael, Application of nested behavioral rules for anti-malware processing.
상세보기
Blanksteen, Scott I., Automated removal of personally identifiable information.
상세보기
Hicks, Matthew J.; Bieda, Teresa M., Automated workflow generation.
상세보기
Cowan, Joe; Esposito, Robert Edward; Dawson, Travis Edward; Ranjan, Supranamaya, Botnet beacon detection.
상세보기
Toshimi Yokota JP; Hiroshi Shojima JP; Soshiro Kuzunuki JP; Toshifumi Arai JP; Masaki Miura JP; Keiko Gunji JP; Yasushi Fukunaga JP, Collaborative learning system and pattern recognition method.
상세보기
Bass Vance R. (Austin TX) Bonebrake Veronica A. (Leander TX) Garrison David A. (Austin TX) Landis James K. (Austin TX) Neff Mary S. (Montrose NY) Urquhart Robert J. (Austin ; both of TX) Williams Sus, Compound word spelling verification.
상세보기
Bass Vance R. (Austin TX) Bonebrake Veronica A. (Leander TX) Garrison David A. (Austin TX) Landis James K. (Austin TX) Neff Mary S. (Montrose NY) Urquhart Robert J. (Austin TX) Williams Susan C. (Aus, Compound word suitability for spelling verification.
상세보기
Njemanze, Hugh S.; Kothari, Pravin S.; Dash, Debabrata; Wang, Shijie, Correlation engine with support for time-based rules.
상세보기
Rubin, Gregory A., Detection of and responses to network attacks.
상세보기
Jordan, Christopher J., Device, system and method for defending a computer network.
상세보기
Swanson Daniel R. (Dallas TX) Moen Jerry M. (Plano TX) Tate Bradley M. (Carrollton TX), Event surveillance system.
상세보기
McFadden, Brian D., Flexible rule-based communication system and method for controlling the flow of and access to information between computer users.
상세보기
Aragon David B. (Berkeley CA), Fuzzy string matcher.
상세보기
Church Kenneth Ward ; Dagan Ido,ILX, Glossary construction tool.
상세보기
Mayer Laurance W. ; Spear Daniel S., High speed data searching for information in a computer system.
상세보기
Baker, Steven D.; Lamping, John O., Identifying a synonym with N-gram agreement for a query phrase.
상세보기
Cerna, Michael D.; Nagle, James C.; Ruan, Qing; Schmidt, Darren R.; Wenzel, Lothar, Identifying randomly distributed microparticles in images to sequence a polynucleotide.
상세보기
Liang,Yung Chang, Innoculation of computing devices against a selected computer virus.
상세보기
Maruyama Fuyuki,JPX ; Sai Akira,JPX, Language processing apparatus and method.
상세보기
Ranjan, Supranamaya, Machine learning based botnet detection using real-time extracted traffic features.
상세보기
Ranjan, Supranamaya; Chen, Feilong, Machine learning based botnet detection with dynamic adaptation.
상세보기
Carus Alwin B. ; Good Kathleen, Method and apparatus for automatic identification of word boundaries in continuous text and computation of word boundary scores.
상세보기
Chiang, Hui-Hwa; Lee, Kuo-Chun; Chen, Tsung-Yen (Eric); Han, Ching-Chih (Jason), Method and apparatus for automatically recording snapshots of a computer screen during a computer session for later playback.
상세보기
Datta Utpal ; Carlson David G., Method and apparatus for digital data compression.
상세보기
Barger, Richard, Method and apparatus for disrupting the command and control infrastructure of hostile programs.
상세보기
Zolotov, Moshe, Method and system for creating real time integrated Call Details Record (CDR) databases in management systems of telecommunication networks.
상세보기
Ichbiah Jean D. (58 Lexington St. Essex MA 01803), Method and system for entering text in computer equipment.
상세보기
Czarnecki,David Anthony; Bufi,Corey Nicholas; Simmons,Melvin Kurt, Method and system for event phrase identification.
상세보기
Kaufman Ilia,CAX, Method and system for retrieving relevant documents from a database.
상세보기
McIllwaine, John C. C.; McConnell, Matthew G. A., Method and system for scheduled delivery of training to call center agents.
상세보기
Sykes, Mark; Baldock, George Ronald, Method for converting speech to text, performing natural language processing on the text output, extracting data values and matching to an electronic ticket form.
상세보기
Fritchman, Barry Lynn, Method for execution of query to search strings of characters that match pattern with a target string utilizing bit vector.
상세보기
Zamora Antonio (Bethesda MD), Method for isolation of Chinese words from connected Chinese text.
상세보기
Amit,Noah; Amit,Yoni; Eadan,Zvi, Method of surveilling internet communication.
상세보기
Toma Peter P. (5467 Bahia La. La Jolla CA 92037), Method using a programmed digital computer system for translation between natural languages.
상세보기
Kanter, Max L., Method, medium, and system for online ordering using sign language.
상세보기
Bates,Cary Lee; Day,Paul Reuben; Santosuosso,John Matthew, Method, system, and program for checking contact information.
상세보기
Panigrahy,Rina; Nelson,William; Nguyen,Anh Tien, Methods and apparatus for regular expression matching.
상세보기
Hession,Patrick; McCormack,Tony; Hickey,James, Methods of monitoring communications sessions in a contact centre.
상세보기
Potter Terry W. (Acton MA) Worrell Glen C. (Auburn MA), Parallel associative memory having improved selection and decision mechanisms for recognizing and sorting relevant patte.
상세보기
Hostetter, Mathew; Steele, Kenneth M.; Aggarwal, Vijay, Pattern matching.
상세보기
Wenzel,Lothar; Vazquez,Nicolas; Schultz,Kevin L.; Nair,Dinesh, Pattern matching using multiple techniques.
상세보기
Karttunen, Lauri J, Region-matching transducers for natural language processing.
상세보기
Lu X. Allan (Centerville OH) Klein Timothy M. (Miamisburg OH), Short case name generating method and apparatus.
상세보기
Blair, Christopher Douglas; Keenan, Roger Louis, Signal monitoring apparatus analyzing voice communication content.
상세보기
Christopher Douglas Blair GB; Roger Louis Keenan GB, Signal monitoring apparatus for analyzing communications.
상세보기
Hermansen, John Christian; Shaefer, Jr., Leonard Arthur; McCallum-Bayliss, Heather; Lutz, Richard D., System and method for adaptive multi-cultural searching and matching of personal names.
상세보기
Blair,Christopher Douglas, System and method for analysing communication streams.
상세보기
Blair, Christopher Douglas, System and method for analysing communications streams.
상세보기
Kushler,Clifford A.; Marsden,Randal J., System and method for continuous stroke word-based text input.
상세보기
Zaitsev, Oleg V., System and method for establishing rules for filtering insignificant events for analysis of software program.
상세보기
Crosbie,Mark; Shepley,Rosemarie; Kuperman,Benjamin; Frayman,Leonard L., System and method for host and network based intrusion detection and response.
상세보기
Rockwood, Troy Dean, System and method for interactive correlation rule design in a network security system.
상세보기
Yishay, Yitshak, System and method for keyword spotting using representative dictionary.
상세보기
Nair, Dinesh; Lin, Siming; Schmidt, Darren; Vazq?ez, Nicolas, System and method for locating color and pattern match regions in a target image.
상세보기
Fallon, James J.; Bo, Steven L., System and method for lossless data compression and decompression.
상세보기
Vogel, Claude, System and method for parsing a document.
상세보기
Rozman, Allen F.; Cioffi, Alfonso J., System and method for protecting a computer system from malicious software.
상세보기
Rozman, Allen F.; Cioffi, Alfonso J., System and method for protecting a computer system from malicious software.
상세보기
Rozman, Allen F.; Cioffi, Alfonso J., System and method for protecting a computer system from malicious software.
상세보기
Rozman, Allen F.; Cioffi, Alfonso J., System and method for protecting a computer system from malicious software.
상세보기
Honig,Andrew; Howard,Andrew; Eskin,Eleazar; Stolfo,Salvatore J., System and methods for adaptive model generation for detecting intrusions in computer systems.
상세보기
Dewey, David Bryan; Freeman, Robert G.; Griswold, Paul Elliott, System, method and program product for detecting computer attacks.
상세보기
Goldfarb, Eithan; Altman, Yuval; Horovitz, Itsik; Yaari, Gur, Systems and methods for efficient keyword spotting in communication traffic.
상세보기
Zhilyaev Maxim, Test classification system and method.
상세보기
Nachenberg, Carey S., Using temporal attributes to detect malware.
상세보기
Blair,Christopher Douglas; Keenan,Roger Louis, Voice interaction analysis module.
상세보기
Miller John W., Word prediction system.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

System and method for keyword spotting using representative dictionary 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (69)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

System and method for keyword spotting using representative dictionary 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (69)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트