[특허]Video scene classification by activity

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	G06K-009/00 H04N-005/232 H04N-005/77 G11B-027/031 H04N-021/8549 G11B-027/00 G11B-027/22 G11B-027/34 H04N-005/91 H04N-009/82
출원번호	US-0513151 (2014-10-13)
등록번호	US-9984293 (2018-05-29)
발명자 / 주소	Hodulik, Nick Taylor, Jonathan
출원인 / 주소	GoPro, Inc.
대리인 / 주소	Sheppard Mullin Richter & Hampton LLP
인용정보	피인용 횟수 : 0 인용 특허 : 31

초록 ▼

Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. A video summary can be generated including one or more of the identified best scenes. The v

Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. A video summary can be generated including one or more of the identified best scenes. The video summary can be generated using a video summary template with slots corresponding to video clips selected from among sets of candidate video clips. Best scenes can also be identified by receiving an indication of an event of interest within video from a user during the capture of the video. Metadata patterns representing activities identified within video clips can be identified within other videos, which can subsequently be associated with the identified activities.

대표청구항 ▼

1. A method for identifying scenes in videos, the method comprising: obtaining one or more electronic files defining a video captured with a camera, the one or more electronic files including event of interest information indicating reception of verbal input during the capture of the video, the verb

1. A method for identifying scenes in videos, the method comprising: obtaining one or more electronic files defining a video captured with a camera, the one or more electronic files including event of interest information indicating reception of verbal input during the capture of the video, the verbal input identifying an occurrence of an event of interest within the video, the event of interest occurring at an event moment within the video, the event of interest information identifying (i) a given input type of the verbal input, input types of the verbal input including a first input type, a second input type, and a third input type, and (ii) an input moment during the capture of the video at which the verbal input was received, wherein the first input type indicates the input moment occurring before the event moment, the second input type indicates the input moment occurring during the event moment, and the third input type indicates the input moment occurring after the event moment;identifying the input moment based on the event of interest information;identifying the given input type of the verbal input based on the event of interest information;identifying the event moment based on the input moment and the given input type, wherein the event moment is identified to occur before the input moment based on the given input type being the first type, the event moment is identified to occur during the input moment based on the given input type being the second type, and the event moment is identified to occur after the input moment based on the given input type being the third type;identifying a portion of the video as a video clip associated with the event of interest based on the event of interest information, the video clip comprising a first time amount of the video occurring before the event moment and a second time amount of the video occurring after the event moment, the first time amount and the second time amount being determined based on a type of an activity captured within the video; andstoring clip information indicating the association of the video clip with the event of interest and the portion of the video included in the video clip. 2. The method of claim 1, wherein the occurrence of the event of interest is identified further based on metadata associated with the video, the metadata captured during the capture of the video, the metadata characterizing velocity or acceleration of the activity captured within the video. 3. The method of claim 2, wherein a metadata criteria for identifying the occurrence of the event of interest based on the metadata associated with the video is based on the type of the activity captured within the video such that a first metadata criteria is used to identify the occurrence of the event of interest based on the activity being of a first type and a second metadata criteria is used to identify the occurrence of the event of interest based on the activity being of a second type, the first metadata criteria being different from the second metadata criteria. 4. The method of claim 1, wherein the verbal input comprises a spoken command associated with tagging events of interest. 5. The method of claim 1, wherein the event moment includes a point in time within the video or a duration of time within the video. 6. The method of claim 1, further comprising: identifying one or more non-event moments within the video, the one or more non-event moments not associated with any event of interest within the video; andstoring non-event information indicating the one or more non-event moments. 7. The method of claim 6, wherein the one or more non-event moments are identified based on matching metadata associated with the video with a metadata pattern determined to not be of interest to a user, the metadata characterizing velocity of the activity captured within the video, acceleration of the activity captured within the video, visuals captured within the video, or audio captured within the video. 8. A system for identifying scenes in videos, the system comprising: one or more physical processors configured by computer-readable instructions to: obtain one or more electronic files defining a video captured with a camera, the one or more electronic files including event of interest information indicating reception of verbal input during the capture of the video, the verbal input identifying an occurrence of an event of interest within the video, the event of interest occurring at an event moment within the video, the event of interest information identifying (i) a given input type of the verbal input, input types of the verbal input including a first input type, a second input type, and a third input type, and (ii) an input moment during the capture of the video at which the verbal input was received, wherein the first input type indicates the input moment occurring before the event moment, the second input type indicates the input moment occurring during the event moment, and the third input type indicates the input moment occurring after the event moment;identify the input moment based on the event of interest information;identify the given input type of the verbal input based on the event of interest information;identify the event moment based on the input moment and the given input type, wherein the event moment is identified to occur before the input moment based on the given input type being the first type, the event moment is identified to occur during the input moment based on the given input type being the second type, and the event moment is identified to occur after the input moment based on the given input type being the third type;identify a portion of the video as a video clip associated with the event of interest based on the event of interest information, the video clip comprising a first time amount of the video occurring before the event moment and a second time amount of the video occurring after the event moment, the first time amount and the second time amount being determined based on a type of an activity captured within the video; andstore clip information indicating the association of the video clip with the event of interest and the portion of the video included in the video clip. 9. The system of claim 8, wherein the occurrence of the event of interest is identified further based on metadata associated with the video, the metadata captured during the capture of the video, the metadata characterizing velocity or acceleration of the activity captured within the video. 10. The system of claim 9, wherein a metadata criteria for identifying the occurrence of the event of interest based on the metadata associated with the video is based on the type of the activity captured within the video such that a first metadata criteria is used to identify the occurrence of the event of interest based on the activity being of a first type and a second metadata criteria is used to identify the occurrence of the event of interest based on the activity being of a second type, the first metadata criteria being different from the second metadata criteria. 11. The system of claim 8, wherein the verbal input comprises a spoken command associated with tagging events of interest. 12. The system of claim 8, wherein the event moment includes a point in time within the video or a duration of time within the video. 13. The system of claim 8, wherein the one or more physical processors are further configured by the computer-readable instructions to: identify one or more non-event moments within the video, the one or more non-event moments not associated with any event of interest within the video; andstore non-event information indicating the one or more non-event moments. 14. The system of claim 13, wherein the one or more non-event moments are identified based on matching metadata associated with the video with a metadata pattern determined to not be of interest to a user, the metadata characterizing velocity of the activity captured within the video, acceleration of the activity captured within the video, visuals captured within the video, or audio captured within the video. 15. A non-transitory computer-readable storage medium storing instructions for identifying scenes in videos, the instructions, when executed by one or more physical processors, configured to cause the one or more physical processors to: obtain one or more electronic files defining a video captured with a camera, the one or more electronic files including event of interest information indicating reception of verbal input during the capture of the video, the verbal input identifying an occurrence of an event of interest within the video, the event of interest occurring at an event moment within the video, the event of interest information identifying (i) a given type of the verbal input, input types of the verbal input including a first input type, a second input type, and a third input type, and (ii) an input moment during the capture of the video at which the verbal input was received, wherein the first input type indicates the input moment occurring before the event moment, the second input type indicates the input moment occurring during the event moment, and the third input type indicates the input moment occurring after the event moment;identify the input moment based on the event of interest information;identify the given input type of the verbal input based on the event of interest information;identify the event moment based on the input moment and the given input type, wherein the event moment is identified to occur before the input moment based on the given input type being the first type, the event moment is identified to occur during the input moment based on the given input type being the second type, and the event moment is identified to occur after the input moment based on the given input type being the third type;identify a portion of the video as a video clip associated with the event of interest based on the event of interest information, the video clip comprising a first time amount of the video occurring before the event moment and a second time amount of the video occurring after the event moment, the first time amount and the second time amount being determined based on a type of an activity captured within the video; andstore clip information indicating the association of the video clip with the event of interest and the portion of the video included in the video clip. 16. The computer-readable storage medium of claim 15, wherein the occurrence of the event of interest is identified further based on metadata associated with the video, the metadata captured during the capture of the video, the metadata characterizing velocity or acceleration of the activity captured within the video. 17. The computer-readable storage medium of claim 16, wherein a metadata criteria for identifying the occurrence of the event of interest based on the metadata associated with the video is based on the type of the activity captured within the video such that a first metadata criteria is used to identify the occurrence of the event of interest based on the activity being of a first type and a metadata second criteria is used to identify the occurrence of the event of interest based on the activity being of a second type, the first metadata criteria being different from the second metadata criteria. 18. The computer-readable storage medium of claim 15, wherein the verbal input comprises a spoken command associated with tagging events of interest. 19. The computer-readable storage medium of claim 15, wherein the event moment includes a point in time within the video or a duration of time within the video. 20. The computer-readable storage medium of claim 15, wherein the instructions, when executed by the one or more physical processors, are further configured to cause the one or more physical processors to, identify one or more non-event moments within the video, the one or more non-event moments not associated with any event of interest within the video; andstore non-event information indicating the one or more non-event moments.

LOADING...

이 특허에 인용된 특허 (31) 인용/피인용 타임라인 분석

Filippova, Katja; Hall, Keith B., Accurate video concept recognition via classifier combination.
상세보기
Ubillos, Randy, Associating keywords to media.
상세보기
Macmillan, Timothy; Campbell, Scott Patrick; Newman, David A.; Sun, Yajie, Auto-alignment of image sensors in a multi-camera system.
상세보기
Edwards,Jeffrey L.; Ahmad,Subutai, Automatic editing of a visual recording to eliminate content of unacceptably low quality and/or very little or no interest.
상세보기
Yonezawa,Hiroki; Nakamura,Yasuo; Tanaka,Koichiro, Communication apparatus, storage medium, camera and processing method.
상세보기
Harrison,Keith Alexander, Data authentication.
상세보기
Goetz, Jeromey Russell, Determining importance of scenes based upon closed captioning data.
상세보기
Yagnik, Jay; Rowley, Henry A.; Ioffe, Sergey, Endpoint based video fingerprinting.
상세보기
He, Zhen; Liu, Jilin; Liu, Yang; Li, Tengyue; Wang, Dong, Image processing method and apparatus.
상세보기
Sato, Muneyuki; Okegawa, Shuuji; Osano, Keiji; Nishimura, Yoshihiko, Image-capturing apparatus, control method for image-capturing apparatus, and program.
상세보기
Chuang, Daniel B.; Candell, Lawrence M.; Ross, William D.; Beattie, Mark E.; Fang, Cindy Y.; Ren, Bobby; Blanchard, Jonathan P., Imaging system for immersive surveillance.
상세보기
Mallet, Ronald; Saltzman, Jeffrey; Cantwell, Brian, Interactive visual distortion processing.
상세보기
Potts, Steven L.; Wang, Hong; Rabiner, Wendi Beth; Chu, Peter L., Locating an audio source.
상세보기
Swenson, Anne; Agnoli, Giovanni; Rodriguez, Enrique; Lyons, Charles; Meaney, Brian; Cerf, Dave; Stern, Mike, Media editing application for auditioning different types of media clips.
상세보기
Meaney, Brian; Pendergast, Colleen; Matsuda, Ken; Agnoli, Giovanni; Khan, Itrat U.; Minjack, Zachury Bryant; Stern, Mike; De Marco, Vincenzo; Diephouse, Matthew D.; LaSalle, Louis; Fleischhauer, Michael; McCommons, Jordan P., Media editing with multi-camera media clips.
상세보기
Matsuda, Ken; Cerf, Dave; Khan, Itrat U.; Diephouse, Matthew D.; Meaney, Brian; De Marco, Vincenzo; McCommons, Jordan P.; LaSalle, Louis, Media-editing application with anchored timeline.
상세보기
Yang, Gyung-hye; Chung, Seung-Nyung; Choi, Ki-wan; Yoo, Myung-hyun, Method and apparatus for providing multimedia data using event index.
상세보기
Curcio, Igor Danilo Diego; Mate, Sujeet Shyamsundar; Cricri, Francesco; Roininen, Mikko Joonas; Sathish, Sailesh, Method and apparatus for semantic extraction and video remix creation.
상세보기
Evans, Matt; Lagemann, Ole; Danty, John; Helms, Jan-Hinnerk; Lengeling, Gerhard; Soren, Alexander; Martin, Timothy Benjamin; Pillhofer, Stefan, Method and system to process digital audio data.
상세보기
Long, Jieyi; Liu, Mitchell C., Methods and systems for game video recording and virtual reality replay.
상세보기
Eppolito, Aaron M., Optimized volume adjustment.
상세보기
Ritchey Kurtis J. (26374 Tonganoxie Rd. Leavenworth KS 66048), Panoramic display system.
상세보기
Scott Gilbert ; David J. Kaiman ; Michael C. Park ; G. David Ripley, Panoramic movies which simulate movement through multidimensional space.
상세보기
Morgan, Mark Glyn; Davis, Wesley Shawn; Waggoner, Charles Benjamin Franklin, Scene identification.
상세보기
Morgan, Mark Glyn; Davis, Wesley Shawn; Waggoner, Charles Benjamin Franklin, Scene identification.
상세보기
Kozko, Dmitry, Surround image mode for multi-lens mobile devices.
상세보기
Aguilar, Francisco; Alvarado-Moya, Pablo; Fridberg, Mikhail, Systems, methods and media for generating a panoramic view.
상세보기
Lyons, Charles; Van Wazer, Wendy; DeVore, Douglas; Warner, Peter, Tool for grouping media clips for a media editing application.
상세보기
Lyons, Charles; Van Wazer, Wendy; Diener, Lisa; DeVore, Douglas; Warner, Peter, Tool for navigating a composite presentation.
상세보기
Shore, Michael Wayne, User interface for method for creating a custom track.
상세보기
Samaniego, Clifford; King, David G.; Ross, David A.; Frank, Alexander Joshua; Madani, Omid; Arai, Kenji; Lin, Ruei-Sung, Video content claiming classifier.
상세보기

활용도 분석정보

상세보기

다운로드

내보내기

활용도 Top5 특허

해당 특허가 속한 카테고리에서 활용도가 높은 상위 5개 콘텐츠를 보여줍니다.
더보기 버튼을 클릭하시면 더 많은 관련자료를 살펴볼 수 있습니다.

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

[미국특허] Video scene classification by activity 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (31) 인용/피인용 타임라인 분석

활용도 분석정보

활용도 Top5 특허

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

이 특허와 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

[미국특허] Video scene classification by activity 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (31) 인용/피인용 타임라인 분석

활용도 분석정보

활용도 Top5 특허 더보기

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

이 특허와 함께 이용한 콘텐츠

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

활용도 Top5 특허