Systems, methods, and media for transcoding video data according to encoding parameters indicated by received metadata
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
H04N-011/02
H04N-019/40
H04N-019/139
H04N-019/12
H04N-019/14
H04N-019/157
H04N-019/177
출원번호
US-0841943
(2013-03-15)
등록번호
US-9906785
(2018-02-27)
발명자
/ 주소
Naletov, Ivan Vladimirovich
Zurpal, Sergey
출원인 / 주소
Sonic IP, Inc.
대리인 / 주소
KPPB LLP
인용정보
피인용 횟수 :
5인용 특허 :
129
초록▼
Methods, systems, and computer readable media for transcoding video data based on metadata are provided. Methods for transcoding video data using meta data are provided that include receiving portions of encoded video encoded according to varying encoding schemes. The portions of encoded video can b
Methods, systems, and computer readable media for transcoding video data based on metadata are provided. Methods for transcoding video data using meta data are provided that include receiving portions of encoded video encoded according to varying encoding schemes. The portions of encoded video can be re-encoded according to different encoding schemes. Different encoding schemes can be selected according to received metadata. The received metadata can indicate transcoding parameters for use in re-encoding portions of encoded vide. The portions of encoded video can be received from media content sources and the metadata can be received from a media metadata source. In various embodiments, the methods can be performed by a transcoding device and/or implemented via a computer readable media and at least one processing unit.
대표청구항▼
1. A method for transcoding video data using a transcoding device, the method comprising: receiving a portion of encoded video that includes a first plurality of encoded images encoded according to a first encoding scheme from a media content source using a transcoding device;decoding the portion of
1. A method for transcoding video data using a transcoding device, the method comprising: receiving a portion of encoded video that includes a first plurality of encoded images encoded according to a first encoding scheme from a media content source using a transcoding device;decoding the portion of encoded video based on the first encoding scheme to generate a decoded portion of video including a plurality of decoded images using the transcoding device;receiving media metadata at the transcoding device over a communications network from a remote media metadata source, the media metadata being generated and stored on the remote media metadata source prior to decoding the portion of encoded video, wherein the media metadata comprises: a set of transcoding parameters that define a second encoding scheme, wherein the first encoding scheme and the second encoding scheme have different bit rates; andat least two types of data from the group consisting of motion data, picture complexity information, a number of frames between two scene changes, a set of motion vectors, and a set of intra-prediction modes for use in encoding the decoded portion of video according to a second encoding scheme using the transcoding device; andencoding the plurality of decoded images of the decoded portion of video into a second plurality of encoded images based on the second encoding scheme defined by the received media metadata using the transcoding device by performing at least the following operations for each particular decoded image in the plurality of decoded images: generating a prediction image for each of a plurality of coding units of the particular decoded image using at least one of the motion data, the set of motion vectors, or a set of intra-prediction modes within the received media metadata according to the second encoding scheme;performing transforms on residual images of the plurality of coding units to generate sets of transform coefficients based on the second encoding scheme;performing quantization on the sets of transform coefficients by: determining a number of bits to encode a group of pictures (GOP) based at least in part on at least one of the picture complexity information or the number of frames between two scene changes;determining a quantization parameter based on the number of bits; andquantizing the generated set of transform coefficients according to the second encoding scheme using the quantization parameter; andperforming entropy encoding on the sets of quantized transform coefficients to generate images for the second plurality of encoded images. 2. The method of claim 1, wherein the media metadata source is not an encoder, and wherein the set of transcoding parameters within the received metadata include at least one of a number of B-frames between two P-frames, and a scene change detection signal. 3. The method of claim 2, further comprising determining the number of bits based at least in part on the number of B-frames between two P-frames. 4. The method of claim 3, further comprising determining the number of bits based at least in part on the scene detection signal indicated by the received media metadata. 5. The method of claim 1, wherein the first encoding scheme and the second encoding scheme have different resolutions. 6. The method of claim 1 further comprising dividing each particular decoded image in the plurality of decoded images into a plurality of coding units based on the second encoding scheme. 7. A system for transcoding video data, the system comprising: a non-transitory memory storing a transcoding application;a processing circuitry; andwherein the transcoding application directs the processing circuitry to: receive a portion of encoded video that includes a first plurality of encoded images encoded according to a first encoding scheme from a media content source;decode the portion of encoded video based on the first encoding scheme to generate a decoded portion of video including a plurality of decoded images;receive media metadata over a communications network from a remote media metadata source, the media metadata being generated and stored on the remote media metadata source prior to decoding the portion of encoded video, wherein the media metadata comprises: a set of transcoding parameters that define a second encoding scheme, wherein the first encoding scheme and the second encoding scheme have different bit rates; andat least two types of data from the group consisting of motion data, picture complexity information, a number of frames between two scene changes, a set of motion vectors, and a set of intra-prediction modes for use in encoding the decoded portion of video according to a second encoding scheme; andencode the plurality of decoded images of the decoded portion of video into a second plurality of encoded images based on the second encoding scheme defined by the received media metadata by performing at least the following operations for each particular decoded image in the plurality of decoded images: generating a prediction image for each of a plurality of coding units of the particular decoded image using at least one of the motion data, the set of motion vectors, or a set of intra-prediction modes within the received media metadata according to the second encoding scheme;performing transforms on residual images of the plurality of coding units to generate sets of transform coefficients based on the second encoding scheme;performing quantization on the sets of transform coefficients by: determining a number of bits to encode a group of pictures (GOP) based at least in part on at least one of the picture complexity information or the number of frames between two scene changes;determining a quantization parameter based on the number of bits; andquantizing the generated set of transform coefficients according to the second encoding scheme using the quantization parameter; andperforming entropy encoding on the sets of quantized transform coefficients to generate images for the second plurality of encoded images. 8. The system of claim 7, wherein the media metadata source is not an encoder, and wherein the set of transcoding parameters within the received metadata include at least one of a number of B-frames between two P-frames, and a scene change detection signal. 9. The system of claim 8, wherein the processing circuitry is further configured to determine the number of bits based at least in part on the number of B-frames between two P-frames. 10. The system of claim 9, wherein the processing circuitry is further configured to determine the number of bits based at least in part on the scene detection signal indicated by the received media metadata. 11. The system of claim 7 further comprising dividing each particular decoded image in the plurality of decoded images into a plurality of coding units based on the second encoding scheme. 12. The system of claim 7, wherein the first encoding scheme and the second encoding scheme correspond to different video encoding standards. 13. A non-transitory computer-readable medium containing computer-executable instructions that, when executed by a processing circuitry, cause the processing circuitry to perform a method for transcoding video data, the method comprising: receiving a portion of encoded video that includes a first plurality of encoded images encoded according to a first encoding scheme from a media content source;decoding the portion of encoded video based on the first encoding scheme to generate a decoded portion of video including a plurality of decoded images;receiving media metadata over a communications network from a remote media metadata source, the media metadata being generated and stored on the remote media metadata source prior to decoding the portion of encoded video, wherein the media metadata comprises: a set of transcoding parameters that define a second encoding scheme, wherein the first encoding scheme and the second encoding scheme have different bit rates; andat least two types of data from the group consisting of motion data, picture complexity information, a number of frames between two scene changes, a set of motion vectors, and a set of intra-prediction modes for use in encoding the decoded portion of video according to a second encoding scheme; andencoding the plurality of decoded images of the decoded portion of video into a second plurality of encoded images based on the second encoding scheme defined by the received media metadata by performing at least the following operations for each particular decoded image in the plurality of decoded images: generating a prediction image for each of a plurality of coding units of the particular decoded image using at least one of the motion data, the set of motion vectors, or a set of intra-prediction modes within the received media metadata according to the second encoding scheme;performing transforms on residual images of the plurality of coding units to generate sets of transform coefficients based on the second encoding scheme;performing quantization on the sets of transform coefficients by: determining a number of bits to encode a group of pictures (GOP) based at least in part on at least one of the picture complexity information or the number of frames between two scene changes;determining a quantization parameter based on the number of bits; andquantizing the generated set of transform coefficients according to the second encoding scheme using the quantization parameter; andperforming entropy encoding on the sets of quantized transform coefficients to generate images for the second plurality of encoded images. 14. The non-transitory computer-readable medium of claim 13, wherein the media metadata source is not an encoder, and wherein the set of transcoding parameters within the received metadata include at least one of a number of B-frames between two P-frames, and a scene change detection signal. 15. The non-transitory computer-readable medium of claim 14, wherein the method further comprises determining the number of bits based at least in part on the number of B-frames between two P-frames. 16. The non-transitory computer-readable medium of claim 15, wherein the method further comprises determining the number of bits based at least in part on the scene detection signal indicated by the received media metadata. 17. The non-transitory computer-readable medium of claim 13 further comprising dividing each particular decoded image in the plurality of decoded images into a plurality of coding units based on the second encoding scheme. 18. The non-transitory computer-readable medium of claim 13, wherein the first encoding scheme and the second encoding scheme have different resolutions. 19. The non-transitory computer-readable medium of claim 13, wherein the first encoding scheme and the second encoding scheme correspond to different video coding standards.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (129)
Boroczky, Lilla; Ngai, Agnes Y.; Westermann, Edward F., Adaptively encoding multiple streams of video data in parallel for multiplexing onto a constant bit rate channel.
Carmel, Sharon; Gill, Dror; Shoham, Tamar, Apparatus and methods for re-compression having a monotonic relationship between extent of compression and quality of compressed image.
Kollmyer,Brad; Baker,Brian; Shapiro,Eric; Kollmyer,Aric; Rutman,Mike; MacLean,Duncan; Robertson,Dan; Taylor,Neal; Hunsche,Dick; Walker,Amanda, Apparatus, system and method for selectively encrypting different portions of data sent over a network.
Boroczky, Lilla; Ngai, Agnes Y.; Westermann, Edward F., Control strategy for dynamically encoding multiple streams of video data in parallel for multiplexing onto a constant bit rate channel.
Katayama Hironobu,JPX ; Nogami Hiroaki,JPX ; Shiraishi Kenichi,JPX, Digital recording and reproducing apparatus which multiplexes and records HDTV, SDTV and trick play data together on a magnetic tape.
Pensak David A. ; Cristy John J. ; Singles Steven J., Information security architecture for encrypting documents for remote access while maintaining access control.
Kwon, O-hoon; Ha, Ho-jin; Zhang, Guanhua; Choi, Hyung-tak; Kim, Sun-bal; Keum, Ji-Eun; Im, Sung-bin, Method and apparatus for providing trick play service.
Donald F. Gordon ; Ted Ludvig, Method and apparatus for simultaneously producing compressed play and trick play bitstreams from a video frame sequence.
Boyce Jill MacDonald ; Lane Frank Anton, Method and apparatus for the selection of data for use in VTR trick playback operation in a system using intra-coded vid.
Suzuki Hideharu (Tokyo JPX) Ishibashi Yutaka (Aichi JPX) Nishimura Kazutoshi (Tokyo JPX), Method and apparatus for visual search control of digital video files.
Woo,Je Hak; Lee,Hwan Chul; Cho,Sang Young; Jeong,Seong Ho; Ha,Young Soo; Shin,Seog Kyoon; Kim,Seong Il, Method and system for the information protection of digital content.
Carmel Sharon,ILX ; Daboosh Tzur,ILX ; Reifman Eli,ILX ; Shani Naftali,ILX ITX 62917, Method and system of building and transmitting a data file for real time play of multimedia, particularly animation, and.
Yoshida Masayuki (Yokohama JPX) Nakanishi Keiko (Tokyo JPX), Method of commonly using font information for outputting information in a system having a plurality of information proce.
Foehr,Oliver H.; Dunietz,Jerry; Leonov,Mikhail V.; Brown,David C.; Daniels,Simon, Methods for providing an accurate visual rendition of a text element formatted with an unavailable font.
Okubo, Masafumi; Oashi, Masahiro; Ohto, Hidetaka, Playback device, playback method, and computer-readable recording medium for ensuring stable application execution in synchronism with video data playback.
Pantos, Roger; Biderman, David; May, Jr., William; Flick, Christopher; Bushell, John Samuel; Calhoun, John Kevin, Playlists for real-time or near real-time streaming.
Tanaka Yasuyuki,JPX ; Fujii Akio,JPX ; Kato Masato,JPX, Recording apparatus for newly recording a second encoded data train on a recording medium on which an encoded data train is recorded.
Nogami Hiroaki,JPX ; Katayama Hironobu,JPX, Recording apparatus which adaptively records amounts of low resolution video inversely to amounts of high resolution video which has the same program content.
Kollmyer,Brad; Baker,Brian A.; Shapiro,Eric Bradley; Kollmyer,Aric; Rutman,Mike; MacLean,Charles Duncan; Robertson,Dan; Taylor,Neal; Hunsche,Dick; Walker,Amanda, Selectively encrypting different portions of data sent over a network.
Braness, Jason; van der Schaar, Auke Sjoerd; Soroushian, Kourosh, Systems and methods for adaptive bitrate streaming of media stored in matroska container files using hypertext transfer protocol.
Soroushian, Kourosh; Douglas, Scott; Bramwell, Steve; Braness, Jason; Shivadas, Abhishek, Systems and methods for encoding media including subtitles for adaptive bitrate streaming.
Braness, Jason; Soroushian, Kourosh; van der Schaar, Auke Sjoerd, Systems and methods for performing smooth visual search of media encoded for adaptive bitrate streaming via hypertext transfer protocol using trick play streams.
Ginter Karl L. ; Shear Victor H. ; Sibert W. Olin ; Spahn Francis J. ; Van Wie David M., Systems and methods for secure transaction management and electronic rights protection.
Eerenberg, Onno; Rijckaert, Albert M. A.; Bruls, Wilhelmus H. A., Trick play signal generation for a digital video recorder using retrieved intra-encoded pictures and generated inter-encoded pictures.
Sakamoto Hideki,JPX ; Kawaguchi Tomoaki,JPX ; Suzuki Hideharu,JPX ; Mori Tatsuo,JPX ; Kimiyama Hiroyuki,JPX ; Nishimura Kazutoshi,JPX ; Itoh Kouichi,JPX ; Maruyama Mitsuru,JPX ; Sugiyama Hiroshi,JPX , Video data playback system using effective scheme for producing coded video data for fast playback mode.
Kiefer, Michael George; Grab, Eric William; Braness, Jason, Playback devices and methods for playing back alternative streams of content protected using a common set of cryptographic keys.
Kiefer, Michael George; Grab, Eric William; Braness, Jason, Systems and methods for playing back alternative streams of protected content protected using common cryptographic information.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.