[특허]Real-time capturing and generating stereo images and videos with a monoscopic low power mobile device

Real-time capturing and generating stereo images and videos with a monoscopic low power mobile device 원문보기

IPC분류정보
국가/구분	United States(US) Patent 등록
국제특허분류(IPC7판)	H04N-013/02 H04N-013/00 H04N-005/222 G06T-007/00 H04N-005/232
출원번호	US-0497906 (2006-08-01)
등록번호	US-8970680 (2015-03-03)
발명자 / 주소	Wang, Haohong Li, Hsiang-Tsun Manjunath, Sharath
출원인 / 주소	Qualcomm Incorporated
대리인 / 주소	Boyd, Brent A.
인용정보	피인용 횟수 : 2 인용 특허 : 12

초록 ▼

A monoscopic low-power mobile device is capable of creating real-time stereo images and videos from a single captured view. The device uses statistics from an autofocusing process to create a block depth map of a single capture view. Artifacts in the block depth map are reduced and an image depth map is created. Stereo three-dimensional (3D) left and right views are created from the image depth map using a Z-buffer based 3D surface recover process and a disparity map which is a function of the geometry of binocular vision.

대표청구항 ▼

1. A monoscopic low-power mobile device comprising: a single-sensor camera sensor module operable to capture an image and having an autofocusing sub-module operable to determine a best focus position by moving a lens through an entire focusing range via a focusing process and to select the focus position with a maximum focus value when capturing the image;a depth map generator assembly operable to: in a first-stage, develop a block-level depth map automatically using statistics from the autofocusing sub-module;in a second-stage to develop an image depth map, the block-level depth map including a depth value for each of a plurality of portions of the captured image, the image depth map including a pixel depth value for a pixel in a portion of the plurality of portions; andduring the second stage, obtain a depth value for corner pixels of each block included in the block-level depth map, the depth value for a corner pixel based at least in part on an average of depth values for middle points of neighboring blocks of a respective block, the neighboring blocks included in the block-level depth map,the depth map generator assembly including a bilinear filter configured to generate the pixel depth value for the pixel based at least in part on depth values for each corner pixel, the depth values for each corner pixel weighted based on a ratio between a distance of the pixel to a respective corner pixel and a total distance of the pixel to each corner pixel, the bilinear filter configured to generate the pixel depth value during the second-stage; andan image pair generator module operable to create a missing second view from the captured image to create three dimensional (3D) stereo left and right views. 2. The device of claim 1, wherein the image pair generator module comprises: a disparity map sub-module which calculates a disparity map based on a distance in pixels between image points in the left and right views of binocular vision geometry for the captured image wherein the captured image represents the left view;a Z-buffer 3D surface recover sub-module operable to construct a 3D visible surface for the captured image from the right view; anda stereo view generator sub-module operable to project the 3D surface of the right view onto a projection plane. 3. The device of claim 1, wherein the focusing process of the autofocusing sub-module in a still image mode performs an exhaustive search focusing process to capture a still-image, and in a video mode, to achieve real-time capturing of a video clip, is initiated with the exhaustive search focusing process and follows with a climbing-hill focusing process. 4. The device of claim 3, wherein the depth map generator assembly in the second stage is operable to reduce artifacts with the bilinear filter. 5. The device of claim 1, wherein generating the pixel depth value for the pixel based at least in part on depth values for each corner pixel comprises generating the pixel depth value dp for pixel P (xp, yp, dp) in accordance with the equation dp=(xp-xA)4+(yp-yA)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dA+(xp-xB)4+(yp-yB)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dB+(xp-xC)4+(yp-yC)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dC+(xp-xD)4+(yp-yD)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dDwhere position values and the depth values for the corner pixels (A, B, C, and D) of the block are denoted as (xA, yA, dA), (xB, yB, dB), (xC, yC, dC), (xD, yD, dD). 6. The device of claim 3, further comprising a video coding module for coding the video clip captured and providing statistics information for calculating the block-level depth map, the video coding module being operable to determine motion estimation, and the depth map generator assembly being operable in the second stage to detect and estimate depth information for real-time capturing and generation of stereo video using the statistics information from the motion estimation, the focusing process, and history data plus heuristic rules to obtain a final block depth map from which the image depth map is derived. 7. The device of claim 1, further comprising a display and a 3D effects generator module for displaying on the display the 3D stereo left and right views. 8. The device of claim 7, wherein the 3D effects generator module is operable to produce a red-blue anaglyph image of the 3D stereo left and right views on the display. 9. The device of claim 1, wherein the monoscopic low-power mobile device comprises one of a hand-held digital camera, a camcorder, and a single-sensor camera phone. 10. A monoscopic low-power mobile device comprising: means for capturing an image with a single sensor;means for autofocusing a lens and determining a best focus position by moving the lens through an entire focusing range and for selecting the focus position with a maximum focus value when capturing the image;means for generating in a first-stage a block-level depth map automatically using statistics from the autofocusing means and in a second-stage an image depth map, the block-level depth map including a depth value for each of a plurality of portions of the captured image, the image depth map including a pixel depth value for a pixel in a portion of the plurality of portions, wherein during the second stage, a depth value for corner pixels of each block included in the block-level depth map is obtained, the depth value for a corner pixel based at least in part on an average of depth values for middle points of neighboring blocks of a respective block, the neighboring blocks included in the block-level depth map,the means for generating including means for reducing artifacts configured to, during the second-stage, generate the pixel depth value for the pixel based at least in part on depth values for each corner pixel, the depth values for each corner pixel weighted based on a ratio between a distance of the pixel to a respective corner pixel and a total distance of the pixel to each corner pixel; andmeans for creating a missing second view from the captured image to create three dimensional (3D) stereo left and right views. 11. The device of claim 10, wherein the creating means comprises: means for calculating a disparity map based on a distance in pixels between image points in the left and right views of binocular vision geometry for the captured image wherein the captured image represents the left view;means for 3D surface recovering with Z-buffering for constructing a 3D visible surface for the captured image from a missing right viewpoint; andmeans for generating stereo views by projecting the constructed 3D surface onto a projection plane. 12. The device of claim 10, wherein the autofocusing means includes means for performing an exhaustive search focusing process to capture a still-image in a still image mode; means for initiating the exhaustive search focusing process in a video mode; and means for climbing-hill focusing in a video mode to capture a real-time video clip. 13. The device of claim 10, wherein generating the pixel depth value for the pixel based at least in part on depth values for each corner pixel comprises generating the pixel depth value dp for pixel P (xp, yp, dp) in accordance with equation dp=(xp-xA)4+(yp-yA)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dA+(xp-xB)4+(yp-yB)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dB+(xp-xC)4+(yp-yC)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dC+(xp-xD)4+(yp-yD)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dDwhere position values and the depth values for the corner pixels (A, B, C, and D) of the block are denoted as (xA, yA, dA), (xB, yB, dB), (xC, yC, dC), (xD, yD, dD). 14. The device of claim 12, further comprising means for video coding the video clip captured and providing statistics information; wherein the means for video coding includes means for motion estimating; and wherein the generating means includes means for detecting and estimating depth information for real-time capturing and generation of stereo video using statistics information from the motion estimating means, the autofocusing means, and history data plus some heuristic rules to obtain a final block depth map from which the image depth map is derived. 15. The device of claim 10, further comprising a display and means for generating 3D effects of the 3D stereo left and right views on the display. 16. The device of claim 15, wherein the 3D effects generating means produces a red-blue anaglyph image of the 3D stereo left and right views on the display. 17. The device of claim 10, wherein the monoscopic low-power mobile device comprises one of a hand-held digital camera, a camcorder, and a single-sensor camera phone. 18. A method for generating real-time stereo images, the method comprising: capturing an image with a single sensor;autofocusing a lens and determining a best focus position by moving the lens through an entire focusing range and selecting the focus position with a maximum focus value when capturing the image;generating in a first-stage a block-level depth map automatically using statistics from the autofocusing and in a second-stage generating an image depth map, the block-level depth map including a depth value for each of a plurality of portions of the captured image, the image depth map including a pixel depth value for a pixel in a portion of the plurality of portions, the second-stage generating including: obtaining a depth value for corner pixels of each block included in the block-level depth map, the depth value for a corner pixel based at least in part on an average of depth values for middle points of neighboring blocks of a respective block, the neighboring blocks included in the block-level depth map; andgenerating the pixel depth value for the pixel based at least in part on depth values for each corner pixel, the depth values for each corner pixel weighted based on a ratio between a distance of the pixel to a respective corner pixel and a total distance of the pixel to each corner pixel; andcreating a missing second view from the captured image to create three-dimensional (3D) stereo left and right views. 19. The method of claim 18, wherein creating the missing second view comprises: calculating a disparity map based on a distance in pixels between image points in the left and right views of binocular vision geometry for the captured image wherein the captured image represents the left view;3D surface recovering with Z-buffering for constructing a 3D visible surface for the captured image from a missing right viewpoint; andgenerating a missing right view by projecting the constructed 3D surface onto a projection plane. 20. The method of claim 18, wherein autofocusing includes: performing an exhaustive search focusing process to capture a still-image in a still image mode;initiating the exhaustive search focusing process in a video mode; andclimbing-hill focusing in a video mode to capture a real-time video clip. 21. The method of claim 18, wherein generating the pixel depth value for the pixel based at least in part on depth values for each corner pixel comprises generating pixel depth value dp for pixel P (xp, yp, dp) in accordance with equation dp=(xp-xA)4+(yp-yA)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dA+(xp-xB)4+(yp-yB)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dB+(xp-xC)4+(yp-yC)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dC+(xp-xD)4+(yp-yD)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dDwhere position values and the depth values for the corner pixels (A, B, C, and D) of the block are denoted as (xA, yA, dA), (xB, yB, dB), (xC, yC, dC), (xD, yD, dD). 22. The method of claim 20, further comprising video coding the video clip and motion estimating, wherein generating the block-level depth map includes detecting and estimating depth information for real-time capturing and generation of stereo video using statistics from the motion estimating the autofocusing and history data plus heuristic rules to obtain a final block depth map from which the image depth map is derived. 23. The method of claim 18, further comprising generating 3D effects of the 3D stereo left and right views on a display. 24. The method of claim 23, wherein generating 3D effects includes producing a red-blue anaglyph image of the 3D stereo left and right views on the display. 25. A method of operating a still image processing device, the method comprising: autofocusing processing a captured still image and estimating depth information of remote objects in the image to generate a block-level depth map, the block-level depth map including a depth value for each of a plurality of portions of the captured still image;obtaining a depth value for corner pixels of each block included in the block-level depth map, the depth value for a corner pixel based at least in part on an average of depth values for middle points of neighboring blocks of a respective block, the neighboring blocks included in the block-level depth map; andgenerating an image depth map based on the block-level depth map, the image depth map including a pixel depth value for a pixel of a portion of the plurality of portions, the pixel depth value of the pixel based at least in part on depth values for each corner pixel, the depth values for each corner pixel weighted based on a ratio between a distance of the pixel to a respective corner pixel to a total distance of the pixel to each corner pixel. 26. The method of claim 25, wherein the autofocusing processing includes processing the image using a coarse-to-fine depth detection process. 27. The method of claim 25, wherein generating the image depth map comprises bilinear filtering the block-level depth map to derive an approximated image depth map. 28. The method of claim 25, wherein position values and the depth values for the corner pixels (A, B, C, and D) of the block are denoted as (xA, yA, dA), (xB, yB, dB), (xC, yC, dC), (xD, yD, dD) wherein for a respective pixel denoted by a point P (xp, yp, dp), generating the pixel depth value dp of the respective pixel is in accordance with the equation dp=(xp-xA)4+(yp-yA)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dA+(xp-xB)4+(yp-yB)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dB+(xp-xC)4+(yp-yC)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dC+(xp-xD)4+(yp-yD)4(xp-xA)4+(yp-yA)4+(xp-xB)4+(yp-yB)4+(xp-xC)4+(yp-yC)4+(xp-xD)4+(yp-yD)4⁢dD. 29. A still image capturing device comprising: an autofocusing module operable to: process a captured still image and estimate depth information of remote objects in the image to detect a block-level depth map, the block-level depth map including a depth value for each of a plurality of portions of the captured still image; andobtain a depth value for corner pixels of each block included in the block-level depth map, the depth value for a corner pixel based at least in part on an average of depth values for middle points of neighboring blocks of a respective block, the neighboring blocks included in the block-level depth map;an image depth map module operable to approximate from the block-level depth map an image depth map using bilinear filtering, the image depth map including a pixel depth value for a pixel of a portion of the plurality of portions, the pixel depth value of the pixel based at least in part on depth values for each corner pixel, the depth values for each corner pixel weighted based on a ratio between a distance of the pixel to a respective corner pixel and a total distance of the pixel to each corner pixel; andan image pair generator module operable to create a missing second view from the captured image to create three-dimensional (3D) stereo left and right views. 30. The device of claim 29, further comprising a 3D effects generator module operable to display 3D effects of the 3D stereo left and right views. 31. The device of claim 29, wherein a focusing process of the autofocusing module performs an exhaustive search focusing process to capture the still image. 32. The device of claim 29, wherein the image depth map module is operable to reduce artifacts with the bilinear filtering. 33. A video image capturing device comprising: an autofocusing module operable to process a captured video clip and estimate depth information of remote objects in a scene, the autofocusing module configured to generate an autofocusing block depth map and an autofocusing focus value map for frames of the captured video clip;a video coding module operable to code the video clip captured, provide statistics information and determine motion estimation, the video coding module configured to generate a video coding block depth map and a video coding focus value map for frames of the captured video clip based at least in part on the motion estimation; andan image depth map module operable to detect and estimate depth information for real-time capturing and generation of stereo video based on a final block depth map from which an image depth map is derived, the block depth map including a depth value for each of a plurality of portions of the captured video clip, the image depth map including a pixel depth value for a pixel in a portion of the plurality of portions, the image depth map module configured to generate the final block depth map based on values included in the autofocusing block depth map, the autofocusing focus value map, the video coding block depth map, and the video coding focus value map. 34. The device of claim 33, wherein a focusing process of the autofocusing module to achieve real-time capturing of a video clip is initiated with the exhaustive search focusing process and follows with a climbing-hill focusing process. 35. The device of claim 33, further comprising an image pair generator module operable to create a missing second view from the captured image to create three-dimensional (3D) stereo left and right views. 36. The device of claim 35, further comprising a 3D effects generator module operable to display 3D effects of the 3D stereo left and right views. 37. The device of claim 33, wherein the depth map module is operable to predict an internal block depth map (Pn(i, j)) and a focus value map (Tn(i, j)) of a current frame n from those of a previous frame by the following equations Pn⁡(i,j)={Dn-1⁡(a,b)if⁢⁢Vn⁡(i,j)-Fn-1⁡(a,b)

이 특허에 인용된 특허 (12)

Oh,Teik; Flack,Julien; Harman,Philip Victor, 3D image synthesis from depth encoded source view.
상세보기
Mimura Itaru (Sayama JPX) Kurihara Tsuneya (Preverenges CHX), Apparatus for obtaining three-dimensional volume data of an object.
상세보기
Haruhiko Murata JP; Yukio Mori JP; Shuugo Yamashita JP; Akihiro Maenaka JP; Seiji Okada JP; Kanji Ihara JP, Device and method for converting two-dimensional video into three-dimensional video.
상세보기
Baker,David; Basoglu,Christopher; Cutler,Benjamin; Deeley,Richard; Gervasio,Gregorio; Kawaguchi,Atsuo; Kojima,Keiji; Lee,Woobin; Miyazaki,Takeshi; Mundkur,Yatin; Naik,Vinay; Nishioka,Kiyokazu; Nojiri,Toru; O'Donnell,John; Padalkar,Sarang, Integrated multimedia system.
상세보기
Piehl, Erik; Sallinen, Sami, Method for compressing video information.
상세보기
Nakagawa Yasuo (Chigasaki PA JPX) Nayer Shree K. (Pittsburgh PA), Method of detecting solid shape of object with autofocusing and image detection at each focus level.
상세보기
Eleftheriadis Alexandros ; Anastassiou Dimitris ; Chang Shif-Fu ; Nayar Shree, Methods and apparatus for performing digital image and video segmentation and compression using 3-D depth information.
상세보기
Nakamura,Tadashi; Cuthbert,Dylan Simon, Methods and apparatus for rendering an image with depth-of-field display.
상세보기
Yamada,Kunio, Pseudo 3D image creation device, pseudo 3D image creation method, and pseudo 3D image display system.
상세보기
Melen Roger D., Range data recordation.
상세보기
Yata, Kunio, System for autofocusing a moving object.
상세보기
Lemelson, Dorothy; Pedersen, Robert D.; Blake, Tracy D.; Lemelson, Jerome H., Three-dimensional display system.
상세보기

이 특허를 인용한 특허 (2)

Chang, Hong; Friend, Paul Russell, Real-time range map generation.
상세보기
Ding, Wanying; Guo, Lifan; Shang, Yue; Wang, Haohong, Unified attractiveness prediction framework based on content impact factor.
상세보기

IPC	Description
A	생활필수품
A62	인명구조; 소방(사다리 E06C)
A62B	인명구조용의 기구, 장치 또는 방법(특히 의료용에 사용되는 밸브 A61M 39/00; 특히 물에서 쓰이는 인명구조 장치 또는 방법 B63C 9/00; 잠수장비 B63C 11/00; 특히 항공기에 쓰는 것, 예. 낙하산, 투출좌석 B64D; 특히 광산에서 쓰이는 구조장치 E21F 11/00)
A62B-1/08	.. 윈치 또는 풀리에 제동기구가 있는 것

내보내기 구분	파일저장 인쇄 메일전송
구성항목	기본정보 상세정보 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표IPC 관리번호, 국가코드, 자료구분, 상태, 출원번호, 출원일자, 공개번호, 공개일자, 공고번호, 공고일자, 등록번호, 등록일자, 발명명칭(한글), 발명명칭(영문), 출원인(한글), 출원인(영문), 출원인코드, 대표출원인, 출원인국적, 출원인주소, 발명자, 발명자E, 발명자코드, 발명자주소, 발명자 우편번호, 발명자국적, 대표IPC, IPC코드, 요약, 미국특허분류, 대리인주소, 대리인코드, 대리인(한글), 대리인(영문), 국제공개일자, 국제공개번호, 국제출원일자, 국제출원번호, 우선권, 우선권주장일, 우선권국가, 우선권출원번호, 원출원일자, 원출원번호, 지정국, Citing Patents, Cited Patents
저장형식	Text(ASCII format) Excel format PIAS분석(.xls)
메일정보	받는사람 (필수) @ 보내는사람 (선택) @ 제목 내용 KISTI 검색결과 이메일 서비스
안내	총 건의 자료가 검색되었습니다. 다운받으실 자료의 인덱스를 입력하세요. (1-10,000) 검색결과의 순서대로 최대 10,000건 까지 다운로드가 가능합니다. 데이타가 많을 경우 속도가 느려질 수 있습니다.(최대 2~3분 소요) 다운로드 파일은 UTF-8 형태로 저장됩니다. 파일의 내용이 제대로 보이지 않을실 때는 웹브라우저 상단의 보기 -> 인코딩 -> 자동선택 여부를 확인하십시오. ~ Text(ASCII format) Excel format

연합인증

Real-time capturing and generating stereo images and videos with a monoscopic low power mobile device 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

이 특허에 인용된 특허 (12)

이 특허를 인용한 특허 (2)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트

연합인증

Real-time capturing and generating stereo images and videos with a monoscopic low power mobile device 원문보기

초록 ▼

대표청구항 ▼

연구과제 타임라인

전체(0) 논문(0) 특허(0) 보고서(0)

전체(0) 논문(0) 특허(0) 보고서(0)

이 특허에 인용된 특허 (12)

이 특허를 인용한 특허 (2)

관련 콘텐츠

특허 원문 보기

IPC 상위 출원인

AI-Helper ※ AI-Helper는 오픈소스 모델을 사용합니다.

선택된 텍스트