Content-based detection and three dimensional geometric reconstruction of objects in image and video data
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06K-009/48
G06K-009/00
G06T-007/00
G06T-005/00
출원번호
US-0234969
(2016-08-11)
등록번호
US-9779296
(2017-10-03)
발명자
/ 주소
Ma, Jiyong
Thompson, Stephen Michael
Amtrup, Jan W.
출원인 / 주소
Kofax, Inc.
대리인 / 주소
Zilka-Kotab, P.C.
인용정보
피인용 횟수 :
9인용 특허 :
338
초록▼
Systems, computer program products, and techniques for detecting objects depicted in digital image data are disclosed, according to various exemplary embodiments. The inventive concepts uniquely utilize internal features to accomplish object detection, thereby avoiding reliance on detecting object e
Systems, computer program products, and techniques for detecting objects depicted in digital image data are disclosed, according to various exemplary embodiments. The inventive concepts uniquely utilize internal features to accomplish object detection, thereby avoiding reliance on detecting object edges and/or transitions between the object and other portions of the digital image data, e.g. background textures or other objects. The inventive concepts thus provide an improvement over conventional object detection since objects may be detected even when edges are obscured or not depicted in the digital image data. In one aspect, a computer-implemented method of detecting an object depicted in a digital image includes: detecting a plurality of identifying features of the object, wherein the plurality of identifying features are located internally with respect to the object; and projecting a location of one or more edges of the object based at least in part on the plurality of identifying features.
대표청구항▼
1. A computer-implemented method of detecting an object depicted in a digital image, the method comprising: detecting, using a hardware processor, a plurality of identifying features of the object, wherein the plurality of identifying features are located internally with respect to the object; andpr
1. A computer-implemented method of detecting an object depicted in a digital image, the method comprising: detecting, using a hardware processor, a plurality of identifying features of the object, wherein the plurality of identifying features are located internally with respect to the object; andprojecting, using the hardware processor, a location of one or more edges of the object based at least in part on the plurality of identifying features; andoutputting the projected location of the one or more edges of the object to at least one of: a display of a computer, and a non-transitory computer readable medium. 2. The computer-implemented method as recited in claim 1, wherein detecting the plurality of identifying features comprises analyzing a plurality of feature vectors each corresponding to pixels within a patch of the digital image to determine whether the patch includes a sharp transition in intensity. 3. The computer-implemented method as recited in claim 1, wherein detecting the plurality of identifying features comprises automatic feature zone discovery; and wherein automatic feature zone discovery comprises:matching a plurality of pixels in the digital image to a plurality of corresponding pixels in a plurality of reference images to form a set of matching pairs, each matching pair including one pixel from the digital image and one pixel from one of the plurality of reference images; anddetermining a subset of the matching pairs exhibiting a frequency within the set of matching pairs that is greater than a predetermined frequency threshold. 4. The computer-implemented method as recited in claim 1, comprising transforming the digital image to display the projected location of the one or more edges of the object. 5. The computer-implemented method as recited in claim 4, wherein at least portions of at least one of the one or more edges displayed in the transformed digital image are missing from the digital image. 6. The computer-implemented method as recited in claim 4, wherein the digital image is characterized by a complex background comprising a plurality of sharp intensity transitions not corresponding to edges of the object. 7. The computer-implemented method as recited in claim 1, wherein the plurality of identifying features comprise boilerplate content. 8. The computer-implemented method as recited in claim 1, comprising identifying a plurality of distinctive pixels within the plurality of identifying features of the object, wherein the distinctive pixels are located at positions within the digital image characterized by a sharp transition in intensity. 9. The computer-implemented method as recited in claim 1, comprising matching the digital image depicting the object to one of a plurality of reference images each depicting a known object type, wherein the matching comprises determining whether the object includes distinctive pixels that correspond to distinctive pixels present in one or more of the plurality of reference images. 10. The computer-implemented method as recited in claim 1, comprising: matching the digital image depicting the object to one of a plurality of reference images each depicting a known object type; anddesignating as an outlier a candidate match between a distinctive pixel in the digital image and one or more candidate corresponding distinctive pixels present in one of the plurality of reference images;wherein the outlier is designated in response to determining a distance ratio is greater than a predetermined distance ratio threshold, wherein the distance ratio is a ratio describing: a first distance between the distinctive pixel in the digital image and a first of the one or more candidate corresponding distinctive pixels; anda second distance between the distinctive pixel in the digital image and a second of the one or more candidate corresponding distinctive pixels. 11. The computer-implemented method as recited in claim 1, comprising: matching the digital image depicting the object to one of a plurality of reference images each depicting a known object type; anddesignating as an outlier a candidate match between a distinctive pixel in the digital image and a candidate corresponding distinctive pixel present in one of the plurality of reference images in response to determining the candidate match is not unique. 12. The computer-implemented method as recited in claim 1, wherein at least a portion of one or more edges of the object for which the location is projected is missing in the digital image. 13. The computer-implemented method as recited in claim 1, wherein projecting the location of the one or more edges of the object is based on a mapping of key points within some or all of the plurality of identifying features to key points of a reference image depicting an object belonging to a same class as the object depicted in the digital image. 14. The computer-implemented method as recited in claim 1, comprising cropping the digital image based at least in part on the projected location of the one or more edges of the object; wherein the cropped digital image depicts a portion of a background of the digital image surrounding the object; andwherein the method comprises detecting one or more transitions between the background and the object within the cropped digital image. 15. The computer-implemented method as recited in claim 1, comprising: cropping the digital image based at least in part on the projected location of the one or more edges of the object; andclassifying the object depicted within the cropped digital image. 16. The computer-implemented method as recited in claim 1, comprising: generating a plurality of scaled images based on the digital image, each scaled image being characterized by a different resolution;extracting one or more feature vectors from each scaled image; andmatching one or more of the scaled images to one of a plurality of reference images, each reference image depicting a known object type and being characterized by a known resolution. 17. The computer-implemented method as recited in claim 1, comprising: attempting to detect the object within the digital image using a plurality of predetermined object detection models each corresponding to a known object type; anddetermining a classification of the object based on a result of attempting to detect the object within the digital image using the plurality of predetermined object detection models; andwherein the classification of the object is determined to be the known object type corresponding to one of the object detection models for which the attempt to detect the object within the digital image was successful. 18. A computer program product for detecting an object depicted in a digital image, comprising a non-transitory computer readable medium having stored thereon computer readable program instructions configured to cause a processor, upon execution thereof, to: generate, using the processor, a plurality of scaled images based on the digital image, each scaled image being characterized by a different resolution;extract, using the processor, one or more feature vectors from each scaled image;match, using the processor, one or more of the scaled images to one of a plurality of reference images based on the one or more feature vectors, each reference image depicting a known object type and being characterized by a known resolution;detect, using the processor, a plurality of identifying features of the object within the scaled image matched to the one of the plurality of reference images, wherein the plurality of identifying features are located internally with respect to the object; andproject, using the processor, a location of one or more edges of the object based at least in part on the plurality of identifying features. 19. The computer program product as recited in claim 18, comprising computer readable program instructions configured to cause the processor, upon execution thereof, to: match the digital image depicting the object to one of a plurality of reference images each depicting a known object type. 20. A system for detecting an object depicted in a digital image, comprising a processor and logic embodied with and/or executable by the processor, the logic being configured to cause the processor, upon execution thereof, to: detect a plurality of identifying features of the object, wherein the plurality of identifying features are located internally with respect to the object; andproject a location of one or more edges of the object based at least in part on the plurality of identifying features, wherein at least portions of at least one of the one or more edges of the object displayed in the digital image are missing from the digital image.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (338)
Kawasaki, Somei; Goden, Tatsuhito, Active matrix type display apparatus and driving method thereof.
Gaborski Roger Stephen (Pittsford NY) Pawlicki Thaddeus Francis (Rochester NY), Apparatus and method for identifying specific bone regions in digital X-ray images.
Nakatsuka Kimihiro,JPX, Apparatus for determining image processing parameter, method of the same, and computer program product for realizing the method.
Barrett Terence W. (Vienna VA), Automata networks and methods for obtaining optimized dynamically reconfigurable computational architectures and control.
Block, James; Graef, H. Thomas; Magee, Paul D.; Nelson, Donald S.; Meek, James; McIntyre, Daniel S.; DiPietro, Mark; Ramachandran, Natarajan, Automated banking machine with remote user assistance.
Sang ; Jr. Henry W. (Cupertio CA) Tahn Whei-Tsu H. (Sunnyvale CA) Zhang Xiao B. (Foster City CA), Automated method for creating templates in a forms recognition and processing system.
Iwai, Yoshiaki; Yoshigahara, Takayuki, Camera calibration apparatus and method, image processing apparatus and method, program providing medium, and camera.
McElroy, John F.; Chorvat, Robert J., Cannabinoid receptor antagonists/inverse agonists useful for treating metabolic disorders, including obesity and diabetes.
Nishimura Kazuyuki (Ichikawa JPX) Sato Shinichi (Yokohama JPX), Color picture processing apparatus for reproducing a color picture having a smoothly changed gradation.
Suzuki,Masahiro; Tamune,Michihiro; Chen,Zhe Hong; Juen,Masahiro, Digital camera, storage medium for image signal processing, carrier wave and electronic camera.
Rowe Edward R. ; Priyadarshan Eswar ; Anderson Kenneth S. ; Al-Shamma Nabeel A. ; Taft Edward A. ; McQuarrie Elizabeth M. ; Cohn Richard, Displaying electronic documents with substitute fonts.
Nagatsuka,Tetsuro; Miyachi,Tatsuo; Shimada,Atsuo; Takeya,Kazutoshi; Kemmochi,Eiji; Nakajima,Akiko; Yamasaki,Makoto; Fujita,Katsuhiko, Document classification system and method for classifying a document according to contents of the document.
Borrey Roland G. (19251 Canyon Dr. Villa Park CA 92667) Borrey Daniel G. (19251 Canyon Dr. Villa Park CA 92667), Document identification by characteristics matching.
Clark ; Jr. Louis George (St. Charles MO) Gummow ; Jr. Donald Romaine (O\Fallon MO) Vanacht Marc (St. Louis MO), Hand-held GUI PDA with GPS/DGPS receiver for collecting agronomic and GPS position data.
LeBrun Thomas Q. (Dallas TX) Cage Kerry (Carrollton TX) Arnold Dennis D. (Carrollton TX), Image based document processing and information management system and apparatus.
Mino, Kazuhiro; Yoda, Akira; Ohtsuka, Shuichi; Ono, Shuji; Ito, Wataru; Yamada, Masahiko, Image displaying system and apparatus for displaying images by changing the displayed images based on direction or direction changes of a displaying unit.
Naofumi Yamamoto JP; Haruko Kawakami JP; Gururaj Rao JP, Image processing apparatus for discriminating image field of original document plural times and method therefor.
Appelt, Douglas E.; Arnold, James Frederick; Bear, John S.; Hobbs, Jerry Robert; Israel, David J.; Kameyama, Megumi; Martin, David L.; Myers, Karen Louise; Ravichandran, Gopalan; Stickel, Mark Edward, Information retrieval by natural language querying.
David L. Patton ; John R. Fredlund ; John D. Buhr, Method and apparatus for modifying a portion of an image in accordance with colorimetric parameters.
Walnut David Francis ; Berenstein Carlos Alberto ; Liu K. J. Ray ; Rashid-Farrokhi Farrokh, Method and apparatus for processing data from a tomographic imaging system.
Withers,William Douglas, Method and apparatus for recognizing a digitized form, extracting information from a filled-in form, and generating a corrected filled-in form.
Guberman Shelja A. (Moscow RUX) Lossev Ilia (Moscow RUX) Pashintsev Alexander V. (Moscow RUX), Method and apparatus for recognizing cursive writing from sequential input information.
Guberman Shelja A. (Moscow RUX) Lossev Ilia (Moscow RUX) Pashintsev Alexander V. (Moscow RUX), Method and apparatus for recognizing cursive writing from sequential input information.
Polyakov Vladislav G. (Moscow RUX) Ryleev Mikhail A. (Moscow RUX), Method and apparatus for representing image data using polynomial approximation method and iterative transformation-repa.
Green, Stephen J.; Lamere, Paul B.; Alexander, Jeffrey L.; Haberl, Karl R., Method and apparatus for searching and resource discovery in a distributed enterprise system.
Winkelman Kurt-Helfried (Kiel DEX), Method and apparatus for the automatic analysis of density range, color cast, and gradation of image originals on the Ba.
Berman, Arie; Vlahos, Paul; Dadourian, Arpag, Method and apparatus for the automatic generation of subject to background transition area boundary lines and subject shadow retention.
Verstraelen,Boudewijn Joseph Angelus; Verstraelen,Sebastiaan Paul, Method and apparatus for visualization of biological structures with use of 3D position information from segmentation results.
Ejiri Koichi,JPX ; Guan Haike,JPX ; Aoki Shin,JPX, Method and system for generating a composite image from partially overlapping adjacent images taken along a plurality of axes.
Tischler, Karl M., Method arrangement and computer software for the printing of a separator sheet by means of an electrophotographic printer or copier.
Raskar, Ramesh; Willwacher, Thomas H.; van Baar, Jeroen, Method for determining a largest inscribed rectangular image within a union of projected quadrilateral images.
Kanda Shinji (Kawasaki JPX) Wakitani Jun (Kawasaki JPX) Maruyama Tsugito (Kawasaki JPX) Morita Toshihiko (Kawasaki JPX), Method for determining orientation of contour line segment in local area and for determining straight line and corner.
Kurosu Yasuo (Yokosuka JPX) Yokoyama Yoshihiro (Yokohama JPX) Nishikawa Kenichi (Yokohama JPX) Masuzaki Hidefumi (Hadano JPX) Fujinawa Masaaki (Tokyo JPX), Method for determining the amount of skew of image, method for correcting the same, and image data processing system.
Henderson Todd R. ; Spaulding Kevin E. ; Couwenhoven Douglas W., Method for segmenting a digital image into a foreground region and a key color region.
Kohchi Tsukasa JP, Method of and system for extracting predetermined elements from input document based upon model which is adaptively modified according to variable amount in the input document.
Beaulieu Dennis N. (Churchville NY) Compton John T. (LeRoy NY) Wojtanik Eugene R. (Plano TX), Method of calibration of image scanner signal processing circuits.
Dumais Susan T. ; Heckerman David ; Horvitz Eric ; Platt John Carlton ; Sahami Mehran, Methods and apparatus for classifying text and for building a text classifier.
Cheong, Cheol Ho; Han, Tack Don; Kim, Jong Young; Kim, Eui Jae; Jeong, Seong Hun; Kim, Jae Yun; Choi, Han Yeong, Mixed code, and method and apparatus for generating the same.
Fast Bruce B. (2600 Prindle Rd. Belmont CA 94402) Allen Dana R. (1745 Hunt Dr. Burlingame CA 94010), OCR image preprocessing method for image enhancement of scanned documents.
Michimoto Yasuyuki,JPX ; Onda Katsumasa,JPX ; Nishizawa Masato,JPX, Object detecting apparatus in which the position of a planar object is estimated by using hough transform.
Wong, Patrick, System and a method for web-based editing of documents online with an editing interface and concurrent display to webpages and print documents.
Ellis, Stephen M.; Kennedy, Michael J.; Kurani, Ashish Bhoopen; Lowry, Melissa; Meyyappan, Uma; Sahni, Bipin; Stroke, Nikolai, System and method for a mobile wallet.
Woolf,Susan D.; Baird,Andrew; Jiang,Sheng; Beezer,John L.; Rubin,Darryl E., System and method for annotating an electronic document independently of its content.
Pizano Arturo (Milpitas CA) Tan May-Inn (Saratoga CA) Gambo Naoto (Tanashi JPX), System and method for automatically classifying heterogeneous business forms.
Vazquez, Nicolas; Kodosky, Jeffrey L.; Kudukoli, Ram; Schultz, Kevin L.; Nair, Dinesh; Caltagirone, Christophe, System and method for automatically generating a graphical program to perform an image processing algorithm.
Oppenlander, Timothy J.; Underhill, James; Jackson, Elizabeth; Cook, Rebecca Ann; Dimel, Gary R.; Ortize, Carlos, System and method for electronic document generation and delivery.
Emerson,Geoffrey A.; Moon,Rodney G.; Rector,Gerald C.; Stokes,Raymond F.; Sutton,Andrew H., System and method of sorting document images based on image quality.
Heidenreich,James R.; Higgins,Linda S., System and method to customize the facilitation of development of user thinking about and documenting of an arbitrary problem.
Sampath, Meera; Nichols, Stephen J.; Richenderfer, Elizabeth A., Systems and methods for automated image quality based diagnostics and remediation of document processing systems.
Amtrup, Jan W.; Macciola, Anthony; Thompson, Stephen Michael; Ma, Jiyong, Systems and methods for classifying objects in digital images captured using mobile devices.
Amtrup, Jan Willers; Macciola, Anthony; Thompson, Steve; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for classifying objects in digital images captured using mobile devices.
Amtrup, Jan W.; Ma, Jiyong; Kilby, Steven; Macciola, Anthony, Systems and methods for identification document processing and business workflow integration.
Amtrup, Jan W.; Thompson, Stephen Michael; Kilby, Steven; Macciola, Anthony, Systems and methods for identification document processing and business workflow integration.
Ferlitsch,Andrew Rodney; DeVore,Darwin Alan, Systems and methods for manipulating electronic information using a three-dimensional iconic representation.
Amtrup, Jan Willers; Macciola, Anthony; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for mobile image capture and processing.
Macciola, Anthony; Amtrup, Jan Willers; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for mobile image capture and processing.
Roach, John J.; Nepomniachtchi, Grisha; Couch, Robert; Avergun, Mikhail, Systems and methods for obtaining financial offers using mobile image capture.
Macciola, Anthony; Amtrup, Jan W.; Ma, Jiyong; Borrey, Roland G.; Schmidtler, Mauritius A. R.; Asuri, Hari S.; Fechter, Joel S.; Taylor, Robert A., Systems and methods for processing video data.
Gorski, Nikolai D.; Semenov, Andrey V.; Anisimov, Valery; Maksimov, Sergey K.; Sashov, Sergey N., Systems and methods for recognizing information in objects using a mobile device.
Macciola, Anthony; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W.; Amtrup, Jan, Systems and methods for three dimensional geometric reconstruction of captured image data.
Borrey, Roland G.; Schmidtler, Mauritius A. R.; Taylor, Robert A.; Fechter, Joel S.; Asuri, Hari S., Systems and methods of accessing random access cache for rescanning.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods and computer program products for determining document validity.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods and computer program products for determining document validity.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods, and computer program products for determining document validity.
Macciola, Anthony; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher; Amtrup, Jan W., Determining distance between an object and a capture device based on captured image data.
Thrasher, Christopher W.; Shustorovich, Alexander; Thompson, Stephen Michael; Amtrup, Jan W.; Macciola, Anthony, Iterative recognition-guided thresholding and data extraction.
Burgos, Xavier; Kabbaj, Nezha; Le Clerc, Francois, Method and device for determining the shape of an object represented in an image, corresponding computer program product and computer-readable medium.
Amtrup, Jan W.; Macciola, Anthony; Thompson, Steve; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for classifying objects in digital images captured using mobile devices.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.