Systems and methods for mobile image capture and processing
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
G06K-009/00
G06T-007/40
H04N-001/40
출원번호
US-0334558
(2014-07-17)
등록번호
US-8971587
(2015-03-03)
발명자
/ 주소
Macciola, Anthony
Shustorovich, Alexander
Thrasher, Christopher W.
출원인 / 주소
Kofax, Inc.
대리인 / 주소
Zilka-Kotab, PC
인용정보
피인용 횟수 :
45인용 특허 :
257
초록▼
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impracti
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impractical, and furthermore address unique challenges presented by images captured using a camera rather than a traditional flat-bed scanner, paper-feed scanner, or multifunction peripheral. Particularly advantageous features include robustly detecting edges of one or more documents depicted in the digital image data, and defining/locating document pages at least partially on this basis. The statistical approaches employed enable robust yet computationally efficient techniques to accomplish page detection, and associated functions, using hardware typically included in mobile devices and within practical (especially temporal) limits imposed by device manufacturers, users, associated and/or downstream computational and/or business processes.
대표청구항▼
1. A method, comprising: capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document;defining a plurality of candidate edge points within the image data;removing one or more outlier candidate edge points f
1. A method, comprising: capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document;defining a plurality of candidate edge points within the image data;removing one or more outlier candidate edge points from the plurality of candidate edge points;defining a second plurality of candidate edge points excluding the one or more outlier candidate edge points; anddefining four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points,wherein each side of the tetragon corresponds to a different side of the document, andwherein the tetragon bounds the digital representation of the document. 2. The method as recited in claim 1, wherein defining the plurality of candidate edge points comprises, for each of a plurality of portions of the image data: calculating one or more statistics corresponding to the portion;estimating one or more distributions of statistics corresponding to the portion;determining whether a statistically significant difference exists between one or more of the statistics calculated for the portion and the distribution of statistics estimated for the portion; anddesignating a point corresponding to the statistically significant difference as a candidate edge point in response to determining the statistically significant difference exists. 3. The method as recited in claim 2, wherein the statistics calculated for each portion comprise one or more of: a minimum brightness value;a maximum brightness value; anda brightness value range; and wherein the distributions of statistics estimated for each portion comprise one or more of: a distribution of color channel value minima;a distribution of color channel value maxima; anda distribution of color channel value ranges. 4. The method as recited in claim 2, wherein at least one portion comprises a corner region of the image. 5. The method as recited in claim 2, wherein each portion is defined along a path proceeding from an outer region of the image toward a center of the image. 6. The method as recited in claim 5, wherein the path proceeds along one or more of rows and columns of the digital image. 7. The method as recited in claim 2, wherein portion is characterized by a substantially rectangular shape, and wherein each small analysis window is characterized by a substantially rectangular shape. 8. The method as recited in claim 2, wherein each portion is characterized by a single center pixel, and wherein the center pixel is designated as a candidate edge point upon determining the statistically significant difference exists. 9. The method as recited in claim 1, further comprising bypassing one or more variations in a texture of a background of the digital image. 10. The method as recited in claim 1, wherein each side of the tetragon is characterized by a polynomial equation, and wherein the defining comprises determining one or more coefficients for each polynomial equation. 11. The method as recited in claim 1, further comprising defining one or more corners of the tetragon, the defining comprising: calculating one or more intersections between two adjacent sides; anddesignating an appropriate intersection from the one or more calculated intersections. 12. The method as recited in claim 1, wherein a corner of the tetragon comprises an intersection of two adjacent sides of the tetragon, wherein the two adjacent sides are selected from: one substantially straight line and one substantially curved line;one substantially straight line and one substantially parabolic curve; andtwo substantially parabolic curves. 13. The method as recited in claim 1, further comprising defining one or more tetragon corners, the defining comprising solving one or more of: a first degree polynomial equation;a second degree polynomial equation;a third degree polynomial equation; anda fourth degree polynomial equation. 14. The method as recited in claim 1, wherein an area of the tetragon comprises at least a threshold percentage of a total area of the digital image. 15. The method as recited in claim 1, wherein a first line connects a calculated top left corner of the tetragon to a calculated bottom right corner of the tetragon, wherein a second line connects a calculated top right corner of the tetragon and a calculated bottom left corner of the tetragon, andwherein the first line and the second line intersect inside the tetragon. 16. The method as recited in claim 1, further comprising: determining whether the tetragon satisfies one or more quality control metrics; andrejecting the tetragon upon determining the tetragon does not satisfy one or more of the quality control metrics,wherein the quality control metrics comprise one or more of: a Least Mean Squares (LMS) support metric,a minimum tetragon area metric,a tetragon corner location metric; anda tetragon diagonal intersection location metric. 17. The method as recited in claim 1, further comprising outputting the digital representation of the document and the tetragon to a display of a mobile device. 18. A system, comprising: a processor configured to execute logic; andlogic configured to cause the processor executing the logic to:capture one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document;define a plurality of candidate edge points within the image data,remove one or more outlier candidate edge points from the plurality of candidate edge points;define a second plurality of candidate edge points excluding the one or more outlier candidate edge points; anddefine four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein each side of the tetragon corresponds to a different side of the document; andoutput the digital representation of the document and the tetragon to a display of a mobile device, wherein the tetragon bounds the digital representation of the document. 19. A computer program product comprising a computer readable storage medium having computer readable program code stored thereon, the computer readable program code comprising: computer readable program code configured to cause a processor to:capture one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document;define a plurality of candidate edge points within the image data,remove one or more outlier candidate edge points from the plurality of candidate edge points;define a second plurality of candidate edge points excluding the one or more outlier candidate edge points; anddefine four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein each side of the tetragon corresponds to a different side of the document; andoutput the digital representation of the document and the tetragon to a display of a mobile device, wherein the tetragon bounds the digital representation of the document.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (257)
Kawasaki, Somei; Goden, Tatsuhito, Active matrix type display apparatus and driving method thereof.
Nakatsuka Kimihiro,JPX, Apparatus for determining image processing parameter, method of the same, and computer program product for realizing the method.
Barrett Terence W. (Vienna VA), Automata networks and methods for obtaining optimized dynamically reconfigurable computational architectures and control.
Sang ; Jr. Henry W. (Cupertio CA) Tahn Whei-Tsu H. (Sunnyvale CA) Zhang Xiao B. (Foster City CA), Automated method for creating templates in a forms recognition and processing system.
McElroy, John F.; Chorvat, Robert J., Cannabinoid receptor antagonists/inverse agonists useful for treating metabolic disorders, including obesity and diabetes.
Nishimura Kazuyuki (Ichikawa JPX) Sato Shinichi (Yokohama JPX), Color picture processing apparatus for reproducing a color picture having a smoothly changed gradation.
Suzuki,Masahiro; Tamune,Michihiro; Chen,Zhe Hong; Juen,Masahiro, Digital camera, storage medium for image signal processing, carrier wave and electronic camera.
Rowe Edward R. ; Priyadarshan Eswar ; Anderson Kenneth S. ; Al-Shamma Nabeel A. ; Taft Edward A. ; McQuarrie Elizabeth M. ; Cohn Richard, Displaying electronic documents with substitute fonts.
Nagatsuka,Tetsuro; Miyachi,Tatsuo; Shimada,Atsuo; Takeya,Kazutoshi; Kemmochi,Eiji; Nakajima,Akiko; Yamasaki,Makoto; Fujita,Katsuhiko, Document classification system and method for classifying a document according to contents of the document.
Borrey Roland G. (19251 Canyon Dr. Villa Park CA 92667) Borrey Daniel G. (19251 Canyon Dr. Villa Park CA 92667), Document identification by characteristics matching.
Clark ; Jr. Louis George (St. Charles MO) Gummow ; Jr. Donald Romaine (O\Fallon MO) Vanacht Marc (St. Louis MO), Hand-held GUI PDA with GPS/DGPS receiver for collecting agronomic and GPS position data.
LeBrun Thomas Q. (Dallas TX) Cage Kerry (Carrollton TX) Arnold Dennis D. (Carrollton TX), Image based document processing and information management system and apparatus.
Mino, Kazuhiro; Yoda, Akira; Ohtsuka, Shuichi; Ono, Shuji; Ito, Wataru; Yamada, Masahiko, Image displaying system and apparatus for displaying images by changing the displayed images based on direction or direction changes of a displaying unit.
Naofumi Yamamoto JP; Haruko Kawakami JP; Gururaj Rao JP, Image processing apparatus for discriminating image field of original document plural times and method therefor.
Appelt, Douglas E.; Arnold, James Frederick; Bear, John S.; Hobbs, Jerry Robert; Israel, David J.; Kameyama, Megumi; Martin, David L.; Myers, Karen Louise; Ravichandran, Gopalan; Stickel, Mark Edward, Information retrieval by natural language querying.
Walnut David Francis ; Berenstein Carlos Alberto ; Liu K. J. Ray ; Rashid-Farrokhi Farrokh, Method and apparatus for processing data from a tomographic imaging system.
Withers,William Douglas, Method and apparatus for recognizing a digitized form, extracting information from a filled-in form, and generating a corrected filled-in form.
Guberman Shelja A. (Moscow RUX) Lossev Ilia (Moscow RUX) Pashintsev Alexander V. (Moscow RUX), Method and apparatus for recognizing cursive writing from sequential input information.
Guberman Shelja A. (Moscow RUX) Lossev Ilia (Moscow RUX) Pashintsev Alexander V. (Moscow RUX), Method and apparatus for recognizing cursive writing from sequential input information.
Polyakov Vladislav G. (Moscow RUX) Ryleev Mikhail A. (Moscow RUX), Method and apparatus for representing image data using polynomial approximation method and iterative transformation-repa.
Green, Stephen J.; Lamere, Paul B.; Alexander, Jeffrey L.; Haberl, Karl R., Method and apparatus for searching and resource discovery in a distributed enterprise system.
Winkelman Kurt-Helfried (Kiel DEX), Method and apparatus for the automatic analysis of density range, color cast, and gradation of image originals on the Ba.
Berman, Arie; Vlahos, Paul; Dadourian, Arpag, Method and apparatus for the automatic generation of subject to background transition area boundary lines and subject shadow retention.
Verstraelen,Boudewijn Joseph Angelus; Verstraelen,Sebastiaan Paul, Method and apparatus for visualization of biological structures with use of 3D position information from segmentation results.
Ejiri Koichi,JPX ; Guan Haike,JPX ; Aoki Shin,JPX, Method and system for generating a composite image from partially overlapping adjacent images taken along a plurality of axes.
Tischler, Karl M., Method arrangement and computer software for the printing of a separator sheet by means of an electrophotographic printer or copier.
Kanda Shinji (Kawasaki JPX) Wakitani Jun (Kawasaki JPX) Maruyama Tsugito (Kawasaki JPX) Morita Toshihiko (Kawasaki JPX), Method for determining orientation of contour line segment in local area and for determining straight line and corner.
Kurosu Yasuo (Yokosuka JPX) Yokoyama Yoshihiro (Yokohama JPX) Nishikawa Kenichi (Yokohama JPX) Masuzaki Hidefumi (Hadano JPX) Fujinawa Masaaki (Tokyo JPX), Method for determining the amount of skew of image, method for correcting the same, and image data processing system.
Henderson Todd R. ; Spaulding Kevin E. ; Couwenhoven Douglas W., Method for segmenting a digital image into a foreground region and a key color region.
Kohchi Tsukasa JP, Method of and system for extracting predetermined elements from input document based upon model which is adaptively modified according to variable amount in the input document.
Beaulieu Dennis N. (Churchville NY) Compton John T. (LeRoy NY) Wojtanik Eugene R. (Plano TX), Method of calibration of image scanner signal processing circuits.
Dumais Susan T. ; Heckerman David ; Horvitz Eric ; Platt John Carlton ; Sahami Mehran, Methods and apparatus for classifying text and for building a text classifier.
Cheong, Cheol Ho; Han, Tack Don; Kim, Jong Young; Kim, Eui Jae; Jeong, Seong Hun; Kim, Jae Yun; Choi, Han Yeong, Mixed code, and method and apparatus for generating the same.
Michimoto Yasuyuki,JPX ; Onda Katsumasa,JPX ; Nishizawa Masato,JPX, Object detecting apparatus in which the position of a planar object is estimated by using hough transform.
Ellis, Stephen M.; Kennedy, Michael J.; Kurani, Ashish Bhoopen; Lowry, Melissa; Meyyappan, Uma; Sahni, Bipin; Stroke, Nikolai, System and method for a mobile wallet.
Woolf,Susan D.; Baird,Andrew; Jiang,Sheng; Beezer,John L.; Rubin,Darryl E., System and method for annotating an electronic document independently of its content.
Vazquez, Nicolas; Kodosky, Jeffrey L.; Kudukoli, Ram; Schultz, Kevin L.; Nair, Dinesh; Caltagirone, Christophe, System and method for automatically generating a graphical program to perform an image processing algorithm.
Emerson,Geoffrey A.; Moon,Rodney G.; Rector,Gerald C.; Stokes,Raymond F.; Sutton,Andrew H., System and method of sorting document images based on image quality.
Heidenreich,James R.; Higgins,Linda S., System and method to customize the facilitation of development of user thinking about and documenting of an arbitrary problem.
Sampath, Meera; Nichols, Stephen J.; Richenderfer, Elizabeth A., Systems and methods for automated image quality based diagnostics and remediation of document processing systems.
Ferlitsch,Andrew Rodney; DeVore,Darwin Alan, Systems and methods for manipulating electronic information using a three-dimensional iconic representation.
Roach, John J.; Nepomniachtchi, Grisha; Couch, Robert; Avergun, Mikhail, Systems and methods for obtaining financial offers using mobile image capture.
Gorski, Nikolai D.; Semenov, Andrey V.; Anisimov, Valery; Maksimov, Sergey K.; Sashov, Sergey N., Systems and methods for recognizing information in objects using a mobile device.
Borrey, Roland G.; Schmidtler, Mauritius A. R.; Taylor, Robert A.; Fechter, Joel S.; Asuri, Hari S., Systems and methods of accessing random access cache for rescanning.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods and computer program products for determining document validity.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods and computer program products for determining document validity.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods, and computer program products for determining document validity.
Ma, Jiyong; Thompson, Stephen Michael; Amtrup, Jan W., Content-based detection and three dimensional geometric reconstruction of objects in image and video data.
Macciola, Anthony; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher; Amtrup, Jan W., Determining distance between an object and a capture device based on captured image data.
Thrasher, Christopher W.; Shustorovich, Alexander; Thompson, Stephen Michael; Amtrup, Jan W.; Macciola, Anthony, Iterative recognition-guided thresholding and data extraction.
Wilbert, Anthony Russell; Wach, Hans Brandon; Chung, David Ching-Chien, Method and apparatus for receiving a broadcast radio service offer from an image.
Wilbert, Anthony Russell; Wach, Hans Brandon; Chung, David Ching-Chien, Method and apparatus for receiving a broadcast radio service offer from an image.
Wilbert, Anthony Russell; Wach, Hans Brandon; Chung, David Ching-Chien, Method and apparatus for receiving a location of a vehicle service center from an image.
Wilbert, Anthony Russell; Wach, Hans Brandon; Chung, David Ching-Chien, Method and apparatus for receiving a location of a vehicle service center from an image.
Wilbert, Anthony Russell; Wach, Hans Brandon; Chung, David Ching-Chien, Method and apparatus for receiving vehicle information from an image and posting the vehicle information to a website.
Wilbert, Anthony Russell; Wach, Hans Brandon; Chung, David Ching-Chien, Method and apparatus for receiving vehicle information from an image and posting the vehicle information to a website.
Wilbert, Anthony Russell; Wach, Hans Brandon; Chung, David Ching-Chien, Method and apparatus for recovering a vehicle identification number from an image.
Wilbert, Anthony Russell; Wach, Hans Brandon; Chung, David Ching-Chien, Method and apparatus for recovering a vehicle identification number from an image.
Shustorovich, Alexander; Thrasher, Christopher W.; Ma, Jiyong; Macciola, Anthony; Amtrup, Jan W., Mobile document detection and orientation based on reference object characteristics.
Wilbert, Anthony Russell; Chung, David Ching-Chien; Wach, Hans Brandon; Rauker, Goran Matko; White, Solomon John, System and method for electronic processing of vehicle transactions based on image detection of vehicle license plate.
Amtrup, Jan W.; Macciola, Anthony; Thompson, Steve; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for classifying objects in digital images captured using mobile devices.
Amtrup, Jan W.; Macciola, Anthony; Thompson, Steve; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for classifying objects in digital images captured using mobile devices.
Macciola, Anthony; Amtrup, Jan W.; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W.; Thompson, Stephen Michael, Systems and methods for classifying objects in digital images captured using mobile devices.
Macciola, Anthony; Amtrup, Jan Willers; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for mobile image capture and processing.
Thrasher, Christopher W.; Shustorovich, Alexander; Thompson, Stephen Michael; Amtrup, Jan W.; Macciola, Anthony; Borrey, Roland G.; Schmidtler, Mauritius A. R.; Taylor, Robert A.; Fechter, Joel S.; Asuri, Hari S., Systems and methods of processing scanned data.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.