Systems and methods for mobile image capture and processing
원문보기
IPC분류정보
국가/구분
United States(US) Patent
등록
국제특허분류(IPC7판)
H04M-001/00
G06K-009/00
G06T-011/20
G06F-003/00
H04N-001/40
G06T-005/00
G06T-007/00
G06T-003/00
G06F-017/40
H04N-001/387
G06T-007/40
G06K-009/32
G06K-009/38
G06K-009/36
출원번호
US-0740145
(2013-01-11)
등록번호
US-9342742
(2016-05-17)
발명자
/ 주소
Amtrup, Jan Willers
Macciola, Anthony
Shustorovich, Alexander
Thrasher, Christopher W.
출원인 / 주소
Kofax, Inc.
대리인 / 주소
Zilka-Kotab, PC
인용정보
피인용 횟수 :
15인용 특허 :
265
초록▼
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impracti
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impractical, and furthermore address unique challenges presented by images captured using a camera rather than a traditional flat-bed scanner, paper-feed scanner or multifunction peripheral.
대표청구항▼
1. A method, comprising: generating a case management interface using a processor;outputting a case management interface to a display of a mobile device;receiving one or more instructions corresponding to one or more case actions via the displayed case management interface, each case action being as
1. A method, comprising: generating a case management interface using a processor;outputting a case management interface to a display of a mobile device;receiving one or more instructions corresponding to one or more case actions via the displayed case management interface, each case action being associated with at least one case;detecting a digital representation of a document depicted in a digital image, the document relating to one or more of the case(s); andperforming one or more of the case actions in response to receiving the instructionswherein the case actions comprise: creating a new case;opening an existing case;deleting one or more of the existing cases;designating a case type;capturing case information;capturing data corresponding to a digital representation of a document the data comprising either image data or video data;outputting the captured data to the display of the mobile device;associating the captured data with one or more of the new case and the existing case;dissociating the captured data from one or more of the new case and the existing case;processing the captured data;outputting the processed data to the display of the mobile device;receiving user input via the display of the mobile device; andsubmitting one or more of the new case and the existing case to a remote server, andwherein detecting the digital representation of the document comprises: defining a plurality of candidate edge points corresponding to the digital representation of the document; anddefining four sides of a tetragon based on the plurality of candidate edge points. 2. The method as recited in claim 1, further comprising: outputting a data capture interface to the display of the mobile device for capturing an image comprising a digital representation of a document;receiving a request from a user to capture the image via the data capture interface;capturing the image in response to receiving the request, the capturing being performed using a capture component of the mobile device; andassociating the captured image with one or more cases. 3. The method as recited in claim 2, further comprising: detecting the digital representation of the document. 4. The method as recited in claim 1, wherein defining the plurality of candidate edge points comprises: defining one or more large analysis windows within a digital image;defining a plurality of small analysis windows within the digital image;estimating one or more distributions of statistics for each large analysis window;calculating one or more statistics for each small analysis window;determining whether a statistically significant difference exists between one or more of the statistics calculated for one of the small analysis windows and a corresponding distribution of statistics estimated for one of the large analysis windows; anddesignating a point in the small analysis window as a candidate edge point upon determining the statistically significant difference exists. 5. The method as recited in claim 2, further comprising: correcting one or more distortions of one or more portions of the digital representation of the document. 6. The method as recited in claim 5, wherein correcting the one or more distortions comprises transforming a tetragon corresponding to the digital representation of the document into a rectangle, wherein the tetragon is characterized by a plurality of equations, wherein each equation corresponds to a side of the tetragon, and wherein each equation is selected from a chosen class of functions. 7. The method as recited in claim 2, further comprising determining whether one or more illumination problems exist in the digital representation of the document. 8. The method as recited in claim 7, wherein determining whether the one or more illumination problems exist comprises: dividing, using a processor, a tetragon comprising a detected digital representation of a document within the digital image into a plurality of sections, each section comprising a plurality of pixels;determining a distribution of brightness values of each section;determining a brightness value range of each section;determining a variability of brightness values of each section;determining whether each section is oversaturated;determining whether each section is undersaturated;determining a number of oversaturated sections;determining a number of undersaturated sections;determining that the digital image is oversaturated upon determining that a ratio of the number of oversaturated sections to a total number of sections exceeds an oversaturation threshold;determining that the digital image is undersaturated upon determining that a ratio of the number of undersaturated sections to the total number of sections exceeds an undersaturation threshold; anddetermining that the illumination problem exists in the digital image upon determining that the digital image is either undersaturated or oversaturated. 9. The method as recited in claim 2, further comprising correcting unevenness of illumination in one or more regions of the digital representation of the document. 10. The method as recited in claim 9, wherein the correcting comprises: deriving a two-dimensional illumination model from the digital image; andapplying the two-dimensional illumination model to each pixel in the digital image. 11. The method as recited in claim 2, further comprising estimating a resolution of the digital representation of the document. 12. The method as recited in claim 11, the estimating comprising: detecting a plurality of connected components of a plurality of non-background elements in the digital image;determining a plurality of likely characters based on the plurality of connected components;determining one or more average character dimensions based on the plurality of likely text characters; andestimating the resolution of the digital image based on the one or more average character dimensions. 13. The method as recited in claim 2, further comprising determining whether the image comprises one or more blurred regions. 14. The method as recited in claim 13, wherein determining whether the image comprises one or more blurred regions comprises: dividing, using a processor, a tetragon comprising a digital representation of a document in a digital image into a plurality of sections, each section comprising a plurality of pixels;for each section: determining whether the section contains one or more sharp pixel-to-pixel transitions in a first direction;counting a total number of first direction sharp pixel-to-pixel transitions for the section (SS1);determining whether the section contains one or more blurred pixel-to-pixel transitions in the first direction;counting a total number of first-direction blurred pixel-to-pixel transitions for the section (SB1);determining whether the section contains one or more sharp pixel-to-pixel transitions in a second direction;counting a total number of second direction sharp pixel-to-pixel transitions for the section (SS2);determining whether the section contains one or more blurred pixel-to-pixel transitions in the second direction;counting a total number of second-direction blurred pixel-to-pixel transitions for the section (SB2);determining the section is blank upon determining: SS1 is less than a predetermined sharp transition threshold,SB1 is less than a predetermined blurred transition threshold,SS2 is less than a predetermined sharp transition threshold, andSB2 is less than a predetermined blurred transition threshold; anddetermining, for all non-blank sections, a first direction blur ratio r1=SS1/SB1;determining, for all non-blank sections, a second direction blur ratio r2=SS2/SB2;determining that a non-blank section is blurred in the first direction upon determining that r1 is less than a predefined section blur ratio threshold; anddetermining that a non-blank section is blurred in the second direction upon determining that r2 is less than the predefined section blur ratio threshold; anddetermining that a non-blank section is blurred upon determining one or more of: the section is blurred in the first direction, andthe section is blurred in the section direction; anddetermining a total number of blurred sections;calculating an image blur ratio R comprising: the total number blurred sections to a total number of sections; anddetermining the digital image is blurred upon determining the image blur ratio is greater than a predetermined image blur threshold. 15. The method as recited in claim 2, further comprising: outputting an authentication interface to a display of the mobile device;receiving authentication data via the authentication interface;receiving an authentication request;determining whether the authentication data is valid in response to receiving the authentication request;granting access to one or more secure case actions upon determining the authentication data is valid; anddenying access to the secure case actions upon determining the authentication data is not valid. 16. The method as recited in claim 1, wherein the capturing comprises: receiving user input via the display of the mobile device; andcapturing data using a capture component of the mobile device in response to the user input. 17. The method as recited in claim 1, wherein the capturing comprises: receiving user input via the display of the mobile device; andreading the data from a computer-readable storage medium of the mobile device in response to the user input. 18. The method as recited in claim 1, wherein the user input corresponds to metadata related to the one or more of the new case and the existing case; and further comprising associating the metadata with one or more of the new case and the existing case, the metadata comprising one or more of: one or more alphanumeric characters,one or more symbols,one or more signatures; and one or more pointers. 19. The method as recited in claim 1, wherein designating the case type comprises: displaying a plurality of potential case types on the display of the mobile device;receiving user input via the display of the mobile device, the user input indicating one of the plurality of the potential case types is a desired case type; anddesignating the case type as the desired case type. 20. The method as recited in claim 19, wherein the potential case types comprise: an insurance claim; a loan application; a proof of delivery; an undefined case type; a new account opening; an educational program application; a medical record; an expense management; an accident report; and a user-defined case type. 21. The method as recited in claim 1, wherein the case information comprises one or more of a digital signature, an electronic signature, and a signature created by hand. 22. The method as recited in claim 1, wherein the processing comprises: detecting the digital representation of the document in the captured data;determining whether the digital representation of the document is characterized by a rectangular shape;determining whether the digital representation of the document is characterized by a skew angle≠0°; andtransforming the digital representation of the document into a substantially rectangular digital representation of the document characterized by a skew angle=0° upon determining the digital representation of the document is not characterized by a rectangular shape and the skew angle≠0°. 23. The method as recited in claim 22, the processing further comprising: estimating a resolution of the digital representation of the document; andmatching the digital representation of the document to a known document type based on the estimated resolution. 24. The method as recited in claim 22, the processing further comprising: determining whether the substantially rectangular, deskewed digital representation of the document is characterized by one or more problematic illumination regions;outputting a preview interface to the display of the mobile device upon determining the substantially rectangular, deskewed digital representation of the document is characterized by one or more problematic illumination regions;receiving user input relating to the one or more problematic illumination regions via the display of the mobile device; andcorrecting one or more unevenly illuminated regions in response to the user input relating to the one or more unevenly illuminated regions. 25. The method as recited in claim 22, the processing further comprising: determining whether the substantially rectangular, deskewed digital representation of the document is characterized by one or more blurred regions;outputting a preview interface to the display of the mobile device upon determining the substantially rectangular, deskewed digital representation of the document is characterized by one or more blurred regions;receiving user input relating to the one or more blurred regions via the display of the mobile device; andaccepting, rejecting, or reprocessing the substantially rectangular, deskewed digital representation characterized by one or more blurred regions in response to the user input relating to the one or more blurred regions. 26. The method as recited in claim 1, the case actions comprising printing one or more documents related to one or more cases, the printing comprising: submitting a print request from the mobile device to a remote resource, the remote resource and the mobile device not being in communication via a local network; and printing the one or more documents at a remote location using the remote resource in response to the print request. 27. The method as recited in claim 1, wherein performing the one or more case actions utilizes a processor of the mobile device, the mobile device having an integrated camera. 28. A system, comprising: a processor configured to execute logic;logic for outputting a case management interface to a display of a mobile device;logic for receiving one or more instructions corresponding to one or more case actions via the displayed case management interface, each case action being associated with at least one case; andlogic for performing one or more of the case actions in response to receiving the instructions; andlogic for detecting a digital representation of a document depicted in a digital image, the document relating to one or more of the case(s);wherein the case actions comprise: creating a new case;opening an existing case;deleting one or more of the existing cases;designating a case type;capturing case information;capturing data corresponding to a digital representation of a document the data comprising either image data or video data;outputting the captured data to the display of the mobile device;associating the captured data with one or more of the new case and the existing case;dissociating the captured data from one or more of the new case and the existing case;processing the captured data;outputting the processed data to the display of the mobile device;receiving user input via the display of the mobile device; andsubmitting one or more of the new case and the existing case to a remote server, andwherein detecting the digital representation of the document comprises: defining a plurality of candidate edge points corresponding to the digital representation of the document; anddefining four sides of a tetragon based on the plurality of candidate edge points. 29. A computer program product comprising a non-transitory computer readable storage medium having computer readable program code stored thereon, the computer readable program code comprising: computer readable program code configured to output a case management interface to a display of a mobile device;computer readable program code configured to receive one or more instructions corresponding to one or more case actions via the displayed case management interface, each case action being associated with at least one case;computer readable program code configured to detect a digital representation of a document depicted in a digital image, the document relating to one or more of the case(s); andcomputer readable program code configured to perform one or more of the case actions in response to receiving the instructions, andwherein the case actions comprise: creating a new case;opening an existing case;deleting one or more of the existing cases;designating a case type;capturing case information;capturing data corresponding to a digital representation of a document the data comprising either image data or video data;outputting the captured data to the display of the mobile device;associating the captured data with one or more of the new case and the existing case;dissociating the captured data from one or more of the new case and the existing case;processing the captured data;outputting the processed data to the display of the mobile device;receiving user input via the display of the mobile device; andsubmitting one or more of the new case and the existing case to a remote server, andwherein detecting the digital representation of the document comprises: defining a plurality of candidate edge points corresponding to the digital representation of the document; anddefining four sides of a tetragon based on the plurality of candidate edge points.
연구과제 타임라인
LOADING...
LOADING...
LOADING...
LOADING...
LOADING...
이 특허에 인용된 특허 (265)
Kawasaki, Somei; Goden, Tatsuhito, Active matrix type display apparatus and driving method thereof.
Nakatsuka Kimihiro,JPX, Apparatus for determining image processing parameter, method of the same, and computer program product for realizing the method.
Barrett Terence W. (Vienna VA), Automata networks and methods for obtaining optimized dynamically reconfigurable computational architectures and control.
Block, James; Graef, H. Thomas; Magee, Paul D.; Nelson, Donald S.; Meek, James; McIntyre, Daniel S.; DiPietro, Mark; Ramachandran, Natarajan, Automated banking machine with remote user assistance.
Sang ; Jr. Henry W. (Cupertio CA) Tahn Whei-Tsu H. (Sunnyvale CA) Zhang Xiao B. (Foster City CA), Automated method for creating templates in a forms recognition and processing system.
McElroy, John F.; Chorvat, Robert J., Cannabinoid receptor antagonists/inverse agonists useful for treating metabolic disorders, including obesity and diabetes.
Nishimura Kazuyuki (Ichikawa JPX) Sato Shinichi (Yokohama JPX), Color picture processing apparatus for reproducing a color picture having a smoothly changed gradation.
Suzuki,Masahiro; Tamune,Michihiro; Chen,Zhe Hong; Juen,Masahiro, Digital camera, storage medium for image signal processing, carrier wave and electronic camera.
Rowe Edward R. ; Priyadarshan Eswar ; Anderson Kenneth S. ; Al-Shamma Nabeel A. ; Taft Edward A. ; McQuarrie Elizabeth M. ; Cohn Richard, Displaying electronic documents with substitute fonts.
Nagatsuka,Tetsuro; Miyachi,Tatsuo; Shimada,Atsuo; Takeya,Kazutoshi; Kemmochi,Eiji; Nakajima,Akiko; Yamasaki,Makoto; Fujita,Katsuhiko, Document classification system and method for classifying a document according to contents of the document.
Borrey Roland G. (19251 Canyon Dr. Villa Park CA 92667) Borrey Daniel G. (19251 Canyon Dr. Villa Park CA 92667), Document identification by characteristics matching.
Clark ; Jr. Louis George (St. Charles MO) Gummow ; Jr. Donald Romaine (O\Fallon MO) Vanacht Marc (St. Louis MO), Hand-held GUI PDA with GPS/DGPS receiver for collecting agronomic and GPS position data.
LeBrun Thomas Q. (Dallas TX) Cage Kerry (Carrollton TX) Arnold Dennis D. (Carrollton TX), Image based document processing and information management system and apparatus.
Naofumi Yamamoto JP; Haruko Kawakami JP; Gururaj Rao JP, Image processing apparatus for discriminating image field of original document plural times and method therefor.
Appelt, Douglas E.; Arnold, James Frederick; Bear, John S.; Hobbs, Jerry Robert; Israel, David J.; Kameyama, Megumi; Martin, David L.; Myers, Karen Louise; Ravichandran, Gopalan; Stickel, Mark Edward, Information retrieval by natural language querying.
Walnut David Francis ; Berenstein Carlos Alberto ; Liu K. J. Ray ; Rashid-Farrokhi Farrokh, Method and apparatus for processing data from a tomographic imaging system.
Withers,William Douglas, Method and apparatus for recognizing a digitized form, extracting information from a filled-in form, and generating a corrected filled-in form.
Guberman Shelja A. (Moscow RUX) Lossev Ilia (Moscow RUX) Pashintsev Alexander V. (Moscow RUX), Method and apparatus for recognizing cursive writing from sequential input information.
Guberman Shelja A. (Moscow RUX) Lossev Ilia (Moscow RUX) Pashintsev Alexander V. (Moscow RUX), Method and apparatus for recognizing cursive writing from sequential input information.
Polyakov Vladislav G. (Moscow RUX) Ryleev Mikhail A. (Moscow RUX), Method and apparatus for representing image data using polynomial approximation method and iterative transformation-repa.
Green, Stephen J.; Lamere, Paul B.; Alexander, Jeffrey L.; Haberl, Karl R., Method and apparatus for searching and resource discovery in a distributed enterprise system.
Winkelman Kurt-Helfried (Kiel DEX), Method and apparatus for the automatic analysis of density range, color cast, and gradation of image originals on the Ba.
Berman, Arie; Vlahos, Paul; Dadourian, Arpag, Method and apparatus for the automatic generation of subject to background transition area boundary lines and subject shadow retention.
Verstraelen,Boudewijn Joseph Angelus; Verstraelen,Sebastiaan Paul, Method and apparatus for visualization of biological structures with use of 3D position information from segmentation results.
Tischler, Karl M., Method arrangement and computer software for the printing of a separator sheet by means of an electrophotographic printer or copier.
Kanda Shinji (Kawasaki JPX) Wakitani Jun (Kawasaki JPX) Maruyama Tsugito (Kawasaki JPX) Morita Toshihiko (Kawasaki JPX), Method for determining orientation of contour line segment in local area and for determining straight line and corner.
Kurosu Yasuo (Yokosuka JPX) Yokoyama Yoshihiro (Yokohama JPX) Nishikawa Kenichi (Yokohama JPX) Masuzaki Hidefumi (Hadano JPX) Fujinawa Masaaki (Tokyo JPX), Method for determining the amount of skew of image, method for correcting the same, and image data processing system.
Henderson Todd R. ; Spaulding Kevin E. ; Couwenhoven Douglas W., Method for segmenting a digital image into a foreground region and a key color region.
Kohchi Tsukasa JP, Method of and system for extracting predetermined elements from input document based upon model which is adaptively modified according to variable amount in the input document.
Beaulieu Dennis N. (Churchville NY) Compton John T. (LeRoy NY) Wojtanik Eugene R. (Plano TX), Method of calibration of image scanner signal processing circuits.
Dumais Susan T. ; Heckerman David ; Horvitz Eric ; Platt John Carlton ; Sahami Mehran, Methods and apparatus for classifying text and for building a text classifier.
Cheong, Cheol Ho; Han, Tack Don; Kim, Jong Young; Kim, Eui Jae; Jeong, Seong Hun; Kim, Jae Yun; Choi, Han Yeong, Mixed code, and method and apparatus for generating the same.
Michimoto Yasuyuki,JPX ; Onda Katsumasa,JPX ; Nishizawa Masato,JPX, Object detecting apparatus in which the position of a planar object is estimated by using hough transform.
Ellis, Stephen M.; Kennedy, Michael J.; Kurani, Ashish Bhoopen; Lowry, Melissa; Meyyappan, Uma; Sahni, Bipin; Stroke, Nikolai, System and method for a mobile wallet.
Woolf,Susan D.; Baird,Andrew; Jiang,Sheng; Beezer,John L.; Rubin,Darryl E., System and method for annotating an electronic document independently of its content.
Vazquez, Nicolas; Kodosky, Jeffrey L.; Kudukoli, Ram; Schultz, Kevin L.; Nair, Dinesh; Caltagirone, Christophe, System and method for automatically generating a graphical program to perform an image processing algorithm.
Emerson,Geoffrey A.; Moon,Rodney G.; Rector,Gerald C.; Stokes,Raymond F.; Sutton,Andrew H., System and method of sorting document images based on image quality.
Heidenreich,James R.; Higgins,Linda S., System and method to customize the facilitation of development of user thinking about and documenting of an arbitrary problem.
Sampath, Meera; Nichols, Stephen J.; Richenderfer, Elizabeth A., Systems and methods for automated image quality based diagnostics and remediation of document processing systems.
Ferlitsch,Andrew Rodney; DeVore,Darwin Alan, Systems and methods for manipulating electronic information using a three-dimensional iconic representation.
Roach, John J.; Nepomniachtchi, Grisha; Couch, Robert; Avergun, Mikhail, Systems and methods for obtaining financial offers using mobile image capture.
Gorski, Nikolai D.; Semenov, Andrey V.; Anisimov, Valery; Maksimov, Sergey K.; Sashov, Sergey N., Systems and methods for recognizing information in objects using a mobile device.
Borrey, Roland G.; Schmidtler, Mauritius A. R.; Taylor, Robert A.; Fechter, Joel S.; Asuri, Hari S., Systems and methods of accessing random access cache for rescanning.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods and computer program products for determining document validity.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods and computer program products for determining document validity.
Schmidtler, Mauritius A. R.; Borrey, Roland G.; Amtrup, Jan W.; Thompson, Stephen Michael, Systems, methods, and computer program products for determining document validity.
Ma, Jiyong; Thompson, Stephen Michael; Amtrup, Jan W., Content-based detection and three dimensional geometric reconstruction of objects in image and video data.
Macciola, Anthony; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher; Amtrup, Jan W., Determining distance between an object and a capture device based on captured image data.
Thrasher, Christopher W.; Shustorovich, Alexander; Thompson, Stephen Michael; Amtrup, Jan W.; Macciola, Anthony, Iterative recognition-guided thresholding and data extraction.
Shustorovich, Alexander; Thrasher, Christopher W.; Ma, Jiyong; Macciola, Anthony; Amtrup, Jan W., Mobile document detection and orientation based on reference object characteristics.
Amtrup, Jan W.; Macciola, Anthony; Thompson, Steve; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for classifying objects in digital images captured using mobile devices.
Amtrup, Jan W.; Macciola, Anthony; Thompson, Steve; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for classifying objects in digital images captured using mobile devices.
Macciola, Anthony; Amtrup, Jan W.; Ma, Jiyong; Shustorovich, Alexander; Thrasher, Christopher W.; Thompson, Stephen Michael, Systems and methods for classifying objects in digital images captured using mobile devices.
Macciola, Anthony; Amtrup, Jan Willers; Shustorovich, Alexander; Thrasher, Christopher W., Systems and methods for mobile image capture and processing.
Thrasher, Christopher W.; Shustorovich, Alexander; Thompson, Stephen Michael; Amtrup, Jan W.; Macciola, Anthony; Borrey, Roland G.; Schmidtler, Mauritius A. R.; Taylor, Robert A.; Fechter, Joel S.; Asuri, Hari S., Systems and methods of processing scanned data.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.