G06K9/18

Text realization

In one example, a graphics processing unit may use an optimized geometric realization to render a text shape as a scalable geometry. The graphics processing unit may generate an inner geometry for a text shape. The graphics processing unit also may generate a tessellated edge geometry abutting the inner geometry for an edge of the text shape. The graphics processing unit further may assign a coverage gradient to the tessellated edge geometry to create an anti-aliased edge for the text shape.

Smart optical input/output (I/O) extension for context-dependent workflows
09747269 · 2017-08-29 · ·

Systems, methods, and computer program products for smart, automated capture of textual information using optical sensors of a mobile device are disclosed. The capture and provision is context-aware, and determines context of the optical input, and invokes a contextually-appropriate workflow based thereon. The techniques also provide capability to normalize, correct, and/or validate the captured optical input and provide the corrected, normalized, validated, etc. information to the contextually-appropriate workflow. Other information necessary by the workflow and available to the mobile device optical sensors may also be captured and provided, in a single automatic process. As a result, the overall process of capturing information from optical input using a mobile device, invoking an appropriate workflow, and providing captured information to the workflow is significantly simplified and improved in terms of accuracy of data transfer/entry, speed and efficiency of workflows, and user experience.

Automatically capturing and cropping image of check from video sequence for banking or other computing application

A mobile device with camera automatically captures an image of a check from a video sequence. A computing application assesses quality metrics of a frame of the video and, if acceptable, initiates capture of the check in that frame without user selection. Metrics include an aspect ratio of the check, image quality of the routing transit symbols that delineate a routing transit number on a MICR line of the check, distances between the routing transit symbols and to an edge of the check, recognition of digits of the routing transit number, checksum of the routing transit numbers, and image sharpness. Other embodiments note cropping of the check from the background of the image, properly orienting the check for viewing, and providing color coded visual feedback to users about the quality of the image frame about the check, to name a few.

Systems and methods for detection and high-quality capture of documents on a cluttered tabletop with an automatically controlled camera
09747499 · 2017-08-29 · ·

Described are systems and methods for recognizing paper documents on a tabletop using an overhead camera mounted on pan-tilt servos. The described automated system first finds paper documents on a cluttered desk based on a text probability map, constructed using multiple images acquired at fixed grid positions, and then captures a sequence of high-resolution overlapping frames of the located document(s), which are then fused together and perspective-rectified, using computed homography, to reconstruct a high quality and fronto-parallel document image that is of sufficient quality required for optical character recognition. The extracted textual information may be used, for example, for indexing and search, document repository and/or language translation applications.

Information input/output device, and medium, using dot patterns with at least two kinds of inks with different reaction characteristic
09740948 · 2017-08-22 ·

To achieve an information output device in which information of a medium disposed on a stage surface can be surely and efficiently read out, and a stage effect and a security are high. In the information output device, media are disposed on a stage surface in a state where the media are opposite to the stage surface, the media where dot patterns are printed on medium surfaces on the basis of a predetermined rule, the dot patterns are read by imaging units disposed in a space below the stage and converted into code values or coordinate values indicating the dot patterns from photographed images obtained by the imaging units, and information corresponding to the code values or the coordinate values is outputted. Further, light transmitting reading holes are respectively provided at a plurality of medium disposing locations on the stage surface, and the imaging units are disposed in the space below the stage corresponding the respective reading holes such that the imaging units photograph the medium surfaces of the media disposed on the reading holes.

Internet fax message searching and fax content delivery using keyword detection

A system for providing an internet fax service has an engine to process a raw fax document, received for a user of the internet fax service, using digital character recognition to produce recognized text or an associated formatted text document. A conversion engine may convert the raw fax to native file format being one of a) word processor format, b) spreadsheet format, c) slide presentation format, or d) another format that can be read on a user interface. An internet server is to make the native file format document available to the user over an internet. A fax content processing unit is to perform keyword scanning of the recognized text, and then allows the user to search for stored faxes by keyword. Other embodiments are also described.

Method and system for the spotting of arbitrary words in handwritten documents
09740925 · 2017-08-22 · ·

A method and system for the spotting of keywords in a handwritten document, the method comprising the steps of inputting an image of the handwritten document, performing word segmentation on the image to obtain segmented words, performing word matching, and outputting the spotted keywords. The word matching itself consisting in the substeps of performing character segmentation on the segmented words, performing character recognition on the segmented characters, performing distance computations on the recognized characters using a Generalized Hidden Markov Model with ergodic topology to identify words based on character models and performing nonkeyword rejection using a classifier based on a combination of Gaussian Mixture Models, Hidden Markov Models and Support Vector Machines.

Assisted text input for computing devices

Various approaches provide for detecting and recognizing text to enable a user to perform various functions or tasks. For example, a user could point a camera at an object with text, in order to capture an image of that object. The camera can be integrated with a portable computing device that is capable of taking the image and processing the image (or providing the image for processing) to recognize, identify, and/or isolate the text in order to send the image of the object as well as recognized text to an application, function, or system, such as an electronic marketplace.

ANALYSIS OF VIDEO GAME VIDEOS FOR INFORMATION EXTRACTION, CONTENT LABELING, SMART VIDEO EDITING/CREATION AND HIGHLIGHTS GENERATION

Methods and systems for analyzing video-game videos in connection with facilitating video editing and creation and performing automated extraction of information of interest to facilitate labeling of and/or highlight generation for such videos is provided. According to one embodiment, a video, containing content pertaining to a video game, is received by a video-game video analysis system. Information regarding the status of the video game over time is received by retrieving game metadata through an API of the video game or by analyzing audio or visual features within the content. Multiple clips are automatically identified within the video for proposed inclusion within an edited version of the video based on the status of the video game over time. The edited version of the video is then generated by (i) joining the automatically identified clips or (ii) joining multiple user-selected clips, including at least one clip selected from the automatically identified clips.

METHODS AND SYSTEMS FOR DETECTING TOPIC TRANSITIONS IN A MULTIMEDIA CONTENT
20170228614 · 2017-08-10 ·

According to embodiments illustrated herein there is provided a method for detecting one or more topic transitions in a multimedia content. The method includes identifying, one or more frames from a plurality of frames of the multimedia content based on a comparison between one or more content items in a first frame of the plurality of frames, and the one or more content items in a first set of frames of the plurality of frames. The method further includes determining at least a first score, and a second score for each of the one or more frames. Additionally, the method includes determining a likelihood for each of the one or more frames based at least on the first score, and the second score, wherein the likelihood is indicative of a topic transition among the one or more frames.