Patent classifications
G06K9/38
Intelligent whiteboard collaboration systems and methods
Systems and methods are provided for capturing time-stamped data from whiteboard video signals and producing high-resolution whiteboard images. Local patches around a multitude of pixels in the whiteboard are used in classifying background white pixels and foreground color pixels for each foreground marker color. Clustering is performed in alternative color spaces globally and locally in defining background white and each foreground marker color. Color normalization is performed for each foreground pixel classified as a foreground marker color and for each image sensor color plane separately utilizing the maximum local background white and the darkest pixel intensities in local patches. Strokes are reconstructed based on spline interpolation of inflection points of cross sections along the length of each stroke for a foreground marker color with a predetermined width. Also provided is an intelligent whiteboard collaboration system including a messaging utility whereby participants based on relevant biometrics information are enabled to access time-lapse whiteboard data and communicate with the system and other participants.
ITERATIVE RECOGNITION-GUIDED THRESHOLDING AND DATA EXTRACTION
Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; and extracting data from some or all of the plurality of binarized images. The extracted data includes connected components that overlap and/or are obscured by unique background. Corresponding systems and computer program products are disclosed.
Image Recognition System
Systems and methods for predicting items within content and using improved, fine-grained image classification techniques to produce images used to identify consumer products in the real-world by allowing for the recognition of a product using an image captured under a variety of conditions and environments, such as angles, lighting, camera settings, and the like.
SYSTEM, METHOD AND APPARATUS FOR ASSISTING A DETERMINATION OF MEDICAL IMAGES
A quantification system (700) is described that includes: at least one input (710) configured to provide two input medical images and two locations of interest in said input medical images that correspond to a same anatomical region; and a mapping circuit (725) configured to compute a direct quantification of change of said input medical images from the at least one input (710).
Ultrasonic fingerprint sensor with low-frequency vibration source
An apparatus may include an ultrasonic sensor system, a low-frequency vibration source and a control system. The ultrasonic sensor system may include an ultrasonic receiver and an ultrasonic transmitter configured for transmitting ultrasonic waves in a first frequency range (e.g., 1 MHz to 30 MHz). The low-frequency vibration source may be configured for generating low-frequency vibrations in a second frequency range (e.g., the range of 5 Hz to 2000 Hz). The control system may be configured for synchronizing the generation of the first low-frequency vibrations and the transmission of the first ultrasonic waves.
Handwriting detector, extractor, and language classifier
Disclosed are methods for handwriting recognition. In some aspects, an image representing a page of a sample document is analyzed to identify a region having indications of handwriting. The region is analyzed to determine frequencies of a plurality of geometric features within the region. The frequencies may be compared to profiles or histograms of known language types, to determine if there are similarities between the frequencies in the sample document relative to those of the known language types. In some aspects, machine learning may be used to characterize the document as a particular language type based on the frequencies of the geometric features.
Preprocessing images for OCR using character pixel height estimation and cycle generative adversarial networks for better character recognition
A text extraction computing method that comprises calculating an estimated character pixel height of text from a digital image. The method may scale the digital image using the estimated character pixel height and a preferred character pixel height. The method may binarizes the digital image. The method may remove distortions using a neural network trained by a cycle GAN on a set of source text images and a set of clean text images. The set of source text images and clean text images are unpaired. The source text images may be distorted images of text. Calculating the estimated character pixel height may include summarizing the rows of pixels into a horizontal projection, and determining a line-repetition period from the projection, and quantifying the portion of the line-repetition period that corresponds to the text as the estimated character pixel height. The method may extract characters from the digital image using OCR.
Equalization-Based Image Processing and Spatial Crosstalk Attenuator
The technology disclosed attenuates spatial crosstalk from sequencing images for base calling. In particular, the technology disclosed accesses an image whose pixels depict intensity emissions from a target cluster and intensity emissions from additional adjacent clusters. The pixels include a center pixel that contains a center of the target cluster. Each pixel in the pixels is divisible into a plurality of subpixels. Depending upon a particular subpixel, in a plurality of subpixels of the center pixel, which contains the center of the target cluster, the technology disclosed selects, from a bank of subpixel lookup tables, a subpixel lookup table that corresponds to the particular subpixel. The selected subpixel lookup table contains pixel coefficients that are configured to maximizes a signal-to-noise ratio. The technology disclosed element-wise multiplies the pixel coefficients with the pixels and determines a weighted sum.
Placement detection system
A placement detection system includes: a placement table; an imaging device that images an object placed on the placement table to generate an input image; and a control device. The control device generates a first binarized image for the input image based on a first threshold value and determines whether the object is placed in a predetermined placement orientation. The control device changes a threshold value for a target pixel to a second threshold value higher than the first threshold value when the object is determined to be placed in the predetermined placement orientation, the target pixel being sandwiched between pixels having pixel levels lower than or equal to the first threshold value and has a higher pixel level than the first threshold value in a first region, generates a second binarized image for the target pixel based on the second threshold value, and detects a shape of the object.
RECOGNIZING OBJECTS IN A PASSABLE WORLD MODEL IN AUGMENTED OR VIRTUAL REALITY SYSTEMS
One embodiment is directed to a system for enabling two or more users to interact within a virtual world comprising virtual world data, comprising a computer network comprising one or more computing devices, the one or more computing devices comprising memory, processing circuitry, and software stored at least in part in the memory and executable by the processing circuitry to process at least a portion of the virtual world data; wherein at least a first portion of the virtual world data originates from a first user virtual world local to a first user, and wherein the computer network is operable to transmit the first portion to a user device for presentation to a second user, such that the second user may experience the first portion from the location of the second user, such that aspects of the first user virtual world are effectively passed to the second user.