Patent classifications
G06K9/38
METHOD FOR GENERATING TRAINING DATA, IMAGE SEMANTIC SEGMENTATION METHOD AND ELECTRONIC DEVICE
A method for generating training data includes: defining a corresponding category tag for an object model in a three-dimensional scenario; acquiring a plurality of corresponding scenario images by modifying scenario parameters of the three-dimensional scenario; rendering the object model to a monochromic material corresponding to the category tag of the object model; acquiring a semantic segmentation image corresponding to each scenario image based on the rendered object model; and storing each scenario image and the semantic segmentation image corresponding to the scenario image as a set of training data.
CONVOLUTIONAL NEUTRAL NETWORK IDENTIFICATION EFFICIENCY INCREASING METHOD AND RELATED CONVOLUTIONAL NEUTRAL NETWORK IDENTIFICATION EFFICIENCY INCREASING DEVICE
A convolutional neutral network identification efficiency increasing method is applied to a related device. The convolutional neutral network identification efficiency increasing method includes analyzing an input image to acquire foreground information, utilizing the foreground information to generate a foreground mask, and transforming the input image into an output image via the foreground mask. The output image is used to be an input of the convolutional neutral network identification for preferred object identification efficiency.
SYSTEMS AND METHODS FOR MOBILE IMAGE CAPTURE AND PROCESSING OF DOCUMENTS
Techniques for processing images of documents captured using a mobile device are provided. The images can include different sides of a document from a mobile device for an authenticated transaction. In an example implementation, a method includes inspecting the images to detect a feature associated with a first side of the document. In response to determining an image is the first side of the document, a type of content is selected to be analyze on the image of the first side and one or more of regions of interests (ROIs) are identified on the image of the first side that are known to include the selected type of content. A process can include receiving a sub-image of the image of the first side from the preprocessing unit, and performing content detection test on the sub-image.
IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD, AND NON-TRANSITORY COMPUTER-READABLE RECORDING MEDIUM
An image processing apparatus comprises a specific object detection unit configured to detect a specific object from a moving image, a setting unit configured to set, in the moving image, based on a position of the specific object, a region of interest which is a region for performing an encoding process that produces a relatively higher image quality than in another region other than the region of interest, a moving object detection unit configured to detect a moving object from the moving image, and an encoding unit configured to perform an encoding process using, in the region of interest, an encoding parameter that can produce a relatively higher image quality than an encoding parameter used in the other region, when the moving object is detected in the region of interest.
Image processing apparatus, and image processing method
An apparatus includes a first brightness value acquisition unit configured to obtain a first absolute brightness value by converting a representative brightness value of an image obtained using an image pickup unit, into absolute brightness, a first determination unit configured to determine an output value based on an input-output characteristic of an output device, the output value corresponding to the first absolute brightness value obtained by the first brightness value acquisition unit, and a second determination unit configured to determine information about exposure corresponding to the output value.
Method of encoding an image including a privacy mask
A method, system, and computer program product of encoding a digital image comprising a privacy mask. Information representative of pixels in the digital image is received. The pixels are grouped into encoding units. Information representative of a privacy mask area in which a privacy mask is to be applied on the image is also received. All encoding units that at least partially are located within the privacy mask area are identified, and the privacy mask area is extended to be aligned with the identified encoding units. For each encoding unit, a respective quantization parameter to be used for encoding the image is determined. The privacy mask is applied in the extended privacy mask area of the image, and the image with the applied privacy mask is encoded using the determined quantization parameters. The digital image encoding system may be included in a camera.
Systems and methods for mobile image capture and processing
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. The exemplary image processing techniques are coupled with inbound and outbound communications protocols and workflows configured to facilitate closed-loop processing, such that a method includes initiating a workflow; providing one or more of case information and raw data to the workflow; processing one or more of the case information and the raw data to generate a processing result; storing at least some of the case information in association with the processing result, wherein the associated case information acts as an identifier of the processing result; transmitting at least the processing result and the identifier; receiving, in response to the transmitting, a reply comprising the identifier; and retrieving at least the processing result using the identifier.
IMAGE DEPTH DECODER AND COMPUTING DEVICE
An image depth decoder includes an NIR image buffer, a reference image ring buffer and a pattern matching engine. The NIR image buffer stores an NIR image inputted by a stream. The reference image ring buffer stores a reference image inputted by a stream. The pattern matching engine is coupled to the NIR image buffer and the reference image ring buffer, and performs a depth computation according to the NIR image and the reference image to output at least one depth value.
System and method for augmented and virtual reality
One embodiment is directed to a system for enabling two or more users to interact within a virtual world comprising virtual world data, comprising a computer network comprising one or more computing devices, the one or more computing devices comprising memory, processing circuitry, and software stored at least in part in the memory and executable by the processing circuitry to process at least a portion of the virtual world data; wherein at least a first portion of the virtual world data originates from a first user virtual world local to a first user, and wherein the computer network is operable to transmit the first portion to a user device for presentation to a second user, such that the second user may experience the first portion from the location of the second user, such that aspects of the first user virtual world are effectively passed to the second user.
METHODS FOR MOBILE IMAGE CAPTURE OF VEHICLE IDENTIFICATION NUMBERS IN A NON-DOCUMENT
Various embodiments disclosed herein are directed to methods of capturing Vehicle Identification Numbers (VIN) from images captured by a mobile device. Capturing VIN data can be useful in several applications, for example, insurance data capture applications. There are at least two types of images supported by this technology: (1) images of documents and (2) images of non-documents.