Patent classifications
G06T7/168
Generating class-agnostic object masks in digital images
The present disclosure relates to a class-agnostic object segmentation system that automatically detects, segments, and selects objects within digital images irrespective of object semantic classifications. For example, the object segmentation system utilizes a class-agnostic object segmentation neural network to segment each pixel in a digital image into an object mask. Further, in response to detecting a selection request of a target object, the object segmentation system utilizes a corresponding object mask to automatically select the target object within the digital image. In some implementations, the object segmentation system utilizes a class-agnostic object segmentation neural network to detect and automatically select a partial object in the digital image in response to a target object selection request.
Generating class-agnostic object masks in digital images
The present disclosure relates to a class-agnostic object segmentation system that automatically detects, segments, and selects objects within digital images irrespective of object semantic classifications. For example, the object segmentation system utilizes a class-agnostic object segmentation neural network to segment each pixel in a digital image into an object mask. Further, in response to detecting a selection request of a target object, the object segmentation system utilizes a corresponding object mask to automatically select the target object within the digital image. In some implementations, the object segmentation system utilizes a class-agnostic object segmentation neural network to detect and automatically select a partial object in the digital image in response to a target object selection request.
Moving image analysis apparatus, system, and method
A moving image analysis apparatus includes at least one of a processor and a circuitry configured to perform operations including acquiring first data and second data used in processing, in which a moving image is compressed and encoded, for a first frame and a second frame, respectively, included in the moving image, detecting first feature data indicating a first feature of the moving image on the basis of the first frame and the first data and detecting second feature data indicating a second feature of the moving image on the basis of the second frame and the second data, and detecting an object included in the first frame on the basis of the first feature data and the second feature data.
Systems and methods for visually guided audio separation
A system for separating audio based on sound producing objects includes a processor configured to receive video data and audio data. The processor is also configured to perform object detection using the video data to identify a number of sound producing objects in the video data and predict a separation for each sound producing object detected in the video data. The processor is also configured to generate separated audio data for each sound producing object using the separation and the audio data.
METHOD AND APPARATUS FOR PROCESSING AN IMAGE OF A ROAD TO IDENTIFY A REGION OF THE IMAGE WHICH REPRESENTS AN UNOCCUPIED AREA OF THE ROAD
A method of processing an image of a scene including a road acquired by a vehicle-mounted camera to generate boundary data indicative of a boundary of an image region which represents an unoccupied area of the road, comprising: generating (S10) an LL sub-band image of an N.sup.th level of an (N+1)-level discrete wavelet transform, DWT, decomposition of the image by iteratively low-pass filtering and down-sampling the image N times, where N is an integer equal to or greater than one; generating (S20) a sub-band image of an (N+1).sup.th level by high-pass filtering the LL sub-band image of the N.sup.th level, and down-sampling a result of the high-pass filtering, such that the sub-band image of the (N+1).sup.th level has a pixel region having substantially equal pixel values representing the unoccupied area of the road in the image; and generating (S30) the boundary data by determining a boundary of the pixel region.
METHOD AND APPARATUS FOR PROCESSING AN IMAGE OF A ROAD HAVING A ROAD MARKER TO IDENTIFY A REGION OF THE IMAGE WHICH REPRESENTS THE ROAD MARKER
A method of processing an image of a road having a road marker acquired by a vehicle-mounted camera to generate boundary data indicating a boundary of the road marker region of the image which represents the road marker, comprising: generating an LL sub-band image of an M.sup.th level of an (M+1)-level discrete wavelet transform, DWT, decomposition of the image by iteratively low-pass filtering and down-sampling the image M times; generating a sub-band image of an (M+1).sup.th level of the (M+1) level DWT decomposition by high-pass filtering the LL sub-band image and down-sampling a result of the high-pass filtering; and determining a boundary of a region of pixels of the sub-band image of the (M+1).sup.th level, the region being surrounded by pixels having pixel values substantially different to the pixel values of the pixels in the region, the determined boundary indicating the boundary of the road marker region.
METHOD AND APPARATUS FOR DETERMINING THE SPEED OF A VEHICLE TRAVELLING ALONG A ROAD BY PROCESSING IMAGES OF THE ROAD
An apparatus for determining a speed of a vehicle along a road by processing a first image and a second image of the road captured by a camera on the vehicle and comprising respective road marker images of a road marker, the apparatus arranged to: determine a location of the road marker in the first image; predict a location of the road marker in the second image based on the determined location, an estimate of the vehicle speed, and a time period between capture of the images; detect the road marker in a portion of the second image at the predicted location; estimate a distance moved by the vehicle during the time period based on the determined location, and a location of the detected road marker in the portion of the second image; and calculate the speed based on the estimated distance and the time period.
SYSTEM AND METHOD FOR DIFFERENTIATING A TISSUE OF INTEREST FROM ANOTHER PART OF A MEDICAL SCANNER IMAGE
One or more example embodiments provides a system and a method for differentiating a tissue of interest from another part of a medical scanner image, in particular pectoral muscle tissue from breast tissue in an X-ray mammography image. The method comprises providing a medical scanner image; inputting input data into a trained artificial neural network, the input data being based on the provided medical scanner image; generating, by the trained artificial neural network, output data based on the input data, the output data indicating a one-dimensional borderline between at least a part of the tissue of interest and the at least one other part of the medical scanner image; and outputting an output signal comprising or based on the generated output data.
SYSTEM AND METHOD FOR DIFFERENTIATING A TISSUE OF INTEREST FROM ANOTHER PART OF A MEDICAL SCANNER IMAGE
One or more example embodiments provides a system and a method for differentiating a tissue of interest from another part of a medical scanner image, in particular pectoral muscle tissue from breast tissue in an X-ray mammography image. The method comprises providing a medical scanner image; inputting input data into a trained artificial neural network, the input data being based on the provided medical scanner image; generating, by the trained artificial neural network, output data based on the input data, the output data indicating a one-dimensional borderline between at least a part of the tissue of interest and the at least one other part of the medical scanner image; and outputting an output signal comprising or based on the generated output data.
Image processing apparatus and control method thereof
Disclosed herein is an image processing apparatus and a control method thereof. The image processing apparatus includes communication circuitry, a storage, and a controller configured to control the image processing apparatus to: perform object recognition for recognizing a plurality of objects in first image data stored in the storage, obtain a score inferred through operation processing through a neural network for the recognized plurality of objects, generate second image data based on the obtained score and proximity of the plurality of objects, and perform image processing based on the second image data.