G06T3/4046

APPARATUS AND METHOD FOR CLASSIFYING CLOTHING ATTRIBUTES BASED ON DEEP LEARNING

Disclosed herein are an apparatus and method for classifying clothing attributes based on deep learning. The apparatus includes memory for storing at least one program and a processor for executing the program, wherein the program includes a first classification unit for outputting a first classification result for one or more attributes of clothing worn by a person included in an input image, a mask generation unit for outputting a mask tensor in which multiple mask layers respectively corresponding to principal part regions obtained by segmenting a body of the person included in the input image are stacked, a second classification unit for outputting a second classification result for the one or more attributes of the clothing by applying the mask tensor, and a final classification unit for determining and outputting a final classification result for the input image based on the first classification result and the second classification result.

CODING SCHEME FOR VIDEO DATA USING DOWN-SAMPLING/UP-SAMPLING AND NON-LINEAR FILTER FOR DEPTH MAP

Methods of encoding and decoding video data are provided. In an encoding method, source video data comprising one or more source views is encoded into a video bitstream. Depth data of at least one of the source views is nonlinearly filtered and downsampled prior to encoding. After decoding, the decoded depth data is up-sampled and nonlinearly filtered.

SUPER RESOLUTION USING CONVOLUTIONAL NEURAL NETWORK
20230052483 · 2023-02-16 ·

An apparatus for super resolution imaging includes a convolutional neural network (104) to receive a low resolution frame (102) and generate a high resolution illuminance component frame. The apparatus also includes a hardware scaler (106) to receive the low resolution frame (102) and generate a second high resolution chrominance component frame. The apparatus further includes a combiner (108) to combine the high resolution illuminance component frame and the high resolution chrominance component frame to generate a high resolution frame (110).

METHOD OF FUSING IMAGE, AND METHOD OF TRAINING IMAGE FUSION MODEL

A method of fusing an image, a method of training an image fusion model, an electronic device, and a storage medium. The method of fusing the image includes: encoding a stitched image obtained by stitching a foreground image and a background image, so as to obtain a feature map; and decoding the feature map to obtain a fused image, wherein the feature map is decoded by: performing a weighting on the feature map by using an attention mechanism, so as to obtain a weighted feature map; performing a fusion on the feature map according to feature statistical data of the weighted feature map, so as to obtain a fused feature; and decoding the fused feature to obtain the fused image.

IMAGE PROVIDING METHOD AND APPARATUS USING ARTIFICIAL INTELLIGENCE, AND DISPLAY METHOD AND APPARATUS USING ARTIFICIAL INTELLIGENCE

Provided is an electronic apparatus configured to provide an image based on artificial intelligence (AI), the electronic apparatus including a processor configured to execute one or more instructions stored in the electronic apparatus to obtain a first image by AI-downscaling an original image by a downscaling neural network, obtain first image data by encoding the first image, based on a display apparatus not supporting an AI upscaling function, obtain a second image by decoding the first image data, obtain a third image by AI-upscaling the second image by an upscaling neural network, and provide, to the display apparatus, second image data obtained by encoding the third image.

DEEP LEARNING-BASED IMAGE QUALITY ENHANCEMENT OF THREE-DIMENSIONAL ANATOMY SCAN IMAGES

Techniques are described for enhancing the quality of three-dimensional (3D) anatomy scan images using deep learning. According to an embodiment, a system is provided that comprises a memory that stores computer executable components, and a processor that executes the computer executable components stored in the memory. The computer executable components comprise a reception component that receives a scan image generated from 3D scan data relative to a first axis of a 3D volume, and an enhancement component that applies an enhancement model to the scan image to generate an enhanced scan image having a higher resolution relative to the scan image. The enhancement model comprises a deep learning neural network model trained on training image pairs respectively comprising a low-resolution scan image and a corresponding high-resolution scan image respectively generated relative to a second axis of the 3D volume.

DEFECT INSPECTION SYSTEM AND METHOD OF USING THE SAME

A method includes patterning a hard mask over a target layer, capturing a low resolution image of the hard mask, and enhancing the low resolution image of the hard mask with a first machine learning model to produce an enhanced image of the hard mask. The method further includes analyzing the enhanced image of the hard mask with a second machine learning model to determine whether the target layer has defects.

Image positioning system and image positioning method based on upsampling

An image positioning system based on upsampling and a method thereof are provided. The image positioning method based on upsampling is to fetch a region image covering a target from a wide region image, determine a rough position of the target, execute an upsampling process on the region image based on neural network data model for obtaining a super-resolution region image, map the rough position onto the super-resolution region image, and analyze the super-resolution region image for determining a precise position of the target. The present disclosed example can significantly improve the efficiency of positioning and effectively reduce the required cost of hardware.

IMAGING SYSTEM, DRIVING ASSISTANCE SYSTEM, AND PROGRAM
20230044180 · 2023-02-09 ·

The driving assistance system includes an imaging device capable of capturing a first monochrome image in a vehicle traveling direction, a first neural network for segmentation processing, a second neural network for depth estimation processing, a determination portion determining a center of a portion cut off from the first monochrome image on the basis of the segmentation processing and the depth estimation processing, a third neural network for colorization processing of only a second cut-off monochrome image, and a display device for enlargement of the second monochrome image subjected to the colorization processing.

UNSUPERVISED LEARNING-BASED SCALE-INDEPENDENT BLUR KERNEL ESTIMATION FOR SUPER-RESOLUTION
20230041888 · 2023-02-09 ·

One embodiment provides a method generating a first image crop and a second image crop randomly extracted from a low-quality image and a high-quality image, respectively. The method further comprises comparing the first image crop and the second image crop using a plurality of loss functions including pixel-wise loss to calculate losses, and optimizing a model trained to estimate a realistic scale-independent blur kernel of a low-resolution (LR) blurred image by minimizing the losses.