G06T2207/20076

APPARATUS AND METHOD FOR CLASSIFYING CLOTHING ATTRIBUTES BASED ON DEEP LEARNING

Disclosed herein are an apparatus and method for classifying clothing attributes based on deep learning. The apparatus includes memory for storing at least one program and a processor for executing the program, wherein the program includes a first classification unit for outputting a first classification result for one or more attributes of clothing worn by a person included in an input image, a mask generation unit for outputting a mask tensor in which multiple mask layers respectively corresponding to principal part regions obtained by segmenting a body of the person included in the input image are stacked, a second classification unit for outputting a second classification result for the one or more attributes of the clothing by applying the mask tensor, and a final classification unit for determining and outputting a final classification result for the input image based on the first classification result and the second classification result.

INFORMATION PROCESSING APPARATUS, SENSING APPARATUS, MOBILE OBJECT, METHOD FOR PROCESSING INFORMATION, AND INFORMATION PROCESSING SYSTEM
20230048222 · 2023-02-16 · ·

An information processing apparatus includes an input interface, a processor, and an output interface. The input interface obtains observation data obtained from an observation space. The processor detects a subject image of a detection target from the observation data, calculates a plurality of individual indices indicating degrees of reliability, each of which relates to at least identification information or measurement information regarding the detection target, and also calculates an integrated index, which is obtained by integrating a plurality of calculated individual indices. The output interface outputs the integrated index.

LOOP CLOSURE DETECTION METHOD AND SYSTEM, MULTI-SENSOR FUSION SLAM SYSTEM, ROBOT, AND MEDIUM
20230045796 · 2023-02-16 ·

The present invention provides a loop closure detection method and system, a multi-sensor fusion SLAM system, a robot, and a medium. Said system runs on a mobile robot, and comprises a similarity detection unit, a visual pose solving unit, and a laser pose solving unit. According to the loop closure detection system, the multi-sensor fusion SLAM system and the robot provided in the present invention, the speed and accuracy of loop closure detection in cases of a change in a viewing angle of the robot, a change in the environmental brightness, a weak texture, etc. can be significantly improved.

AUTOMATED ASSESSMENT OF ENDOSCOPIC DISEASE

The application relates to devices and methods for analysing a colonoscopy video or a portion thereof, and for assessing the severity of ulcerative colitis in a subject by analysing a colonoscopy video obtained from the subject. Analysing a colonoscopy video comprises using a first deep neural network classifier to classify image data from the subject colonoscopy video or portion thereof into at least a first severity class (more severe endoscopic lesions) and a second severity class (less severe endoscopic lesions), wherein the first deep neural network has been trained at least in part in a weakly supervised manner using training image data from a plurality of training colonoscopy videos, the training image data comprising multiple sets of consecutive frames from the plurality of training colonoscopy videos, wherein frames in a set have the same severity class label. Devices and methods for providing a tool for analysing colonoscopy videos are also described.

DEVICE AND COMPUTER-IMPLEMENTED METHOD FOR OBJECT TRACKING
20230051014 · 2023-02-16 ·

A device and computer-implemented method for object tracking. The method comprises providing a sequence of digital images, determining a sequence of relational graph embeddings, wherein a first relational graph embedding of the sequence comprises a first object embedding representing a first object in a first digital image of the sequence of digital images, wherein the first relational graph embedding comprises a first relation embedding of a relation for the first object embedding, wherein the first relation embedding relates the first object embedding to embeddings representing other objects of the first digital image in the first relational graph embedding and to embeddings in a second relational graph embedding of the sequence that represent objects of a second digital image of the sequence of digital images.

MAP INFORMATION UPDATE METHOD, LANDMARK GENERATION METHOD, AND FEATURE POINT DISTRIBUTION ADJUSTMENT METHOD
20230046001 · 2023-02-16 ·

A map information update method includes: (a) obtaining map information; (b) obtaining landmark observed positions indicating positions of one or more landmarks in a captured image; (c) adding that includes (i) generating added map information by adding information pertaining to the landmark observed positions to the map information, and (ii) updating the map information obtained in (a) to the added map information; (d) predicting that includes (i) calculating predicted map information based on the map information updated in (c), by using a neural network inference engine that has been trained, and (ii) updating the map information to the predicted map information; and updating information that includes (i) calculating updated map information based on the map information updated in (d), by using a gradient method, and (ii) updating the map information to the updated map information.

SYSTEM AND METHOD FOR ADDITIVE MANUFACTURING CONTROL

An additive manufacturing apparatus, a computing system, and a method for operating an additive manufacturing apparatus are provided. The method includes obtaining two or more images corresponding to respective build layers at a build plate, wherein each image comprises a plurality of data points comprising a feature and corresponding location at the build plate; removing variation between the features of the plurality of data points; and normalizing each feature to remove location dependence in the plurality of data points.

AIRCRAFT DOOR CAMERA SYSTEM FOR DOCKING ALIGNMENT MONITORING
20230052176 · 2023-02-16 ·

A camera with a field of view toward an external environment of an aircraft is disposed within an aircraft door such that a ground surface is within the field of view of the camera during taxiing of the aircraft. A display device is disposed within an interior of the aircraft. A processor is operatively coupled to the camera and to the display device. The processor analyzes image data captured by the camera for docking guidance by identifying, within the captured image data, a region on the ground surface corresponding to an alignment fiducial indicating a parking location for the aircraft, determining, based on the region of the captured image data corresponding to the alignment fiducial indicating the parking location, a relative location of the aircraft with respect to the alignment fiducial, and outputting an indication of the relative location of the aircraft to the alignment fiducial.

NON-TRANSITORY COMPUTER READABLE MEDIUM AND METHOD FOR STYLE TRANSFER

According to one or more embodiments, a non-transitory computer readable medium storing a program which, when executed, causes a computer to perform processing comprising acquiring image data, applying style transfer to the image data a plurality of times based on one or more style images, and outputting data after the style transfer is applied.

DIGITAL TISSUE SEGMENTATION AND MAPPING WITH CONCURRENT SUBTYPING
20230050168 · 2023-02-16 ·

Accurate tissue segmentation is performed without a priori knowledge of tissue type or other extrinsic information not found within the subject image, and may be combined with classification analysis so that diseased tissue is not only delineated within an image but also characterized in terms of disease type. In various embodiments, a source image is decomposed into smaller overlapping subimages such as square or rectangular tiles. A predictor such as a convolutional neural network produces tile-level classifications that are aggregated to produce a tissue segmentation and, in some embodiments, to classify the source image or a subregion thereof.