G06V10/7715

BOUNDARY ESTIMATION
20230215190 · 2023-07-06 ·

Certain aspects are directed to an apparatus for lane estimation. The apparatus generally includes: at least one memory; and at least one processor coupled to the at least one memory and configured to receive a first input associated with a three-dimensional (3D) space, extract, from the first input, a first set of points associated with a ground plane of the 3D space, map each of the first set of points to a region of a plurality of regions of a two-dimensional (2D) frame, determine one or more attributes associated with each region of the plurality of regions based on one or more of the first set of points mapped to the region, and identify one or more road lanes based on the one or more attributes.

EFFICIENT NEURAL-NETWORK-BASED PROCESSING OF VISUAL CONTENT
20230215157 · 2023-07-06 ·

Certain aspects of the present disclosure provide techniques and apparatus for efficient processing of visual content using machine learning models. An example method generally includes generating, from an input, an embedding tensor for the input. The embedding tensor for the input is projected into a reduced-dimensional space projection of the embedding tensor based on a projection matrix. An attention value for the input is derived based on the reduced-dimensional space projection of the embedding tensor and a non-linear attention function. A match, in the reduced-dimensional space, is identified between a portion of the input and a corresponding portion of a target against which the input is evaluated based on the attention value for the input. One or more actions are taken based on identifying the match.

Method, device, and computer program product for error evaluation

Embodiments of the present disclosure provide a method, device, and computer program product for error evaluation. A method for error evaluation comprises in accordance with a determination that an error occurs in a data protection system, obtaining context information related to an operation of the data protection system; determining, based on the context information and using a trained deep learning model, a type of the error in the data protection system from a plurality of predetermined types, the deep learning model being trained based on training context information and a label on a ground-truth type of an error associated with the training context information; and providing the determined type of the error in the data protection system. In this way, it is possible to achieve automatic classification of errors in the data protection system, thereby improving the efficiency in error classification and saving the operation costs. Therefore, more rapid and more accurate measures can be taken to handle the errors.

Lane detection and tracking techniques for imaging systems

A method for tracking a lane on a road is presented. The method comprises receiving, by one or more processors from an imaging system, a set of pixels associated with lane markings. The method further includes generating, by the one or more processors, a predicted spline comprising (i) a first spline and (ii) a predicted extension of the first spline in a direction in which the imaging system is moving. The first spline describes a boundary of a lane and is generated based on the set of pixels. The predicted extension of the first spline is generated based at least in part on a curvature of at least a portion of the first spline.

Systems and methods for real-time complex character animations and interactivity

Systems, methods, and non-transitory computer-readable media can identify a virtual character being presented to a user within a real-time immersive environment. A first animation to be applied to the virtual character is determined. A nonverbal communication animation to be applied to the virtual character simultaneously with the first animation is determined. The virtual character is animated in real-time based on the first animation and the nonverbal communication animation.

Signal translation system and signal translation method

A signal translating method may include, according to one aspect of the present application, receiving a source signal of a first domain; identifying erroneous features and effective features from the source signal; translating the source signal of the first domain into a first virtual signal of a second domain, the first virtual signal is that in which erroneous features included in the source signal has been removed; and outputting the first virtual signal. Therefore, the virtual signal of the second domain in which the erroneous features removed may be output.

Systems and methods for skyline prediction for cyber-physical photovoltaic array control

Various embodiments of a cyber-physical system for providing cloud prediction for photovoltaic array control are disclosed herein.

Dividing pattern determination device capable of reducing amount of computation, dividing pattern determination method, learning device, learning method, and storage medium
11695928 · 2023-07-04 · ·

A dividing pattern determination device capable of reducing the amount of computation performed when determining a dividing pattern of an image. An image for which a dividing pattern is expressed by a hierarchical structure for each predetermined area is input to a feature extraction section, and the feature extraction section generates, based on the input image, for the predetermined area, a hierarchy map in which a value indicative of a block size is associated with each of a plurality of blocks in the predetermined area. A determination section determines a dividing pattern of the image based on the generated hierarchy map.

Advanced driver assist system and method of detecting object in the same

ADAS includes a processing circuit and a memory which stores instructions executable by the processing circuit. The processing circuit executes the instructions to cause the ADAS to receive, from a vehicle that is in motion, a video sequence, generate a position image including at least one object included in the stereo image, generate a second position information associated with the at least one object based on reflected signals received from the vehicle, determine regions each including at least a portion of the at least one object as candidate bounding boxes based on the stereo image and the position image, and selectively adjusting class scores of respective ones of the candidate bounding boxes associated with the at least one object based on whether a respective first position information of the respective ones of the candidate bounding boxes matches the second position information.

Learning model architecture for image data semantic segmentation

A learning model may provide a hierarchy of convolutional layers configured to perform convolutions upon image features, each layer other than a topmost layer convoluting the image features at a lower resolution to a higher layer, and each layer other than a bottommost layer returning the image features to a lower layer. Each layer fuses the lower resolution image features received from a higher layer with same resolution image features convoluted at the layer, so as to combine large-scale and small-scale features of images. Layers of the hierarchy may be substantially equal to a number of lateral convolutions at a bottommost convolutional layer. The bottommost convolutional layer ultimately passes the fused features to an attention mapping module, which utilizes two attention mapping pathways in combination to detect non-local dependencies and interactions between large-scale and small-scale features of images without de-emphasizing local interactions.