Patent classifications
G06V10/00
Image coding method, image decoding method, image coding apparatus, image decoding apparatus, and image coding and decoding apparatus
A dependency indication is signaled within the beginning of a packet, that is, within the adjacent of a slice header to be parsed or a parameter set. This is achieved, for example, by including the dependency indication at the beginning of the slice header, preferably after a syntax element identifying the parameter set and before the slice address, by including the dependency indication before the slice address, by providing the dependency indication to a NALU header using a separate message, or by using a special NALU type for NALUs carrying dependent slices.
DETECTION AND CLASSIFICATION OF AUDIO EVENTS IN GAMING SYSTEMS
A system that incorporates the subject disclosure may include, for example, a gaming system that cooperates with a graphical user interface to enable user modification and enhancement of one or more audio streams associated with the gaming system. In embodiments, the audio streams may include a game audio stream, a chat audio stream of conversation among players of a video game, and a microphone audio stream of a player of the video game. Additional embodiments are disclosed.
Method and system for document classification and text information extraction
Variation in received documents types and templates used for each document type poses challenge in developing a generic background noise removal approach for automatic text information extraction technique. Embodiments herein provide a method and a system for document classification and text information extraction. Time efficient and accurate text detection engine-based Region of Interest (ROI) technique is provided to accurately identify text region followed by a multi-layered neural network based architecture for enhanced classification accuracy to identify the type of document. A multistage image pre-processing approach is provided for efficient, effective, and accurate background noise removal from the classified document, which includes unsupervised clustering, identification, segmentation, masking, contour approximation, selective subtraction, and dynamic thresholding. Further, text information extraction is applied on the pre-processed image to determine co-relation between text information blocks and associate values with the labels for attributes to be extracted to extract final text.
Method and device for supporting an advanced driver assistance system in a motor vehicle
The invention relates to a method for supporting an advanced driver assistance system in a motor vehicle, comprising the following steps: providing a map, wherein categorized objects are stored in associated positions in the map, capturing environment data using at least one environment sensor system of the advanced driver assistance system, analyzing the captured environment data using an analysis apparatus of the advanced driver assistance system, wherein the captured environment data are analyzed for object recognition according to the categorized objects stored in the map. Furthermore, the invention relates to an associated device.
Method and device for supporting an advanced driver assistance system in a motor vehicle
The invention relates to a method for supporting an advanced driver assistance system in a motor vehicle, comprising the following steps: providing a map, wherein categorized objects are stored in associated positions in the map, capturing environment data using at least one environment sensor system of the advanced driver assistance system, analyzing the captured environment data using an analysis apparatus of the advanced driver assistance system, wherein the captured environment data are analyzed for object recognition according to the categorized objects stored in the map. Furthermore, the invention relates to an associated device.
Information processing apparatus and method
The present disclosure relates to an information processing apparatus and a method that allows for suppression of a decrease in encoding efficiency. In comparison of voxel data resulting from quantization of point cloud data, a correlation degree of a distribution pattern between voxel data to be compared is calculated on the basis of a subjective feature of the distribution pattern of values of the voxel data. The present disclosure is applicable to an information processing apparatus, an image processing apparatus, an electronic device, an information processing method, a program, or the like, for example.
Method and apparatus for extracting information, device and storage medium
Embodiments of the present disclosure disclose a method and apparatus for extracting information, a device and a storage medium, relate to the field of image processing technology. The method may include: acquiring a location template corresponding to a category of a target document image; determining key point locations on the target document image; generating a transformation matrix based on the key point locations on the target document image and key point locations on the location template; determining locations of information corresponding to the target document image, based on locations of information on the location template and the transformation matrix; and extracting information at the locations of information corresponding to the target document image to obtain information in the target document image.
Identifying an attribute of an electromagnetic source configuration by matching simulated and patient data
Systems are provided for generating data representing electromagnetic states of a heart for medical, scientific, research, and/or engineering purposes. The systems generate the data based on source configurations such as dimensions of, and scar or fibrosis or pro-arrhythmic substrate location within, a heart and a computational model of the electromagnetic output of the heart. The systems may dynamically generate the source configurations to provide representative source configurations that may be found in a population. For each source configuration of the electromagnetic source, the systems run a simulation of the functioning of the heart to generate modeled electromagnetic output (e.g., an electromagnetic mesh for each simulation step with a voltage at each point of the electromagnetic mesh) for that source configuration. The systems may generate a cardiogram for each source configuration from the modeled electromagnetic output of that source configuration for use in predicting the source location of an arrhythmia.
Identifying an attribute of an electromagnetic source configuration by matching simulated and patient data
Systems are provided for generating data representing electromagnetic states of a heart for medical, scientific, research, and/or engineering purposes. The systems generate the data based on source configurations such as dimensions of, and scar or fibrosis or pro-arrhythmic substrate location within, a heart and a computational model of the electromagnetic output of the heart. The systems may dynamically generate the source configurations to provide representative source configurations that may be found in a population. For each source configuration of the electromagnetic source, the systems run a simulation of the functioning of the heart to generate modeled electromagnetic output (e.g., an electromagnetic mesh for each simulation step with a voltage at each point of the electromagnetic mesh) for that source configuration. The systems may generate a cardiogram for each source configuration from the modeled electromagnetic output of that source configuration for use in predicting the source location of an arrhythmia.
Spatio-temporal graph for video captioning with knowledge distillation
A method for scene perception using video captioning based on a spatio-temporal graph model is described. The method includes decomposing the spatio-temporal graph model of a scene in input video into a spatial graph and a temporal graph. The method also includes modeling a two branch framework having an object branch and a scene branch according to the spatial graph and the temporal graph to learn object interactions between the object branch and the scene branch. The method further includes transferring the learned object interactions from the object branch to the scene branch as privileged information. The method also includes captioning the scene by aligning language logits from the object branch and the scene branch according to the learned object interactions.