G06F16/784

NFT INVENTORY PRODUCTION

Methods and processes for manufacture of an image product from a digital image. An object in the digital image is detected and recognized. Object metadata is assigned to the object, the object metadata linking sound to the object in the digital image which produced the sound. At least one cryptographic hash of the object metadata is generated, and the hash is written to a node of a transaction processing network.

Cognitive video and audio search aggregation

A method, computer program product, and a system where a processor(s) obtains a video from a user, via a client, and segments the video into temporal shots that comprise a timeline of the video. The processor(s) cognitively analyze the video, by applying an image recognition algorithm to identify image entities in each temporal shot of the video and by applying a data structure comprising a user profile of the user to the temporal shots, to identity personal entities in each temporal shot of the video. The program code generates a search index for the video, utilizing the user entities (image entities and personal entities), where each entry of the search index is a given user entity and a linkage to a given temporal shot and the linkage indicates a location of the given user entity in the timeline of the video.

METHOD AND SYSTEM FOR CHARACTERISTIC-BASED VIDEO PROCESSING

A method and apparatus for characteristic-based video processing include: in response to receiving a region of a picture of a video sequence, determining a characteristic in the region, the region being independent of other regions of the picture for video coding; determining a class associated with the region based on the characteristic, the class being selected from a plurality of classes; and encoding the region using a parameter set associated with the class, the parameter set being selected from a plurality of parameter sets for video coding at different quality levels.

Automatically detecting contents expressing emotions from a video and enriching an image index

The present disclosure provides method, apparatus and system for detecting contents expressing emotions from a video. The method may comprise: dividing the video into a plurality of clips; extracting, from a first clip and at least one second clip of the plurality of clips, features associated with the first clip; determining whether the first clip expresses emotions based on the features associated with the first clip; and building an index containing the first clip based on the features associated with the first clip if the first clip expresses emotions.

VIDEO RANKING METHOD, AND APPARATUS
20230259555 · 2023-08-17 ·

A method and apparatus for ranking video for ease of redaction is provided herein. During operation, a video ranking apparatus will determine a plurality of faces and/or objects that need redaction from the plurality of videos and analyze the plurality of videos to determine unique identifiers within the plurality of videos. The video ranking apparatus will then determine a number of unique identifiers for each video and rank the plurality of videos based on the number of unique identifiers for each video.

AUTOMATIC LOCALIZATION OF ACCELERATION IN EDGE COMPUTING ENVIRONMENTS

Methods, apparatus, systems and machine-readable storage media of an edge computing device which is enabled to access and select the use of local or remote acceleration resources for edge computing processing is disclosed. In an example, an edge computing device obtains first telemetry information that indicates availability of local acceleration circuitry to execute a function, and obtains second telemetry that indicates availability of a remote acceleration function to execute the function. An estimated time (and cost or other identifiable or estimateable considerations) to execute the function at the respective location is identified. The use of the local acceleration circuitry or the remote acceleration resource is selected based on the estimated time and other appropriate factors in relation to a service level agreement.

Processing content based on natural language queries

Disclosed are systems and methods for summarizing content or preparing missed portions of content based on natural language queries. A natural language query can be received. One or more portions of summarized or missed content can be determined based on the natural language query, and transmitted to a user device.

System and method for algorithmic editing of video content

A computer implemented method for algorithmically editing digital video content is disclosed. A video file containing source video is processed to extract metadata. Label taxonomies are applied to extracted metadata. The labelled metadata is processed to identify higher-level labels. Identified higher-level labels are stored as additional metadata associated with the video file. A clip generating algorithm applies the stored metadata for selectively editing the source video to generate a plurality of different candidate video clips. Responsive to determining a clip presentation trigger on a viewer device, a clip selection algorithm is implemented that applies engagement data and metadata for the candidate video clips to select one of the stored candidate video clips. The engagement data is representative of one or more engagement metrics recorded for at least one of the stored candidate video clips. The selected video clip is presented to one or more viewers via corresponding viewer devices.

Processing content based on natural language queries

Disclosed are systems and methods for summarizing content or preparing missed portions of content based on natural language queries. A natural language query can be received. One or more portions of summarized or missed content can be determined based on the natural language query, and transmitted to a user device.

Content entity recognition within digital video data for dynamic content generation

Techniques for selectively associating frames with content entities and using such associations to dynamically generate web content related to the content entities. One embodiment performs a facial recognition analysis on frames of one or more instances of video content to identify a plurality of frames that each depict a first content entity. A measure of quality and a measure of confidence that the frame contains the depiction of the first content entity are determined for each of the identified plurality of frames. Embodiments select one or more frames from the identified plurality of frames, based on the measures of quality and the measures of confidence. The selected one or more frames are associated with the first content entity and web content associated with the first content entity is generated that includes a depiction of the selected one or more frames in association with an instance of video content.