Patent classifications
G06V20/43
METHOD AND APPARATUS FOR GENERATING COMMENTARY
Embodiments of the present disclosure provide a method and apparatus for generating a commentary. The method may include: acquiring at least one news cluster composed of pieces of news generated within a first preset time length, the pieces of news in the news cluster direct to a given news event; determining a target news cluster based on the at least one news cluster; determining, for each piece of news in the target news cluster, a score of being suitable for generating a commentary for the piece of news; and generating, based on a piece of target news, a commentary for the target news cluster, where the piece of target news is a piece of news having a highest score of being suitable for generating a commentary in the target news cluster.
METHOD AND APPARATUS FOR GENERATING INFORMATION
Embodiments of the present disclosure provide a method and apparatus for generating information. The method may include: determining at least one video segment obtained by semantically segmenting videos included in a target news cluster as a target video set, where respective pieces of news in the target news cluster directs to a given news event; determining a commentary for the target news cluster; determining, based on the target video set and a target image set, a candidate material resource set corresponding to the commentary, where the target image set is composed of respective images included in the target news cluster; and for each paragraph in the commentary, determining degrees of matching between the paragraph and candidate material resources in the candidate material resource set.
METHOD AND APPARATUS FOR GENERATING VIDEO
Embodiments of the present disclosure provide a method and apparatus for generating a video. The method may include: determining a commentary of a target news cluster, each piece of news in the target news cluster being specific to a given news event; generating a voice corresponding to each paragraph in the commentary using a speech synthesis technology; determining a candidate material resource set corresponding to the commentary based on a video and an image included in the target news cluster, the candidate material resource being a video or image; determining a candidate material resource sequence corresponding to the each paragraph in the commentary; and generating a video corresponding to the commentary based on the voice corresponding to the each paragraph in the commentary and the candidate material resource sequence.
Method and apparatus for adjusting parameter
A method and apparatus for adjusting a parameter are provided. The method may include: acquiring a current value of at least one parameter which is in a process of generating a video corresponding to a commentary of the news cluster based on a news cluster; determining a video evaluation score of the video which is generated based on the news cluster and according to the current value of the at least one parameter; performing feature extraction on the current value of the at least one parameter to obtain a feature representation; inputting the feature representation and the determined video evaluation score into a pre-trained evaluation network to obtain a predicted video evaluation score; inputting the feature representation and the predicted video evaluation score into a pre-trained operation network, to obtain current operation information; and adjusting the current value of the at least one parameter based on the current operation information.
LEARNING REPRESENTATIONS OF GENERALIZED CROSS-MODAL ENTAILMENT TASKS
A method is provided for determining entailment between an input premise and an input hypothesis of different modalities. The method includes extracting features from the input hypothesis and an entirety of and regions of interest in the input premise. The method further includes deriving intra-modal relevant information while suppressing intra-modal irrelevant information, based on intra-modal interactions between elementary ones of the features of the input hypothesis and between elementary ones of the features of the input premise. The method also includes attaching cross-modal relevant information to the features from the input premise to the features from the input hypothesis to form a cross-modal representation, based on cross-modal interactions between pairs of different elementary features from different modalities. The method additionally includes classifying a relationship between the input premise and the input hypothesis using a label selected from the group consisting of entailment, neutral, and contradiction based on the cross-modal representation.
ELECTRONIC APPARATUS AND METHOD FOR CONTROLLING THE ELECTRONIC APPARATUS
An electronic apparatus and a method for controlling the same are disclosed. The method for controlling an electronic apparatus includes acquiring multimedia content including a plurality of image frames, acquiring information related to the multimedia content, selecting at least one image frame including an object related to the acquired information among objects included in the plurality of image frames, generating description information for the at least one selected image frame based on the acquired information, and acquiring description information for the multimedia content based on the generated description information. Thus, the electronic apparatus may generate description information for more elaborate scene analysis regarding multimedia content.
Event argument extraction method, event argument extraction apparatus and electronic device
An event argument extraction (EAE) method, an EAE apparatus and an electronic device, relates to the technical field of knowledge graphs. A specific implementation scheme includes acquiring a to-be-extracted event content; and performing argument extraction on the to-be-extracted event content based on a trained EAE model, to obtain a target argument of the to-be-extracted event content; where the trained EAE model is obtained by training a pre-trained model with event news annotation data and a weight of each argument annotated in the event news annotation data.
Training machine learning models with training data
In one embodiment, a method is provided. The method includes storing a set of training data. One or more machine learning models are trained based on the set of training data. The method also includes receiving, from a computing device, a request to access the set of training data. The method further includes determining whether the computing device is allowed to access the set of training data. In response to determining that the computing device is allowed to access the set of training data, the method includes transmitting a training token to the computing device. The training token grants a training environment with access to the set of training data for a period of time.
IMAGE DISPLAY DEVICE AND OPERATING METHOD OF THE SAME
An image display device including a display configured to display a first image is provided. The image display device includes a controller configured to generate a second image by enlarging a part of the first image displayed in a first region of the display and to control the display to display a part of the second image in the first region, and a sensor configured to sense a user input for moving the second image. In response to the user input, the controller is configured to control the display to move and display the second image, within the first region.