Patent classifications
G11B20/00
Artificial intelligence-based cross-language speech transcription method and apparatus, device and readable medium using Fbank40 acoustic feature format
An artificial intelligence-based cross-language speech transcription method and apparatus, a device and a readable medium. The method includes pre-processing to-be-transcribed speech data to obtain multiple acoustic features, the to-be-transcribed speech data being represented in a first language; predicting a corresponding translation text after transcription of the speech data according to the multiple acoustic features and a pre-trained cross-language transcription model; wherein the translation text is represented in a second language which is different from the first language. According to the technical solution, it is unnecessary, upon cross-language speech transcription, to perform speech recognition first and then perform machine translation, but to directly perform cross-language transcription according to the pre-trained cross-language transcription model. The technical solution can overcome the problem of error accumulation in the two-step cross-language transcription manner in the prior art, and can effectively improve accuracy and efficiency of the cross-language speech transcription as compared with the prior art.
CONCURRENT SECURE COMMUNICATION GENERATION
A recording of an audio stream is initiated. The audio stream is a part of a communication between two or more participants. A first indication related to the audio stream is received. The first indication is that the audio stream should start being altered. A second indication related to the audio stream is received. The second indication is that the audio stream should stop being altered. A portion of the recorded audio stream between the first indication and the second indication is altered.
Authenticating and presenting video evidence
A method for automatically authenticating unknown video data based on known video data stored at a client server is provided, wherein, unknown and known video data each are made up of segments and include metadata, a hash message digest, and a serial code. The method involves selecting a first segment of the unknown video and locating the serial code within the first segment of the unknown video data. The serial code is used to locate a corresponding first segment in the known video data. The first segment may include a known hash message digest. A new hash message digest for the first segment of the unknown video data is generated and compared with the known hash message digest. If they match, the segment of unknown video data is authentic.
Enhanced content tracking system and method
The invention, as shown by the system in FIG. 2, relates to a client-side content tracking system of media files, e.g. digital music files. Audio trackingor indeed multimedia trackingis shifted to a client-side perspective, with the client tasked with establishing use of a selected source audio track by trackingand then reporting uplink to the serverat least one of: entry and exit points associated with playing of at least one of said musical sections in the identified source audio track, and how the identified source audio track was used, performed or manipulated at the client device. Server functionality is designed, having regard to the reported tracking data and its link to a unique identifier to permit the media file (e.g. source audio track) to be selected and/or identified, to store or relaypossibly in the context of a subscription service and billing regime for content usetracking data related to use of at least a portion of the source audio track at or by the client device. In the context of audio, reporting of use at a client device can, in turn, cause streaming of related multi-media content from a third-party database to the client device. For music, reporting of entry and end points into and out of sections of complete audio tracks can coincide with musically seamless audio transitions between sections.
Selective sharing of body data
Information from a position and/or gesture detection system can be transmitted to various devices in order to enable users to interact and/or view others users. In some embodiments, video is captured that includes a current view of the body of a user. In order to prevent an unauthorized, unintended, or undesired transmission of at least part of the body image data, one or more settings or policies can be specified that can control which portions are transmitted, received, and/or displayed. For example, a user can be prompted before body image or position data is transmitted, which enables a user to control the type of data that is sent. A recipient or intermediate entity or component can also specify one or more settings or policies to control the type of data that is transmitted and/or received. In some embodiments, an external service can be utilized to manage the transmission of data.
Multimedia Distribution System for Multimedia Files with Interleaved Media Chunks of Varying Types
A multimedia file and methods of generating, distributing and using the multimedia file are described. Multimedia files in accordance with embodiments of the present invention can contain multiple video tracks, multiple audio tracks, multiple subtitle tracks, data that can be used to generate a menu interface to access the contents of the file and meta data concerning the contents of the file. Multimedia files in accordance with several embodiments of the present invention also include references to video tracks, audio tracks, subtitle tracks and meta data external to the file. One embodiment of a multimedia file in accordance with the present invention includes a series of encoded video frames and encoded menu information.
Detecting media watermarks in magnetic field data
Methods, apparatus, systems and articles of manufacture (e.g., physical storage media) to detect media watermarks in magnetic field data are disclosed herein. Example media monitoring apparatus disclosed herein include means for transforming magnitude values of magnetic field data to a frequency domain to determine transformed magnetic field data, the magnetic field data associated with a first sampling rate, the magnetic field data obtained from a sensor. Disclosed example media monitoring apparatus also include means for detecting an audio watermark in a portion of the transformed magnetic field data associated with a first frequency, the audio watermark encoded in an audio signal, the audio watermark to have a frequency component associated with a second frequency different from the first frequency, the first frequency to be aliased relative to the second frequency based on the first sampling rate.
Content individualization
Content individualization, including: encrypting a first part of a source data set using a first key creating a first encrypted data set; encrypting a second part of the source data set using a second key creating a second encrypted data set; encrypting the second part of the source data set using a third key creating a third encrypted data set; and combining the first encrypted data set, the second encrypted data set, and the third encrypted data set to form a final encrypted data set. Key words include watermarking and content individualization.
Encryption Method, Decryption Method, Encryption System and Decryption System
An encryption method includes an operation method of an encryption system and is a method of encrypting encryption target information.
Audio encoding using video information
Various audio encoders and methods of using the same are disclosed. In one aspect, an apparatus is provided that includes an audio encoder and an audio encoder mode selector. The audio encoder mode selector is operable to analyze video data and adjust an encoding mode of the audio encoder based on the analyzed video data.