G10L19/018

Methods and apparatus for supplementing partially readable and/or inaccurate codes in media

Methods and apparatus are disclosed for supplementing partially readable and/or inaccurate codes. An example apparatus includes a watermark analyzer to select a first watermark and a second watermark decoded from media; a comparator to compare a first decoded timestamp of the first watermark to a second decoded timestamp of the second watermark; and a timestamp adjuster to adjust the second decoded timestamp based on the first decoded timestamp of the second watermark when at least a threshold number of symbols of the second decoded timestamp match corresponding symbols of the first decoded timestamp.

Transmission of promotional information during a promotional campaign to user electronic equipments
11501329 · 2022-11-15 · ·

Provided is a system that transmits a notification to a user electronic equipment associated with a user, based on a user location information. This notification includes information representative of a sound-based promotional campaign for a product at a location known by the user. The system also transmits a location-specific promotional information, associated with this sound-based promotional campaign and based on a user information, to a merchant electronic apparatus associated to this known location.

Transmission of promotional information during a promotional campaign to user electronic equipments
11501329 · 2022-11-15 · ·

Provided is a system that transmits a notification to a user electronic equipment associated with a user, based on a user location information. This notification includes information representative of a sound-based promotional campaign for a product at a location known by the user. The system also transmits a location-specific promotional information, associated with this sound-based promotional campaign and based on a user information, to a merchant electronic apparatus associated to this known location.

Audio fingerprinting
11495238 · 2022-11-08 · ·

A machine may be configured to generate one or more audio fingerprints of one or more segments of audio data. The machine may access audio data to be fingerprinted and divide the audio data into segments. For any given segment, the machine may generate a spectral representation from the segment; generate a vector from the spectral representation; generate an ordered set of permutations of the vector; generate an ordered set of numbers from the permutations of the vector; and generate a fingerprint of the segment of the audio data, which may be considered a sub-fingerprint of the audio data. In addition, the machine or a separate device may be configured to determine a likelihood that candidate audio data matches reference audio data.

Audio fingerprinting
11495238 · 2022-11-08 · ·

A machine may be configured to generate one or more audio fingerprints of one or more segments of audio data. The machine may access audio data to be fingerprinted and divide the audio data into segments. For any given segment, the machine may generate a spectral representation from the segment; generate a vector from the spectral representation; generate an ordered set of permutations of the vector; generate an ordered set of numbers from the permutations of the vector; and generate a fingerprint of the segment of the audio data, which may be considered a sub-fingerprint of the audio data. In addition, the machine or a separate device may be configured to determine a likelihood that candidate audio data matches reference audio data.

TEXT-TO-SPEECH PROCESSING USING INPUT VOICE CHARACTERISTIC DATA

During text-to-speech processing, a speech model creates synthesized speech that corresponds to input data. The speech model may include an encoder for encoding the input data into a context vector and a decoder for decoding the context vector into spectrogram data. The speech model may further include a voice decoder that receives vocal characteristic data representing a desired vocal characteristic of synthesized speech. The voice decoder may process the vocal characteristic data to determine configuration data, such as weights, for use by the speech decoder.

Methods and apparatus for audio signature generation and matching

A method for generating a signature is disclosed. As part of the method, a first number of bits are identified in respective rows of an audio signature matrix that are determined to be the strongest bits in the row, bits of the audio signature matrix are replaced with respective cells having values depending on whether the respective bits are included in the first number of bits, a set of uniformly distributed numbers are generated within a range of numbered locations corresponding to cells of the audio signature matrix; numerical distances are determined, from respective numbers in the set of uniformly distributed numbers, to the numbered locations of the matrix, associated with nearest occurrences of a first value. A set of integers is generated based on the distances.

Methods and apparatus for audio signature generation and matching

A method for generating a signature is disclosed. As part of the method, a first number of bits are identified in respective rows of an audio signature matrix that are determined to be the strongest bits in the row, bits of the audio signature matrix are replaced with respective cells having values depending on whether the respective bits are included in the first number of bits, a set of uniformly distributed numbers are generated within a range of numbered locations corresponding to cells of the audio signature matrix; numerical distances are determined, from respective numbers in the set of uniformly distributed numbers, to the numbered locations of the matrix, associated with nearest occurrences of a first value. A set of integers is generated based on the distances.

INTEGRATING OVERLAID DIGITAL CONTENT INTO DATA VIA PROCESSING CIRCUITRY USING AN AUDIO BUFFER

The present disclosure is related to a method, including receiving, by processing circuitry, data transmitted over a communication network during a voice call, accessing an audio buffer of the processing circuitry, analyzing, in the audio buffer of the processing circuitry, audio data associated with the transmitted data, based on the analyzed audio data in the audio buffer, identifying an audio reference patch that includes a unique identifier associated with an available area in which secondary digital content located at a remote device is insertable in displayed data that is being displayed by the processing circuitry during the voice call, after identifying the audio reference patch, retrieving the secondary digital content from the remote device based on the unique identifier, and after retrieving the secondary digital content from the remote device, overlaying the secondary digital content into the displayed data during the voice call.

INTEGRATING OVERLAID DIGITAL CONTENT INTO DATA VIA PROCESSING CIRCUITRY USING AN AUDIO BUFFER

The present disclosure is related to a method, including receiving, by processing circuitry, data transmitted over a communication network during a voice call, accessing an audio buffer of the processing circuitry, analyzing, in the audio buffer of the processing circuitry, audio data associated with the transmitted data, based on the analyzed audio data in the audio buffer, identifying an audio reference patch that includes a unique identifier associated with an available area in which secondary digital content located at a remote device is insertable in displayed data that is being displayed by the processing circuitry during the voice call, after identifying the audio reference patch, retrieving the secondary digital content from the remote device based on the unique identifier, and after retrieving the secondary digital content from the remote device, overlaying the secondary digital content into the displayed data during the voice call.