G10L19/018

Identifier

A computer device (100), configured to encode identifiers by providing audio identifiers therefrom, is described. The computer device (100) is configured to provide a set of audio signals as respective bitstreams. Each audio signal of the set of audio signals is defined based, at least in part, on audio signal information including at least one of a type, a fundamental frequency, a time signature and a time. Each audio signal comprises a set of audio segments. Each audio segment of the set of audio segments is defined based, at least in part, on audio segment information including at least one of a frequency, an amplitude, a transform, a time duration and an envelope. The computer device (100) is configured to receive an identifier and select a subset of audio signals from the set of audio signals according to the received identifier based, at least in part, on the audio signal information and/or the audio segment information. The computer device (100) is configured to process the audio selected subset of audio signals by combining the selected subset of audio signals to provide an audio identifier. The computer device (100) is configured to output the audio identifier in an output audio signal as an output bitstream, wherein the audio identifier encodes the identifier. Also described is a method of encoding identifiers by providing audio identifiers therefrom.

Identifier

A computer device (100), configured to encode identifiers by providing audio identifiers therefrom, is described. The computer device (100) is configured to provide a set of audio signals as respective bitstreams. Each audio signal of the set of audio signals is defined based, at least in part, on audio signal information including at least one of a type, a fundamental frequency, a time signature and a time. Each audio signal comprises a set of audio segments. Each audio segment of the set of audio segments is defined based, at least in part, on audio segment information including at least one of a frequency, an amplitude, a transform, a time duration and an envelope. The computer device (100) is configured to receive an identifier and select a subset of audio signals from the set of audio signals according to the received identifier based, at least in part, on the audio signal information and/or the audio segment information. The computer device (100) is configured to process the audio selected subset of audio signals by combining the selected subset of audio signals to provide an audio identifier. The computer device (100) is configured to output the audio identifier in an output audio signal as an output bitstream, wherein the audio identifier encodes the identifier. Also described is a method of encoding identifiers by providing audio identifiers therefrom.

User authentication over an audio channel using a mobile device

A system is described for authenticating a user on a client device using the user's mobile device and utilizing the audio channel. An authentication server receives a request from the client to initiate a session for the user, creates the session, and sends a session token back to the client along with a request for authentication. The client broadcasts an audio transmission containing the token to the mobile device over an audio channel using data-over-sound transmission. The mobile device receives the transmission via a microphone, obtains the token and the server identity from the transmission, and sends user credentials that are stored on the mobile device along with the token identifying the session directly to the authentication server. The server verifies the received credentials, confirms the token, and logs the user into the session.

INFORMATION EXCHANGE ON MOBILE DEVICES USING AUDIO
20230005491 · 2023-01-05 ·

In some implementations, a user device may receive input that triggers transmission of information via sound. The user device may select an audio clip based on a setting associated with the device, and may modify a digital representation of the selected audio clip using an encoding algorithm and based on data associated with a user of the device. The user device may transmit, to a remote server, an indication of the selected audio clip, an indication of the encoding algorithm, and the data associated with the user. The user device may use a speaker to play audio, based on the modified digital representation, for recording by other devices. Accordingly, the user device may receive, from the remote server and based on the speaker playing the audio, a confirmation that users associated with the other devices have performed an action based on the data associated with the user of the device.

Systems and methods for embedding data in media content

An electronic device modifies a first media content item by superimposing a first set of data over a first accented musical event. The first accented musical event has a first audio profile. The first set of data has a second audio profile configured to be masked by the first audio profile during playback of the first media content item. The electronic device transmits, to a second electronic device, the modified first media content item.

USING AUDIO WATERMARKS TO IDENTIFY CO-LOCATED TERMINALS IN A MULTI-TERMINAL SESSION

An audio playing method is performed by a first terminal participating in a group communication session. The method includes obtaining first audio data of the group communication session, and adding an audio watermark to the first audio data to obtain second audio data. The audio watermark includes on a session identifier of the group communication session and a device identifier of the first terminal. The method also includes playing the second audio data.

USING AUDIO WATERMARKS TO IDENTIFY CO-LOCATED TERMINALS IN A MULTI-TERMINAL SESSION

An audio playing method is performed by a first terminal participating in a group communication session. The method includes obtaining first audio data of the group communication session, and adding an audio watermark to the first audio data to obtain second audio data. The audio watermark includes on a session identifier of the group communication session and a device identifier of the first terminal. The method also includes playing the second audio data.

METHODS AND APPARATUS FOR PANELIST-BASED LOGINS USING VOICE COMMANDS
20220415331 · 2022-12-29 ·

Methods and apparatus for panelist-based logins using voice commands are disclosed herein. A disclosed example apparatus for identifying a user as a member of an audience includes a memory and at least one processor to execute machine readable instructions to at least access audio emitted by a speech recognition system, the audio generated based on a request spoken by the user, identify at least one of a watermark or a fingerprint included in the audio, the watermark or fingerprint including identifying information to identify a user and an indication of the presence of the user, and record the indication of the presence of the user in an audience.

METHODS AND APPARATUS FOR PANELIST-BASED LOGINS USING VOICE COMMANDS
20220415331 · 2022-12-29 ·

Methods and apparatus for panelist-based logins using voice commands are disclosed herein. A disclosed example apparatus for identifying a user as a member of an audience includes a memory and at least one processor to execute machine readable instructions to at least access audio emitted by a speech recognition system, the audio generated based on a request spoken by the user, identify at least one of a watermark or a fingerprint included in the audio, the watermark or fingerprint including identifying information to identify a user and an indication of the presence of the user, and record the indication of the presence of the user in an audience.

Generation and detection of watermark for real-time voice conversion
11538485 · 2022-12-27 · ·

A method watermarks speech data by using a generator to generate speech data including a watermark. The generator is trained to generate the speech data including the watermark. The training process generates first speech from the generator. The first speech data is configured to represent speech. The first speech data includes a candidate watermark. The training also produces an inconsistency message as a function of at least one difference between the first speech data and at least authentic speech data. The training further includes transforming the first speech data, including the candidate watermark, using a watermark robustness module to produce transformed speech data including a transformed candidate watermark. The transformed speech data includes a transformed candidate watermark. The training further produces a watermark-detectability message, using a watermark detection machine learning system, relating to one or more desirable watermark features of the transformed candidate watermark.