Patent classifications
G10L19/018
Frictionless handoff of audio content playing using overlaid ultrasonic codes
In a frictionless handoff of audio content playing, a client device listens for ultrasonic audio. The client hears a playing of a modified audio content by another client device, which includes audio content and an ultrasonic audio quick response (QR) code overlaid on the audio content. The ultrasonic audio QR code includes location information corresponding to a location in the audio content. The client device extracts the ultrasonic audio QR code from the modified audio content. After determining that the playing of the modified audio content has stopped, the client device receives a command to resume playing of the audio content on the client device. In, response to the command, the client device retrieves location information in a last extracted ultrasonic audio QR code and plays the audio content starting at a location in the audio content corresponding to the location information in the last extracted ultrasonic audio QR code.
Frictionless handoff of audio content playing using overlaid ultrasonic codes
In a frictionless handoff of audio content playing, a client device listens for ultrasonic audio. The client hears a playing of a modified audio content by another client device, which includes audio content and an ultrasonic audio quick response (QR) code overlaid on the audio content. The ultrasonic audio QR code includes location information corresponding to a location in the audio content. The client device extracts the ultrasonic audio QR code from the modified audio content. After determining that the playing of the modified audio content has stopped, the client device receives a command to resume playing of the audio content on the client device. In, response to the command, the client device retrieves location information in a last extracted ultrasonic audio QR code and plays the audio content starting at a location in the audio content corresponding to the location information in the last extracted ultrasonic audio QR code.
SPEECH SYNTHESIZER, AUDIO WATERMARKING INFORMATION DETECTION APPARATUS, SPEECH SYNTHESIZING METHOD, AUDIO WATERMARKING INFORMATION DETECTION METHOD, AND COMPUTER PROGRAM PRODUCT
According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
SPEECH SYNTHESIZER, AUDIO WATERMARKING INFORMATION DETECTION APPARATUS, SPEECH SYNTHESIZING METHOD, AUDIO WATERMARKING INFORMATION DETECTION METHOD, AND COMPUTER PROGRAM PRODUCT
According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
AUDIO ENCODER AND DECODER WITH DYNAMIC RANGE COMPRESSION METADATA
An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data.
AUDIO ENCODER AND DECODER WITH DYNAMIC RANGE COMPRESSION METADATA
An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data.
ACOUSTIC SECURITY CODE TRANSMISSION
A system and methods of computer security are provided wherein a first mobile computing device drives an acoustic transducer to emit an acoustic signal encoding a time-based, one time password (TOTP) code, and a second mobile computing device measures output of a MEMS gyroscope that senses the emitted acoustic signal. The second mobile computing device decodes the TOTP code from the gyroscope output, validates the TOTP code and responsively permits a user to access a secure application.
ACOUSTIC SECURITY CODE TRANSMISSION
A system and methods of computer security are provided wherein a first mobile computing device drives an acoustic transducer to emit an acoustic signal encoding a time-based, one time password (TOTP) code, and a second mobile computing device measures output of a MEMS gyroscope that senses the emitted acoustic signal. The second mobile computing device decodes the TOTP code from the gyroscope output, validates the TOTP code and responsively permits a user to access a secure application.
Electronic device and control method therefor
An electronic apparatus is provided. The electronic apparatus includes a communication interface and a processor configured to, based on receiving a first audio signal from a first sensor device from among a plurality of sensor devices through the communication interface, perform an operation corresponding to the first audio signal, and control the first sensor device to output a second audio signal indicating a result of performing the operation corresponding to the first audio signal, and based on receiving a third audio signal from a second sensor device from among the plurality of sensor devices within a threshold time from a time of controlling the first sensor device through the communication interface, determine whether to process the third audio signal based on the second audio signal and the third audio signal. In particular, at least part of a method of determining whether to process the third audio signal may use an artificial intelligence model that is trained according to at least one of a neural network or a deep learning algorithm.
Electronic device and control method therefor
An electronic apparatus is provided. The electronic apparatus includes a communication interface and a processor configured to, based on receiving a first audio signal from a first sensor device from among a plurality of sensor devices through the communication interface, perform an operation corresponding to the first audio signal, and control the first sensor device to output a second audio signal indicating a result of performing the operation corresponding to the first audio signal, and based on receiving a third audio signal from a second sensor device from among the plurality of sensor devices within a threshold time from a time of controlling the first sensor device through the communication interface, determine whether to process the third audio signal based on the second audio signal and the third audio signal. In particular, at least part of a method of determining whether to process the third audio signal may use an artificial intelligence model that is trained according to at least one of a neural network or a deep learning algorithm.