G10L21/055

System and method for the creation and playback of soundtrack-enhanced audiobooks

A synchronised soundtrack for an audiobook. The soundtrack has a soundtrack timeline having one or more audio regions that are configured for synchronised playback with corresponding narration regions in the audiobook playback timeline. Each audio region having a position along the soundtrack timeline that is dynamically adjustable to maintain synchronization of the audio regions of the soundtrack with their respective narration regions in the audiobook based on a narration speed variable indicative of the playback narration speed of the audiobook.

System and method for the creation and playback of soundtrack-enhanced audiobooks

A synchronised soundtrack for an audiobook. The soundtrack has a soundtrack timeline having one or more audio regions that are configured for synchronised playback with corresponding narration regions in the audiobook playback timeline. Each audio region having a position along the soundtrack timeline that is dynamically adjustable to maintain synchronization of the audio regions of the soundtrack with their respective narration regions in the audiobook based on a narration speed variable indicative of the playback narration speed of the audiobook.

SYSTEMS AND METHODS FOR INTELLIGENT PLAYBACK

Systems and methods for intelligent playback of media content may include an intelligent media playback system that, in response to determining the speech tempo in audio content by measuring syllable density of speech in the audio content, automatically adjusts a playback speed of the audio content as the audio content is being played based on the determined speech tempo. In some embodiments, the system may automatically and dynamically adjust the playback speed to result in a desired target speech tempo. In addition, the system may determine whether to automatically adjust playback speed of the audio content, as the media is being played, based on the detected speech tempo of the speech in the audio content and the determined type of content of media. Such automatic adjustments in playback speed result in more efficient playback of the audio content.

SYSTEMS AND METHODS FOR INTELLIGENT PLAYBACK

Systems and methods for intelligent playback of media content may include an intelligent media playback system that, in response to determining the speech tempo in audio content by measuring syllable density of speech in the audio content, automatically adjusts a playback speed of the audio content as the audio content is being played based on the determined speech tempo. In some embodiments, the system may automatically and dynamically adjust the playback speed to result in a desired target speech tempo. In addition, the system may determine whether to automatically adjust playback speed of the audio content, as the media is being played, based on the detected speech tempo of the speech in the audio content and the determined type of content of media. Such automatic adjustments in playback speed result in more efficient playback of the audio content.

Efficient audio description systems and methods

A computer system configured to generate an audio description of a media file is provided. The system includes a display, a memory, and a processor coupled to the display and the memory. The memory stores a media file, including video data that is accessible via a time index and audio data synchronized with the video data by the time index and a transcript of the audio data, including transcription data synchronized with the video data via the time index. The processor is configured to render, via the display, images from portions of the video data; render text from portions of the transcription data in synchrony with the images; receive input identifying a point within the time index; receive input specifying audio description data to associate with the point; store, in the memory, the audio description data; and store an association between the audio description data and the point.

Efficient audio description systems and methods

A computer system configured to generate an audio description of a media file is provided. The system includes a display, a memory, and a processor coupled to the display and the memory. The memory stores a media file, including video data that is accessible via a time index and audio data synchronized with the video data by the time index and a transcript of the audio data, including transcription data synchronized with the video data via the time index. The processor is configured to render, via the display, images from portions of the video data; render text from portions of the transcription data in synchrony with the images; receive input identifying a point within the time index; receive input specifying audio description data to associate with the point; store, in the memory, the audio description data; and store an association between the audio description data and the point.

Correcting clock drift via embedded sine waves

Features are disclosed for measuring and correcting clock drift and propagation delay in an audio system through one or more waveforms embedded in an audio signal. A first device in communication with a speaker may be configured to obtain an audio signal and insert one or more waveforms into the audio signal. For example, the waveforms may be inserted during an interval of time. A second device in communication with a microphone may be configured to detect sound as an audio input signal. The second device may obtain a spectral representation of the audio input signal and determine a rotation based on the spectral representation at the frequency of at least one of the inserted waveforms. Clock drift may be determined based on the rotation.

Correcting clock drift via embedded sine waves

Features are disclosed for measuring and correcting clock drift and propagation delay in an audio system through one or more waveforms embedded in an audio signal. A first device in communication with a speaker may be configured to obtain an audio signal and insert one or more waveforms into the audio signal. For example, the waveforms may be inserted during an interval of time. A second device in communication with a microphone may be configured to detect sound as an audio input signal. The second device may obtain a spectral representation of the audio input signal and determine a rotation based on the spectral representation at the frequency of at least one of the inserted waveforms. Clock drift may be determined based on the rotation.

Robust spectral encoding and decoding methods
09773504 · 2017-09-26 · ·

Spectral encoding methods are more robust when used with improved weak signal detection and synchronizations methods. Further robustness gains are achieved by using informed embedding, error correction and embedding protocols that enable signal to noise enhancements by folding and pre-filtering the received signal.

Robust spectral encoding and decoding methods
09773504 · 2017-09-26 · ·

Spectral encoding methods are more robust when used with improved weak signal detection and synchronizations methods. Further robustness gains are achieved by using informed embedding, error correction and embedding protocols that enable signal to noise enhancements by folding and pre-filtering the received signal.