G10L21/043

AUDIO DRIVEN ACCELERATED BINGE WATCH
20170309296 · 2017-10-26 ·

Example embodiments provide systems and methods for accelerating digital content playback based on speech. A content acceleration system electronically accesses digital content. The system analyzes the digital content to detect at least one audio portion within the digital content, each of the at least one audio portion comprising speech. The system creates at least one digital content segment from the digital content based on the at least one audio portion, whereby a beginning of each digital content segment of the at least one digital content segment coincides with a beginning of a corresponding audio portion of the at least one audio portion. The system then accelerates playback of the digital content by fast forwarding through parts of the at least one digital content segment where speech is absent.

EVALUATING SCREEN CONTENT FOR ACCESSIBILITY

In one example, a method for evaluating screen content for accessibility with a screen reader device is disclosed. The method provides a baseline document including a script of expected screen content that conforms accessibility requirements. The method may generate an audio file based on screen content elements. For some implementations, the method uses a machine learning model to transcribe the audio file into an output transcription file. The method may determine whether output transcription file matches the baseline document and a corresponding output report is generated.

Techniques for decreasing echo and transmission periods for audio communication sessions

A computer-implemented technique can include establishing an audio communication session between first and second computing devices and obtaining, by the first computing device, an audio input signal using audio data captured by a microphone. The first computing device can analyze the audio input signal to detect a speech input by its first user and can determine a duration of a detection period from when the audio input signal was obtained until the analyzing has completed. The first computing device can then transmit, to the second computing device, (i) a portion of the audio input signal beginning at a start of the speech input and (ii) the detection period duration, wherein receipt of the portion of the audio input signal and the detection period duration causes the second computing device to accelerate playback of the portion of the audio input signal to compensate for the detection period duration.

Techniques for decreasing echo and transmission periods for audio communication sessions

A computer-implemented technique can include establishing an audio communication session between first and second computing devices and obtaining, by the first computing device, an audio input signal using audio data captured by a microphone. The first computing device can analyze the audio input signal to detect a speech input by its first user and can determine a duration of a detection period from when the audio input signal was obtained until the analyzing has completed. The first computing device can then transmit, to the second computing device, (i) a portion of the audio input signal beginning at a start of the speech input and (ii) the detection period duration, wherein receipt of the portion of the audio input signal and the detection period duration causes the second computing device to accelerate playback of the portion of the audio input signal to compensate for the detection period duration.

Determining a Playback Rate of Media for a Requester
20170238026 · 2017-08-17 ·

A method, a system, and a computer program product for providing media to a requester at a particular playback rate associated with the requester. The method includes receiving a request from a requester for a playback session of media that includes a time varying content. In response to receiving the request, a profile associated with the requester is accessed to determine a playback rate of the media for the requester. In response to determining the playback rate of the media for the requester, the media is provided to the requester at the determined playback rate. The method further includes monitoring the playback session of the media for playback changes by the requester and dynamically adapting the playback rate associated with the requester based on the type and frequency of playback changes.

Determining a Playback Rate of Media for a Requester
20170238026 · 2017-08-17 ·

A method, a system, and a computer program product for providing media to a requester at a particular playback rate associated with the requester. The method includes receiving a request from a requester for a playback session of media that includes a time varying content. In response to receiving the request, a profile associated with the requester is accessed to determine a playback rate of the media for the requester. In response to determining the playback rate of the media for the requester, the media is provided to the requester at the determined playback rate. The method further includes monitoring the playback session of the media for playback changes by the requester and dynamically adapting the playback rate associated with the requester based on the type and frequency of playback changes.

Playback apparatus, setting apparatus, playback method, and program
09728201 · 2017-08-08 · ·

A playback apparatus includes: an acquiring unit that acquires auditory language data including data to be played back as a spoken voice; an analyzing unit that analyzes the auditory language data to output an analysis result; a setting unit that sets at least a portion of the auditory language data to a control portion to be played back at a set playback speed, based on the analysis result; and a voice playback unit that plays back the control portion as a spoken voice at the set playback speed.

Playback apparatus, setting apparatus, playback method, and program
09728201 · 2017-08-08 · ·

A playback apparatus includes: an acquiring unit that acquires auditory language data including data to be played back as a spoken voice; an analyzing unit that analyzes the auditory language data to output an analysis result; a setting unit that sets at least a portion of the auditory language data to a control portion to be played back at a set playback speed, based on the analysis result; and a voice playback unit that plays back the control portion as a spoken voice at the set playback speed.

SYSTEMS AND METHODS FOR INTELLIGENT PLAYBACK

Systems and methods for intelligent playback of media content may include an intelligent media playback system that, in response to determining the speech tempo in audio content by measuring syllable density of speech in the audio content, automatically adjusts a playback speed of the audio content as the audio content is being played based on the determined speech tempo. In some embodiments, the system may automatically and dynamically adjust the playback speed to result in a desired target speech tempo. In addition, the system may determine whether to automatically adjust playback speed of the audio content, as the media is being played, based on the detected speech tempo of the speech in the audio content and the determined type of content of media. Such automatic adjustments in playback speed result in more efficient playback of the audio content.

SYSTEMS AND METHODS FOR INTELLIGENT PLAYBACK

Systems and methods for intelligent playback of media content may include an intelligent media playback system that, in response to determining the speech tempo in audio content by measuring syllable density of speech in the audio content, automatically adjusts a playback speed of the audio content as the audio content is being played based on the determined speech tempo. In some embodiments, the system may automatically and dynamically adjust the playback speed to result in a desired target speech tempo. In addition, the system may determine whether to automatically adjust playback speed of the audio content, as the media is being played, based on the detected speech tempo of the speech in the audio content and the determined type of content of media. Such automatic adjustments in playback speed result in more efficient playback of the audio content.