G10L21/0356

SYSTEMS AND METHODS FOR PROCESSING AUDIO BASED ON CHANGES IN ACTIVE SPEAKER
20210225391 · 2021-07-22 · ·

System and methods for processing audio signals are disclosed. In one implementation, a system may comprise a wearable camera configured to capture images from an environment of a user; a microphone; and a processor. The processor may be configured to receive an audio signal representative of sounds captured by the microphone during a time period; and receive the images captured by the wearable camera. The processor may process the audio signal in a first mode based on audio data accumulated in a buffer prior to the time period; detect a change in the active speaker from the first individual to a second individual; and cease processing in the first mode and process the audio signal in a second mode that differs from the first mode.

SYSTEMS AND METHODS FOR PROCESSING AUDIO BASED ON CHANGES IN ACTIVE SPEAKER
20210225391 · 2021-07-22 · ·

System and methods for processing audio signals are disclosed. In one implementation, a system may comprise a wearable camera configured to capture images from an environment of a user; a microphone; and a processor. The processor may be configured to receive an audio signal representative of sounds captured by the microphone during a time period; and receive the images captured by the wearable camera. The processor may process the audio signal in a first mode based on audio data accumulated in a buffer prior to the time period; detect a change in the active speaker from the first individual to a second individual; and cease processing in the first mode and process the audio signal in a second mode that differs from the first mode.

CONTINUOUS PITCH-CORRECTED VOCAL CAPTURE DEVICE COOPERATIVE WITH CONTENT SERVER FOR BACKING TRACK MIX

Techniques have been developed to facilitate (1) the capture and pitch correction of vocal performances on handheld or other portable computing devices and (2) the mixing of such pitch-corrected vocal performances with backing tracks for audible rendering on targets that include such portable computing devices and as well as desktops, workstations, gaming stations, even telephony targets. Implementations of the described techniques employ signal processing techniques and allocations of system functionality that are suitable given the generally limited capabilities of such handheld or portable computing devices and that facilitate efficient encoding and communication of the pitch-corrected vocal performances (or precursors or derivatives thereof) via wireless and/or wired bandwidth-limited networks for rendering on portable computing devices or other targets.

CONTINUOUS PITCH-CORRECTED VOCAL CAPTURE DEVICE COOPERATIVE WITH CONTENT SERVER FOR BACKING TRACK MIX

Techniques have been developed to facilitate (1) the capture and pitch correction of vocal performances on handheld or other portable computing devices and (2) the mixing of such pitch-corrected vocal performances with backing tracks for audible rendering on targets that include such portable computing devices and as well as desktops, workstations, gaming stations, even telephony targets. Implementations of the described techniques employ signal processing techniques and allocations of system functionality that are suitable given the generally limited capabilities of such handheld or portable computing devices and that facilitate efficient encoding and communication of the pitch-corrected vocal performances (or precursors or derivatives thereof) via wireless and/or wired bandwidth-limited networks for rendering on portable computing devices or other targets.

Method and Application for Synchronizing Audio Across a Plurality of Devices
20210125626 · 2021-04-29 ·

A method and associated software application (app) for synchronizing audio across a plurality of mobile devices such as smart phones. In some implementations, the method syncs all the smart phones together allowing users to use the headsets on the smart phones instead of having to use speakers. In some implementations, the application syncs the audio by first downloading the audio onto the smart phones and then syncing it across the smart phones by using in conjunction, the clock on the smart phone, the clock on a server and/or the time obtained from GPS satellites.

Method and Application for Synchronizing Audio Across a Plurality of Devices
20210125626 · 2021-04-29 ·

A method and associated software application (app) for synchronizing audio across a plurality of mobile devices such as smart phones. In some implementations, the method syncs all the smart phones together allowing users to use the headsets on the smart phones instead of having to use speakers. In some implementations, the application syncs the audio by first downloading the audio onto the smart phones and then syncing it across the smart phones by using in conjunction, the clock on the smart phone, the clock on a server and/or the time obtained from GPS satellites.

AUDIOVISUAL CONTENT RENDERING WITH DISPLAY ANIMATION SUGGESTIVE OF GEOLOCATION AT WHICH CONTENT WAS PREVIOUSLY RENDERED

Techniques have been developed to facilitate (1) the capture and pitch correction of vocal performances on handheld or other portable computing devices and (2) the mixing of such pitch-corrected vocal performances with backing tracks for audible rendering on targets that include such portable computing devices and as well as desktops, workstations, gaming stations, even telephony targets. Implementations of the described techniques employ signal processing techniques and allocations of system functionality that are suitable given the generally limited capabilities of such handheld or portable computing devices and that facilitate efficient encoding and communication of the pitch-corrected vocal performances (or precursors or derivatives thereof) via wireless and/or wired bandwidth-limited networks for rendering on portable computing devices or other targets.

AUDIOVISUAL CONTENT RENDERING WITH DISPLAY ANIMATION SUGGESTIVE OF GEOLOCATION AT WHICH CONTENT WAS PREVIOUSLY RENDERED

Techniques have been developed to facilitate (1) the capture and pitch correction of vocal performances on handheld or other portable computing devices and (2) the mixing of such pitch-corrected vocal performances with backing tracks for audible rendering on targets that include such portable computing devices and as well as desktops, workstations, gaming stations, even telephony targets. Implementations of the described techniques employ signal processing techniques and allocations of system functionality that are suitable given the generally limited capabilities of such handheld or portable computing devices and that facilitate efficient encoding and communication of the pitch-corrected vocal performances (or precursors or derivatives thereof) via wireless and/or wired bandwidth-limited networks for rendering on portable computing devices or other targets.

Device and method of performing automatic audio focusing on multiple objects
10917721 · 2021-02-09 · ·

The present disclosure relates to a device and method of providing automatic audio focusing, the method includes: registering objects of interest; capturing a video; displaying the video on a display; recognizing at least one object included in the video; inferring at least one object of interest included in the video from the recognized at least one object; identifying distribution of the at least one object of interest in the video; and performing audio focusing on the at least one object of interest by adjusting activity of each of multiple microphones included in a microphone array on the basis of the distribution of the at least one object of interest in the video, whereby it is possible to emphasize voice of the object of interest during the video capturing of the electronic device, thereby improving the satisfaction with the video capturing result.

AUDIO METADATA SMOOTHING

The disclosed computer-implemented method for smoothing audio gaps using adaptive metadata identifies an initial audio segment and a subsequent audio segment that follows the initial audio segment. The method accesses a first set of metadata that corresponds to a last audio frame of the initial audio segment and accesses a second set of metadata that corresponds to the first audio frame of the subsequent audio segment. The first and second sets of metadata include audio characteristic information for the two audio segments. The method then generates a new set of metadata that is based on both sets of audio characteristics. The method further inserts a new audio frame between the last audio frame of the initial audio segment and the first audio frame of the subsequent audio segment and applies the new set of metadata to the new audio frame. Various other methods, systems, and computer-readable media are also disclosed.