G10L25/84

System and method for real-time synchronization of media content via multiple devices and speaker systems
11553236 · 2023-01-10 ·

A method and system for real-time customizing and synchronizing media by a client device in communication with a server device. A client device customizes stock media content based on user preferences, and synchronizes the customized content for playback with a server-side playback of the stock media content.

System and method for real-time synchronization of media content via multiple devices and speaker systems
11553236 · 2023-01-10 ·

A method and system for real-time customizing and synchronizing media by a client device in communication with a server device. A client device customizes stock media content based on user preferences, and synchronizes the customized content for playback with a server-side playback of the stock media content.

ELECTRONIC DEVICE FOR CONTROLLING BEAMFORMING AND OPERATING METHOD THEREOF

An electronic device is provided. The electronic device includes, for the purpose of determining a customized beamformer filter, an input module including a plurality of microphones configured to receive an external sound signal, a memory configured to store computer-executable instructions and an initial value of a voice parameter used to perform beamforming on the external sound signal, and a processor configured to execute the instructions by accessing the memory. The instructions may be configured to estimate a feature value of the external sound signal, calculate the initial value of the voice parameter used to perform beamforming based on the external sound signal received by the plurality of microphones, determine whether to store the calculated initial value according to the feature value, determine which one of the calculated initial value or an initial value stored in the memory used according to the feature value, and obtain a target voice parameter.

ELECTRONIC DEVICE FOR CONTROLLING BEAMFORMING AND OPERATING METHOD THEREOF

An electronic device is provided. The electronic device includes, for the purpose of determining a customized beamformer filter, an input module including a plurality of microphones configured to receive an external sound signal, a memory configured to store computer-executable instructions and an initial value of a voice parameter used to perform beamforming on the external sound signal, and a processor configured to execute the instructions by accessing the memory. The instructions may be configured to estimate a feature value of the external sound signal, calculate the initial value of the voice parameter used to perform beamforming based on the external sound signal received by the plurality of microphones, determine whether to store the calculated initial value according to the feature value, determine which one of the calculated initial value or an initial value stored in the memory used according to the feature value, and obtain a target voice parameter.

Systems and methods for generating a cleaned version of ambient sound
11551678 · 2023-01-10 · ·

A first electronic device is provided. While a media content item provided by a media-providing service is emitted by a second electronic device that is remote from the first electronic device, the first electronic device receives, from the media-providing service, data that includes an audio stream that corresponds to the media content item. The first electronic device detects ambient sound that includes sound corresponding to the media content item emitted by the second electronic device. The first electronic device generates a cleaned version of the ambient sound, which includes: using the data received from the media-providing service to align the audio stream with the ambient sound; and performing a subtraction operation to subtract the audio stream from the ambient sound. The first electronic device detects a voice command in the cleaned version of the ambient sound.

Systems and methods for generating a cleaned version of ambient sound
11551678 · 2023-01-10 · ·

A first electronic device is provided. While a media content item provided by a media-providing service is emitted by a second electronic device that is remote from the first electronic device, the first electronic device receives, from the media-providing service, data that includes an audio stream that corresponds to the media content item. The first electronic device detects ambient sound that includes sound corresponding to the media content item emitted by the second electronic device. The first electronic device generates a cleaned version of the ambient sound, which includes: using the data received from the media-providing service to align the audio stream with the ambient sound; and performing a subtraction operation to subtract the audio stream from the ambient sound. The first electronic device detects a voice command in the cleaned version of the ambient sound.

Method of performing function of electronic device and electronic device using same

An electronic device includes: a camera; a microphone; a display; a memory; and a processor configured to receive an input for activating an intelligent agent service from a user while at least one application is executed, identify context information of the electronic device, control to acquire image information of the user through the camera, based on the identified context information, detect movement of a user's lips included in the acquired image information to recognize a speech of the user, and perform a function corresponding to the recognized speech.

Method of performing function of electronic device and electronic device using same

An electronic device includes: a camera; a microphone; a display; a memory; and a processor configured to receive an input for activating an intelligent agent service from a user while at least one application is executed, identify context information of the electronic device, control to acquire image information of the user through the camera, based on the identified context information, detect movement of a user's lips included in the acquired image information to recognize a speech of the user, and perform a function corresponding to the recognized speech.

Electronic device and method for speech recognition of the same

An electronic device for recognizing a user's speech and a speech recognition method therefor are provided. The electronic device includes a microphone configured to receive a user's speech, a memory for storing speech recognition models, and at least one processor configured to select a speech recognition model from among the speech recognition models stored in the memory based on an operation state of the electronic device, and recognize the user's speech received by the microphone based on the selected speech recognition model.

Electronic device and method for speech recognition of the same

An electronic device for recognizing a user's speech and a speech recognition method therefor are provided. The electronic device includes a microphone configured to receive a user's speech, a memory for storing speech recognition models, and at least one processor configured to select a speech recognition model from among the speech recognition models stored in the memory based on an operation state of the electronic device, and recognize the user's speech received by the microphone based on the selected speech recognition model.