G10L21/00

Systems and methods for automatic program recommendations based on user interactions

Methods and systems are provided for generating automatic program recommendations based on user interactions. In some embodiments, control circuitry processes verbal data received during an interaction between a user of a user device and a person with whom the user is interacting. The control circuitry analyzes the verbal data to automatically identify a media asset referred to during the interaction by at least one of the user and the person with whom the user is interacting. The control circuitry adds the identified media asset to a list of media assets associated with the user of the user device. The list of media assets is transmitted to a second user device of the user.

Device control method, display control method, and purchase settlement method

A device control method includes acquiring voice information, obtaining a spoken command indicating a control instruction as to a device based on the acquired voice information, identifying speaker information relating to a speaker which has uttered the acquired voice information, based on the acquired voice information, identifying, out of a plurality of devices, a device to be controlled, based on the spoken command and the speaker information, and controlling the identified device to be controlled.

Digital audio workstation with audio processing recommendations
11687314 · 2023-06-27 · ·

Presentation of a recommendation to a user for individual processing of audio tracks in a digital audio workstation. Training audio tracks are provided to a human sound mixer and responsive to the training audio tracks individually processed training audio tracks are received from the human sound mixer. The training audio tracks and the individually processed training audio tracks are input to a machine to train the machine. Audio processing operations are output from the trained machine and stored in a record of a database.

Method and apparatus for adaptively encoding and decoding high frequency band

Provided are a method and apparatus for encoding and decoding an audio signal. According to the present application, a signal of a high frequency band above a preset frequency band is adaptively encoded or decoded in the time domain or in the frequency domain by using a signal of a low frequency band below the preset frequency band. As such, the sound quality of a high frequency signal is not deteriorate even when an audio signal is encoded or decoded by using a small number of bits and thus coding efficiency may be maximized.

Method and apparatus for adaptively encoding and decoding high frequency band

Provided are a method and apparatus for encoding and decoding an audio signal. According to the present application, a signal of a high frequency band above a preset frequency band is adaptively encoded or decoded in the time domain or in the frequency domain by using a signal of a low frequency band below the preset frequency band. As such, the sound quality of a high frequency signal is not deteriorate even when an audio signal is encoded or decoded by using a small number of bits and thus coding efficiency may be maximized.

Voice processing device, voice processing method, and non-transitory computer readable recording medium having therein program for voice processing
09847094 · 2017-12-19 · ·

A voice processing device includes a first sound collecting unit for generating a first voice signal; a human-body vibration obtaining unit for generating a human-body vibration signal; a first distance calculating unit for calculating a ratio in power or amplitude between the human-body vibration signal and the first voice signal and for calculating a first distance from the first sound collecting unit to a sound source in accordance with the ratio and distance estimation information; a second distance calculating unit for calculating, for each of a plurality of frequencies, a second distance from the first sound collecting unit to a sound source which produces a component of a frequency of a first frequency signal; a gain determining unit for determining, for each of the plurality of frequencies, a gain based on a comparison result between the first distance and the second distance.

Adaptive processing with multiple media processing nodes

Techniques for adaptive processing of media data based on separate data specifying a state of the media data are provided. A device in a media processing chain may determine whether a type of media processing has already been performed on an input version of media data. If so, the device may adapt its processing of the media data to disable performing the type of media processing. If not, the device performs the type of media processing. The device may create a state of the media data specifying the type of media processing. The device may communicate the state of the media data and an output version of the media data to a recipient device in the media processing chain, for the purpose of supporting the recipient device's adaptive processing of the media data.

Adaptive processing with multiple media processing nodes

Techniques for adaptive processing of media data based on separate data specifying a state of the media data are provided. A device in a media processing chain may determine whether a type of media processing has already been performed on an input version of media data. If so, the device may adapt its processing of the media data to disable performing the type of media processing. If not, the device performs the type of media processing. The device may create a state of the media data specifying the type of media processing. The device may communicate the state of the media data and an output version of the media data to a recipient device in the media processing chain, for the purpose of supporting the recipient device's adaptive processing of the media data.

Voice processing apparatus and voice processing method
09842599 · 2017-12-12 · ·

A voice processing apparatus calculates a phase difference between first and second frequency signals obtained by transforming first and second voice signals generated by two voice input units for each frequency, calculates, for each extension range set outside or inside a reference range, a presence ratio based on the number of frequencies with the phase difference between the first and second frequency signals falling within the extension range, the reference range representing a range of the phase difference between the first and second voice signals for each frequency and corresponding to a direction in which a target sound source is assumed to be located, and sets, as a non-suppression range, a first extension range having the presence ratio higher than a predetermined value and a second extension range closer to the phase difference at the center of the reference range than the first extension range is within the reference range.

Voice processing apparatus and voice processing method
09842599 · 2017-12-12 · ·

A voice processing apparatus calculates a phase difference between first and second frequency signals obtained by transforming first and second voice signals generated by two voice input units for each frequency, calculates, for each extension range set outside or inside a reference range, a presence ratio based on the number of frequencies with the phase difference between the first and second frequency signals falling within the extension range, the reference range representing a range of the phase difference between the first and second voice signals for each frequency and corresponding to a direction in which a target sound source is assumed to be located, and sets, as a non-suppression range, a first extension range having the presence ratio higher than a predetermined value and a second extension range closer to the phase difference at the center of the reference range than the first extension range is within the reference range.