G10H2240/036

Computationally efficient language based user interface event sound selection

A computer user interface (UI) is capable of generating a sound when a predetermined event occurs. The sound generated when the predetermined event occurs may possess at least some characteristics of a predominant natural language used by a user and/or a location of a computer implementing the UI. This enables the user to quickly assimilate the sound generated when the predetermined event occurs. Because the user quickly assimilates the sound generated when the predetermined event occurs, the user is able to rapidly respond to the predetermined event, at times using the computer UI, which reduces undesirable memory use, processor use and/or battery drain associated with a computing device that implements the computer UI.

Dual sound source audio data processing method and apparatus

A dual sound source audio data processing method and apparatus are provided. The method includes obtaining audio data of a song pair including a first song and a second song, the first and second songs having a same accompaniment audio but different voice audio. The audio data is decoded to obtain first mono audio data corresponding to the first song and second mono audio data corresponding to the second song. The first and second mono audio data are combined to one piece of two-channel audio data including a left audio channel and a right audio channel. A play time of the two-channel audio data is divided into play periods, and energy suppression is selectively performed on the left audio channel and the right audio channel in different play periods.

Methods, computer server systems and media devices for media streaming
11889165 · 2024-01-30 · ·

A computer server system associates one or more media items with a first segment of a first media item, the one or more media items selected based on current location information of a media device. The computer server system receives, from a media device, a request for a media item associated with the first media item, wherein the request includes a media segment identifier for the first segment of the first media item. In response to the request, the computer server system identifies the one or more media items associated with the first segment and provides the one or more media items to the media device.

Identifying language in music
11955110 · 2024-04-09 · ·

The present disclosure describes techniques for identifying languages associated with music. Training data may be received, wherein the training data comprise information indicative of audio data representative of a plurality of music samples and metadata associated with the plurality of music samples. The training data further comprises information indicating a language corresponding to each of the plurality of music samples. A machine learning model may be trained to identify a language associated with a piece of music by applying the training data to the machine model until the model reaches a predetermined recognition accuracy. A language associated with the piece of music may be determined using the trained machine learning model.

COMPUTATIONALLY EFFICIENT LANGUAGE BASED USER INTERFACE EVENT SOUND SELECTION

A computer user interface (UI) is capable of generating a sound when a predetermined event occurs. The sound generated when the predetermined event occurs may possess at least some characteristics of a predominant natural language used by a user and/or a location of a computer implementing the UI. This enables the user to quickly assimilate the sound generated when the predetermined event occurs. Because the user quickly assimilates the sound generated when the predetermined event occurs, the user is able to rapidly respond to the predetermined event, at times using the computer UI, which reduces undesirable memory use, processor use and/or battery drain associated with a computing device that implements the computer UI.

METHODS, COMPUTER SERVER SYSTEMS AND MEDIA DEVICES FOR MEDIA STREAMING
20190182561 · 2019-06-13 ·

In general, this disclosure concerns media streaming. Among other things, the present disclosure presents a first media item for streaming from a computer server system to a media device. The first media item has an audio format. Furthermore, the first media item comprises a number of media segments, wherein each one of the number of media segments is identifiable by a media segment identifier. Still further, one or several of the number of media segments is/are associated with a respective second media item corresponding to a respective media segment identifier. The second media item(s) typically has/have a media format other than audio.

DUAL SOUND SOURCE AUDIO DATA PROCESSING METHOD AND APPARATUS

A dual sound source audio data processing method and apparatus are provided. The method includes obtaining audio data of a song pair including a first song and a second song, the first and second songs having a same accompaniment audio but different voice audio. The audio data is decoded to obtain first mono audio data corresponding to the first song and second mono audio data corresponding to the second song. The first and second mono audio data are combined to one piece of two-channel audio data including a left audio channel and a right audio channel. A play time of the two-channel audio data is divided into play periods, and energy suppression is selectively performed on the left audio channel and the right audio channel in different play periods.