G10H2220/011

Enhanced graphical user interface for voice communications
11574633 · 2023-02-07 · ·

Enhanced graphical user interfaces for transcription of audio and video messages is disclosed. Audio data may be transcribed, and the transcription may include emphasized words and/or punctuation corresponding to emphasis of user speech. Additionally, the transcription may be translated into a second language. A message spoken by a user depicted in one or more images of video data may also be transcribed and provided to one or more devices.

SOUND CONTROL DEVICE, SOUND CONTROL METHOD, AND SOUND CONTROL PROGRAM
20180005617 · 2018-01-04 ·

A sound control device includes: a reception unit that receives a start instruction indicating a start of output of a sound; a reading unit that reads a control parameter that determines an output mode of the sound, in response to the start instruction being received; and a control unit that causes the sound to be output in a mode according to the read control parameter.

Information processing method, terminal device and computer storage medium
20180005618 · 2018-01-04 ·

A method for processing information, terminal device and a computer storage medium are disclosed. The method for processing information includes that: a first control instruction is acquired, and a first application is switched to a preset mode according to the first control instruction; a first triggering operation is acquired based on the preset mode, at least two pieces of multimedia data are selected based on the first triggering operation, and a first playing interface is generated; when a second control instruction is acquired, the at least two pieces of multimedia data in the first playing interface are sequentially played; in a process of playing first multimedia data in the at least two pieces of multimedia data, first audio data is acquired; and the first multimedia data and the first audio data are synthesized as second multimedia data.

LYRIC PAGE GENERATION METHOD AND LYRIC PAGE GENERATION APPARATUS

The present disclosure discloses a lyrics page generation method and a lyrics page generation apparatus, belonging to the field of network technologies. The method includes: receiving a lyrics page generation instruction, the lyrics page generation instruction being used for instructing to display a lyrics page of a selected multimedia file list; obtaining a lyrics file of at least one multimedia file in the multimedia file list; and generating a lyrics page according to the lyrics file of the at least one multimedia file, the lyrics page including lyrics information of the at least one multimedia file. The present disclosure provides a brand-new lyrics display manner, which can achieve an effect similar to a lyrics book, so that a user can collect lyrics that the user likes, for subsequent appreciation, so that an application becomes more humanistic, and an amount of information that a lyrics page can provide is greatly increased.

AUDIOVISUAL COLLABORATION SYSTEM AND METHOD WITH SEED/JOIN MECHANIC

User interface techniques provide user vocalists with mechanisms for seeding subsequent performances by other users (e.g., joiners). A seed may be a full-length seed spanning much or all of a pre-existing audio (or audiovisual) work and mixing, to seed further contributions of one or more joiners, a user's captured media content for at least some portions of the audio (or audiovisual) work. A short seed may span less than all (and in some cases, much less than all) of the audio (or audiovisual) work. For example, a verse, chorus, refrain, hook or other limited “chunk” of an audio (or audiovisual) work may constitute a seed. A seeding user's call invites other users to join the full-length or short-form seed by singing along, singing a particular vocal part or musical section, singing harmony or other duet part, rapping, talking, clapping, recording video, adding a video clip from camera roll, etc. The resulting group performance, whether full-length or just a chunk, may be posted, livestreamed, or otherwise disseminated in a social network.

Sound source file structure, recording medium recording the same, and method of producing sound source file

The present disclosure relates to a sound source file structure, to output lyrics as audible sounds right before melodies corresponding to the lyrics start, to help a user to remind the lyrics based on accompaniment for a song after the accompaniment starts to be provided, and to help the user to sing based on correct lyrics corresponding to the melodies. The sound source file structure may include one or more backing sound source layers in which backing sounds based on beats and rhythms are placed, a melody sound source layer in which melody notes corresponding to lyrics based on beats and rhythms and a rest section corresponding to a rest are placed, and a lyric voice source layer in which a lyric voice is placed at a position corresponding to a rest section.

Short segment generation for user engagement in vocal capture applications

User interface techniques provide user vocalists with mechanisms for solo audiovisual capture and for seeding subsequent performances by other users (e.g., joiners). Audiovisual capture may be against a full-length work or seed spanning much or all of a pre-existing audio (or audiovisual) work and in some cases may mix, to seed further contributions of one or more joiners, a user's captured media content for at least some portions of the audio (or audiovisual) work. A short seed or short segment may span less than all (and in some cases, much less than all) of the audio (or audiovisual) work. For example, a verse, chorus, refrain, hook or other limited “chunk” of an audio (or audiovisual) work may constitute a short seed or short segment. Computational techniques are described that allow a system to automatically identify suitable short seeds or short segments. After audiovisual capture against the short seed or short segment, a resulting, solo or group, full-length or short-form performance may be posted, livestreamed, or otherwise disseminated in a social network.

Audiovisual content rendering with display animation suggestive of geolocation at which content was previously rendered

Techniques have been developed to facilitate (1) the capture and pitch correction of vocal performances on handheld or other portable computing devices and (2) the mixing of such pitch-corrected vocal performances with backing tracks for audible rendering on targets that include such portable computing devices and as well as desktops, workstations, gaming stations, even telephony targets. Implementations of the described techniques employ signal processing techniques and allocations of system functionality that are suitable given the generally limited capabilities of such handheld or portable computing devices and that facilitate efficient encoding and communication of the pitch-corrected vocal performances (or precursors or derivatives thereof) via wireless and/or wired bandwidth-limited networks for rendering on portable computing devices or other targets.

Electronic musical instrument, electronic musical instrument control method, and storage medium

An electronic musical instrument includes at least one processor that, in accordance with a user operation on an operation unit, obtains lyric data and waveform data corresponding to a first tone color; inputs the obtained lyric data to a trained model so as to cause the trained model to output acoustic feature data in response thereto; generates waveform data corresponding to a singing voice of a singer and corresponding to a second tone color that is different from the first tone color, based on the acoustic feature data outputted from the trained model and the obtained waveform data corresponding to the first tone color; and outputs a singing voice based on the generated waveform data corresponding to the second tone color.

DISPLAY DEVICE AND DISPLAY SYSTEM

The present disclosure relates to a display device and a display system for providing lyrics when reproducing music of the external device, regardless of a connection state of an external device. The display device includes: a display; a controller configured to receive a music reproduction command through an external device; and an audio output interface configured to output music received from the external device, wherein, when the controller receives the music reproduction command, the controller is configured to request lyric information to the external device, and when the controller receives the lyric information from the external device, the controller is configured to display lyrics through the display while outputting the music.