G10H1/365

SYSTEM AND METHOD FOR PROVIDING A VIDEO WITH LYRICS OVERLAY FOR USE IN A SOCIAL MESSAGING ENVIRONMENT
20200184937 · 2020-06-11 ·

In accordance with an embodiment, described herein is a system and method for providing a live lyrics overlay in a social messaging environment. The system can utilize advances in three-dimensional mapping technology that allow social messaging services, to offer real time video lenses or overlays to their users, and extends this three-dimensional mapping technology to support for lyrics. During creation of a video with lyrics lens overlay, the lyrics corresponding to a selected song are retrieved from a lyrics source, and are displayed within the video. For example, with the lyrics lens, a user can record an image of themselves on live video, singing along to a song clip, with the lyrics of the song displayed as if they appear to be coming from their mouths. The created live lyrics content can also be shared with other users of a social messaging environment.

ELECTRONIC DEVICE FOR OUTPUTTING SOUND AND OPERATING METHOD THEREOF
20200176017 · 2020-06-04 ·

An electronic device includes a microphone, a speaker, a processor operatively connected to the microphone and the speaker, and a memory electrically connected to the processor and storing instructions that, when executed by the processor, cause the processor to receive a sound through the microphone, analyze the received sound, determine a song associated with the sound, and a kind of output sound based on at least in part on a result of the analyzing, generate an output sound based on the kind of output sound, and output the generated output sound through the speaker.

DIGITAL JUKEBOX DEVICE WITH KARAOKE AND/OR PHOTO BOOTH FEATURES, AND ASSOCIATED METHODS

Certain exemplary embodiments relate to entertainment systems and, more particularly, certain exemplary embodiments relate to jukebox systems that incorporate digital downloading jukebox features along with karaoke jukebox and/or photo booth features. A combined karaoke/photo booth/jukebox may enable more integrated performance-like experiences in an in-home or out-of-home location or venue. By leveraging vast audio media libraries, trusted rights-respecting network infrastructure, and on-site image/video capturing from integrated recorders and/or remote portable devices, a more sociable experience may be created for karaoke jukebox patrons, e.g., where custom content can be generated and shared in a safe and legally appropriate manner.

Song playback system

A KTV player in communication with at least one song server is configured to upload a song list of a song database to the song server, obtain a song playback list from the song server, obtain corresponding music scores and pitch curves from the song server according to the song playback list, obtain corresponding songs according to the song playback list, display the songs in sequence on a display, and display the corresponding music scores and pitch curves on the display synchronously with the songs. The song playback list is sent by a mobile terminal in communication with the song server.

Systems, Methods and Applications For Modulating Audible Performances
20200135237 · 2020-04-30 ·

Embodiments involve harmonising one or more geographically or temporally distributed renditions with at least one backing clip, comprising a calibration module for selecting a parameter of one or more aural or visual characteristics of the first rendition, a backing clip selector in communication with a backing clip database, a reference selector for selecting reference clip for modification of the first rendition, a modification module for applying a computational process to the first rendition or the backing clip to modify an aural characteristic of the first rendition or the backing clip to reduce the difference between the first rendition or the backing clip and the reference clip in the aural characteristic, and a mixing module to combine one or multiple renditions with the backing clip after modification.

Digital jukebox device with karaoke and/or photo booth features, and associated methods

Certain exemplary embodiments relate to entertainment systems and, more particularly, certain exemplary embodiments relate to jukebox systems that incorporate digital downloading jukebox features along with karaoke jukebox and/or photo booth features. A combined karaoke/photo booth/jukebox may enable more integrated performance-like experiences in an in-home or out-of-home location or venue. By leveraging vast audio media libraries, trusted rights-respecting network infrastructure, and on-site image/video capturing from integrated recorders and/or remote portable devices, a more sociable experience may be created for karaoke jukebox patrons, e.g., where custom content can be generated and shared in a safe and legally appropriate manner.

Digital jukebox device with karaoke and/or photo booth features, and associated methods

Certain exemplary embodiments relate to entertainment systems and, more particularly, certain exemplary embodiments relate to jukebox systems that incorporate digital downloading jukebox features along with karaoke jukebox and/or photo booth features. A combined karaoke/photo booth/jukebox may enable more integrated performance-like experiences in an in-home or out-of-home location or venue. By leveraging vast audio media libraries, trusted rights-respecting network infrastructure, and on-site image/video capturing from integrated recorders and/or remote portable devices, a more sociable experience may be created for karaoke jukebox patrons, e.g., where custom content can be generated and shared in a safe and legally appropriate manner.

Method, client and computer storage medium for processing information

The present disclosure discloses a method, a client and a computer storage medium for processing information, wherein the method includes: triggering a first operation; downloading a first audio file and a first text file matching the first audio file in response to the first operation; partly truncating the first audio file to obtain a first audio clip according to first indication information for identifying a truncating start position and second indication information for identifying a truncating end position; triggering a second operation; playing the first audio clip and dynamically displaying a text information part in the first text file corresponding to the first audio clip synchronously in response to the second operation; acquiring voice information of a user while playing the first audio chip; and synthesizing the first audio clip and the voice information into a first acquisition result.

Audiovisual media application platform with wireless handheld audiovisual input

Embodiments described herein relate generally to systems comprising a display device, a display device-coupled computing platform, a mobile device in communication with the computing platform, and a content server in which methods and techniques of capture and/or processing of audiovisual performances are described and, in particular, description of techniques suitable for use in connection with display device connected computing platforms for rendering vocal performance captured by a handheld computing device.

SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS
20200043517 · 2020-02-06 · ·

A system, method and computer product for estimating a component of a provided audio signal. The method comprises converting the provided audio signal to an image, processing the image with a neural network trained to estimate one of vocal content and instrumental content, and storing a spectral mask output from the neural network as a result of the image being processed by the neural network. The neural network is a U-Net. The method also comprises providing the spectral mask to a client media playback device, which applies the spectral mask to a spectrogram of the provided audio signal, to provide a masked spectrogram. The media playback device also transforms the masked spectrogram to an audio signal, and plays back that audio signal via an output user interface.