Patent classifications
G10H1/365
Singing voice separation with deep U-Net convolutional networks
A system, method and computer product for estimating a component of a provided audio signal. The method comprises converting the provided audio signal to an image, processing the image with a neural network trained to estimate one of vocal content and instrumental content, and storing a spectral mask output from the neural network as a result of the image being processed by the neural network. The neural network is a U-Net. The method also comprises providing the spectral mask to a client media playback device, which applies the spectral mask to a spectrogram of the provided audio signal, to provide a masked spectrogram. The media playback device also transforms the masked spectrogram to an audio signal, and plays back that audio signal via an output user interface.
Instructional method and system of an electronic keyboard, instructional electronic keyboard, and a storage medium
An instructional method of an electronic keyboard includes the following steps: obtaining a playing script, wherein the playing script is generated by a recording electronic keyboard; the recording electronic keyboard generates the playing script according to a pressed second key and the time of pressing the second key, and the playing script is used for indicating a corresponding relationship between a to-be-pressed key and the time of pressing the to-be-pressed key; controlling an indicator light on the first key of the instructional electronic keyboard to be turned on/off according to the playing script. In this implementation mode, a user can be prompted for the key by an indicator light on the electronic keyboard, which facilitates the user to learn the electronic keyboard.
ELECTRONIC MUSICAL INSTRUMENT, OPERATION STATUS NOTIFICATION METHOD OF ELECTRONIC MUSICAL INSTRUMENT, AND NON-TRANSITORY RECORDING MEDIUM
An electronic musical instrument includes an input device that receives an input operation, a display, and a processor. The processor performs a first process upon duration of a detected continuation of the input operation being a reference time or longer. The processor performs a second process upon duration of a detected continuation of the input operation being less than the reference time. The processor causes the display to display a content from which it is determinable whether or not duration of a detected continuation of the input operation being the reference time or longer.
DIGITAL JUKEBOX DEVICE WITH KARAOKE AND/OR PHOTO BOOTH FEATURES, AND ASSOCIATED METHODS
Certain exemplary embodiments relate to entertainment systems and, more particularly, certain exemplary embodiments relate to jukebox systems that incorporate digital downloading jukebox features along with karaoke jukebox and/or photo booth features. A combined karaoke/photo booth/jukebox may enable more integrated performance-like experiences in an in-home or out-of-home location or venue. By leveraging vast audio media libraries, trusted rights-respecting network infrastructure, and on-site image/video capturing from integrated recorders and/or remote portable devices, a more sociable experience may be created for karaoke jukebox patrons, e.g., where custom content can be generated and shared in a safe and legally appropriate manner.
System and method for providing a video with lyrics overlay for use in a social messaging environment
In accordance with an embodiment, described herein is a system and method for providing a live lyrics overlay in a social messaging environment. The system can utilize advances in three-dimensional mapping technology that allow social messaging services, to offer real time video lenses or overlays to their users, and extends this three-dimensional mapping technology to support for lyrics. During creation of a video with lyrics lens overlay, the lyrics corresponding to a selected song are retrieved from a lyrics source, and are displayed within the video. For example, with the lyrics lens, a user can record an image of themselves on live video, singing along to a song clip, with the lyrics of the song displayed as if they appear to be coming from their mouths. The created live lyrics content can also be shared with other users of a social messaging environment.
Self-produced music apparatus and method
An application for operating on a smart phone that records a musician's performance, either voice or instrumental, in combination with pre-recorded music. The combination allows for the auto tuning of the recording, the compression of the recording, the equalization of the recording, adding in reverb, correcting latency and the audio quantization of the rhythm, in addition to music enhancement features such as vocal spread, DeEsser, vocal doubler, vocal harmonizer, tape saturation, pitch correcdtion, flanger, phaser, auto pan, vibrato, tremolo, rotary, ring modulator, metalizer, expander, noise gate, wah, vocal leveling, tape stop, half speed, LoFi, and stutter. Once combined, the song is transmitted to social media and/or to an online store for sale. The user can also make a video with the song. Additional marketing such as song competitions or music reviews and ratings are also provided.
SINGING SCORING METHOD AND SINGING SCORING SYSTEM BASED ON STREAMING MEDIA
A singing scoring method and a singing scoring system based on streaming media are provided. In the singing scoring method and the singing scoring system, a first time difference between a moment at which a streaming video player starts to play a song and a moment at which an electronic device starts an audio recording program and a scoring engine is calculated. In addition, in the singing scoring method and the singing scoring system, a playing time difference of the streaming video player within every fixed period of a system time of the electronic device is continuously calculated, and the playing time difference is transferred to the scoring engine for accumulation to form a second time difference. The scoring engine then adjusts a singing time of each note in an entire musical score according to the first time difference and the second time difference.
SYSTEM AND METHOD FOR INTERACTIVE MICROPHONE
A system and method for an interactive three piece microphone with customizable features that may be quickly accessed by a user whereby the users' settings are automatically uploaded to the microphone providing a more personalized experience. The microphone may be used in a variety of situations where users may challenge one another as well as collect monetary rewards based on the location and type of performance they have.
DIGITAL JUKEBOX DEVICE WITH KARAOKE AND/OR PHOTO BOOTH FEATURES, AND ASSOCIATED METHODS
Certain exemplary embodiments relate to entertainment systems and, more particularly, certain exemplary embodiments relate to jukebox systems that incorporate digital downloading jukebox features along with karaoke jukebox and/or photo booth features. A combined karaoke/photo booth/jukebox may enable more integrated performance-like experiences in an in-home or out-of-home location or venue. By leveraging vast audio media libraries, trusted rights-respecting network infrastructure, and on-site image/video capturing from integrated recorders and/or remote portable devices, a more sociable experience may be created for karaoke jukebox patrons, e.g., where custom content can be generated and shared in a safe and legally appropriate manner.
Textual display of aural information broadcast via frequency modulated signals
An electronic device includes a display screen and circuitry. The circuitry receives a first frequency modulated (FM) signal from a first FM radio transmitter, via a first FM radio channel. The first FM signal comprises a broadcast data signal that includes an audio segment of aural information of a performer-of-interest at of a live event, text information associated with the audio segment, and synchronization information. The synchronization information is associated with the text information and the audio segment. The circuitry extracts the synchronization information from a plurality of data packets of the broadcast data signal. The circuitry extracts a portion of the text information from the extracted plurality of data packets of the broadcast data signal based on the extracted synchronization information. The circuitry controls display of the extracted portion of the text information on the display screen.