Patent classifications
G10H1/366
Methods and systems for overlaying and playback of audio data received from distinct sources
A system that is distinct from a remote server and distinct from a client device receives, over a first communications channel, a first data stream for a first media item. The system receives, over a second communications channel, from an application at the client device, a second data stream for audio data that includes vocals provided by a user as the first media item plays. The system measures a latency of the second communications channel and overlays, with the first media item, the vocals provided by the user as the first media item plays to generate a composite data stream. The overlaying comprises offsetting the first data stream from the second data stream in accordance with the measured latency of the second communications channel and combining the first and second data streams in accordance with the offset of the data streams.
LOW COMPLEXITY HOWLING SUPPRESSION FOR PORTABLE KARAOKE
A low complexity howling suppression system and method for portable karaoke system are provided. In the howling suppression, at least one infinite impulse response (IIR) filters are introduced for estimating the acoustic feedback picked up by the microphone from the real environment, and thereby to cancel out the acoustic feedback from the microphone input signal.
AUDIO-VISUAL EFFECTS SYSTEM FOR AUGMENTATION OF CAPTURED PERFORMANCE BASED ON CONTENT THEREOF
Visual effects schedules are applied to audiovisual performances with differing visual effects applied in correspondence with differing elements of musical structure. Segmentation techniques applied to one or more audio tracks (e.g., vocal or backing tracks) are used to compute some of the components of the musical structure. In some cases, applied visual effects schedules are mood-denominated and may be selected by a performer as a component of his or her visual expression or determined from an audiovisual performance using machine learning techniques.
SOUND PROCESSING METHOD, SOUND PROCESSING SYSTEM, ELECTRONIC MUSICAL INSTRUMENT, AND RECORDING MEDIUM
A computer-implemented sound processing method includes: outputting singing sound data based on a sound signal representing singing sound; and outputting sound data representing musical instrument sound that correlates with musical elements of the singing sound, by inputting input data that includes the singing sound data to a trained model that has learned, by machine learning, a relationship between singing sound for training and musical instrument sound for training.
Automated generation of coordinated audiovisual work based on content captured from geographically distributed performers
Vocal audio of a user together with performance synchronized video is captured and coordinated with audiovisual contributions of other users to form composite duet-style or glee club-style or window-paned music video-style audiovisual performances. In some cases, the vocal performances of individual users are captured (together with performance synchronized video) on mobile devices, television-type display and/or set-top box equipment in the context of karaoke-style presentations of lyrics in correspondence with audible renderings of a backing track. Contributions of multiple vocalists are coordinated and mixed in a manner that selects for presentation, at any given time along a given performance timeline, performance synchronized video of one or more of the contributors. Selections are in accord with a visual progression that codes a sequence of visual layouts in correspondence with other coded aspects of a performance score such as pitch tracks, backing audio, lyrics, sections and/or vocal parts.
Operation panel for audio processing apparatus
An audio mixer includes an operation panel having a plurality of faders, a display device, and a first surface, a second surface, and a third surface, the second surface extending from the first surface and the third surface extending from the second surface, in a front-back direction. The plurality of faders are disposed along the first surface and arranged in a left-right direction. The display device is disposed along the third surface. A first reference distance from a common reference point to a first point of the first surface is equal to greater than a second reference distance from the common reference point to a second point of the third surface. The first point is disposed at a frontmost portion of the operation panel. The second point is disposed higher than the first point, at an uppermost portion of the third surface. The common reference position is disposed higher than the second point.
Audiovisual collaboration method with latency management for wide-area broadcast
Techniques have been developed to facilitate the livestreaming of group audiovisual performances. Audiovisual performances including vocal music are captured and coordinated with performances of other users in ways that can create compelling user and listener experiences. For example, in some cases or embodiments, duets with a host performer may be supported in a sing-with-the-artist style audiovisual livestream in which aspiring vocalists request or queue particular songs for a live radio show entertainment format. The developed techniques provide a communications latency-tolerant mechanism for synchronizing vocal performances captured at geographically-separated devices (e.g., at globally-distributed, but network-connected mobile phones or tablets or at audiovisual capture devices geographically separated from a live studio).
AUDIOVISUAL COLLABORATION METHOD WITH LATENCY MANAGEMENT FOR WIDE-AREA BROADCAST
Techniques have been developed to facilitate the livestreaming of group audiovisual performances. Audiovisual performances including vocal music are captured and coordinated with performances of other users in ways that can create compelling user and listener experiences. For example, in some cases or embodiments, duets with a host performer may be supported in a sing-with-the-artist style audiovisual livestream in which aspiring vocalists request or queue particular songs for a live radio show entertainment format. The developed techniques provide a communications latency-tolerant mechanism for synchronizing vocal performances captured at geographically-separated devices (e.g., at globally-distributed, but network-connected mobile phones or tablets or at audiovisual capture devices geographically separated from a live studio).
Mutating spectral resynthesizer system and methods
A method of and system for generating audio having pitch attributes of an incoming audio stream. The method comprises receiving a digital audio input. The audio spectrum is analyzed and integrated over segments of digital audio data upon receiving analysis triggers which can be synced with the audio tempo. The integrated spectrum is processed to find peak frequencies in the spectrum and their associated gain stored in a peaks array. The peak frequencies are used to program the oscillators controllable attributes and characteristics. The synthesis is performed upon receiving an analysis clock. A number of digital oscillators are configured with the associated frequency parameters and gain parameters from a peaks array. The oscillators are configured according to the audio pitch analysis and generate an oscillator output at the frequency and gain specified in the peaks array. These oscillator outputs are summed together generating synthesized audio.
System and method for generating harmonious color sets from musical interval data
Systems and methods are disclosed for generating color sets based on musical concepts of pitch intervals and harmony. Color sets are derived via a music-to-hue process which analyzes musical pitch data associated with musical input to determine pitch intervals included in the music. Pitch interval angles associated with the pitch intervals are applied to a tuned hue index to identify hue note ordered within the index which are separated by a hue interval angle similar to the pitch angle associated with the analyzed pitch data. The systems and methods provide for the creation of color sets which are analogous to musical chords in that they include multiple hue notes selected based on hue interval angles derived from musical interval angles associated with the received musical input.