G10H2240/175

NON-TRANSITORY COMPUTER-READABLE MEDIUM HAVING COMPUTER-READABLE INSTRUCTIONS AND SYSTEM
20230239650 · 2023-07-27 · ·

A sound controlling system including a user terminal having a sound source, a wireless communication device, a digital to analog converter (DAC) and first processing electronics. The first processing electronics are configured to: provide data of a backing sound to the sound source; control the sound source to generate a sound signal based on the data; receive a first input instruction including a first instruction to transmit the sound signal and a second instruction to play back the backing sound; provide the sound signal to the wireless communication device as the first input instruction being the first instruction, and provide the sound signal to the DAC as being the second instruction; control the wireless communication device to convert the sound signal to a wireless signal and transmit the wireless signal; and convert the sound signal from a digital signal to an analog signal for play back of the backing sound.

CROWD-SOURCED TECHNIQUE FOR PITCH TRACK GENERATION
20230005463 · 2023-01-05 ·

Digital signal processing and machine learning techniques can be employed in a vocal capture and performance social network to computationally generate vocal pitch tracks from a collection of vocal performances captured against a common temporal baseline such as a backing track or an original performance by a popularizing artist. In this way, crowd-sourced pitch tracks may be generated and distributed for use in subsequent karaoke-style vocal audio captures or other applications. Large numbers of performances of a song can be used to generate a pitch track. Computationally determined pitch trackings from individual audio signal encodings of the crowd-sourced vocal performance set are aggregated and processed as an observation sequence of a trained Hidden Markov Model (HMM) or other statistical model to produce an output pitch track.

AUDIOVISUAL COLLABORATION SYSTEM AND METHOD WITH SEED/JOIN MECHANIC

User interface techniques provide user vocalists with mechanisms for seeding subsequent performances by other users (e.g., joiners). A seed may be a full-length seed spanning much or all of a pre-existing audio (or audiovisual) work and mixing, to seed further contributions of one or more joiners, a user's captured media content for at least some portions of the audio (or audiovisual) work. A short seed may span less than all (and in some cases, much less than all) of the audio (or audiovisual) work. For example, a verse, chorus, refrain, hook or other limited “chunk” of an audio (or audiovisual) work may constitute a seed. A seeding user's call invites other users to join the full-length or short-form seed by singing along, singing a particular vocal part or musical section, singing harmony or other duet part, rapping, talking, clapping, recording video, adding a video clip from camera roll, etc. The resulting group performance, whether full-length or just a chunk, may be posted, livestreamed, or otherwise disseminated in a social network.

DIGITAL AUDIO SYSTEM
20230230492 · 2023-07-20 ·

A portable digital audio system for a musician. The digital audio system includes an amplifier for processing an audio signal from a musical instrument or microphone electronically connected to the digital audio system and a speaker for playing a sound associated with the audio signal processed by the amplifier. The portable digital audio system also includes an audio control system providing operational control of the digital audio system and a primary housing for supporting the amplifier, the audio control system, and the speaker. Further, the digital audio system has a touch screen display in electronic communication with the audio control system and supported by the primary housing.

Short segment generation for user engagement in vocal capture applications

User interface techniques provide user vocalists with mechanisms for solo audiovisual capture and for seeding subsequent performances by other users (e.g., joiners). Audiovisual capture may be against a full-length work or seed spanning much or all of a pre-existing audio (or audiovisual) work and in some cases may mix, to seed further contributions of one or more joiners, a user's captured media content for at least some portions of the audio (or audiovisual) work. A short seed or short segment may span less than all (and in some cases, much less than all) of the audio (or audiovisual) work. For example, a verse, chorus, refrain, hook or other limited “chunk” of an audio (or audiovisual) work may constitute a short seed or short segment. Computational techniques are described that allow a system to automatically identify suitable short seeds or short segments. After audiovisual capture against the short seed or short segment, a resulting, solo or group, full-length or short-form performance may be posted, livestreamed, or otherwise disseminated in a social network.

Audiovisual content rendering with display animation suggestive of geolocation at which content was previously rendered

Techniques have been developed to facilitate (1) the capture and pitch correction of vocal performances on handheld or other portable computing devices and (2) the mixing of such pitch-corrected vocal performances with backing tracks for audible rendering on targets that include such portable computing devices and as well as desktops, workstations, gaming stations, even telephony targets. Implementations of the described techniques employ signal processing techniques and allocations of system functionality that are suitable given the generally limited capabilities of such handheld or portable computing devices and that facilitate efficient encoding and communication of the pitch-corrected vocal performances (or precursors or derivatives thereof) via wireless and/or wired bandwidth-limited networks for rendering on portable computing devices or other targets.

METHODS, SYSTEMS, AND DEVICES FOR ASSEMBLY OF LIVE AND RECORDED AUDIO CONTENT

Aspects of the subject disclosure may include, for example, receiving first audio content from a first communication device and receiving second audio content from a second communication device, and adjusting the first audio content and the second audio content. The adjusting of the first audio content and the second audio content can comprise: detecting a gap in the first audio content; analyzing the first audio content resulting in an audio analysis; generating filler audio content based on the audio analysis; and inserting the filler audio content into the gap of the first audio content. Further embodiments can include aggregating the first adjusted audio content with the second adjusted audio content resulting in aggregated audio content, and providing the aggregated audio content to a third communication device for playback. The third communication device plays the aggregated audio content. Other embodiments are disclosed.

Method and apparatus for using a test audio pattern to generate an audio signal transform for use in performing acoustic echo cancellation
11521636 · 2022-12-06 ·

A test audio pattern is sent to the speaker of the participant computer for outputting by the speaker. A computer receives a microphone input signal from the participant computer that includes the test audio pattern outputted by the speaker of the participant computer, and any ambient noise picked up by the speaker of the participant computer. Ambient noise suppression is performed to cancel out any ambient noise in the microphone input signal picked up by the speaker of the participant computer. The test audio pattern sent to the speaker of the participant computer is compared with the noise-suppressed microphone input signal which includes the test audio pattern outputted by the speaker of the participant computer. An audio signal transform is generated from the comparison. The generated audio signal transform is subsequently used for performing acoustic echo cancellation of streaming audio received from the microphone input signal when the participant computer receives streaming audio and the participants engage in remote audio communications with each other.

TECHNOLOGICAL SUPPORT FOR COLLABORATIVE SONGWRITING AND RIGHTS REGISTRATION THEREFOR
20220383258 · 2022-12-01 ·

A facility providing technological support for the initiation, performance, and/or registration of a collaborative songwriting project is described.

Template-Based Excerpting and Rendering of Multimedia Performance

Disclosed herein are computer-implemented method, system, and computer-readable storage-medium embodiments for implementing template-based excerpting and rendering of multimedia performances technologies. An embodiment includes at least one computer processor configured to retrieve a first content instance and corresponding first metadata. The first content instance may include a first plurality of structural elements, with at least one structural element corresponding to at least part of the first metadata. The first content instance may be transformed by a rendering engine running on the at least one computer processor and/or transmitted to a content-playback device.