IPIQ

G10H2210/051

ELECTRONIC DEVICE, METHOD AND COMPUTER PROGRAM

20220076687 · 2022-03-10 ·

Sony Group Corporation

An electronic device comprising circuitry configured to perform (402; 702; 1204) source separation (201) based on a received audio input to obtain a separated source, to perform onset detection (202) on the separated source to obtain an onset detection signal and to mix (405; 706; 1207) the audio signal with the separated source based on the onset detection signal to obtain an enhanced separated source.

AUDIO ANALYSIS METHOD, AUDIO ANALYSIS SYSTEM AND PROGRAM

20230395052 · 2023-12-07 ·

Kazuhiko Yamamoto

An audio analysis method that is realized by a computer system includes estimating a plurality of beat points of a musical piece by analyzing an audio signal representing a performance sound of the musical piece, receiving an instruction from a user to change a location of at least one beat point of the plurality of beat points, and updating a plurality of locations of the plurality of beat points in response to the instruction from the user.

COMPUTING ORDERS OF MODELED EXPECTATION ACROSS FEATURES OF MEDIA

20210335333 · 2021-10-28 ·

Secret Chord Laboratories, Inc.

A method implemented by a determination engine is provided. The determination engine receives a media dataset comprising target piece music information, target piece audience information, corpus music information, corpus audience information, and corpus preference data. The determination engine determines a subset of the corpus music and preference information and determines at least one surprise factor of the subset of the corpus music and preference information across features at one of a plurality of orders. The determination engine learns a model that estimates a likelihood that time-varying surprise trends across the features achieves a preference level. The determination engine determines at least one surprise factor of the target piece music information across the features at the one of the plurality of orders and predicts, using the model, preference information using the time-varying surprise trends for the target piece music information across the features.

Media content identification on mobile devices

11140439 · 2021-10-05 ·

Roku, Inc.

A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.

Sound signal processor and sound signal processing method

11140506 · 2021-10-05 ·

Yamaha Corporation

A sound signal processor includes a memory storing instructions and a processor configured to implement the stored instructions to execute a plurality of tasks, the tasks including a sound signal input task configured to obtain a sound signal, a beat detection task configured to detect a beat in the sound signal, and a processing task configured to perform an effect processing on the sound signal in accordance with a timing of the detected beat.

SYSTEMS AND METHODS FOR GENERATING A PLAYBACK-INFORMATION DISPLAY DURING TIME COMPRESSION OR EXPANSION OF AN AUDIO SIGNAL

20210405958 · 2021-12-30 ·

inMusic Brands, Inc.

Systems and methods for generating a playback-information display during time compression or expansion of an audio signal are provided. The system includes a processor that performs a method including displaying a first remaining playback-time associated with an audio file; adjusting the playback speed of the audio file during playback of the audio file; and, in response to the playback speed being adjusted, automatically displaying a second remaining playback-time associated with the audio file during playback of the audio file.

Automatic conversion of speech into song, rap or other audible expression having target meter or rhythm

11127407 · 2021-09-21 ·

Smule, Inc.

Captured vocals may be automatically transformed using advanced digital signal processing techniques that provide captivating applications, and even purpose-built devices, in which mere novice user-musicians may generate, audibly render and share musical performances. In some cases, the automated transformations allow spoken vocals to be segmented, arranged, temporally aligned with a target rhythm, meter or accompanying backing tracks and pitch corrected in accord with a score or note sequence. Speech-to-song music applications are one such example. In some cases, spoken vocals may be transformed in accord with musical genres such as rap using automated segmentation and temporal alignment techniques, often without pitch correction. Such applications, which may employ different signal processing and different automated transformations, may nonetheless be understood as speech-to-rap variations on the theme.

APPARATUS AND METHOD FOR DECOMPOSING AN AUDIO SIGNAL USING A VARIABLE THRESHOLD

20210295854 · 2021-09-23 ·

An apparatus for decomposing an audio signal into a background component signal and a foreground component signal, has: a block generator for generating a time sequence of blocks of audio signal values; an audio signal analyzer for determining a characteristic of a current block of the audio signal and for determining a variability of the characteristic within a group of blocks having at least two blocks of the sequence of blocks; and a separator for separating the current block into a background portion and a foreground portion wherein the separator is configured to determine a separation threshold based on the variability and to separate the current block into the background component signal and the foreground component signal, when the characteristic of the current block is in a predetermined relation to the separation threshold.

Media-media augmentation system and method of composing a media product

11114074 · 2021-09-07 ·

MASHTRAXX LIMITED

Joseph Michael William LYSKE

A media-content augmentation system includes a processing system that receives input data in the form of temporally-varying events data. The processing system resolves the input into one or more categorized contextual themes, correlates the themes with metadata associated with at least one reference media file, and then splices or fades together selected parts of the media file, thus generating as an output, a media product in which transitions between its contextual themes are aligned with selected temporal events in the input data. The temporarily-varying events take the form of a beginning and an end in the case of a sustained feature, or a specific point in time for a hit point. A method aligns sections in digital media files with temporally-varying events data to compose a media product. The system augments a sensory experience of a user by dynamically changing and then playing selected media files within the context of the categorized themes input to the processing system.

SYSTEMS AND METHODS FOR DISPLAYING GRAPHICS ABOUT A CONTROL WHEEL'S CENTER

20210279029 · 2021-09-09 ·

inMusic Brands, Inc.

A DJ media player is provided. The DJ media player has a control wheel used to control audio playback and a customizable first electronic display located about the center of the platter for displaying a graphic. The DJ media player has a second electronic display to show audio playback information. The graphic is dynamically updated. The graphic corresponds to at least one of a logo, an artist, an album, a song playback information, or a selection made by a user.

Patent classifications

G10H2210/051