Intelligent system for matching audio with video
20230015498 ยท 2023-01-19
Inventors
Cpc classification
G10H2210/056
PHYSICS
G10H2240/131
PHYSICS
G10H2220/441
PHYSICS
G10H2240/085
PHYSICS
G10H1/368
PHYSICS
G10H2250/311
PHYSICS
G10H2210/031
PHYSICS
International classification
Abstract
An intelligent system for matching audio with video of the present invention provides a video analysis module targeting color tone, storyboard pace, video dialogue, length and category and director's special requirement, actors expression, movement, weather, scene, buildings, spacial and temporal, things and a music analysis module targeting recorded music form, sectional turn, style, melody and emotional tension, and then uses an AI matching module to adequately match video of the video analysis module with musical characteristics of the music analysis module, so as to quickly complete a creative composition selection function with respect to matching audio with a video.
Claims
1. An intelligent system for matching audio with video, comprising: a video analysis module for making an analysis according to color tone, storyboard pace, video dialogue, length and category, director's special requirement, and characteristic, actors expression, movement, weather, scene, buildings, spatial and temporal factors, things, creature, character, character personality; a music analysis module for making an analysis according to recorded music form, sectional turn, style, genre, melody, tempo, instrument, chord accompaniment, voice type, rhythm, volume and emotional tension, wherein said music analysis and content comprise a music property analysis, an emotion analysis and music characteristic information; an AI matching module for connecting to the video analysis module and the music analysis module so as to adequately match a video with a musical characteristic; and a music editing module connected to the AI matching module, so as to impeccably match a time axis with an impact point between a music file and a video file by means of clip cutting and editing, music editing, music volume adjustment and sound field simulation.
2. The intelligent system for matching audio with video according to claim 1, wherein the video analysis module comprises an analysis of a color function and a color value in a movie, a color analysis of a structure of color analysis categories, a content analysis of a scene, a person, an item and lighting for distinguishing who, how, when, where and what in a video, and a character expression analysis for determining an emotion, a plot and a likely conversation of characters in a video according to an expression.
3. The intelligent system for matching audio with video according to claim 1, wherein the video analysis module has a storyboard file analysis for processing a storyboard pace according to a time point of the storyboard pace, and then a mode is input to serve as a reference for time point recording, music and sound effect insertion points between scene switches.
4. The intelligent system for matching audio with video according to claim 1, wherein the video analysis module has a character-based analysis handling a video dialogue according to a video dialogue and plot analysis, and processes the video dialogue to look for a storyline or delete a word of turn in speech, so as to clearly present a keyword and arrange the same according to dependency (or influence), and proportionally locate a corresponding emotional parameter on average.
5. The intelligent system for matching audio with video according to claim 1, wherein the music analysis module has a music property analysis for analyzing musical tone property, instrumental arrangement structure, rhythm, chord, chord progression, rhythm pitch, scale progression, style, music form, section, phrase, lyrical phrase, genre and other music file information.
6. The intelligent system for matching audio with video according to claim 1, wherein the music analysis module has an emotion analysis for recording an emotion parameter (x, y) at different time points of each song by means of machine training and intelligent learning according to musical content, wherein an x axis (Valence) of the emotional parameter shows a value of a positive emotion and a y axis (Arousal) of the emotional parameter shows an excitation level of a negative emotion.
7. The intelligent system for matching audio with video according to claim 1, wherein the music analysis module has music characteristic information derived from a singer, a music professional, album production personnel, single track production personnel, a record company, a media company, OP, SP, a regional organization, a copyright collective management organization, a copyright, a contractual relationship, a recorded music length, a style, a file location, an open region, a streaming link, a download link, a video link, a midi file, a way file and a mp3 file.
8. The intelligent system for matching audio with video according to claim 1, wherein an algorithm of the AI matching module includes: a filtering and selecting mode and a scoring mode and a editing mode.
9. The intelligent system for matching audio with video according to claim 8, wherein the filtering and selecting mode is within a range of standard deviation for normal distribution, so as to provide a criterion for whether to select or not, a value within a 68% confidence interval (within the error range of one standard deviation) is allowed, and a category of said filtering and selecting comprises a genre or an emotional parameter and the like.
10. The intelligent system for matching audio with video according to claim 8, wherein the scoring mode quantifies categories such as rhythm, instrument arrangement, chord, musical emotion (x, y), keyword emotion (x, y), director-input information, main video color tone, video content and the like, so as to calculate a score for each item for performing weighting and averaging.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0006]
[0007]
[0008]
[0009]
[0010]
[0011]
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0012] Referring to
[0013] The video analysis module 10 makes an analysis according to color tone, storyboard pace, video dialogue (such as a plot, a word of turn in speech and the like), length and category, director's special requirement and characteristic, actors expression, movement, weather, scene, buildings, spatial and temporal factors, things, creature, character, character personality;
[0014] video content analysis of the video analysis module 10 includes: a color analysis, a content analysis and a character expression analysis. Referring to
[0015] The music analysis module 20 makes an analysis according to recorded music form, sectional turn, style, genre, melody, tempo, instrument, chord accompaniment, voice type, rhythm, volume and emotional tension; a music analysis and content of the music analysis module 20 includes: a music property analysis, an emotion analysis and music characteristic information, wherein the music property analysis is related to an analysis of musical tone property, instrumental arrangement, music structure, rhythm, chord, chord progression, rhythm notes, pitch, scale progression, style, music form, section, phrase, lyrical phrase, genre and other music file information.
[0016] Referring to
[0017] Referring to
[0018] The present invention of the intelligent system for matching audio with video is characterized in: an AI matching module 30 for connecting to the video analysis module 10 and the music analysis module 20, so as to perform adequate matching between a video and a musical characteristic and recommend five songs for matching in practice; if the recommended songs are not satisfactory, new recommendations of other songs can be made for matching. The music editing module 40 is connected to the AI matching module 30, and the present invention can be used to impeccably match a time axis with an impact point between a music file and a video file by means of clip cutting and editing, music editing, music volume adjustment and sound field simulation. With regard to point-to-point matching of sound effects between the music editing module 40 and the music analysis module 20, in video data referred thereby, there can be more sound effects, so that an insertion point for a sound effect can be obtained by analyzing a waveform. The video data referred to by the AI matching module 30 trained by the present invention includes: YouTube-Movie, YouTube-movie clips and the like.
[0019] Referring to
[0020] A search for related keywords in a database page includes: a title, a genre, a style, a tempo, an instrument, a related keyword, an artist, an emotion, a cover photo and the like; an unique function of an audio signal is related to formats such as a mp3, a way format or mp3 format and the like; related authorization and an order are related to commercial behaviors such as an estimated order amount based on Loop, midi and music authorization, making an order, updating an order, downloading purchased music and the like.
[0021] An algorithm of the AI matching module 30 of the present invention includes:
[0022] a filtering and selecting mode and a scoring mode, wherein the filtering and selecting mode is within a range of standard deviation for normal distribution, so as to provide a criterion for whether to select or not, a value within a 68% confidence interval (within the error range of one standard deviation) is allowed, and a category of said filtering and selecting comprises a genre or an emotional parameter and the like. The scoring mode quantifies categories such as rhythm, instrument arrangement, chord, musical emotion (x, y), keyword emotion (x, y), director-input information, main video color tone, video content and the like, so as to calculate a score for each item for performing weighting and averaging.
[0023] In conclusion, the intelligent system for matching audio with video of the present invention, the AI matching module is mainly used to connect to the video analysis module and the music analysis module, so as to adequately match a video with a musical characteristic; after diverse logging in by a video company, selecting a video and reviewing by a director, as long as an API end point blockchain smart contract is established on the platform, a music professional, a video company and a media company are enabled to quickly complete matching audio with video.
[0024] It is of course to be understood that the embodiments described herein are merely illustrative of the principles of the invention and that a wide variety of modifications thereto may be effected by persons skilled in the art without departing from the spirit and scope of the invention as set forth in the following claims.