AUTOMATICALLY CONVERTING AND STORING OF INPUT AUDIO STREAM INTO AN INDEXED COLLECTION OF RHYTHMIC NODAL STRUCTURE, USING THE SAME FORMAT FOR MATCHING AND EFFECTIVE RETRIEVAL
20220019618 · 2022-01-20
Inventors
Cpc classification
G06F16/685
PHYSICS
G06F16/955
PHYSICS
International classification
G06F16/955
PHYSICS
Abstract
The present invention relates to a method of representing wave oscillations uniquely into machine readable data structure, and search technique using Symphonic quality of audio content as compared to lexicality of the audio content. An automatic computer processing acoustic search method for converting an input audio encoding of an utterance into an output that rhythmically harmonizes with a target song is disclosed.
Claims
1. An automatic computer processing acoustic search method for converting an input audio encoding of an utterance into an output that rhythmically harmonizes with a target song.
2. The automatic computer processing acoustic search method as claimed in claim 1, wherein the acoustic search method comprises a system in which content publisher or content producer or content owner can convert original content into Acoustic Node list and stores into Acoustic Node Map.
3. The automatic computer processing acoustic search method as claimed in claim 1, wherein the acoustic search method comprises a system in which content subscriber or content search client can search content which is converted into Acoustic Node list and searched within Acoustic Node Map to return matching source URL's redirecting client to the original content.
4. The automatic computer processing acoustic search method as claimed in claim 1, wherein the acoustic search method comprises acoustic nodes along with the content's “source URL” is stored as a HashMap object in the Acoustic Node Map.
5. The automatic computer processing acoustic search method claimed in claim 1, wherein the acoustic search method comprises a much effective version of creating a more reliable storage structure that slices the Audio source into regular rhythmic cycles representing one full wave oscillation in the spectrogram.
6. The automatic computer processing acoustic search method as claimed in claim 1, wherein a content search client can search the target song by melody as an input for searching for Audio content on internet.
7. The automatic computer processing acoustic search method as claimed in claim 1, wherein the wherein the acoustic search method converts the audio Wave oscillation as an “Acoustic Node”, which is a special representation of the node.
8. The automatic computer processing acoustic search method as claimed in claim 1, wherein the wherein the acoustic search method generates a Node Value equivalence using an algorithm that takes Node attributes as inputs assigning a value to each oscillation.
9. The automatic computer processing acoustic search method as claimed in claim 1, wherein the acoustic search method uses the effective indexing on Node Attributes and tinyurl properties such as source content locale for faster search.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0028] Other objects and advantages of the present invention will become apparent to those skilled in the art upon reading the following detailed description of the preferred embodiments, in conjunction with the accompanying drawings, wherein like reference numerals have been used to designate like elements, and wherein:
[0029]
[0030]
[0031] The drawings described herein are for illustration purposes only and are not intended to limit the scope of the present invention in any way.
DETAILED DESCRIPTION OF THE INVENTION
[0032] It is to be understood that the present disclosure is not limited in its application to the details of composition set forth in the following description. The present disclosure is capable of other embodiments and of being practiced or of being carried out in various ways. Also, it is to be understood that the phraseology and terminology used herein is for the purpose of description and should not be regarded as limiting.
[0033] The use of “including”, “comprising” or “having” and variations thereof herein is meant to encompass the items listed thereafter and equivalents thereof as well as additional items. The terms “a” and “an” herein do not denote a limitation of quantity, but rather denote the presence of at least one of the referenced items.
[0034] Each statement of an embodiment is to be considered independent of any other statement of an embodiment despite any use of similar or identical language characterizing each embodiment. The wording “one embodiment”, or the like, does not appear at the beginning of every sentence in the specification, is merely a convenience for the reader's clarity. However, it is the intention of this application to incorporate by reference the phrasing “an embodiment,” and the like, at the beginning of every sentence herein where logically possible and appropriate.
[0035]
[0036] According to one embodiment of the invention, the present invention discloses a search for audio song by melody as an input. No words or lexical equivalent input required for searching for Audio content on internet.
[0037] According to one embodiment of the invention, the present invention discloses a method to convert audio Wave oscillation as an “Acoustic Node”. “Acoustic Node” is a special representation.
[0038] According to one embodiment of the invention, the present invention discloses a method to generate Node Value equivalence using an algorithm that takes Node attributes as inputs assigning a value to each oscillation.
[0039] According to one embodiment of the invention, the only link between the repository of the present invention and content source owner who stores the information with the repository of the present invention is the url link. The url link is stored into the repository.
[0040] According to one embodiment of the invention, whenever there is match of the audio content of the content searcher with the content publisher, the url link is returned to the content searcher.
[0041]
[0042] According to one embodiment of the invention, one or more audio publishers upload the information on an audio source site, for example on a cloud server. The audio publisher uses the acoustic node service of the present invention to convert the audio content into a set of nodes, referred to as acoustic nodes. For example, in an audio song, the musical notes repeat for several times in the particular song. The entire musical note is converted into a node structure.
[0043] According to one embodiment of the invention, unique nodes within the audio song are captured and stored in the repository of the present invention. All the acoustic nodes are collected and are called as acoustic node collection. The acoustic node collection is mapped to an attribute called as “tinyurl”. The “tinyurl” of the present invention is a url to the original audio file uploaded by the audio publisher.
[0044] According to one embodiment of the invention, the audio file is an Indian song. In general an Indian song comprises 8 musical nodes. Any sound produced will be mapped with one of the 8 musical nodes. A song may have nodes repeated at various intervals of the song. However, only one unique musical node of the audio song is stored in the repository.
[0045] According to one embodiment of the invention, the sequence of the musical node and length of the complete wave of the musical node is stored as an attribute of that particular musical node.
[0046] According to one embodiment of the invention, the content owner of the audio files uses the search method of the present invention and converts the audio files into acoustic nodes. The acoustic nodes converted are saved on the repository system of the present invention. The repository system includes a link to the original audio file uploaded.
[0047] According to one embodiment of the invention, the audio search subscriber who wants to retrieve an audio file inputs the same musical structure of the original file by humming or with the help of any musical instrument. The input provided by the audio search subscriber is consider by the same node builder service to convert the tune of the audio search subscriber into the musical node structure. If there is a complete match between the audio file input with anyone of the musical node stored in the repository of the present invention, a tinyurl of the original audio file is provided to the audio search subscriber.
[0048] According to one embodiment of the invention, one or more tinyurl(s) is/are provided to the audio search subscriber even when there is reasonable amount of match of the audio file input with the musical node stored in the repository.
[0049] According to one embodiment of the invention, the audio stream converted into the “Acoustic Node” structure proposed by this invention, can then be sent to matching engine that returns the tinyurl's depending on relevant matching score. A matching score here would be the percentage conformance in Node equivalence value between input search string and various nodes indexed in the “Acoustic Node Map” associated with their respective tinyurls.
[0050] The table provided below provides a use case with detailed steps according to an embodiment of the invention
TABLE-US-00001 Scenario Persona(s) Description Use Case - Steps CONTENT_PUBLISHER Content Owner converts 1. Owner Registers with Acoustic Search service CONTENT_PRODUCER original content into 2. Owner provides source content as audio file CONTENT_OWNER Acoustic Node list and 3. Wave patterns are identified within the audio stores into Acoustic 4. Each unique wave is converted into Acoustic Node having Node Map. below node attributes. a. Acoustic node Value b. Note Sequence String c. Node length d. Node Recurrence count e. Custom Attributes (for future enhancements) 5. These Acoustic nodes along with the content's “source URL” are stored as a HashMap object in the Acoustic Node Map. CONTENT_SUBSCRIBER Search content is 1. Client registers with Acoustic Search service CONTENT_SEARCH_CLIENT converted into Acoustic 2. Client submits search content as an audio file along with Node list and searched expected matching percentage. within Acoustic Node 3. Wave patterns are identified within submitted audio Map to return matching 4. Each wave is converted into Acoustic nodes having source URL's attributes as shown in above scenario. redirecting client to the 5. This node list is sent to matching. original content. 6. Match logic searches within “Acoustic Node Map” and returns content Source URLs matching with submitted search audio, having match score better than requested percentage. 7. Client navigates to the returned URLs and downloads audio content from the original producer.
[0051] According to one embodiment of the invention, the present invention discloses a Search content which is converted into Acoustic Node list and searched within Acoustic Node Map to return matching source URL's redirecting client to the original content.
[0052] According to one embodiment of the invention, the present invention discloses a system in which a content publisher or content producer or content owners converts the original content into Acoustic Node list and stores into the Acoustic nodes along with the content's “source URL” and the “source URL” are stored as a HashMap object in the Acoustic Node Map.
[0053] According to one embodiment of the invention, the present invention discloses a system in which a content publisher or content producer or content owners Registers with Acoustic Search service can provides source content as audio file and the Wave patterns are identified within the audio file, Each unique wave is converted into Acoustic Node having below node attributes. [0054] a. Acoustic node Value [0055] b. Note Sequence String [0056] c. Node length [0057] d. Node Recurrence count [0058] e. Custom Attributes (for future enhancements)
[0059] These Acoustic nodes along with the content's “source URL” are stored as a HashMap object in the Acoustic Node Map.
[0060] According to one embodiment of the invention, the present invention discloses a system in which a content subscriber or content search client can search the original contents within Acoustic Node list and Acoustic Node Map via matching source URL's which redirecting client to the original content.
[0061] According to one embodiment of the invention, the present invention discloses a system in which a content subscriber or content search client registers with Acoustic Search service can easily submits search content as an audio file along with expected matching percentage the wave patterns are identified within submitted audio and the each wave is converted into Acoustic nodes having attributes as shown in above scenario this node list is sent to matching and the Match logic searches within “Acoustic Node Map” and returns content Source URLs matching with submitted search audio, having match score better than requested percentage and the Client navigates to the returned URLs and downloads audio content from the original producer.
[0062] According to one embodiment of the invention, the present invention discloses use of effective indexing on Node Attributes and tinyurl properties such as source content locale for faster search.
[0063] According to one embodiment of the invention, the present invention identifies matching audio patterns more effectively and accurately. The present invention adds value in music industry and entertainment software as well as forensic and defense departments, to identify matching audio patterns more effectively and accurately.
[0064] According to one embodiment of the invention, the present invention can also be used to store and match the sounds produced in nature such as seismographs, cosmic vibrations, Meteorological audio recordings with higher accuracy and build Machine learning intelligence on top to predict the actual input samples to the historical events stored in the database generating useful observations.
[0065] It should be noted that in some embodiments, the user may select and reselect from a library of phrase templates for different target songs, performances, performers, styles, etc.
[0066] According to one embodiment of the invention, the fundamental frequency or pitch of a speech changes continuously, but generally does not sound like a musical melody. Typically, the change is too small, fast or infrequent to sound like a musical melody. Pitch changes occur for a variety of reasons, including sound generation methods and speaker emotional states, and indicate phrase endings or questions and unique parts of the tone language.
[0067] According to one embodiment of the invention, speech encoding of speech segments) is pitch corrected according to a timbre sequence or melody score.
[0068] According to one embodiment of the invention, a desirable attribute of the implemented speech-melody (S2M) transformation is that the speech sounds clearly like a musical melody but remains clearly understandable.
[0069] According to one embodiment of the invention, a rhythm pattern is defined, generated, or searched. It should be noted that in some embodiments, the user may select and reselect from a library of rhythm skeletons for different target raps, performances, performers, styles, etc. In some embodiments, the rhythm pattern is represented as a series of impulses at a particular time position.
[0070] According to one embodiment of the invention, more complex patterns of audio inputs can also be defined. Some embodiments in accordance with the present invention (s) can be executed one after the other in a computer system (such as an iPhone handheld, mobile device or portable computing device) to implement the methods described herein. In the form of a computer program product encoded in a machine-readable medium as a sequence of software instructions and other functional configurations tangibly embodied in a temporary medium and/or provided as a computer program product
[0071] Although the invention (s) has been described in connection with various embodiments, these embodiments are illustrative and the scope of the invention is limited thereto. Many variations, modifications, additions and improvements are possible.
[0072] It will be recognized that the above described subject matter may be embodied in other specific forms without departing from the spirit or essential characteristics of the disclosure.