APPARATUS TO DETECT, ANALYZE, RECORD, AND DISPLAY AUDIO DATA, AND METHOD THEREOF
20180082606 ยท 2018-03-22
Inventors
Cpc classification
G10H2220/015
PHYSICS
G10H2230/155
PHYSICS
G10H2220/126
PHYSICS
G06F16/685
PHYSICS
G10H2210/066
PHYSICS
G10H1/0575
PHYSICS
G09B15/008
PHYSICS
G10H2210/385
PHYSICS
G09B15/006
PHYSICS
G10H2240/281
PHYSICS
G10H2220/051
PHYSICS
G09B15/003
PHYSICS
International classification
Abstract
An apparatus to detect, analyze, record, and display audio data, including an input unit to allow a user to input musical notes corresponding to the audio data, a processor to analyze the musical notes and to save the musical notes into a file, and a display unit to display notes corresponding to the musical notes on a virtual instrument.
Claims
1. An apparatus to detect, analyze, record, and display audio data, comprising: an input unit to allow a user to input musical notes corresponding to the audio data; a processor to analyze the musical notes and to save the musical notes into a file; and a display unit to display notes corresponding to the musical notes on a virtual instrument.
2. The apparatus of claim 1, wherein the display unit displays a virtual piano roll to allow the user to input the musical notes using the input unit.
3. The apparatus of claim 1, wherein the file is at least one of a midi file and a musicXML file.
4. An apparatus to allow a user to transcribe music from an audio file, the apparatus comprising: a processor to play music corresponding to the audio file; a display unit to display a visualization of the music as a playable visual audio track; and an input unit to allow a user to input musical notes to be displayed on the display unit corresponding to the music as a playable visual transcription track.
5. The apparatus of claim 4, wherein the processor changes the speed of the played music based on a preference of the user.
6. The apparatus of claim 5, wherein the processor plays the playable visual audio track and the playable visual transcription track simultaneously at an original speed.
7. The apparatus of claim 4, wherein musical notes may be input at a slower speed and a synchronization of the music played with respect to the playable audio track at a normal speed is maintained.
8. A method of detecting, analyzing, recording, and displaying audio data, the method comprising: analyzing data input into an input unit using a processor; saving the analyzed data; and displaying musical notes corresponding to the analyzed data on a virtual instrument.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0032] These and/or other features and utilities of the present general inventive concept will become apparent and more readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0048] Reference will now be made in detail to the embodiments of the present general inventive concept, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The embodiments are described below in order to explain the present general inventive concept while referring to the figures.
[0049]
[0050] The apparatus 100 may interact with a peripheral device 10, which may include at least a guitar, an electronic keyboard, a violin, any other type of electronically connectable or non electrically-connectable instrument, an MP3 player, a CD player, a television, a mobile device, and a computer, but is not limited thereto. In other words, the peripheral device 10 may include any type of device that may be connected to the apparatus via electrical, wired, wireless, or aural connection.
[0051] Referring to
[0052] The display unit 110 may include a screen to display pictures, videos, and programs thereon, and may include any type of visual displaying technology, including Cathode ray tube display (CRT), Light-emitting diode display (LED), Electroluminescent display (ELD), Electronic paper, E Ink, Plasma display panel (PDP), Liquid crystal display (LCD), High-Performance Addressing display (H PA), Thin-film transistor display (TFT), Organic light-emitting diode display (OLED), Surface-conduction electron-emitter display (SED) (experimental), Field emission display (FED) (experimental), Laser TV (forthcoming), Carbon nanotubes (experimental), Quantum dot display (experimental), Interferometric modulator display (IMOD), Digital microshutter display (DMS), and hologram, but is not limited thereto.
[0053] The input unit 120 may allow a user to input commands into the apparatus 100, and may include any type of inputting technology or combination thereof, including a Keyboard, an Image scanner, a Microphone, a Pointing device, a Graphics tablet, a Joystick, a Light pen, a Mouse, a Pointing stick, a Touchpad, a Touchscreen, a Trackball, a Midi player, and a webcam, but is not limited thereto.
[0054] The user may use the input unit 120 to open a software application to allow an audio file to be opened, converted, and displayed.
[0055] The input unit 120 may also be used by the user when the user is using the software application, to manually input notes corresponding to an audio file being played, in order to transcribe the notes. More specifically, the software application may allow the user to slow down the audio file being played, so that the user may use the input unit 120 to manually transcribe the notes being played.
[0056] The storage unit 130 may include various types of storage devices to store programs, files, and other data, including magnetic storage devices such as a Floppy diskette, a Hard drive, a Magnetic strip, a SuperDisk, a Tape cassette, and a Zip diskette, optical storage devices such as a Blu-Ray disc, a CD-ROM disc, a CD-R and CD-RW disc, and a DVD-R, DVD+ R, DVD-RW, and DVD+ RW disc, Flash memory devices such as a Jump drive or flash drive, a Memory card, a Memory stick, or an SSD, or Online storage such as cloud storage and network media, but is not limited thereto.
[0057] The processor 140 may perform various processing functions, including downloading files, software, and programs, running programs and software, opening files, playing files, storing data into the storage unit 130, interpreting information from other hardware in the apparatus 100, hardware, making appropriate services available to other parts of the apparatus 100, displaying user interfaces on the display 110, and interpreting the user input from the input unit 120.
[0058] The user may run a program or software that allows a media file such as an MP3 or MP4 to play on the apparatus 100. Furthermore, the processor 140 may run a program or software that allows the media file to be captured, interpreted, analyzed, converted, played back as a different type of file, and displayed as a different type of file on the display 110.
[0059] The microphone 150 may allow music and sounds to be input via sound-waves and/or audible sounds emanating from the peripheral device 10, such that the music and sounds may be sensed by the microphone 150 to allow software running on the processor 140 to interpret and/or record the music and sounds. More specifically, when the peripheral device 10 is a guitar, for example, the microphone 150 may pick up sounds of the strings as they are played, such that the processor 140 may record the sounds to be saved in an audio file. Also, the user may speak into the microphone 150 to allow the processor to record the user's voice.
[0060] The audio input/output unit connection unit 160 may allow the user to connect an audio jack into the apparatus 100 to allow for either capture or playback of audio. The audio input/output unit connection unit 160 may also allow the user to connect the peripheral device 10 to the apparatus 100.
[0061] The speaker 170 may allow the user to hear audio files or any other sounds playing on the apparatus 100. The speaker 170 may be provided singularly or in plurality, in order to output monaural or stereo sound.
[0062] The wireless connection unit 180 may support wired or wireless communications with peripheral devices using various wired or wireless technologies, including Universal Serial Bus (USB), Firewire, Bluetooth, ZigBee, Infrared Transmission, Radio Frequency Identification (RFID), Wireless LAN (WLAN), (IEEE 802.11 branded as Wi-Fi and HiperLAN), Wireless Metropolitan Area Networks (WMAN) and (LMDS, WiMAX, and HiperMAN), W-Fi Direct, Hotspots, Microwave, Satellite, 3G, 4G, WiMAX, cellular communication (including 3G, 4G, and beyond 4G), IrDA, TransferJet, Wireless USB, DSRC (Dedicated Short Range Communications), EnOcean, Near Field Communication (NFC), EnOcean, Personal area networks, Ultra-wideband (UWB from WiMedia Alliance), but is not limited thereto.
[0063] For example, the apparatus 100 may use peer-to-peer technology, such as Bluetooth pairing, to connect the apparatus 100 wirelessly with the external peripheral device 10, such as an MP3 player, computer, mobile device, etc., such that the software running on the processor 140 may capture and detect music playing on the external peripheral device.
[0064] The processor 140 may analyze an audio file or audio captured from the peripheral device 10 by using a software application that may be downloaded and installed on the apparatus 100, to be run by the processor 140. The analysis may include extracting various components of an audio sample, converting the audio sample or the various components of the audio sample into a midi file or various midi files, and then displaying the elements of the midi file as notes on the display 110. More specifically, after the audio sample is converted to the midi file, the processor 140 uses the software application to display notes corresponding to various pitches output by the converted midi file. As a result, the notes output by the midi file may be displayed visually on the display 110 on a treble clef, a bass clef, a virtual guitar fret, a virtual piano keyboard, a virtual saxophone, a virtual flute, a virtual drum, and/or any other instrument supported by the software application.
[0065] Therefore, if notes are depicted on the virtual guitar fret displayed on the display unit 110, for example, the user may follow the notes as they are displayed or lit up, in order to learn how to play a guitar. In other words, the user can follow along with the notes displayed on the display 110, and simultaneously strum the guitar held by the user with proper chord fingering.
[0066]
[0067] Referring to
[0068] A unique feature in the transcription track 240 is that it provides a transcribed track that represents a portion of music, e.g., a guitar solo, which has been manually transcribed or recorded by the user, such that it may be heard as an overlay to the original track. Thus, as the song is being worked out and transcribed by the user, the user can record the transcription of the solo or different parts of the solo and begin from where the/she hey left off, if some time is needed between working out and competing the entire solo.
[0069] The transcription track 240 can process a particular instrument that the user wants to transcribe with by using either internal effects processing or using a VST interface for commercial effects tools. Moreover, an equalizer (EQ) effect may be provided to process the original audio track to put more emphasis on the frequency range of the instrument being transcribed from the track (e.g. boost a frequency range of a Saxophone during a Sax solo). In effect, the saxophone, for example, will be played at a louder, clearer, and more distinguishable decibel than other non-desired instruments in the background of the audio track 220. Moreover, Transcriber Track can slow down or speed up an audio track (and transcribed track simultaneously and synchronously) so that the user can record the solo overlaid against the original track at a slowed down speed, but yet play it back at the original speed of the track. This will allow a guitarists to transcribe solos of instruments that may be inherently difficult for a guitar to keep up with in real time (e.g. saxophone). The guitarist can play at a slower speed but have it play back overlaid against the original track at the original speed. Using the transcribed track the user can work out one bar or two bars of the solo at a time and come back to it a week later and begin from where he or she left off. Once the transcribed track (or particular phrases of the audio track of interest) is completed, the user can use a pitch detection algorithm to convert the transcribed track to midi or musicXML. Also a tap tempo feature is used to provide the reference timing that the note timing is derived from for the midi and/or musicXML conversion. The midi/musicXML conversion tool comes with correction editing tools to get the pitches just right. Once the transcribed track is converted to midi or musicXML it can be translated into music notation software or played back using guitar fretboard or piano display so that the notes can be illustrated to the user in real time or at a slowed down or accelerated speed using a musicXML player. Transciber track may also contain a musicXML player which displays the notes being played on a guitar fretboard, keyboard, or many other instruments (Saxophone display, flute display, etc . . . ).
[0070] To reiterate, Transcriber Track is a transcription tool that provides both an audio track to be transcribed (transcribed meaning it contains a solo or chords to be copied and/or written in notation) and a transcriber track which contains the resulting transcription. Transcriber Track allows the user to work on the transcription incrementally as the progress of the transcription is retained on the transcription track (which may be recorded corresponding to different regions or bars of the audio track at different sessions). As such, the user does not need to memorize the complete transcription since the progress is retained in the recording.The transcription track can be recorded while playing against the audio track at original speed (speed of 1) or possibly a slower speed (e.g. half speed of 0.5). When the transcriber track and audio track are sped up to original speed (1), the two tracks will remain synchronized so the whole solo or solo phrases can be recorded at a slower speed. This allows for users to work out solos from instruments (or artists) that are inherently faster than the user is able to play, but hear the solo played back at the original speed. For example, a guitarist can transcribe a sax solo from John Coltrane at half speed but hear it played back at the original track speed.
[0071] The audio track can be processed with an equalizer (EQ) to bring out the instrument (e.g. saxophone, guitar, etc.) or instruments that are being transcribed. More specifically, in order to obtain a proper sound quality of an instrument to be transcribed an equalizer can change the pitch and sound quality of the output sound to emulate a particular instrument.
[0072] A set of effects may be provided for the instrument recorded on the transcription 240 (e.g. EQ, compression, reverb, damping, etc . . . )
[0073] The transcribed track may be converted to midi and/or MusicXML using pitch detection algorithms.
[0074] A tap tempo feature is used to provide the reference timing that from which the note timing is extracted. Tap tempo timing may be used since the audio track timing may vary over time or may vary with live recordings of music. Different sections are bars may be tapped at 1/4 note (or different timing) to provide the reference timing over that region. The musicXML file or midi file of the transcribed track can be displayed on a guitar fretboard or piano or any other instrument. The MusicXML guitar fretboard (or other instrument) display can be used to retain the note sequences for those players who are learning to read music notation or would like to have that music documented in tablature display.
[0075] The user can market and sell solo transcriptions using transcriber track to provide verification that the transcription is accurate. A forum can be established which provides both free and commercial solo's that can be demonstrated with Transcriber Track.
[0076] As such, the present general inventive concept includes an apparatus providing an improved manual music transcription tool.
[0077]
[0078] Referring to
[0079] The user may choose to have the notes played at a slower speed or faster speed, based on the user's preference. More specifically, the notes may be played at a speed dictated by a tempo setting (in beats per minute). If an audio recording is played back, the source audio may not have a consistent beat per minute value, and in this case it may be played back as a fraction of the speed of the original recording, e.g. 70% of the recorded audio speed, or 50% of the recorded audio speed, or 200% of the recorded audio speed, etc.
[0080]
[0081]
[0082]
[0083]
[0084] Referring to
[0085]
[0086] Referring to
[0087]
[0088] Referring to
[0089]
[0090]
[0091] Referring to
[0092] Referring to
[0093] More specifically, as illustrated in
[0094]
[0095] Referring to
[0096] Each of the pitches in the piano roll 800 may be detected by a pitch detection algorithm after running the algorithm using the software including the transcription track 240, which may contain notes played by the user (that attempt to match the notes of a soloist playing music on the captured audio track). Referring to
[0097] Also, the transcription track 240 can now produce the virtual guitar fret, using musicXML for example, to allow the user to learn how to play the music that was manually transcribed by the user, on a guitar. More specifically, the virtual guitar fret may accept the musicXML produced by the transcription track 240 via an import function, in order to display the virtual guitar fret with the proper notes highlighted corresponding to the music transcribed on the transcription track 240 from the audio track 220.
[0098]
[0099] Referring to
[0100] More specifically, the virtual saxophone 900 may be displayed with filled dots representing notes to be played by the user. The dots may be filled in real-time, or may be slowed down to allow the user to learn the music at a desired tempo.
[0101] Although
[0102] A Virtual Saxophone application may display a fingering chart of every note displayed from a musicXML or midi file as they occur in the corresponding music notation (which may also be extracted from the musicXML or mid file). In the example illustrated in
[0103]
[0104] Referring to
[0105] More specifically, the virtual drum 1000 may be displayed with various types of percussive instruments, including a snare drum, a kick drum, a cymbal, a large tom, a tom tom 1, and a tom tom 2, representing percussion to be played by the user. The various percussive instruments may be highlighted in real-time, or may be slowed down to allow the user to learn the music at a desired tempo.
[0106] The number and types of percussive instruments are not limited to
[0107] A Virtual Drum application may include the specific drums being struck in direct correspondence to the drum notation (derived from the musicXML or midi file). Highlighted drums may change in direct correspondence to the drum chart notation as a read pointer traverses the music notation. In the example illustrated in
[0108] Referring to
[0109]
[0110] Referring to
[0111] The mobile device 101 may be any type of device including a camera having still-recording and video-recording capabilities. For example, the mobile device 101 may be a camera, a video camera, a digital camera, a web camera, a mobile telephone, and a smartphone, but is not limited thereto.
[0112] The mobile device 101 may be connected to the apparatus 100 wired or wirelessly. For example, the mobile device 101 may be connected to the apparatus 100 via wired connections, including, but not limited to, USB, FIREWIRE, ETHERNET cable, etc.
[0113] Alternatively, the mobile device 101 may be connected to the apparatus 100 via wireless connections, including, but not limited to, ZIGBEE, Z-WAVE, BLUETOOTH, GSM, UMTS, LTE, WLAN, 802.11ac, IoT, RADAR, satellite, WAVEGUIDE, RFID, infrared (IR) wireless communication, Near-Field Communications (NFC), WIFI, WIFI-DIRECT, proximity communications, etc.
[0114] A user may use the mobile device 101 to video-record a person while the person is playing an instrument, such as a guitar. As such, the video-recording may be of a guitar player and an actual fingering of fingers of the guitar player, while the guitar player is playing a song on the guitar.
[0115] The video-recording may be stored directly onto a storage medium connected to the mobile device as a video file of any type. The storage medium may include, but is not limited to, hard drives, cloud storage, memory cards, RAM, floppy disks, USB flash drives, memory cards, memory sticks, tape cassettes, zip cassettes, computer hard drives, CDs and DVDs.
[0116] The video file may be transferred from the mobile device 101 to the apparatus 100 wired or wirelessly. Alternatively, the mobile device 101 may stream the video-recording to the apparatus 100 in real-time, such that the video-recording may be stored directly onto the storage unit 130 of the apparatus 100.
[0117] When the apparatus 100 receives the video file or the video-recording from the mobile device 101, the apparatus 100 may subsequently be used to manipulate the video file or the video-recording, hereinafter known as the video.
[0118] Referring to
[0119] The display unit 110 may also display the video in real-time along with the transcription track 240, when the transcription track 240 is played back with the corresponding transcribed notes. As such, the user may watch the video showing the actual fingering of the fingers of the guitar player playing the song on the guitar, while the transcriber track 240 moves through the corresponding notes, thereby allowing the user to more easily learn how to play the song. Additionally, the user may benefit from watching specific fingering techniques of the guitar player, which are not visible without the video of the guitar player.
[0120] Furthermore, the video may be slowed down along with the transcriber track 240 during playback, without affecting a pitch of the song, to make it easier for the user to learn how to properly play the song.
[0121] Also, the user may import other previously-recorded videos of musicians playing instruments, such that the apparatus 100 automatically captures corresponding notes played by the musicians, in order to allow for playback of the transcriber track 240 and the previously-recorded videos of the musicians on the display unit 110 of the apparatus 100. In other words, the user may simply download music videos of their favorite bands from sites such as YOUTUBE, etc., and learn their favorite songs using the apparatus 100 and the corresponding transcription application running thereupon.
[0122] The present general inventive concept can also be embodied as computer-readable codes on a non-transitory computer-readable medium. The computer-readable medium can include a computer-readable recording medium and a computer-readable transmission medium. The computer-readable recording medium is any data storage device that can store data that can be thereafter read by a computer system. Examples of the computer-readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage devices. The computer-readable recording medium can also be distributed over network coupled computer systems so that the computer-readable code is stored and executed in a distributed fashion. The computer-readable transmission medium can transmit carrier waves or signals (e.g., wired or wireless data transmission through the Internet). Also, functional programs, codes, and code segments to accomplish the present general inventive concept can be easily construed by programmers skilled in the art to which the present general inventive concept pertains.
[0123] Although a few embodiments of the present general inventive concept have been shown and described, it will be appreciated by those skilled in the art that changes may be made in these embodiments without departing from the principles and spirit of the general inventive concept, the scope of which is defined in the appended claims and their equivalents.