Multi-channel data sonification system with partitioned timbre spaces including periodic modulation techniques
10365890 ยท 2019-07-30
Assignee
Inventors
Cpc classification
G10H2250/471
PHYSICS
G10H2220/351
PHYSICS
G06F3/167
PHYSICS
G10H1/06
PHYSICS
International classification
Abstract
A data sonification system for representing a plurality of channels of numerical information is described wherein the rate of the periodic modulation is itself used as a vehicle for carrying information of at least one component of the multidimensional numerical data. The data sonification system includes a plurality of audio waveform generator elements. Each of the audio waveform generator elements generates an associated audio frequency waveform. Each audio frequency waveform has an audio frequency parameter and at least one timbre modulation parameter having a settable value. The timbre modulation parameter affects the timbre of the audio waveform. The data sonification system includes a mapping element for associating aspects of multidimensional numerical data with the timbre modulation parameter of each audio frequency waveform. The mapping element sets the value of the timbre modulation parameter in response to multidimensional numerical data and/or periodic modulation.
Claims
1. A data sonification system for representing a plurality of channels of numerical information via a plurality of corresponding discernable variations of at least one of a plurality of audio-frequency waveforms, the at least one of the plurality of the audio-frequency waveforms being perceivable by a user as comprising a plurality of audio tones having at least one corresponding discernable timbre attribute, the data sonification system comprising: a plurality of audio-frequency waveform generators, each of the plurality of audio-frequency waveform generators generating an associated one of the plurality of the audio-frequency waveforms, wherein each of the associated ones of the plurality of the audio-frequency waveforms comprises an audio-frequency and at least one adjustable timbre control parameter, each of the at least one of the adjustable timbre control parameter having an associated adjustable value which can be discernibly varied within a timbre space occupied by the plurality of the audio-frequency waveforms; and a mapping element for mapping aspects of multidimensional numerical data with the at least one of the adjustable timbre control parameters of each of the associated ones of the plurality of audio-frequency waveforms, wherein for the associated ones of the plurality of the audio-frequency waveforms the mapping element adjusts a value of a periodic modulation of the at least one of the adjustable timbre control parameters responsive to values of the multidimensional numerical data, wherein the rate of the periodic modulation is itself used as a vehicle for carrying information of at least one component of the multidimensional numerical data, wherein the timbre of each of the other the audio-frequency waveform carries information responsive to values of multidimensional numerical data for presentation to a user, and wherein the associated ones of the plurality of the audio-frequency waveforms and their respective adjustable timbres are arranged to provide a plurality of data-modulated sound timbre classes within the timbre space.
2. The data sonification system of claim 1 wherein the at least one of the plurality of the audio-frequency waveforms comprises a plurality of timbre control parameters, each of the plurality of the timbre control parameters affecting the timbre of the at least one of the plurality of the audio-frequency waveforms; and wherein the mapping element adjusts the value of each of the plurality of the timbre control parameters responsive to values of the multidimensional numerical data.
3. The data sonification system of claim 1, wherein the sound timbre classes can be arranged to convey a sonic metaphor.
4. The data sonification system of claim 1, wherein the values of the multidimensional numerical data are sequentially selected by the user from the multidimensional numerical data, the values sequentially selected over an interval of time.
5. The data sonification system of claim 1, wherein the values of the multidimensional numerical data are sequentially selected according to a trajectory selected through a subset of the multidimensional numerical data.
6. The data sonification system of claim 1, wherein the values of the multidimensional numerical data are sequentially selected responsive to a position on a trajectory specified by the user through a subset of the multidimensional numerical data, the trajectory rendered in a data visualization.
7. The data sonification system of claim 1, wherein the values of the multidimensional numerical data are sequentially selected responsive to a position of a cursor within a rendered data visualization.
8. The data sonification system of claim 1, wherein the multidimensional numerical data is retrieved from storage.
9. The data sonification system of claim 1, wherein the multidimensional numerical data is provided by a live real-time data stream.
10. The data sonification system of claim 1, wherein the multidimensional numerical data is created from live sensor data.
11. The data sonification system of claim 1, wherein the mapping comprises pre-sonification operations involving the imposing of indexing on selected data.
12. The data sonification system of claim 1, wherein the mapping comprises pre-sonification operations involving data suppression.
13. The data sonification system of claim 1, wherein the mapping comprises pre-sonification operations involving data value normalization.
14. The data sonification system of claim 1, wherein the mapping comprises pre-sonification operations involving linear transformations.
15. The data sonification system of claim 1, wherein the mapping comprises pre-sonification operations involving affine transformations.
16. The data sonification system of claim 1, wherein the mapping comprises pre-sonification signal processing filtering operations on selected data.
17. The data sonification system of claim 1, wherein the mapping comprises pre-sonification operations involving nonlinear transformations.
18. The data sonification system of claim 1, wherein the mapping comprises pre-sonification operations involving linear predictor operations.
19. The data sonification system of claim 1, wherein the at least one timbre control parameter controls a width of a pulse waveform.
20. A data sonification system for representing a plurality of channels of numerical information via a plurality of corresponding discernable variations of at least one of a plurality of audio-frequency waveforms, the at least one of the plurality of the audio-frequency waveforms being perceivable by a user as comprising a plurality of audio tones having at least one corresponding discernable timbre attribute, the data sonification system comprising: a plurality of audio-frequency waveform generators, each of the plurality of audio-frequency waveform generators generating an associated one of the plurality of the audio-frequency waveforms, wherein each of the associated ones of the plurality of the audio-frequency waveforms comprises an audio-frequency and at least one adjustable timbre control parameter, each of the at least one of the adjustable timbre control parameter having an associated adjustable value which can be discernibly varied within a timbre space occupied by the plurality of the audio-frequency waveforms; and a mapping element for mapping aspects of multidimensional numerical data with the at least one of the adjustable timbre control parameters of each of the associated ones of the plurality of the audio-frequency waveforms, wherein for the associated ones of the plurality of the audio-frequency waveforms the mapping element adjusts a value of the at least one of the adjustable timbre control parameters responsive to values of the multidimensional numerical data, wherein for the plurality of the audio-frequency waveforms the timbre of the associated ones of the audio-frequency waveform carries information responsive to values of underlying data for presentation to a user, wherein the associated ones of the plurality of the audio-frequency waveforms and their respective adjustable timbres are arranged to provide a plurality of data-modulated sound timbre classes within the timbre space, wherein the pitch of at least one of the audio-frequency waveforms is varied periodically, and wherein the rate of the periodic variation in pitch is itself used as a vehicle for carrying information of at least one component of the multidimensional numerical data.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The above and other aspects, features, and advantages of the present invention will become more apparent upon consideration of the following description of preferred embodiments, taken in conjunction with the accompanying drawing figures.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
DETAILED DESCRIPTION
(24) In the following detailed description, reference is made to the accompanying drawing figures which form a part hereof, and which show by way of illustration specific embodiments of the invention. It is to be understood by those of ordinary skill in this technological field that other embodiments can be utilized, and structural, electrical, as well as procedural changes can be made without departing from the scope of the present invention. Wherever possible, the same element reference numbers will be used throughout the drawings to refer to the same or similar parts.
(25) Computer-generated data visualization has been actively used to study complex data for decades. Data visualization uses parameterize visual primitives, spatial geometry, time, and other elements to convey numerical or logical data to a user. In the mid 1980's data visualization, particularly as used in scientific computing, became extremely sophisticated. Remnants of this are manifest in medical imaging system, instrument control consoles, Geographic Information Systems (GIS), and more recently in newly emergent applications for representing business data (for example employing BIRT).
(26) Data sonification has an interesting but not so successful history. As described earlier, the present invention reopens the possibilities for data sonification with controlled sound rendering approaches allowing for multiple simultaneous channels of information-carrying utilizing at least the timbre of one or more parameterized audio waveforms.
(27) Before delving into the details of the latter, attention is first directed to a framework construction that sets aside many historic notions of and approaches to data sonification (for example, using piano tones of various pitches to sound out stock and bond market data, computer-event alerts, etc.) and instead treats data sonification as a peer to data visualization.
(28) Comparing Data Visualization and Data Sonification
(29)
(30) In listening to a sound that carries information, time must be taken to ascertain the sound's characteristics and what they are configured to convey. However, this is equally true for looking at graphic that carries information: hereto time must be taken to ascertain the graphic's characteristics and what they are configured to convey.
(31) Similarly,
(32)
(33)
(34) More specifically as to the last item,
(35) There are other examples, some equally important as those cited here. However non-ideal and intermingled these may seem, there are comparable limitations in the visual channel. However, well-defined rules and metrics have been devised long ago to avoid visual channel overloading, distortion, etc., and these well-defined rules and metrics are commonly accepted in the practice of fine-art, graphic arts, photography, and other fields including data visualization. In a similar manner, well-defined rules and metrics can be assembled, devised, and refined to facilitate useful multi-channel data sonification. Several aspects of the present invention relate to this in one way or another as will be seen.
(36) Although data sonification can be used by itself, data sonification can also be used in conjunction with data visualization. Referring again to
(37) In accordance with the above and other possibilities and opportunities,
(38) Information Carrying Vehicles for Data Sonification
(39) Attention is now directed toward modulating parameterized periodic waveforms with low bandwidth signals (most of the spectral energy below the lowest pitch that can be heard). First to be considered is pulse width modulation, and in particular the creating of pulse width modulated waveforms from various source periodic waveforms.
(40)
(41)
(42)
(43) As taught in U.S. patent application Ser. No. 12/144,480 entitled Variable Pulse-Width Modulation with Zero Constant DC Component in Each Period, the pulse-modulated waveforms of
(44)
(45) In
(46)
The pulse width is:
(47)
and the duty cycle is:
(48)
(49) As the reference signal Rc is raised (c approaches 1 or Rc approaches R), the region of the pulse wave with the amplitude of A.sub.max gets narrower, and the regions of the pulse wave with the amplitude 0 get wider. Similarly, as the reference signal Rc is lowered (c approaches 0), the region of the pulse wave with the amplitude of A.sub.ma gets wider, and the regions of the pulse with the amplitude 0 get narrower. Also, because of the symmetry of the sine wave is similar to that in the triangle wave of
(50) Should pulse-width modulation be used, it can be advantageous to use zero-DC pulse-width modulation as taught in U.S. patent application Ser. No. 12/144,480 entitled Variable Pulse-Width Modulation with Zero Constant DC Component in Each Period, particularly if many such pulse waveforms are summed together.
(51) Sonification Sound-Field Audio Rendering
(52)
(53)
(54) Timbre Spaces and Sonic Design
(55) The timbre of sounds is often mentioned in sonification, but aside from leveraging varieties of timbral qualities of traditional musical instruments, the above work provides little in the way of systemic use of differentiated perception of the timbre of sounds as a tool for sonification in general (and, as discussed later, multi-channel sonification in particular). Notable exceptions to this include [1] based on the notion of Grey's timbre space abstraction [2]. Timbre space has been further formalized in a way that characterizes it close to that of a conventional linear vector space, including notions of distance [3-4] and its use as a control structure [5]. This work typically again expresses varieties of timbral qualities in terms of those of traditional musical instruments, although [4] and (with regards to timbre as an abstraction) [6] include and describe the synthesis of sound timbre.
(56)
(57) The features described thus far can readily be extended to clusters of two or more separately perceived sonification tones, each tone carrying its own set of information.
(58) Other collections of audio signals also occupy well-separated partitions within an associated timbre space. A more sophisticated example of a partitioned timbre space technique also providing a partitioned spectral space is the system and method of U.S. Pat. No. 6,849,795 entitled Controllable Frequency-Reducing Cross-Product Chain. The harmonic spectral partition of the multiple cross-product outputs do not overlap.
(59) Through proper sonic design, each timbre space coordinate may support several partition boundaries, as suggested in
(60) Exemplary Pre-Sonification Operations
(61) Attention is now directed to consideration of pre-sonification operations.
(62) Data sonification (and data visualization) can be made far more powerful if one or more mappings of data can be shifted, scaled, or warped with nonlinearities (such as logarithmic, exponential, or power-law functions). Such functionality can be combined with indexing and sorting functions, as well as provisions for updating underlying datasets with new measurements, trial synthesized data, and/or live sensor feeds.
(63)
(64)
(65) Two or more of these functions may occur in various orders as may be advantageous or required for an application and produce modified data. Aspects of these functions and/or order of operations may be controlled by a user interface or other source, including an automated data formatting element or an analytic model. The invention further provides that updates are provided to a native data set.
(66) The invention also provides for other types of pre-sonification operations. Statistical operations and statistical processing functions can be used as pre-sonification operations, and for linking to external programs to perform other types of pre-sonification operations. External programs can be added to the collection of available pre-sonification operations.
(67)
(68) Sonification Time-Index Handling
(69) In some situations, the data in a dataset to be sonified is defined against an intrinsic or inherent time-line. The sonification rendering in some cases may be performed at the natural time scale, at a speeded or slowed time-scale, in real-time, or in artificially controlled time (as with a shuttle wheel control, animation loop, etc.). In other circumstances the time-line may be artificially created from components of the data (for example, time may signify travel in distance, increase in temperature, etc.). Additional variations on these capabilities include the creation and use of artificial trajectories, such as the path through the Belmont urban wetland slough depicted in
(70) Rates of change of sound parameters can easily be even more of a concern in multi-channel and multiple-perceived-tone sonification. Due to the intertwined 20 Hz lowest perceived frequency and 50 msec time correlation window [64] of auditory perception, temporal variation of timbre, pitch, and amplitude attributes at periods/rates notably less than 50 msec/20 Hz are perceived as a change in these attributes, while temporal variation of timbre, pitch, and amplitude attributes at rates notably more than 50 msec/20 Hz are perceived as quality of timbre of the tone as was illustrated in
(71) As an example, the three dimensions of the timbre space may still represent salinity, temperature, and turbidity and each of the (two or four) separate sources represent different water depths or differing transverse locations across the water surface. Although the same three-parameter tone described in conjunction with
(72) The invention provides for each of these considerations, as well as far more sophisticated and varied tone classes than the one described in the examples of
(73) Use of Metaphors
(74) Accordingly, the invention additionally provides for the inclusion and use of visual metaphors to simplify sonification setup and user interaction for data exploration. As an example,
(75) Use of Multidimensional User Interfaces
(76) As mentioned above, the invention provides for the support, inclusion, and use of multidimensional user interface devices for providing extra control parameters, 3D-geometry control and metaphors, 6D-geometry control and metaphors, etc. Such multidimensional user interface devices can include a High-Definition Touchpad that taught in U.S. Pat. No. 6,570,078, and U.S. patent application Ser. Nos. 11/761,978 and 12/418,605, advanced computer mice taught in U.S. Pat. No. 7,557,797 and U.S. patent application Ser. No. 10/806,694, video cameras taught in U.S. Pat. No. 6,570,078, or other types of touch, control-based, or visually operated user interfaces. The invention provides for the incorporation of and use of multidimensional user interface devices in interacting with data visualization and/or data sonification environments, either stand alone or in collaborative environments.
(77) Use of Data Flow Paths to Implement Arbitrary Interconnection Topologies
(78) The invention provides for the use of data flow paths to link arbitrary data sources with arbitrary data destinations via arbitrary topologies. This allows the selection and/or fusion of data sources, their interconnection with selected signal processing, statistical processing, pre-sonification operations, and sonification parameters.
(79)
(80) The invention provides for some or all of the data flow paths (such as a graphical diagrammic depiction of the arrangement of
(81) Shared Data Sonification and Data Visualization Environments
(82)
(83) The example in the section to follow shows how this arrangement can be useful in an application.
(84) In situations where there is a natural or artificial timeline, the invention provides for synchronization between data sonification rendering and presentation and the data visualization rendering and presentation.
(85) Use of GIS and Data Visualizations as Interactive User Interface for Data Sonification
(86)
(87)
(88) The visual plot or sonification can render representations of one or more data values associated with a selected point selected by a cursor a cursor (shown as a small black box on the curved arrow line) on a flow path (curved arrow line), or as a function of time as a cursor (shown as a small black box on the curved arrow line) moves along the flow path at a specified rate.
(89) The system can visually display this data or use the data to produce a sonification.
(90) The sonification may render sounds according to a selected point on the flow path, or as a function of time as a cursor moves along the flow path at a specified rate. For example, the system can produce a trajectory in sonification parameter (timbre) space such as that depicted in
(91) An embodiment of the invention can overlay visual plot items or portions of data, geometrically position the display of items or portions of data, and/or use data to produce one or more sonification renderings. For example, a sonification environment may render sounds according to a selected point on the flow path, or as a function of time as a cursor moves along the surface water flow path at a specified rate.
(92) Use of Auditory Perception Eigenfunctions
(93) The invention provides for sonifications employing auditory perception eigenfunctions in the production of the data-manipulated sound. As taught in that provisional patent application, these auditory perception eigenfunctions are eigenfunctions (within a Hilbert space) for an operator equation defined by three of the most fundamental empirical attributes of human hearing: the approximate 20 Hz-20 KHz frequency range of auditory perception [7]; the approximate 50 msec temporal-correlation window of auditory perception (for example time constant in [8]; the approximate wide-range linearity (modulo post-summing logarithmic amplitude perception, nonlinearity explanations of beat frequencies, etc.) when several signals are superimposed [7,8].
(94) The audio perception eigenfunctions can be related to the integral equation whose eigenfunctions are the Prolate Spheroidal Wave Functions (PSWFs, also known more recently as Slepian functions) [9]. The integral equation for the audio eigenfunctions stems from a (typically smoothed) time-domain gate function and a (typically smoothed) frequency-domain bandpass function; in comparison, the integral equation whose eigenfunctions are the PSWFs stems from an (abrupt) time-domain gate function and an (abrupt) frequency-domain lowpass function. As the auditory perception eigenfunctions are, by its very nature, defined by the interplay of time limiting and band-pass phenomena, it is possible the Hilbert space model eigensystem may provide important new information regarding the boundaries of temporal variation and perceived frequency (for example as may occur in rapidly spoken languages, tonal languages, vowel guide [10-12], auditory roughness [8], etc.), as well as empirical formulations (such as critical band theory, phantom fundamental, pitch/loudness curves, etc.) [7,8].
(95) While the invention has been described in detail with reference to disclosed embodiments, various modifications within the scope of the invention will be apparent to those of ordinary skill in this technological field. It is to be appreciated that features described with respect to one embodiment typically can be applied to other embodiments. Audio eigenfunctions are taught in the inventor's copending U.S. patent application Ser. No. 12/849,013.
(96) The invention can be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Therefore, the invention properly is to be construed with reference to the claims.
REFERENCES
(97) [1] Barrass, S.; A perceptual framework for the auditory display of scientific data, ACM Transactions on Applied Perception, Vol. 2 No. 4, October 2005, pp. 389-402. [2] Grey, J. M., An exploration of musical timbre, Ph.D. dissertation, Dept. of Music Report No. STAN-M-2, Stanford University, Calif., 1975. [3] Terasawa, H.; Slaney, M.; Berger, J., Perceptual Distance in Timbre Space, ICADO5 Proceedings, 2005. [4] Drioli, C.; Polotti.; Delle Monache, S.; Rocchesso, D.; Adiloglu, K.; Annies, R.; Obermayer, K.; Auditory Representations as Landmarks in the Sound Design Space, Proc. SMC09, Porto, July 2009. [5] Wessel, D., Timbre Space as a Musical Control Structure, Rapport Ircam 12/78, 1978 http://articles.ircam.fr/textes/Wessel78a/ (visited Nov. 27, 2009). [6] Scaletti, C., Sound synthesis algorithms for auditory data representations, Auditory Display: Sonification, Audification, and Auditory Interfaces. G. Kramer, (Ed.), Santa Fe Institute Studies in the Sciences of Complexity, Proc. Vol. XVIII. Addison-Wesley, Reading, Mass. [7] Winckel, F., Music, Sound and Sensation: A Modern Exposition, Dover Publications, 1967. [8] Zwicker, E.; Fastl, H., Psychoacoustics: Facts and Models, Springer, 2006. [9] Slepian, D.; Pollak, H., Prolate Spheroidal Wave Functions, Fourier Analysis and UncertaintyI, The Bell Systems Technical Journal, pp. 43-63, January 1960. [10] Rosenthall, S., Vowel/Glide Alternation in a Theory of Constraint Interaction (Outstanding Dissertations in Linguistics), Routledge, 1997. [11] Zhang, J., The Effects of Duration and Sonority on Contour Tone Distribution: A Typological Survey and Formal Analysis (Outstanding Dissertations in Linguistics), Routledge, 2002. [12] Rosner, B.; Pickering, J., Vowel Perception and Production (Oxford Psychology Series), Oxford University Press, 1994.