SYSTEMS AND METHODS FOR ENHANCING AUDIO IN VARIED ENVIRONMENTS
20230169989 · 2023-06-01
Assignee
Inventors
- Daniel Paul Darcy (San Francisco, CA, US)
- Xuemei YU (San Jose, CA, US)
- Clara Y-Phuong VO (San Jose, CA, US)
- Stewart MURRIE (San Francisco, CA, US)
- Libin LUO (Beijing, CN)
Cpc classification
G06F3/04847
PHYSICS
G10L21/02
PHYSICS
International classification
Abstract
Novel methods and systems for creating and using user profiles for dialog boost and sound equalizer adjustment to compensate for various ambient sound situations. For creating profiles when not at the ambient sound, a synthesized/pre-recorded ambient noise can be mixed with the media to simulate the noise conditions.
Claims
1. A method of configuring a mobile device for use with ambient noise for a user of the mobile device, the method comprising: receiving, from the user, a case identification of the ambient noise; receiving, from the user, a noise level for the ambient noise; receiving, from the user, a dialog boost level for the ambient noise at the noise level; receiving, from the user, a graphic equalizer setting for the ambient noise at the noise level; playing, from the mobile device, a sample audio for the user while the user sets the dialog boost level and the graphic equalizer setting; and storing on the mobile device the dialog boost level and graphic equalizer setting for the case identification at the noise level in a profile, wherein the device is configured to play audio media using the dialog boost level and graphic equalizer setting when the profile is selected by the user.
2. The method of claim 1, further comprising: simulating the ambient noise at the noise level; and mixing the simulated ambient noise with the sample audio prior to the playing the sample audio.
3. The method of claim 2, wherein the simulating includes retrieving a stored pre-recorded ambient noise from a memory.
4. The method of claim 3, wherein the stored pre-recorded ambient noise is in binaural format.
5. The method of claim 1, further comprising: presenting, on the mobile device, a graphical user interface control for setting the case identification, the noise level, the dialog boost level, and the graphic equalizer setting.
6. The method of claim 1, wherein the profile corresponds to both the case identification and the noise level.
7. A method of adjusting audio for a mobile device for a user, the method comprising: receiving a profile selection from the user wherein the profile selection relates at least to an ambient noise condition; receiving a noise level of the ambient noise condition from the user; retrieving a dialog boost level and a graphic equalizer setting from memory on the mobile device; and adjusting levels of the audio using the dialog boost level and the graphic equalizer setting.
8. The method of claim 7, further comprising presenting, on the mobile device, a graphical user interface control for selecting a profile corresponding to an ambient noise condition.
9. A device configured to run in software or firmware the method of claim 1.
10. A non-transient computer-readable medium that, when read by a computer, instructs the computer to perform the method of claim 1.
11. The device of claim 9, wherein the device is a phone.
12. The device of claim 9, wherein the device is at least one of: a cell phone, a laptop computer, a tablet computer, a mobile game system, a wearable device, and a small media player.
13. The device of claim 9, wherein the software or firmware is part of the operating system of the device.
14. The device of claim 9, wherein the software or firmware runs a stand-alone program on the device.
15. The method of claim 1, wherein the method is performed by an operating system of mobile device.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
DETAILED DESCRIPTION
[0019] A solution to the problem of providing intelligible speech in media playback (audio or audio/visual) in a noisy environment (ambient noise) is described herein by creating and using dialog boost and equalizer settings in a profile for a particular user in a particular noise level and type (environment type).
[0020] The term “mobile device” as used herein refers to a device capable of audio playback and capable to be used in multiple locations by being carried on the person of the user. Examples include cell phones, laptop computers, tablet computers, mobile game systems, wearable devices, small media players, and the like.
[0021] The term “ambient condition” or “case” or “case identification” as used herein refers to a category of noisy location/environment that may or may not interfere with the enjoyment of listening to audio media on a mobile device. Examples include the home (e.g. “default”), outside in a populated area (e.g. walking), on public transportation, in a noisy indoor environment (e.g. airport), and others.
[0022] The term “dialog boost” refers to the application of a general sound amplification of speech components of audio with negligible amplification of the non-speech components. For example, dialog boost can be performed as an algorithm that continuously monitors the audio being played, detects the presence of dialog, and dynamically applies processing to improve the intelligibility of the spoken portion of audio content. In some embodiments, dialog boost analyzes features from the audio signal and applies a pattern-recognition system to detect the presence of dialogue from moment to moment. When speech is detected, the speech spectrum is altered where necessary to accentuate the speech content in a way that allows the listener to hear it more succinctly.
[0023] The term “equalization” or “graphic equalizer” or “GED” refers to frequency-based amplitude adjustment of audio. In a true GED, the amplitude settings would be set by sliders whose positions correspond to the frequency ranges they control, but herein GED also refers to a particular setting a graphic equalizer might have, giving a specific frequency response curve.
[0024] The term “media” or “content” as used herein refers to anything with audio content. This can be music, movies, videos, video games, telephone conversations, alerts, etc. In particular, the systems and methods herein are most useful for media that has a combination of speech and non-speech components, but the systems and methods can be applied to any media.
[0025]
[0026] In some embodiments, the dialog boost level and/or the GEQ setting are each one value from a short list of possible settings. For example, “3” from a range of 0-5. In some embodiments, the settings are a real value related to the setting, such as +10 dB (e.g. at a specific frequency range).
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033] The system can be provided as a service driven from a remote server, as a standalone program on the device, integrated into a media player application, or included as part of the operating system as part of its sound settings.
[0034] A number of embodiments of the disclosure have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the present disclosure. Accordingly, other embodiments are within the scope of the following claims.
[0035] As described herein, an embodiment of the present invention may thus relate to one or more of the example embodiments, which are enumerated below. Accordingly, the invention may be embodied in any of the forms described herein, including, but not limited to the following Enumerated Example Embodiments (EEEs) which described structure, features, and functionality of some portions of the present invention:
[0036] EEE1: A method of configuring a mobile device for use with ambient noise for a user of the mobile device, the method comprising: receiving, from the user, a case identification of the ambient noise; receiving, from the user, a noise level for the ambient noise; receiving, from the user, a dialog boost level for the ambient noise at the noise level; receiving, from the user, a graphic equalizer setting for the ambient noise at the noise level; playing, from the mobile device, a sample audio for the user while the user sets the dialog boost level and the graphic equalizer setting; and storing on the mobile device the dialog boost level and graphic equalizer setting for the case identification at the noise level in a profile, wherein the device is configured to play audio media using the dialog boost level and graphic equalizer setting when the profile is selected by the user.
[0037] EEE2. The method of EEE1, further comprising: simulating the ambient noise at the noise level; and mixing the simulated ambient noise with the sample audio prior to the playing the sample audio.
[0038] EEE3. The method of EEE2, wherein the simulating includes retrieving a stored pre-recorded ambient noise from a memory.
[0039] EEE4. The method of EEE3, wherein the stored pre-recorded ambient noise is in binaural format.
[0040] EEE5. The method of any of EEE1 to EEE4, further comprising: presenting, on the mobile device, a graphical user interface control for setting the case identification, the noise level, the dialog boost level, and the graphic equalizer setting.
[0041] EEE6. The method of any of EEE1 to EEE5, wherein the profile corresponds to both the case identification and the noise level.
[0042] EEE7. A method of adjusting audio for a mobile device for a user, the method comprising: receiving a profile selection from the user wherein the profile selection relates at least to an ambient noise condition; receiving a noise level of the ambient noise condition from the user; retrieving a dialog boost level and a graphic equalizer setting from memory on the mobile device; adjusting levels of the audio using the dialog boost level and the graphic equalizer setting.
[0043] EEE8. The method of EEE7, further comprising presenting, on the mobile device, a graphical user interface control for selecting a profile corresponding to an ambient noise condition.
[0044] EEE9. A device configured to run in software or firmware at least one of the methods of EEE1 to EEE8.
[0045] EEE10. A non-transient computer-readable medium that, when read by a computer, instructs the computer to perform at least one of the methods of EEE1 to EEE8.
[0046] EEE11. The device of EEE9, wherein the device is a phone.
[0047] EEE12. The device of EEE9, wherein the device is at least one of: a cell phone, a laptop computer, a tablet computer, a mobile game system, a wearable device, and a small media player.
[0048] EEE13. The device of any of EEE9, EEE11, or EEE12, wherein the software or firmware is part of the operating system of the device.
[0049] EEE14. The device of any of EEE9, EEE11, or EEE12, wherein the software or firmware runs a stand-alone program on the device.
[0050] EEE15. The method of any of EEE1 to EEE8, wherein the method is performed by an operating system of mobile device.
[0051] The present disclosure is directed to certain implementations for the purposes of describing some innovative aspects described herein, as well as examples of contexts in which these innovative aspects may be implemented. However, the teachings herein can be applied in various different ways. Moreover, the described embodiments may be implemented in a variety of hardware, software, firmware, etc. For example, aspects of the present application may be embodied, at least in part, in an apparatus, a system that includes more than one device, a method, a computer program product, etc. Accordingly, aspects of the present application may take the form of a hardware embodiment, a software embodiment (including firmware, resident software, microcodes, etc.) and/or an embodiment combining both software and hardware aspects. Such embodiments may be referred to herein as a “circuit,” a “module”, a “device”, an “apparatus” or “engine.” Some aspects of the present application may take the form of a computer program product embodied in one or more non-transitory media having computer readable program code embodied thereon. Such non-transitory media may, for example, include a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. Accordingly, the teachings of this disclosure are not intended to be limited to the implementations shown in the figures and/or described herein, but instead have wide applicability.