AUTOMATIC VOLUME LEVELING
20250364963 ยท 2025-11-27
Assignee
Inventors
- Brian Ellison (Holladay, UT, US)
- Gabriel Olochwoszcz (Hillsborough, NJ, US)
- Michael W. Stark (Acton, MA, US)
Cpc classification
H03G3/32
ELECTRICITY
H03G7/002
ELECTRICITY
International classification
H03G3/32
ELECTRICITY
H03G9/00
ELECTRICITY
Abstract
An audio device is provided. The audio device comprises a controller and an acoustic transducer. The controller is configured to receive an input audio signal. The controller is further configured to generate, via a loudness detector, a loudness level of a first portion of the input audio signal. The controller is further configured to identify a compression curve of a plurality of compression curves corresponding to a user volume setting. The controller is further configured to adjust, via an audio compressor, a second portion of the input audio signal based on the loudness level and the compression curve to generate an output audio signal. The loudness level is determined according to LUFS model.
Claims
1. An audio device comprising a controller configured to: receive an input audio signal, wherein the input audio signal comprises a first portion and a second portion, wherein the first portion of the input audio signal begins at a first time, wherein the second portion of the input audio signal begins at a second time following the first time, and wherein a difference between the first time and the second time is no greater than an adjustment timing period; generate, via a loudness detector, a loudness level of the first portion of the input audio signal; identify a compression curve of a plurality of compression curves corresponding to a user volume setting; adjust, via an audio compressor, the second portion of the input audio signal based on the loudness level and the compression curve to generate an output audio signal.
2. The audio device of claim 1, wherein the adjustment timing period is no greater than approximately six seconds.
3. The audio device of claim 1, further comprising an acoustic transducer configured to generate audio corresponding to the output audio signal.
4. The audio device of claim 1, wherein a volume level of the output audio signal is adjusted according to the user volume setting.
5. The audio device of claim 1, wherein the compression curve comprises a pivot point, a downward compression portion, and an upward compression portion.
6. The audio device of claim 5, wherein the downward compression portion corresponds to a first input power range greater than the pivot point, and wherein the upward compression portion corresponds to a second input power range less than the pivot point.
7. The audio device of claim 1, wherein the loudness level is an integrated loudness of the input audio signal over an integration period.
8. The audio device of claim 7, wherein the integration period is approximately three seconds.
9. The audio device of claim 1, wherein the loudness level is determined according to a Loudness Unit Full Scale (LUFS) model.
10. The audio device of claim 1, wherein the controller is configured to disable the adjustment of the second portion of the input audio signal based on a receiving an adjustment disabling signal.
11. The audio device of claim 10, further comprising a disable switch configured to generate the adjustment disabling signal.
12. The audio device of claim 1, wherein the audio device is a soundbar or a speaker.
13. The audio device of claim 1, wherein the input audio signal corresponds to a High-Definition Multimedia Interface (HDMI) signal or an optical audio signal.
14. The audio device of claim 1, wherein the controller is further configured to determine a content type of the input audio signal, and wherein the adjustment of the second portion of the input audio signal is disabled if the content type is music content.
15. The audio device of claim 1, wherein the compression curve has a frequency range of 20 Hz to 20 kHz.
16. A method for adjusting an input audio signal, comprising: receiving, via a controller, the input audio signal, wherein the input audio signal comprises a first portion and a second portion, wherein the first portion of the input audio signal begins at a first time, wherein the second portion of the input audio signal begins at a second time following the first time, and wherein a difference between the first time and the second time is no greater than an adjustment timing period; generating, via a loudness detector of the controller, a loudness level of the first portion of the input audio signal; identifying, via the controller, a compression curve of a plurality of compression curves corresponding to a user volume setting; adjusting, via an audio compressor of the controller, the second portion of the input audio signal based on the loudness level and the compression curve to generate an output audio signal.
17. The method of claim 16, further comprising generating, via an acoustic transducer, audio corresponding to the output audio signal.
18. The method of claim 16, further comprising adjusting a volume level of the output audio signal according to the user volume setting.
19. The method of claim 16, wherein the compression curve comprises a pivot point, a downward compression portion, and an upward compression portion.
20. The method of claim 19, wherein the downward compression portion corresponds to a first input power range greater than the pivot point, and wherein the upward compression portion corresponds to a second input power range less than the pivot point.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0034] In the drawings, like reference characters generally refer to the same parts throughout the different views. Also, the drawings are not necessarily to scale, emphasis instead generally being placed upon illustrating the principles of the various embodiments.
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
DETAILED DESCRIPTION
[0045] The present disclosure is generally directed to systems and methods for automatic volume leveling. In particular, the present disclosure describes near-real time adjustment of audio signals based on detected loudness. These systems and methods are implemented by an audio device, such as a soundbar or speaker. The audio device includes a controller configured to determine a loudness level of a first portion of an input audio signal. The audio device then identifies a compression curve based on a user volume setting. The identified compression curve is then applied to a second portion of the input audio signal to generate an output audio signal. The second portion of the input audio signal occurs within seconds of the first portion, enabling the controller to adjust the input audio signal in near-real time. The user volume setting is then applied to the output audio signal, and the volume-adjusted signal is then provided to an acoustic transducer to generate audio for a user to hear.
[0046] The following description should be read in view of
[0047]
[0048] As shown in
[0049] The input audio signal 102 is provided to the loudness detector 101. As opposed to a power level detector of a typical dynamic range compression (DRC) system, the loudness detector 101 is configured to generate a loudness level 104 representative of the perceived loudness of the input audio signal 102 as would be experienced by a user prior to the implementation of any volume adjustment due to the user volume setting 108. In some examples, the loudness level 104 is measured according to a Loudness Unit Full Scale (LUFS) model as defined by the International Telecommunication Union (ITU) BS.1770 loudness specification. As will be described in further detail with respect to
[0050] As shown in
[0051] The audio compressor 103 chooses one of the compression curves 106 to apply to the input audio signal 102 based on the user volume setting 108. In some examples, the intensity of the amplification and attenuation of the compression curves 106 increase as the user volume setting 108 decreases, thereby reducing the loudness of loud portions of the input audio signal 102 (such as loud commercials) and increasing the loudness of quiet portions (such as quiet dialogue) when the user has turned down the volume of the audio device 10. In other examples, the amplification and attenuation of the compression curves 106 will be minimal at high user volume settings 108. Further, the frequency response of the compression curves 106 is typically flat over the audible frequency ranges of 20 Hz to 20 kHz.
[0052] Accordingly, the audio compressor 103 generates an output audio signal 110 by applying the identified compression curve 106 to the input audio signal 102. As the compression curve 106 is applied before any volume adjustment, applying the compression curve 106 to the input audio signal 102 is considered a pre-mastering processing step. The output audio signal 110 is then provided to the volume adjustor 105. The volume adjustor 105 applies the user volume setting 108 to the output audio signal 110 to adjust a volume level of the output audio signal 110 and to generate a volume-adjusted signal 124. The volume-adjusted signal 124 is then provided to the acoustic transducer 400 to generate audio to be heard by the user.
[0053]
[0054] Each compression curve 106 is defined by a pivot point 112, a downward compression portion 114, and an upward compression portion 116. The pivot point 112 corresponds to the transition from the downward compression portion 114 to the upward compression portion 116. In the example of
[0055] In some examples, each of the compression curves 106a-106f of the plurality of compression curves 106 shares the same pivot point 112. In the example of
[0056] The downward compression portion 114 represents the portion of the compression curve 106 which attenuates the input signal, while the upward compression portion 116 represents the portion of the compression curve 106 which amplifies the input signal. For example, if the input signal has a loudness level of 40 LUFS and the user volume setting 108 is very high, the first compression curve 106a increases the loudness of the input signal to approximately 23 LUFS. Similarly, if the loudness level of the input signal then increases to 10 LUFS while the user volume setting 108 remains very high, the loudness level of then input signal the decreases to 18 LUFS. Amplifying input signals with low loudness levels allows for a user who has selected a low user volume setting 108 to better hear, for example, quiet dialogue in a television program. Similarly, attenuating input signals with high loudness levels reduces potential annoyance or discomfort from a loud commercial following the quiet dialogue.
[0057] As shown in
[0058]
[0059] Generally, the loudness detector 101 (as shown in
[0060] Accordingly, in the examples of
[0061] The loudness level 104 of the input signal 102 is continuously determined during the duration of the input signal 102. In some examples, the loudness level 104 is measured at least every three seconds. Continuously measuring the loudness of the signal over the integration periods 118 allows for near-real time adjustment of the input audio signal 102 while also allowing short-term loudness bursts to be provided at a reasonable volume to provide their desired effect (such as explosions in a motion picture).
[0062]
[0063]
[0064]
[0065]
[0066]
[0067]
[0068] The method 900 further includes, in step 904 generating, via a loudness detector 101 of the controller 100, a loudness level 104 of the first portion 102a of the input audio signal 102.
[0069] The method 900 further includes, in step 906, identifying, via the controller 100, a compression curve 106 of a plurality of compression curves 106 corresponding to a user volume setting 108.
[0070] The method 900 further includes, in step 908, adjusting, via an audio compressor 103 of the controller 100, the second portion 102b of the input audio signal 102 based on the loudness level 104 and the compression curve 106 to generate an output audio signal 110.
[0071] According to an example, the method 900 further includes, in optional step 910, generating, via an acoustic transducer 400, audio corresponding to the output audio signal 110.
[0072] According to an example, the method 900 further includes, in optional step 912, adjusting a volume level of the output audio signal 110 according to the user volume setting 108.
[0073] All definitions, as defined and used herein, should be understood to control over dictionary definitions, definitions in documents incorporated by reference, and/or ordinary meanings of the defined terms.
[0074] The indefinite articles a and an, as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to mean at least one.
[0075] The phrase and/or, as used herein in the specification and in the claims, should be understood to mean either or both of the elements so conjoined, i.e., elements that are conjunctively present in some cases and disjunctively present in other cases. Multiple elements listed with and/or should be construed in the same fashion, i.e., one or more of the elements so conjoined. Other elements can optionally be present other than the elements specifically identified by the and/or clause, whether related or unrelated to those elements specifically identified.
[0076] As used herein in the specification and in the claims, or should be understood to have the same meaning as and/or as defined above. For example, when separating items in a list, or or and/or shall be interpreted as being inclusive, i.e., the inclusion of at least one, but also including more than one, of a number or list of elements, and, optionally, additional unlisted items. Only terms clearly indicated to the contrary, such as only one of or exactly one of, or, when used in the claims, consisting of, will refer to the inclusion of exactly one element of a number or list of elements. In general, the term or as used herein shall only be interpreted as indicating exclusive alternatives (i.e. one or the other but not both) when preceded by terms of exclusivity, such as either, one of, only one of, or exactly one of.
[0077] As used herein in the specification and in the claims, the phrase at least one, in reference to a list of one or more elements, should be understood to mean at least one element selected from any one or more of the elements in the list of elements, but not necessarily including at least one of each and every element specifically listed within the list of elements and not excluding any combinations of elements in the list of elements. This definition also allows that elements can optionally be present other than the elements specifically identified within the list of elements to which the phrase at least one refers, whether related or unrelated to those elements specifically identified.
[0078] It should also be understood that, unless clearly indicated to the contrary, in any methods claimed herein that include more than one step or act, the order of the steps or acts of the method is not necessarily limited to the order in which the steps or acts of the method are recited.
[0079] In the claims, as well as in the specification above, all transitional phrases such as comprising, including, carrying, having, containing, involving, holding, composed of, and the like are to be understood to be open-ended, i.e., to mean including but not limited to. Only the transitional phrases consisting of and consisting essentially of shall be closed or semi-closed transitional phrases, respectively.
[0080] The above-described examples of the described subject matter can be implemented in any of numerous ways. For example, some aspects can be implemented using hardware, software or a combination thereof. When any aspect is implemented at least in part in software, the software code can be executed on any suitable processor or collection of processors, whether provided in a single device or computer or distributed among multiple devices/computers.
[0081] The present disclosure can be implemented as a system, a method, and/or a computer program product at any possible technical detail level of integration. The computer program product can include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present disclosure.
[0082] The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium can be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
[0083] Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network can comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
[0084] Computer readable program instructions for carrying out operations of the present disclosure can be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, configuration data for integrated circuitry, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++, or the like, and procedural programming languages, such as the C programming language or similar programming languages. The computer readable program instructions can execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer can be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection can be made to an external computer (for example, through the Internet using an Internet Service Provider). In some examples, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) can execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present disclosure.
[0085] Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to examples of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
[0086] The computer readable program instructions can be provided to a processor of a, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions can also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram or blocks.
[0087] The computer readable program instructions can also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
[0088] The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various examples of the present disclosure. In this regard, each block in the flowchart or block diagrams can represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the blocks can occur out of the order noted in the Figures. For example, two blocks shown in succession can, in fact, be executed substantially concurrently, or the blocks can sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
[0089] Other implementations are within the scope of the following claims and other claims to which the applicant can be entitled.
[0090] While various examples have been described and illustrated herein, those of ordinary skill in the art will readily envision a variety of other means and/or structures for performing the function and/or obtaining the results and/or one or more of the advantages described herein, and each of such variations and/or modifications is deemed to be within the scope of the examples described herein. More generally, those skilled in the art will readily appreciate that all parameters, dimensions, materials, and configurations described herein are meant to be exemplary and that the actual parameters, dimensions, materials, and/or configurations will depend upon the specific application or applications for which the teachings is/are used. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific examples described herein. It is, therefore, to be understood that the foregoing examples are presented by way of example only and that, within the scope of the appended claims and equivalents thereto, examples can be practiced otherwise than as specifically described and claimed. Examples of the present disclosure are directed to each individual feature, system, article, material, and/or method described herein. In addition, any combination of two or more such features, systems, articles, materials, and/or methods, if such features, systems, articles, materials, and/or methods are not mutually inconsistent, is included within the scope of the present disclosure.