Patent classifications
G10L21/00
Speech recognition for avionic systems
Voice-operable avionic systems and methods supporting utilization of speech recognition to facilitate control of avionic systems are disclosed. Utilizing speech recognition to control avionic systems may help reduce the head-down time of the flight crew. Safety features may also be implemented to ensure safety-critical commands are carried out as intended when commands are received through speech recognition. In addition, voice-operable avionic systems configured in accordance with embodiments of the inventive concepts disclosed herein may be implemented in manners that can help reduce the complexity and cost associated with obtaining certifications from aviation authorities.
AUDIO DEVICE AND COMPUTER-READABLE PROGRAM
[Problem]To provide an audio device with which it is possible to efficiently utilize the storage capacity of a storage device for storing audio data. [Solution] In this audio device 1, in accordance with user profile information stored in a profile information storage unit 102, audio data inputted to an audio data input unit 103 is subjected by a filter unit 104 to a process of cutting a frequency band higher than the highest frequency audible to the user) and/or a frequency band lower than the lowest frequency audible to the user, and the data is then encoded by a codec unit 105, and stored to an audio storage device 100. Audio data can thereby be stored in the audio storage device 100 in compressed form with no noticeable decline in sound quality to the user, making it possible to store more audio data, such as music, in the audio storage device 100.
Apparatus for encoding a speech signal employing ACELP in the autocorrelation domain
An apparatus for encoding a speech signal by determining a codebook vector of a speech coding algorithm is provided. The apparatus includes a matrix determiner for determining an autocorrelation matrix R, and a codebook vector determiner for determining the codebook vector depending on the autocorrelation matrix R. The matrix determiner is configured to determine the autocorrelation matrix R by determining vector coefficients of a vector r, wherein the autocorrelation matrix R includes a plurality of rows and a plurality of columns, wherein the vector r indicates one of the columns or one of the rows of the autocorrelation matrix R, wherein R(i, j)=r(|i−j|), wherein R(i, j) indicates the coefficients of the autocorrelation matrix R, wherein i is a first index indicating one of a plurality of rows of the autocorrelation matrix R, and wherein j is a second index indicating one of the plurality of columns of the autocorrelation matrix R.
Indicator for voice-based communications
Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
Connected device voice command support
Systems and techniques for connected device voice command support are described herein. A voice command may be received from a user. A set of connected devices proximate to the user may be identified. The voice command may be transformed into a command for the set of connected devices. The command may be communicated to the set of connected devices.
Generating communicative behaviors for anthropomorphic virtual agents based on user's affect
Systems and methods for automatically generating at least one of facial expressions, body gestures, vocal expressions, or verbal expressions for a virtual agent based on emotion, mood and/or personality of a user and/or the virtual agent are provided. Systems and method for determining a user's emotion, mood and/or personality are also provided.
Digital Filterbank for Spectral Envelope Adjustment
An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.
Digital Filterbank for Spectral Envelope Adjustment
An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.
ELECTRONIC ROUTE CREATION
A method of creating an electronic travel route includes providing a user interface that includes electronic geographic maps. Routes created by users using the user interface are stored. The routes include mapped electronic routes along which the users have traveled. The method includes receiving geographic information and a request from a requester to create a new route. A set of the routes created are provided to the requester based on the geographic information. Selections of portions of the set of the routes are accepted to create the new route. Based on the new route, a processor of a computer is used to create a set of geographic markers and instructions for traversing the new route. The new route and instructions for traversing the new route are provided based on a search.
Formant dependent speech signal enhancement
An arrangement is described for speech signal processing. An input microphone signal is received that includes a speech signal component and a noise component. The microphone signal is transformed into a frequency domain set of short-term spectra signals. Then speech formant components within the spectra signals are estimated based on detecting regions of high energy density in the spectra signals. One or more dynamically adjusted gain factors are applied to the spectra signals to enhance the speech formant components.