Patent classifications
G10L21/00
Method and apparatus for connecting service between user devices using voice
A method of connecting a service between a device and at least one other device is provided. The method includes recording, by the device, a user voice input in a state where a voice command button has been input, outputting first information based on the recorded user voice when an input of the voice command button is cancelled, receiving, by the device, second information corresponding to the first information, recognizing a service type according to the first information and the second information, connecting the device to a subject device in an operation mode of the device determined according to the recognized service type, and performing a service with the connected subject device.
Method and apparatus for connecting service between user devices using voice
A method of connecting a service between a device and at least one other device is provided. The method includes recording, by the device, a user voice input in a state where a voice command button has been input, outputting first information based on the recorded user voice when an input of the voice command button is cancelled, receiving, by the device, second information corresponding to the first information, recognizing a service type according to the first information and the second information, connecting the device to a subject device in an operation mode of the device determined according to the recognized service type, and performing a service with the connected subject device.
Anaphora resolution for semantic tagging
A semantic tagging method may add context to a sentence in order to increase search efficiency. Regardless of an author's writing style, translating semantic concepts into tags may increase search efficiency. Automatic semantic tagging of documents may allow semantic search and reasoning. Text for semantic tagging may include an email, a website chat room, an internet forum, or a text message. Additional texts may include aggregating general consensus of an emailed topic across multiple emails, whether in the same email chain or separate emails. To increase search efficiency, the analysis of prior communications within the body of text may comprise analyzing structured contextual information to facilitate with homophora resolution. The structured contextual information may include at least one of a sender email address, one or more recipient email addresses, a subject field, a message date and time stamp, and an attachment title.
Audio signal encoding and decoding method, and audio signal encoding and decoding apparatus
An audio signal encoding and decoding method, an audio signal encoding and decoding apparatus, a transmitter, a receiver, and a communications system, which can improve encoding and/or decoding performance. The audio signal encoding method includes dividing a to-be-encoded time domain signal into a low band signal and a high band signal; encoding the low band signal to obtain a low frequency encoding parameter; calculating a voiced degree factor, and predicting a high band excitation signal; weighting the high band excitation signal and random noise using the voiced degree factor, so as to obtain a synthesized excitation signal; and obtaining a high frequency encoding parameter based on the synthesized excitation signal and the high band signal. Technical solutions in the embodiments of the present invention can improve an encoding or decoding effect.
Frequency envelope vector quantization method and apparatus
Embodiments of the present application proposes a frequency envelope vector quantization method and apparatus, where the method includes: dividing N frequency envelopes in one frame into N1 vectors; quantizing a first vector in the N1 vectors by using a first codebook, to obtain a code word corresponding to the quantized first vector, where the first codebook is divided into 2.sup.B1 portions; determining, according to the code word corresponding to the quantized first vector; determining a second codebook according to the codebook of the i.sup.th portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.
Systems and methods for providing a virtual assistant
A system comprising at least one processor configured to perform: receiving a first request to access a first user profile of a first user from a first device configured to execute a first virtual assistant to interact with the first user; in response to receiving the first request, providing the first device with access to information in the first user profile so that the first virtual assistant is able to customize, based on the accessed information, its behavior when interacting with the first user; receiving a second request to access the first user profile from a second device configured to execute a second virtual assistant to interact with the first user; and in response to receiving the second request, providing the second device with access to the information so that the second virtual assistant is able to customize, based on the accessed information, its behavior when interacting with the first user.
Method and apparatus for voice recording and playback
Methods and apparatuses are provided for controlling an electronic device that includes a plurality of microphones configured to receive voice input, a storage unit configured to store a sound recording file, and a display unit configured to visually display speaker areas of individual speakers when recording a sound or playing a sound recording file. The electronic device also includes a control unit configured to provide a user interface relating a speaker direction to a speaker by identifying the speaker direction while recording the sound or performing playback of the sound recording file, and to update at least one of speaker information, direction information of a speaker, and distance information of the speaker through the user interface.
Information processing apparatus, information processing method and computer program product
According to an embodiment, an information processing apparatus includes a storage unit, a detector, an acquisition unit, and a search unit. The storage unit configured to store therein voice indices, each of which associates a character string included in voice text data obtained from a voice recognition process with voice positional information, the voice positional information indicating a temporal position in the voice data and corresponding to the character string. The acquisition unit acquires reading information being at least a part of a character string representing a reading of a phrase to be transcribed from the voice data played back. The search unit specifies, as search targets, character strings whose associated voice positional information is included in the played-back section information among the character strings included in the voice indices, and retrieves a character string including the reading represented by the reading information from among the specified character strings.
Audio data transmitting method and data transmitting system
An audio data transmitting method applied to an audio data transmitting device. The audio data transmitting method comprises: (a) receiving first audio data from at least one audio data source, wherein the first audio data follows a first audio format; and (b) outputting the first audio data from the audio data transmitting device without encoding or decoding the first audio data.
Enhancing a message by providing supplemental content in the message
The present technology relates to enhancing a message with supplemental content. The system may enhance a message based on topics identified in past correspondence messages or topics anticipated based on an intended recipient of a correspondence message being drafted. The system can operate in combination or conjunction with a language prediction system, an optimizing language model, and a text input method. The systems and methods provide users with supplemental content at a time and in a specific situation, which allows for effective targeting of content.