Patent classifications
G10L15/083
Presentation Assistance Device for Calling Attention to Words that are Forbidden to Speak
To provide a presentation assistance device that can display keywords related to presentation materials and call attention by displaying an alert when words that are forbidden to speak are spoken, A presentation assistance device 1 comprises: a presentation material storage means 3; a keyword storage means 5 which stores a plurality of keywords related to presentation materials; a related word storage means 7 which stores one or a plurality of related words for each of the plurality of keywords; an NG word storage means 9 which stores one or a plurality of NG words for each of the plurality of keywords; a voice recognition means 11; a term determination means 15 which determines whether a voice recognition term corresponds to a related word or an NG word; and a keyword output means 17 which when the voice recognition is a related word, outputs a keyword related to the related word, and when the voice recognition term is an NG word, outputs an alert and a keyword related to the NG word.
HOME APPLIANCE AND OPERATING METHOD THEREOF
The present disclosure relates to a home appliance and an operating method thereof. The home appliance according to the present disclosure includes: a user input interface; a storage configured to store a database of a response history, and a controller configured to: in response to receiving an input requesting to perform a previous operation through the user input interface, verify whether a first operation, corresponding to the request for performing the previous operation, is present in the operation history; in response to there being the first operation, verify a type of a first command, mapped to the first operation, among commands included in the database; in response to the type of the first command being a first type, determine the first operation to be the previous operation; and in response to the type of the first command being a second type, generate a second operation corresponding to the first command, and determines the second operation to be the previous operation. Various other embodiments are also possible.
Implementing a domain adaptive semantic role labeler
A computer-implemented method according to one embodiment includes identifying features of a plurality of data instances within a target domain; assigning weights to the plurality of data instances within the target domain, based on similarities among the features; selecting a subset of the plurality of data instances within the target domain, based on the weights; associating expert annotations with respective ones of data instances within the subset; and training a machine learning algorithm, utilizing the subset of the plurality of data instances and associated expert annotations.
USER INPUT PROCESSING METHOD AND ELECTRONIC DEVICE SUPPORTING SAME
An apparatus and method are provided for processing a user input in an electronic device. The method includes storing information associated with each of a plurality of users; receiving a user utterance associated with task execution of the electronic device; transmitting, to an external device, first data associated with the user utterance; receiving, from the external device, second data including information about at least one operation of the electronic device associated with the task execution, and at least one parameter for performing the at least one operation; identifying, as a target of the task execution, a first user from among the plurality of users based on the at least one parameter; inferring a location of the target based on information associated with the first user, which is included in the information associated with each of the plurality of users; moving the electronic device to a first location based on the inferred location; searching for the first user at the first location by comparing the information about the first user with information obtained at the first location; and in response to recognizing the first user at the first location, perform the at least one operation of the electronic device associated with the task execution.
Systems and Methods for Detecting Voice Commands to Generate a Peer-to-Peer Communication Link
A voice-based peer-to-peer communication system may be used to detect voice commands from users to provide a wireless communication voice connection that allows the users to directly communicate with each other. The system may include a first computing device of a first user communicatively coupled to a second computing device of a second user over the wireless connection. The system may process the detected voice command having a phrase, contact name, and voice message. The phrase may include a wake, answer, or stop phrase. The contact name may be utilized to determine whether that contact name matches an entry within a predetermined contact list of the first user, where the matched contact name may be associated with the second user. Finally, the system may generate audio data based on the processed voice command that is then transmitted to the second computing device of the second user over the wireless connection.
Vocal triggering of presentation transitions
Various arrangements for triggering transitions within a slide-based presentation are presented. An audio-based trigger system may receive a plurality of trigger words. A database may be created that maps trigger words to slide transitions. A voice-based request may be received to initiate audio control of the slide-based presentation being output by the presentation system. An audio stream may be monitored for trigger words. Based on accessing a database, a slide transition to be performed may be identified based on a recognized trigger word. A slide transition request may be transmitted to a presentation system that indicates a slide to which a transition should occur. The presentation system may then transition to the slide based on the received slide transition request.
VOICE CONTROL METHOD AND APPARATUS, AND COMPUTER STORAGE MEDIUM
A voice control method can be applied to a first terminal, and include: receiving a user's voice operation instruction after the first terminal is activated, the voice operation instruction being used for controlling the first terminal to perform a target operation; sending an instruction execution request to a server after the voice operation instruction is received, the instruction execution request being used for requesting the server to determine whether the first terminal is to respond to the voice operation instruction according to device information of the terminal in a device network, wherein the first terminal is located in the device network; and performing the target operation in a case where a response message is received from the server, the response message indicating that the first terminal is to respond to the voice operation instruction.
Presentation assistance device for calling attention to words that are forbidden to speak
To provide a presentation assistance device that can display keywords related to presentation materials and call attention by displaying an alert when words that are forbidden to speak are spoken. A presentation assistance device 1 comprises: a presentation material storage means 3; a keyword storage means 5 which stores a plurality of keywords related to presentation materials; a related word storage means 7 which stores one or a plurality of related words for each of the plurality of keywords; an NG word storage means 9 which stores one or a plurality of NG words for each of the plurality of keywords; a voice recognition means 11; a term determination means 15 which determines whether a voice recognition term corresponds to a related word or an NG word; and a keyword output means 17 which when the voice recognition is a related word, outputs a keyword related to the related word, and when the voice recognition term is an NG word, outputs an alert and a keyword related to the NG word.
ABSTRACT GENERATION DEVICE, METHOD, PROGRAM, AND RECORDING MEDIUM
A speech recognition unit (12) converts an input utterance sequence into a confusion network sequence constituted by a k-best of candidate words of speech recognition results; a lattice generating unit (14) generates a lattice sequence having the candidate words as internal nodes and a combination of k words among the candidate words for an identical speech as an external node, in which edges are extended between internal nodes other than internal nodes included in an identical external node, from the confusion network sequence; an integer programming problem generating unit (16) generates an integer programming problem for selecting a path that maximizes an objective function including at least a coverage score of an important word, of paths following the internal nodes with the edges extended, in the lattice sequence; and the summary generating unit generates a high-quality summary having less speech recognition errors and low redundancy using candidate words indicated by the internal nodes included in the path selected by solving the integer programming problem, under a constraint on the length of a summary to be generated.
SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, AND PROGRAM
A signal processing device includes: an input unit to which a microphone signal including a mixed sound in which a target sound and a sound other than the target sound are mixed and a one-dimensional time-series signal acquired by an auxiliary sensor and synchronized with the target sound are input; and a sound source extraction unit that extracts a target sound signal corresponding to the target sound from the microphone signal on the basis of the one-dimensional time-series signal.