G06F16/316

Sound profile generation based on speech recognition results exceeding a threshold

Systems and methods for generating sound profiles of artificial commands detected by multiple voice activated electronic devices is described herein. In some embodiments, numerous voice activated electronic devices may send audio data representing a phrase to a backend system at a substantially same time. Text data representing the phrase, and counts for instances of that text data, may be generated. If the number of counts exceeds a predefined threshold, the backend system may cause any remaining response generation functionality that particular command that is in excess of the predefined threshold to be stopped, and those devices returned to a sleep state. In some embodiments, a sound profile unique to the phrase that caused the excess of the predefined threshold may be generated such that future instances of the same phrase may be recognized prior to text data being generated, conserving the backend system's resources.

SYSTEMATIC ITERATIVE ANALYSIS OF UNSTRUCTURED DATA FILES
20180246956 · 2018-08-30 ·

Mechanisms for the systematic iterative analysis of unstructured data files are disclosed. A first handler is initiated to analyze a plurality of files. Information about each respective file of the plurality of files analyzed by the first handler is stored in a data structure. The information includes a file identifier that identifies the respective file, a handler identifier that identifies the first handler, and results or a reference to results of an analysis of the respective file by the first handler.

Method and apparatus for associating data with time information

An electronic device including a sensor, a first processor, and a second processor and a method for associating data with time information are provided. The method includes including receiving a notification signal corresponding to the data from the first processor, determining time information and first identification information that correspond to the notification signal in response to the reception, receiving the data and second identification information corresponding to the notification signal from the first processor, associating the data with the time information at least based on the first identification information and the second identification information, and providing the data associated with the time information to an application. Other various embodiments may also be possible.

CATEGORIZATION SYSTEM
20180241769 · 2018-08-23 ·

A system for the categorization of interlinked information items, the system comprising: a trust flow module which is configured to receive a seed trust list of one or more first information items, the seed trust list associating the one or more first information items with one or more categories; and a trust flow module configured to: associate a respective trust value with each of the one or more categories for the one or more first information items; and iteratively pass at least part of the or each trust value to one or more further information items to generate, for each of the one or more further information items, at least one accumulated trust value associated with a category of the one or more categories, such that the one or more further information items can be categorized based on the at least one accumulated trust value and associated category.

INDEXING AND MINING CONTENT OF MULTIPLE DATA SOURCES

A system and method are provided for indexing and mining content of multiple data sources. The method includes: providing a database of learned content of multiple data sources learned using text analysis, the learned content identifying one or more concepts to which a data source relates, wherein the concepts are cognitively associated with the data source and include concepts not explicitly referenced in the data source and providing an index of the learned content including associations between concepts with mappings between concepts and the multiple data sources. The method further includes receiving input of a query and using text analysis to analyze the query to determine one or more query concepts to which it relates and mining the indexed concepts in response to the query concepts to return a list of referenced data sources.

Method and system for searching words in documents written in a source language as transcript of words in an origin language
10042843 · 2018-08-07 · ·

The invention relates to a method used by computers for searching words in documents written in a source language, which are not in the vocabulary of said source language, but are transcript of meaningful words in an origin language. The method is comprised of a preparation process and a search process. During the preparation process a database of unrecognized words in the source language is maintained, which contains, among other data, normalized phonetic conversion of the unrecognized word, as well as a corpus of all words of the documents in the search domain and indexes for efficient search. During search, a phonetic conversion and normalization is done for the search word, and the distance to similar phonetics words in the corpus is calculated. The found words in the corpus are arranged in ascending order, and the relevant documents are displayed.

Video object tagging using synthetic images and segmentation hierarchies

There is provided a system including a memory and a processor configured to obtain a first frame of a video content including an object and a first region based on a segmentation hierarchy of the first frame, insert a synthetic object into the first frame, merge an object segmentation hierarchy of the synthetic object with the segmentation hierarchy of the first frame to create a merged segmentation hierarchy, select a second region based on the merged segmentation hierarchy, provide the first frame including the first region and the second region to a crowd user for creating a corrected frame, receive the corrected frame from the crowd user including a first corrected region including the object and a second corrected region including the synthetic object, determine a quality based on the synthetic object and the second corrected region, and accept the first corrected region based on the quality.

Video object tagging using synthetic images and segmentation hierarchies

There is provided a system including a memory and a processor configured to obtain a first frame of a video content including an object and a first region based on a segmentation hierarchy of the first frame, insert a synthetic object into the first frame, merge an object segmentation hierarchy of the synthetic object with the segmentation hierarchy of the first frame to create a merged segmentation hierarchy, select a second region based on the merged segmentation hierarchy, provide the first frame including the first region and the second region to a crowd user for creating a corrected frame, receive the corrected frame from the crowd user including a first corrected region including the object and a second corrected region including the synthetic object, determine a quality based on the synthetic object and the second corrected region, and accept the first corrected region based on the quality.

Method and device for sorting Chinese characters, searching Chinese characters and constructing dictionary
12118292 · 2024-10-15 · ·

The invention discloses a method and a device for sorting Chinese characters, searching for Chinese characters and constructing a dictionary, and relates to the technical field of computers. A specific implementation of the method includes: obtaining the first basic character-forming component of a Chinese character according to the stroke order as the First Character, and encoding the First Character to obtain the First Character code, where the First Character includes the first character-forming component and the first main stroke component of a Chinese character; obtaining the number of strokes included in each Chinese character, and obtaining the corresponding stroke string of each Chinese character; using the First Character code as the first and highest priority sorting field, the number of strokes as the second sorting field, and the stroke string as the third and the lowest priority sorting field to sort Chinese characters. This embodiment can solve the problem of difficulty in sorting and searching of Chinese characters caused by the unfixed definition and position of radicals.

Methods and systems for providing a search service application

A system for providing a search service application is disclosed and includes an application builder component that provides a search model for a first object of a plurality of objects. The search model is based at least on an end-user input field corresponding to a first attribute of the first object and a search result output field corresponding to a second attribute of the first object. The search model is also associated with a backend data store that supports a storage structure that stores information relating to the first object. The system also includes a deployment engine that automatically configures a search engine system associated with the backend data store to generate new search indexes based on the search model, and to place a portion of indexed data into a first partition and to place another portion of indexed data into at least another partition based on the search model.