Patent classifications
G06F16/173
SYSTEMS AND METHODS FOR SELECTIVELY STORING SPECIFIC VERSIONS OF MEDIA ASSETS
Systems and methods provided herein for intelligently distinguishing between different versions of a media asset and selectively storing those requested media assets that differ from versions that were previously stored in some pre-defined characteristics. These systems and methods, upon identifying a version of the requested media asset was previously stored, determine a characteristic of the requested media asset. These systems and methods compare the characteristic of the requested media asset with a corresponding characteristic of the version of the requested media asset that was previously stored to determine a difference between the two versions. The systems and methods then determine whether the difference in characteristic renders the requested media asset preferable over the previously stored version. Upon determining that the requested media asset is preferable over the previously stored version, the systems and methods store the requested media asset.
Anomaly detection through header field entropy
An approach for detecting anomalous flows in a network using header field entropy. This can be useful in detecting anomalous or malicious traffic that may attempt to “hide” or inject itself into legitimate flows. A malicious endpoint might attempt to send a control message in underutilized header fields or might try to inject illegitimate data into a legitimate flow. These illegitimate flows will likely demonstrate header field entropy that is higher than legitimate flows. Detecting anomalous flows using header field entropy can help detect malicious endpoints.
Automated language detection for domain names
Methods and systems for automated language detection for domain names are disclosed. In some embodiments, a method for detecting a language of an Internationalized Domain Name (IDN) comprises receiving, by an I/O interface, a string of characters for the IDN; receiving training data, including a plurality of multi-gram analyses for a set of languages; analyzing, by a processor, the string of characters based on the training data, wherein the analyzing includes extracting a set of multi-grams from the string of characters and comparing the extracted set of multi-grams with the training data; detecting the language of the IDN based on results of the analyzing. In some embodiments, the method further comprises comparing the detected language of the IDN with a user selected language and using the IDN to generate a domain name, if the comparing indicates that the detected language of the IDN is consistent with the user selected language.
MULTI-LANGUAGE SUPPORT FOR DYNAMIC ONTOLOGY
A system with methodology for providing multi-language support for dynamic ontology. In one embodiment, for example, a method comprises: storing an ontology for a data store, wherein the ontology comprises a plurality of data types; for each data type of the plurality of data types: storing, in the ontology, one or more display values, wherein each display value is associated with a locale; determining a locale identifier corresponding to a particular locale; selecting a particular display value corresponding to the particular locale; displaying the particular display value.
TECHNOLOGIES FOR ANNOTATING PROCESS AND USER INFORMATION FOR NETWORK FLOWS
Systems, methods, and computer-readable media for annotating process and user information for network flows. In some embodiments, a capturing agent, executing on a first device in a network, can monitor a network flow associated with the first device. The first device can be, for example, a virtual machine, a hypervisor, a server, or a network device. Next, the capturing agent can generate a control flow based on the network flow. The control flow may include metadata that describes the network flow. The capturing agent can then determine which process executing on the first device is associated with the network flow and label the control flow with this information. Finally, the capturing agent can transmit the labeled control flow to a second device, such as a collector, in the network.
PROGRAMMATIC DATA PROCESSING SYSTEM
A programmatic data processing system includes a data acquisition module (DAM) and a data management module (DMGM). The data acquisition module (DAM) is configured to stepwise acquire data of material information (DMI) and data of operation information for processing the materials during a processing period according to a processing step. Defining the operations on materials in the same period as a specified standard step, and defining formation process of a real object or the process of real operation as different steps arranged in chronological order, stepwise entering and updating the information in each standard step. Furthermore, different steps are sequenced and combined in chronological order, and a complete formation process of a real object (operation) is restored in the form of inputable information.
System and method for dialogue based file index
An intelligent assistant device for conversation-based indexing and searching of a file. The intelligent assistant device having a processor configured to detect a conversation having at least one user. The processor, responsive to detecting the conversation having at least one user configured to mine the conversation for a file associated with the conversation. The processor, responsive to identifying the file further configured to mine the conversation for contextual keywords associated with the file. The processor also configured to index the contextual keywords to the file.
SYSTEMS AND METHODS FOR CACHING OF MANAGED CONTENT IN A DISTRIBUTED ENVIRONMENT USING A MULTI-TIERED ARCHITECTURE
Embodiments as disclosed provide a distributed caching solution that improve the performance and functionality of a content management platform for sites that are physically or logically remote from the primary site of the content management platform. In particular, according to embodiments, a remote cache server may be associated with a remote site to store local copies of documents that are managed by the primary content management platform. Periodically, a portion of the remote site's cache may be synchronized with the content management platform's primary site using an extensible architecture to ensure that content at the remote cache server is current.
QUERY-BASED RECOMMENDATION SYSTEMS USING MACHINE LEARNING-TRAINED CLASSIFIER
Systems and methods for query-based recommendation systems using machine learning-trained classifiers are provided. A service provider server receives, from a communication device through an application programming interface, a query in an interaction between the server provider server and the communication device. The service provider server generates a vector of first latent features from a set of first visible features associated with the query using a machine learning-trained classifier. The service provider server generates a likelihood scalar value indicating a likelihood of the query is answered by a candidate user in a set of users using a combination of the vector of first latent features and a vector of second latent features. The service provider server provides, to the communication device through the application programming interface, a recommendation message as a response to the query, where the recommendation message includes the likelihood scalar value and an indication of the candidate user.
Scalable space-time density data fusion
A database system includes a processing sub-system having an electronic hardware controller that receives first formatted data, and an electronic file formatting sub-system having an electronic hardware controller that assigns coordinate values corresponding to a second data format to the first formatted data. The file formatting sub-system generates a dual-format data file that fuses together the first formatted data with the coordinate values corresponding to a second data format. The database system further includes a storage sub-system having a data storage unit configured to store the dual-format data file.