Patent classifications
G06F16/955
METHODS AND APPARATUS TO EXTRACT INFORMATION FROM UNIFORM RESOURCE LOCATORS
Methods, apparatus, systems, and articles of manufacture are disclosed to extract information from uniform resource locators (URLs). An example system includes at least one memory, programmable circuitry, and instructions to cause the programmable circuitry to collect first uniform resource locator (URL) information from a server of an Internet-based media publisher, the first URL information corresponding to first media accessed by first users, determine first URL components in the first URL information, and determine feature-to-user assignment rules based on the first URL components.
Collection apparatus, collection method, and collection program
A collection apparatus that collects a URL of a Web page that leads to user operation and includes a search query generation unit that generates a search query by combining a digital content name and an associated keyword of the digital content. There is a fitness prediction unit that predicts a degree to which a Web page that leads to user operation is output as a search result when a search is performed by using the generated search query, a determination unit that searches for a Web page by using a search query in a search order that is based on the predicted degree, and determines analysis priority of a URL of a Web page on the basis of the degree and search result information. Further, there is a communication unit that outputs the URL of the retrieved Web page and the analysis priority of the URL.
Collection apparatus, collection method, and collection program
A collection apparatus that collects a URL of a Web page that leads to user operation and includes a search query generation unit that generates a search query by combining a digital content name and an associated keyword of the digital content. There is a fitness prediction unit that predicts a degree to which a Web page that leads to user operation is output as a search result when a search is performed by using the generated search query, a determination unit that searches for a Web page by using a search query in a search order that is based on the predicted degree, and determines analysis priority of a URL of a Web page on the basis of the degree and search result information. Further, there is a communication unit that outputs the URL of the retrieved Web page and the analysis priority of the URL.
Distributed database configuration
Replicas are selected in a large distributed network, and the roles for these replicas are identified. In one example, a leader is selected from among candidate computing dusters. To make this selection, an activity monitor predicts or monitors the workload of one or more clients. Different activities of the workload are given corresponding weights. The delay in performing requested activities, modified by these weights is found, and the candidate leader with the lowest weighted delay is selected as the leader.
Distributed database configuration
Replicas are selected in a large distributed network, and the roles for these replicas are identified. In one example, a leader is selected from among candidate computing dusters. To make this selection, an activity monitor predicts or monitors the workload of one or more clients. Different activities of the workload are given corresponding weights. The delay in performing requested activities, modified by these weights is found, and the candidate leader with the lowest weighted delay is selected as the leader.
Embedded web page analytic elements
A web browser plugin or other software can be used to integrate visualization of analytical and/or debugging information related to a web page that is being viewed. Particular elements on the web page that are instrumented for tracking can be visually augmented, allowing a developer to see where and how certain aspects of web page functionality are being tracked and/or implemented. Certain information relating to the web page may be surfaced via a graphical area that is displayed concurrently with the web page, e.g., within the web browser that is being used to view the web page. The graphical area can also include selectable elements that can be used to launch additional queries into back-end services related to the web page. The present techniques allow for not only better and more convenient visualization of web page related data, but can speed up development time, reducing both computing and developer resources.
Embedded web page analytic elements
A web browser plugin or other software can be used to integrate visualization of analytical and/or debugging information related to a web page that is being viewed. Particular elements on the web page that are instrumented for tracking can be visually augmented, allowing a developer to see where and how certain aspects of web page functionality are being tracked and/or implemented. Certain information relating to the web page may be surfaced via a graphical area that is displayed concurrently with the web page, e.g., within the web browser that is being used to view the web page. The graphical area can also include selectable elements that can be used to launch additional queries into back-end services related to the web page. The present techniques allow for not only better and more convenient visualization of web page related data, but can speed up development time, reducing both computing and developer resources.
System and method for content fetching using a selected intermediary device and multiple servers
A method for fetching a content from a web server to a client device is disclosed, using tunnel devices serving as intermediate devices. The tunnel device is selected based on an attribute, such as IP Geolocation. A tunnel bank server stores a list of available tunnels that may be used, associated with values of various attribute types. The tunnel devices initiate communication with the tunnel bank server, and stays connected to it, for allowing a communication session initiated by the tunnel bank server. Upon receiving a request from a client to a content and for specific attribute types and values, a tunnel is selected by the tunnel bank server, and is used as a tunnel for retrieving the required content from the web server, using standard protocol such as SOCKS, WebSocket or HTTP Proxy. The client only communicates with a super proxy server that manages the content fetching scheme.
System and method for content fetching using a selected intermediary device and multiple servers
A method for fetching a content from a web server to a client device is disclosed, using tunnel devices serving as intermediate devices. The tunnel device is selected based on an attribute, such as IP Geolocation. A tunnel bank server stores a list of available tunnels that may be used, associated with values of various attribute types. The tunnel devices initiate communication with the tunnel bank server, and stays connected to it, for allowing a communication session initiated by the tunnel bank server. Upon receiving a request from a client to a content and for specific attribute types and values, a tunnel is selected by the tunnel bank server, and is used as a tunnel for retrieving the required content from the web server, using standard protocol such as SOCKS, WebSocket or HTTP Proxy. The client only communicates with a super proxy server that manages the content fetching scheme.
Dynamic updating of query result displays
Described are methods, systems and computer readable media for dynamic updating of query result displays.