Site rank codex search patterns

Abstract

A Codex system of computers linked into a neural network continuously scans and gathers information from, understands, and interacts with, an environment, an optimizer software executing software instructions based on rules of grammar and semantics searches a encyclopedia of human knowledge to transform input into a search pattern. Then the Codex monetizes and commercializes each transformed input and corresponding optimal output. An artificial intelligence interaction software, hereinafter referred to as virtual maestro, uses the search pattern and optimal output to interact and engage scripted communication with the end user.

Claims

1. An evolving system with a memory and a processor that continuously scans and gathers information, understands, and interacts with, from an Internet search engine supercomputer forming a distributed massive scale neural network knowledge database, storing each recognized English language search pattern using rules of semantics, and each recognized geospatial search pattern used for determining a statistically significant one of geospatial and information indices to parse and filter the Internet, comprising: (i) web-crawlers continuously eliminating from calculation duplicative pages and improbable pages of the Internet using the Site and Page probabilities; wherein upon detecting a probable page assigning a Content probability based upon the semantic analysis of the content of the page; (ii) determining from the probable pages of the Internet using the Site, Page, and Content probabilities statistically significant information; (iii) at least on big data indexing link database that assigns a quality partition to each site from 0 to 10 based on the Site probability; (iv) the system adjusting the quality of a page based on one of the parent site and physical location proximity to the computing device comprising: a. assigning a unique probability value from lowest to highest to each site and page; b. assigning to each site and page a physical location with a valid address, GPS coordinates, and telephone number, wherein for each valid location probabilistically mapping geospatial information indices using country, state, city and zip code keyword combinations given the location; c. determining, with the system, a count of unique requests to each site and page; d. determining, with web crawlers, a count of unique hyperlinks to each site and page; e. determining, with the search engine, an output for each valid end user search pattern; f. adjusting the value for each page of the output based on the known quality of their parent site; g. determining a local scope physical location search of the output; and h. upon a positive determination attenuating any page that is not close to the computing device and adjusting the quality of the parent site of a page using a distance probability given the distance between the computing device and the assigned physical location of the page.

2. The evolving system of claim 1, further comprising: i. mapping a searchable environment given a valid search pattern; j. assigning a quality partition of zero to sites identified as viral content; k. assigning a quality partition of one to sites identified as spam content; l. Web-crawlers continuously performing one of scanning, filtering and distilling a Site and identifying one of duplicate and non-navigational pages and updating the link database; m. determining probabilistically improbable pages using Site and Page probabilities; upon a positive determination identifying the page as non-statistical significant; upon a negative determination identifying the page as a statistically significant and then assigning a content probability value based upon the semantic analysis of the content of the page and updating the link database; and n. mapping an improved environment upon eliminating from calculation web pages from the searchable environment that are identified as one of viral, spam, duplicate, non-navigational and improbable pages of the Internet as statistically non-significant.

3. The evolving system of claim 2, further comprising: i. mapping a searchable environment of the Internet; o. assigning to each statistically significant resource a unique identification number; p. determining from the improved Internet environment, with the link database, a count of unique hyperlinks to each statistically significant resource; q. determining from the improved Internet environment, with the link database, a count of unique hyperlinks to each page using the improved Internet environment; and assigning a ranking score to each page; and r. categorizing and subcategorizes with keywords and clusters each statistically significant resource and identifying, with big data indexing, one of natural variants and additional keywords.

4. The evolving system of claim 2, further comprising: i. mapping a searchable environment of the Internet; s. assigning to each site a unique corporate identification number; and t. determining, with the link database, a count of unique hyperlinks to each Site using the improved Internet environment; and assigning a ranking score to each Site.

5. The system of claim 4, further comprising: u. removing from one of super sites and sites having subordinate sites duplicate pages upon the analysis of all the subordinate sites; and t. determining, with the link database, a count of unique hyperlinks to each site using the improved Internet environment; and assigning a ranking score to each site.

6. The evolving system of claim 2, further comprising: u. determining from the improved Internet environment, with the system, a count of unique requests to each site; v. recalculating in real time the quality partition of each site based on a count of unique hyperlinks using the improved Internet environment; and w. recalculating given a quality partition and the subquality partition of each site based on a count of unique hyperlinks using the improved Internet environment.

7. The system of claim 2, further comprising: x. determining given a search pattern the highest value page rank pages of the improved Internet environment as the output and selecting the statistically significant top (n) responses of the output; and y. furnishing the top (n) results in an order from highest to lowest using the adjusted values for each page.

8. The system claim of 2, further comprising: x. determining given a search pattern the highest value page rank pages of the improved Internet environment as the output and selecting the statistically significant top (n) responses of the output; z. determining a location scope search of the output upon a positive determination performing a distance probability location scope search attenuating any page that is not close to the computing device GPS coordinates; aa. adjusting the quality of the parent site based on the proximity of the computing device to the page using one of filter and geospatial information indices; and bb. furnishing the top (n) results in an order from highest to lowest using the adjusted values using one of filter and geospatial information indices.

9. A method using an evolving system to continuously scans and gathers information, understands, and interacts with, from an Internet search engine supercomputer forming a distributed massive scale neural network knowledge database, storing each recognized English language search pattern using rules of semantics, and each recognized geospatial search pattern used for determining a statistically significant one of geospatial and information indices to parse and filter the Internet, comprising: (i) web-crawlers continuously eliminating from calculation duplicative pages and improbable pages of the Internet using the Site and Page probabilities; wherein upon detecting a probable page assigning a Content probability based upon the semantic analysis of the content of the page; (ii) determining from the probable pages of the Internet using the Site, Page, and Content probabilities statistically significant information; (iii) at least on Big Data Indexing Link database that assigns a quality partition to each site from 0 to 10 based on the Site probability; (iv) the system adjusting the quality of a page based on one of the parent site and physical location proximity to the computing device comprising: a. assigning a unique probability value from lowest to highest to each site and page and revaluating in real time the probability value to each site, and page upon detecting statistically significant page; b. assigning to each site and page a physical location with a valid address, GPS coordinates, and telephone number, wherein for each valid location probabilistically mapping geospatial information indices using country, state, city and zip code keyword combinations given the location; c. determining, with the search engine, an output for each valid end user search pattern; d. determining, for each page of the output the known quality of a site, using the Site probability and adjusting the value for each page of the output based on the known quality of weir parent site; e. determining a local scope physical location search of the output; and f. upon a positive determination attenuating any page that is not close to the computing device and adjusting the quality of the parent site of a page using a distance probability given the distance between the computing device and the assigned physical location of the page.

10. The method of claim 9, further comprising: i. mapping a searchable environment given a valid search pattern; j. assigning a quality partition of zero to sites identified as viral content; k. assigning a quality partition of one to sites identified as spam content; l. Web-crawlers continuously performing one of scanning, filtering and distilling a Site and identifying one of duplicate and non-navigational pages and updating the link database; m. determining probabilistically improbable pages using Site and Page probabilities; upon a positive determination identifying the page as non-statistical significant; upon a negative determination identifying the page as a statistically significant and then assigning a content probability value based upon the semantic analysis of the content of the page and updating the link database; and n. mapping an improved environment upon eliminating from calculation web pages from the searchable environment that are identified as one of viral, spam, duplicate, non-navigational and improbable pages of the Internet as statistically non-significant.

11. The method of claim 10, further comprising: o. assigning to each statistically significant resource a unique identification number; p. determining from the improved Internet environment, with the link database, a count of unique hyperlinks to each statistically significant resource; q. determining from the improved Internet environment, with the link database, a count of unique hyperlinks to each page using the improved Internet environment; and assigning a ranking score to each page; and r. categorizing and subcategorizes with keywords and clusters each statistically significant resource and identifying, with Big Data Indexing, one of Natural Variants and additional keywords.

12. The method of claim 11, further comprising: s. assigning to each site a unique corporate identification number; and t. determining, with the link database, a count of unique hyperlinks to each Site using the improved Internet environment; and assigning a ranking score to each Site.

13. The method of claim 12, further comprising: u. removing from one of Super Sites and Sites having subordinate Sites duplicate pages upon the analysis of all the subordinate Sites; and determining, with the link database, a count of unique hyperlinks to each site using the improved Internet environment; and assigning a ranking score to each site.

14. The method of claim 13, further comprising: v. recalculating in real time the quality partition of each Site based on a count of unique hyperlinks using the improved Internet environment; and w. recalculating given a quality partition and the subquality partition of each Site based on a count of unique hyperlinks using the improved Internet environment.

15. The method of claim 10, further comprising: x. determining given a search pattern the highest value page rank pages of the improved Internet environment as the output and selecting the statistically significant top (n) responses of the output; and y. furnishing the top (n) results in an order from highest to lowest using the adjusted values for each page.

16. The system claim of 15, further comprising: z. determining a location scope search of the output upon a positive determination performing a distance probability location scope search attenuating any page that is not close to the computing device GPS coordinates; aa. adjusting the quality of the parent site based on the proximity of the computing device to the page using one of filter and geospatial information indices; and bb. furnishing the top (n) results in an order from highest to lowest using the adjusted values using one of filter and geospatial information indices.

17. An evolving system with a memory and a processor that continuously scans and gathers information, understands, and interacts with, from an Internet search engine supercomputer forming a distributed massive scale neural network knowledge database, storing each recognized English language search pattern using rules of semantics, and each recognized geospatial search pattern used for determining a statistically significant one of geospatial and information indices to parse and filter the Internet, comprising: (i) web-crawlers continuously eliminating from calculation duplicative pages and improbable pages of the Internet using the Site and Page probabilities; wherein upon detecting a probable page assigning a Content probability based upon the semantic analysis of the content of the page; (ii) determining from the probable pages of the Internet using the Site, Page, and Content probabilities statistical significant information; (iii) at least on Big Data Indexing Link database that assigns a quality partition to each Site from 0 to 10 based on the Site probability; (iv) the system using one of filter and geospatial information indices adjusting the quality of a web page based on one of the parent Site quality and proximity to the computing device comprising: a. assigning a unique probability value from lowest to highest to each site and page and a Content probability based upon the semantic analysis of the content of the page; b. mapping an improved Internet environment upon eliminating one of viral, spam, duplicate, and improbable pages using the Site, Page, and Content probabilities; c. assigning to each site a quality value based on the Site probability; d. adjusting each page probability based on the known quality of their parent site and selecting the best statistically significant pages as the output; and e. determining the top (n) results from the output, with the evolving system, using the Site Quality, Page, and Content probabilities statistically significant information and furnishing the top (n) results from highest to lowest based on the Site Quality, Page, and Content probabilities.

18. The evolving system of claim 17, further comprising: u. determining from the improved Internet environment, with the system, a count of unique requests to each site; v. recalculating in real time the quality partition of each Site based on a count of unique hyperlinks using the improved Internet environment; and w. recalculating given a quality partition and the subquality partition of each Site based on a count of unique hyperlinks using the improved Internet environment.

19. The evolving system of claim 18, further comprising: x. determining given a search pattern the highest value page rank pages of the improved Internet environment as the output and selecting the statistically significant top (n) responses of the output; and y. furnishing the top (n) results in an order from highest to lowest using the adjusted values for each page.

20. The evolving system of claim 17, further comprising: x. determining given a search pattern the highest value page rank pages of the improved Internet environment as the output and selecting the statistically significant top (n) responses of the output; z. determining a location scope search of the output upon a positive determination performing a distance probability location scope search attenuating any page that is not close to the computing device GPS coordinates; aa. adjusting the quality of the parent site based on the proximity of the computing device to the page using one of filter and geospatial information indices; and bb. furnishing the top (n) results in an order from highest to lowest using the adjusted values using one of filter and geospatial information indices.

21. The evolving system of claim 20, further comprising: cc. mapping an optimal Internet environment upon eliminating one of viral, spam, duplicate, and non-statistically significant pages using the Site, Page, and Content probabilities.

Description

BRIEF DESCRIPTION OF THE FIGURES

(1) FIG. 1 Multilingual Search System

(2) FIG. 2 Updating the Multilingual Human Knowledge Encyclopedia

(3) FIG. 3 Updating the Surveillance Human Knowledge Encyclopedia

(4) FIG. 4 Virtual Maestro transforming input and mapping Entity object

(5) FIG. 5 Virtual Maestro proactively dialogues

Second Preferred Embodiment: Site Rank Codex Search Patterns

(6) FIG. 6 Codex continuously replicates the Internet

(7) FIG. 7 Codex updates the link database as web crawlers navigating every webpage.

(8) FIG. 8 Codex updates the link database as web crawlers navigating every website.

(9) FIG. 9 End User and Virtual Maestro Historical Profiles

(10) FIG. 10 Codex updates every End User and Virtual Maestro Profile.

(11) FIG. 11 Codex continuously updates each webpage with the link database

(12) FIG. 12 Codex continuously updates each website with the link database

(13) FIG. 13 Codex continuously updating profiles with the latest trending data

(14) FIG. 14 Codex updates the link database as web crawlers navigating every supersite.

(15) FIG. 15 Codex parses news, financial exchanges, social media interactions and trending data as human monitoring and evaluation indicia to update every supersite.

DETAILED DESCRIPTION

First Preferred Embodiment: Virtual Maestro Codex Search Patterns (U.S. Ser. No. 16/129,784)

(16) FIG. 1 Users 110 having a computer terminal or subscriber device 105, in digital communication with the Internet 140 and the Hive 150, a browser 120, and an artificial intelligence computer program 130 residing in memory executing a set of instructions to transform interactive input 115 into a request 119 using rules of semantics 165 to find missing gaps of information and pattern matching 167 the Codex 160 to find an Entity Object 175.

(17) U.S. Pat. No. 7,809,659 teaches an Entity Object 175 is bound to a simple layer of refinement or Superset (I) upon removing redundancy of the searchable environment, and subordinated associative Entity Object 175 are bound to a hybrid layer of refinement or Set (I, J), and that each subordinated transitive Entity Object 175 are bound to a complex layer of refinement or Subset (I, J, K), where I, J and K are independent variables. The top weighted results become the optimal dataset 189 that becomes the output sent to users 110 computer terminal 105.

(18) U.S. Pat. No. 7,809,659 teaches: W_RANK: Electronic Files, Internet links and the associated HTML content can be standardized, organized and transformed into geospatial information. W_RANK: uses eigenvectors to identify the value of each link and its content, and the system must normalize this information into a compatible Partial Vector CDR. The lion share of the conversion and normalization is done by the specialized programming of the system 150, which gathers, distills and analyzes the virtual environment by mapping, standardizing, organizing and transforming the information into logical groups and sets (knowledge graphs) to make them compatible and can also be personalized when using a historical end user profile.

(19) U.S. Pat. No. 8,977,621 teaches the collection of relevant top pages becomes the optimal dataset 189 that probabilistically maps an optimal environment 180, and each page and portions thereof having relevance to the Entity Object 175, 177 becomes Inventory Content 185 that are offered as ADDITIONAL KEYWORDS (Suggestions) that aid the end user to reach the destination.

(20) FIG. 2 The Web Crawler sub system 200 continuously monitors and scans changes in the Internet 140, coordinating Web Crawlers 205, to identify new webpages 210 and then using an artificial intelligence computer program 130 to parse data 220 using rules of grammar and semantics to process, validate, verify, and cleanse raw data 215, into primed data 219 comprehensible for human monitoring and evaluation and sends the primed data to the HIVE 150.

(21) U.S. Pat. No. 8,386,456 teaches the HIVE 150 continuously updates the Codex 160 with the primed data 219 to determine patterns of behavior or trend data 265 fluctuations that identify changes in the virtual environment and then derives significant portions 269 of the content to update in real time the Encyclopedia 170 and map entity objects 275 and subordinated layer of refinement natural variants 277. For each mapped (feature attributes and alternative suggestions) entity object 275, 277 belonging to a layer of refinement, the human knowledge encyclopedia 170 updates the corresponding optimal environment 180 and super glyph mathematical equation 285 is used to select the output that is sent to the end user.

(22) U.S. Pat. No. 7,809,659 teaches each (initial search query) entity object 275 has a join, simple, hybrid, complex and optimal layers of refinement, wherein the subject layers corresponding to the managerial hierarchical partition (alternative suggestions) sub partition for a given keyword search. And U.S. Pat. No. 7,908,263 teaches how to transform the keyword search into a mathematical lingua franca search pattern, and for each entity object corresponding layer of refinement the top (n) results become the optimal environment 180.

(23) FIG. 3 The Web Crawler sub system 200 continuously monitors and scans changes in the virtual environment or the Internet 140, coordinating a plurality of Web Crawlers 205, to identify protected New Documents 211 and then using an artificial intelligence computer program 130 to parse data 220 using rules of grammar to cleanse the raw data 215, into primed data 219 comprehensible for human monitoring and evaluation and sends the primed data to the Hive 150.

(24) The Hive 150 continuously updates the Codex 160 inventory control system with the primed data 219 to determine patterns of behavior or protected trend data 266 fluctuations to identify changes in the Internet. Then derives significant portions 269 of the content to update in real time the Encyclopedia 170 and map protected entity objects 276 and subordinated layer of refinement protected natural variants 278. For each protected mapped entity object 276, 278 belonging to a layer of refinement the Encyclopedia 170 updates the optimal environment 180 and super glyph map equation 285 used to derive the output that is sent to the end user.

(25) U.S. Pat. No. 7,809,659 teaches for each entity object 275 given a regular expression has a join, simple, hybrid, complex and optimal layers of refinement managerial hierarchical nested partitions for a given keyword search. U.S. Pat. No. 7,908,263 teaches transforms the keyword search into a mathematical lingua franca search pattern, and for each entity object corresponding layer of refinement the top (n) results become the optimal environment 180.

(26) FIG. 4 and FIG. 5 teaches how the Virtual Maestro continuously scans and gathers information from the virtual environment, and engages in a scripted dialogue with the end users, as it understands and is able to interact proactively thanks to the simulation input environmental bitmaps using the three samples approach is able to update an inverse ad hoc query as follows:

(27) TABLE-US-00004 (A) Monitoring Learns, helps, assists and teaches how to find something specific. (B) Reactive Smart input 4 independent variables by removes confounding elements. (C) Proactive Personal input 5 independent variables and makes the user decision. (D) Dialogue Direct input 6 independent variables engages in a dialogue as if alive.

(28) FIG. 4 Virtual Maestro transforming input and mapping Entity objects. The Hive 150 based on the math optimal request 419 trending and monetary values of the probabilistic spatial environment maps commercial Entity Objects 575, and Natural Variants 577, which is how the bills are paid, and 3rd parties compete to displays advertisements, creates a consolidated Inventory Control 585 comprising the Intertwining of ideas and concepts by gain factoring relevancy and attenuating irrelevancy and weighting input or commercial levels of satisfaction (needs) and interest (wants), demographic and geospatial data aiding the end user reach the final destination.

(29) For each single request the Virtual Maestro 700 continues to update the Super Glyph (Mathematical) Equation 285 as the user continues to select Inventory Content 185 command instructions and dynamically measures a plurality of optimal environments. Performing as follows: 1.sup.st [CX]: dynamically correlates ‘Related Objects’ belonging to the Input probabilistic spatial environment 701 and creates a dataset of commercialized Entity objects 575 and Natural Variants 577 offered as ADDITIONAL KEYWORDS that aid the end user to reach the destination.

(30) 2.sup.nd [DX]: performs Hot/Cold algorithm calculations of the related objects to identify Regular, Likely and Lucky probability entity objects variables that significantly improve a search pattern. 3.sup.rd [EX]: Cherry picks the top probable combination from Inventory Content 185 from the Input probabilistic spatial environment 701. 4.sup.th analyzes: each “as if the user has selected a particular” Codex Page 169 to enable data mining discovering. 5th the Scripted Algorithm 630 correlates: each Codex Page 169 and weights the Commercial Inventory Content 185. 6.sup.th: Virtual Maestro 700 continues to simulate input until a reaching combination that yields the destination.

(31) FIG. 5 Virtual Maestro proactively dialogues executing a set of informatics using the Scripted Algorithm computer program 630 to determine the Best Probable Branching responses 730 and picks the Best Probable (Associative) Response 740 to communicate with the user 110 based on the interactive input 115 as follows:

(32) (A) When Interactive Input Offers Natural Variants 750

(33) (B) When Assisted Input Communicates Best Response 760

(34) (C) When Smart Input Communicates Best Response 770

(35) (D) When Personalized Input Communicates Best Response 780

(36) The Virtual Maestro 700 proactively dialogues executing a set of informatics using the Scripted Algorithm computer program 630 to Weight Plausible Responses 785 and Picks Best Plausible (Transitive or Nth) Responses 790 and updates the Output 702 based on its own deductive reasoning check mate decision of how to dialogue with the user 110 and now, based on the Nth or Best Plausible Response 790, the Virtual Maestro 700 knows the final destination (input and output) and can dialogue 799 with the user 110 ‘as if alive’ or sentient!

(37) The scripted algorithm computer program 630 measures the valid collection set of Inventory Content 185, (comprising of the simulation environment input (based on an individual, group of related people or trending data, demographics for advertisement means, or similarly same subject matter requests) entity objects 175 and associative and transitive collection of natural variants 177). For example, once an event occurs many people will ask the same question, or make comments that the virtual maestro 700 will transform into trending and demographic input data. Based on the knowledge of a given event and their interaction about the same, the virtual maestro 700 can probabilistically reverse engineer a trending high frequency response (output) made by the request of plurality set of users into a personalized dialogue to a specific individual.

Second Preferred Embodiment: Site Rank Codex Search Patterns

(38) FIG. 6 Codex 160 continuously replicates, scans, scrubs, filters and distill data from the Internet and then updates 800 the link database with statistics of each resource, web page, website and supersite, and whether they are navigational or searchable. Determining unique source, non-duplicate, spam, viral and cookie trap content, and then parsing using rules of semantics each sentence, paragraph structure, and then verifying the meta keyword tags reflect the structure and semantics of the content and are not useless to the search. As the Codex continuously spawns 207 crawlers to web navigates the Internet, 209 reach each URL of a webpage, 230 determining if each webpage and associated ‘related objects are navigational and store the latest information of each object into 800 the link database. Each ‘related object’ or resource, webpage or page, website or site, and supersite are objects.

(39) Web crawlers 207 count unique incoming hyperlinks based on valid navigational URL (Uniform Resource Locator), and request Codex 160 data warehouses, for historical statistics 245 measuring traffic patterns and unique search clicks to URL belonging to a common denominator Website and Supersite. The Link Database 800 stores unique end user, virtual maestro, resources or ‘related objects’, web pages, websites or sites and super sites to determine SQL unique values when creating a table and SQL distinct values when updating a table. The Codex 260 ranks each supersite, site, and webpage with a probability (0.00 irrelevant to 1.00).

(40) FIG. 7 Codex 160 updates 800 the link database as web crawlers 207 navigate every webpage. U.S. Pat. No. 7,908,263 teaches Artificial Intelligence Spiders or web crawlers 207 “consists of automated programs that are designed to continuously gather, distill and analyze the environment in real time. The program after gathering information identifies new content to the known environment. For each page the program determines if the file has been deleted, moved, updated or is new”, then reads and parses documents to determine “SIGNIFICANT Data” that is deemed “NEW Data” or is identified as a change or “UPDATE Data” or modification of the URL as “MODIFY or MOVE” or the removal of an URL or “DELETE” when compared to the last version a web crawler navigated the webpage into 800 the link database. Codex 160 then 260 Ranks each webpage based on the “SIGNIFICANT DATA”, on 242 the change in the count of distinct hyperlinks to ‘related object’ in the webpage and 247 change in the frequency of search clicks to ‘related objects’ in the webpages. Then requests the 160 Codex, to identify 249 each user searching each resource, webpage, website and super site, and identify navigational 270 user Search Patterns and relevant Natural Variants. Upon detecting “SIGNIFICANT Data” deemed and stored as comprehensible human monitoring and evaluation data into 800 the link database. “SIGNIFICANT Data” or significant portions of the web page is 269 is not included.

(41) FIG. 8 Codex 160 updates 800 the link database as web crawlers 207 navigating every Site. U.S. Pat. No. 7,908,263 teaches the Artificial Intelligence Spiders or web crawlers 207 “consists of automated programs that are designed to continuously gather, distill and analyze the environment in real time. The program after gathering information identifies new content to the known environment. For every Site the program determines if a file has been deleted, moved, updated or is new”, then reads and parses documents to determine “SIGNIFICANT Data” that is deemed “NEW Data” or is identified as a change or “UPDATE Data” or modification of the URL as “MODIFY or MOVE” or the removal of an URL or “DELETE” when compared to the last version a web crawler navigated the webpage into 800 the link database. Codex 160 then 260 Ranks each webpage based on the “SIGNIFICANT DATA”, on 242 the change in the count of distinct hyperlinks to ‘related object’ in the webpage and 247 change in the frequency of search clicks to ‘related objects’ in the webpages. Then requests the 160 Codex, to identify 249 each user searching each resource, webpage, website and super site, and identify navigational 270 user Search Patterns and relevant Natural Variants. Upon detecting “SIGNIFICANT Data” deemed and stored as comprehensible human monitoring and evaluation data into 800 the link database, in order to adjust the values of the indices and thus updating real time responses. Detecting, cleansing and determining, “SIGNIFICANT Data” 269 or significant portions of data in the webpage is the primary objective of web navigating a Site 207, 209.

(42) FIG. 9 End User and Virtual Maestro Historical Profiles. The 160 Codex and 700 Virtual Maestro for each search pattern determine 180 an optimal environment and the 185 inventory content of ‘related objects’ such as people, keywords in the content, products such as audio, video, and shopping cart items, geospatial such as addresses, ANI (or telephones) and events such as news, financial, and sporting trending monitoring and evaluation indicia, and then based on the [DX] Hot/Cold Inventory sample update the historical end user profile for each valid Codex Page hierarchical set of corresponding human monitoring and evaluation indicia, which in turn the virtual maestro stores to be able to track as significant inventory content 185.

(43) FIG. 10 Assigning Quality partition based ranking value of a Webpage, the 160 Codex continuously updates the 800 Link Database upon 830 determining the unique count of incoming hyperlinks to a web page and 831 determining the unique count of search clicks to a web page in order to 832 determining a probabilistic ranking value for every web page and then 833 assign a quality partition from 0 to 10 given the webpage ranking value.

(44) FIG. 11 Assigning Quality partition based ranking value of a Website, the 160 Codex continuously updates the 800 Link Database upon 840 determining the unique count of incoming hyperlinks to a web page and 841 determining the unique count of search clicks to a web page in order to 842 determining a probabilistic ranking value for every web page and then 843 assign a quality partition from 0 to 10 given the website ranking value.

(45) FIG. 12 Codex updating each codex page in real time trending data, The Codex 160 upon updating the Link database performs the following tasks: 1.sup.st: 801 simulating for each codex page the optimal environment in real time and assigning a relative master index. 2.sup.nd: 802 continuously scanning the environment and updating each codex page as each new web page is identified having a higher value than the lowest value stored web pages. 3.sup.rd: 803 associates the new webpage and ‘related objects’ to the codex page and disassociate the lowest valued web page to the codex page and stores and updates changes in real time to the codex pages. 4.sup.th: 804 continuously stores and updates in real time the at least one collection of top (n) web pages, and the top (n) sites geospatial information and 5.sup.th: 805 continuously stores and updates in real time relative master index belonging to each codex page and then updates all the profiles.

(46) FIG. 13 Codex continuously updating profiles with the latest trending data. The Codex 160 upon finding significant difference trending data performs the following: First: 821 After each search updates the profile for each end user and virtual maestro. Second: 822 calibrates historical usage pattern of ‘related objects’ Third: 823 weights historical search pattern hot/cold natural variants analysis. Fourth: 824 updates historical GPS data weighting to modify search patterns. Fifth: 825 determines historical frequency of search patterns. Sixth: 826 performs historical analysis of monitoring and evaluation indicia. Finally: 827 Tracks significant difference news and content based on monitoring to keep all profiles current.

(47) FIG. 14 Assigning Quality partition based ranking value of a Supersite, the 160 Codex continuously updates the 800 Link Database upon 840 determining the unique count of incoming hyperlinks to a supersite and 841 determining the unique count of search clicks to a supersite in order to 842 determining a probabilistic ranking value for every supersite and then 843 assign a quality partition from 0 to 10 given the supersite ranking value.

(48) FIG. 15 Codex parses news, financial exchanges, social media interactions and trending data as human monitoring and evaluation indicia to update every supersite upon performing the following task: 1.sup.st: 806 determining at predefined time intervals the total number of web pages in the codex and for each codex page in its chain of command. 2.sup.nd: 807 determining at predefined time intervals the total number of significant difference changes in the Internet and then revaluing each site that updated one of its top ranked (n) web pages. 3.sup.rd: 808 cleansing or purifying data, mapping and plotting each element of the old master index into the new master index using the content value of the relative master index of the highest vector valued codex page. 4.sup.th: 809 continuously creating, storing, synchronizing and updating in real time the new master index that reflect the latest condition of the environment that is derived from the continuously detected significant changes and adjustments made to the codex. 5.sup.th: 899 Cleansing or purifying data, transforming and updating new master index and in turn the codex and the entire chain of command of codex pages. Once the Codex 160 creates a new master index and has all the relevant codex pages chain of command relative master indices, 800 the link database is able to attenuate using join SQL queries to remove from calculation websites and super site that are below the first threshold, the marks anything that fails the test as irrelevancy. Finally, using join SQL queries to ‘cherry picking’ from the output websites and super site that are above the Nth threshold, the marks anything that passes the test as la crème de la crème or optimal website.

First Preferred Embodiment: Virtual Maestro Codex Search Patterns (U.S. Ser. No. 16/129,784)

(49) Example 1. Virtual Maestro 700 as a Customer Service Representative: U.S. Pat. No. 7,058,601 teaches “the virtual environment optionally includes an internet chat room which provides real time communication among multiple users and between users and a broker”. Ser. No. 09/819,174 teaches “the virtual maestro is a product of artificial intelligence, since it would be impractical to provide a real person to process personal selections for each subscriber. The virtual maestro is represented by a virtual image, either of Beethoven or Mozart, in the virtual concert hall and will play specific song or video requests of an individual subscriber, on a pay per view basis. The profile is assembled from information the subscriber provides to become a subscriber and from a history of selections made by the subscriber through the system, and the profile is in effect how the particular subscriber is clothed in the virtual world.”

(50) The interaction between two humans not speaking the same language is buffered by the Virtual Maestro 700 using the Scripted Algorithm 630 formatted communications. Pierre speaks in French, the input is formatted into a script in the English (business lingua franca) and French, customer service receives the English script and the point of sale is populated by the artificial intelligence using the users profile information, the representative responds in English, the text is sent to Pierre's Virtual Maestro 700 that responds with a texted response in French. The transaction, trouble ticket or request for help is made between two humans not speaking in the same language (nor are they required to understand or speak both) using the Virtual Maestro 700 to be the medium of their communication. It is the object of the present invention to improve the Virtual Maestro 700 to act as a multilingual Customer Service Representative using Big Data.

(51) Example 2 Virtual Maestro 700 Acts as a Optimizer Communication Medium: The user 110 using a computing terminal 105 with a Virtual Maestro 700 in memory that executes informatics to transform the input 115 into a search pattern 329, and searches the Encyclopedia 170 to find the Codex Page 169, with the corresponding optimal dataset. In parallel the Virtual Maestro 700 receives the text information and communicates the highest valued reference source to the user 110. As the user 110 types or speaks “TIGER” the Virtual Maestro 700 uses the Script_Say(TIGER, “en.wikipedia.org”), to speak over the audio devices or as text “The tiger (Panthera tigris) is the largest cat species, most recognizable for their pattern of dark vertical stripes on reddish-orange fur with a lighter underside. The species is classified in the genus Panthera with the lion, leopard, and jaguar”, and offers images and videos of a TIGER, and also Panther, Lion, Leopard, and Jaguar, as additional keyword 175,177.

(52) Example 3: Virtual Maestro 700 multilingual communication: the end user searches for an incomplete search such as “THE” using the GIGO mantra the optimizer improve the quality from (???) to (?!) by matching the request to the English grammar meaning of a definition, which can then be used to respond in an language using similarly same semantics constructs such as Script_Say(“DUDE”, ENGLISH, USA), or Script_Say(“GUEY”, SPANISH, MEX). The Virtual Maestro 700 selects the best content to communicate with the user.

(53) Example 4. Advertisement Surveillance each time the Virtual Maestro 700 determines an user 110 wants to view or listen to licensed protected data such as audio or video, the software runs a script to determine if available credits exists to purchase licensing of the digital files or products, or alternatively using purchasing patterns, demographics and profile and social network characteristics can offer personalized, the system 100 offers the user 110 credits for mass media or automatically embedded advertisements for the purchase of licensed product.

(54) Example 5. Transactional Surveillance each time the Virtual Maestro 700 determines an user 110 has decided to perform a licensed transaction to view or listen protected data such as audio or video, the software runs a script to uses available credits or monies to purchase licensing of the digital files or products, or alternatively using purchasing patterns, demographics and profile and social network characteristics can offer personalized, mass media or automatically embedded advertisement to pay the licensee royalties or purchase product.

(55) Example 6. Virtual Maestro creates the Input Spatial Environment: the user 110 performs a valid search 1. “AMERCAN CIVIL WAR” and Superset(I) and the Virtual Maestro 700 identifies the type of search as assisted input, and maps an input spatial environment using US History, in particular events that occurred between 1861 and 1865, where geospatial data is USA and a list of valid States such as Virginia or Maryland. At this point, the encyclopedia subject events that belong to the historical events such as the Siege of Vicksburg, Battle of Gettysburg, or President Abraham Lincoln are probabilistically mapped as Input that can be offered to the user 110 using the benefit of U.S. Pat. No. 7,809,659 FIG. 80 as additional keywords or Entity Object 175 and Natural Variants 177 that can aid in reaching the final destination.

(56) For each entity object 175,177 associated to the concept and idea “American Civil War” the Virtual Maestro 700 searches the system using probable branching any nested transitive command decision, assuming the entity object will be selected by the end user. Each associative and transitive entity object is probabilistically mapped as the Input Spatial Environment 701.

(57) Example 7. Virtual Maestro creates the Output Spatial Environment: the end user searches 1. “AMERCAN CIVIL WAR” as Superset(I), and the Virtual Maestro 700 identifies the search as assisted input or scripted as Is Assisted. Upon building the Input Spatial Environment 701 the Virtual Maestro 700 executes software instructions using the scripted algorithm 630 and database to determine the best way to communicate with the end user.

(58) Using the basic Script_Say: the system 100 determines the end user 110 search is an assisted input and exists and possesses a preprocessed and precalculated Codex Page 169 and corresponding optimal dataset 189. The Virtual Maestro 700 identifies the content paragraph that maps probabilistically the first independent variable Superset (“AMERICAN CIVIL WAR”) as the response and probable encyclopedia subject subordinates as J the 2.sup.nd independent variables Set (“Siege of Vicksburg, 1863”), (“Battle of Gettysburg”, 1863) and (US President (Abraham Lincoln, “1861-1865) to name a few. Then maps probable branching entity objects 175 best responses for each Set(“American Civil War”, J), as K subordinate probable branching Natural Variants 177 Subset(“American Civil War”, J, K) as the Output Spatial Environment, such as J being Set (“Battle of Gettysburg”, 1863) and K as Subset(“Pickett's Charge”, Jul. 3, 1863).

(59) The output using the benefit of U.S. Pat. No. 8,676,667 that index refines to the Nth has preprocessed and precalculated the probability of each response belonging to the output, and using the benefit of U.S. Pat. No. 8,386,456 incorporates as the output the best response for each entity object 175,177 belonging to the Output Spatial Environment 702. The first best response or personalized dataset 199 is determined by Scripted Algorithm 630 using the end user's profile, and the latest values of the Hot/Cold Super Glyph equation, where Hot denotes relevant and trending ideas that are gained factored, and Cold denotes irrelevant and no longer valid based on the personalized vectors such as relevant GPS coordinates that are attenuated. The Virtual Maestro 700 uses the Output Spatial Environment 702 to communicate with the end user 110.

(60) Example 8. The Virtual Maestro dialogues using the Input Spatial Environment: The end user searches 1. “AMERCAN CIVIL WAR”, then adds by selecting 2. BATTLE OF GETTYSBURG, then adds by selecting “PICKETT'S CHARGE, then add by selecting 4. “HISTORICAL QUOTE”, and the Virtual Maestro 700 identifies the type of session as personal input. Where, the independent variables are as follows: I=“AMERICAN CIVIL WAR”, J=“BATTLE OF GETTYSBURG”, K=“PICKETT'S CHARGE”, and L=“HISTORICAL QUOTE”, and with the valid geospatial data US, PA, Gettysburg, Jul. 3, 1863.

(61) In this, case the end user built using the “AMERICAN CIVIL WAR” and then by selecting additional keywords, to map the different layers of refinement (Simple, Hybrid, Complex and Answer) General Pickett informing his commanding officer “General Lee I have no division”,

(62) Example 9. The Virtual Maestro dialogues using Output Spatial Environment: the end user searches 1. “WALMART”, then the virtual maestro using the GPS coordinates from the subscriber device adding 2. US, Florida, North Miami Beach, 33160, and the Virtual Maestro 700 identifies a smart search and renders a map based on the closest stores (A, B and C).

(63) Example 10. Virtual Maestro helps to eliminate confounding elements of the search: continuing with Example 17. the Virtual Maestro 700 dialogues using the Script Verify Location. First, determines using GPS of the interface device to know the user's present location, home or office, in this example from the user's home. Second, creates a dialogue based on the user's profile and present location the most probable stores selecting A and C. Third, the Virtual Maestro 700 dialogues with the user, asking: Are going to Store A or Store C from your present location? The user says Yes, from here to Store C. Alternatively, No, from my office to Store B. The dialogue with the user's help eliminates the confounding elements of the search!

Second Preferred Embodiment: Site Rank Codex Search Patterns

(64) Example 11: Creating the searchable environment: The end user makes a request that is transformed into a search pattern. The Codex 160 using 800 the link database counts any webpage and resource where the search pattern condition is true. Using simplified numbers, marketing, the searchable environment has 100,000,000 valid resources. The searchable environment is described as a zero significant difference gamma factor equation (n!−(n−6)!)/6!

(65) Example 12: Creating the improve environment: The Codex 160 using 800 the link data base calculating any webpage and resource where the search pattern condition is true and valid and the website value is greater than first threshold. Using simplified numbers, marketing, the improved environment has 10,000,000 valid resources. To those in the art the improved environment is described as a 1.sup.st significant difference gamma factor equation ((n−1!)−(n−6)!)/5!

(66) Example 13: Creating the 1.sup.st intermediate reduction calculation: The Codex 160 using 800 the link data base calculating any webpage and resource where the search pattern condition is true is valid and the website value is greater than first threshold. Then using human knowledge performs the 1st intermediate reduction calculation using subject matter as the second threshold test to the search Using simplified numbers, marketing, the improved environment has 1,000,000 valid resources. To those in the art the 1st intermediate reduction calculation is described as a 2.sup.nd significant difference gamma factor equation ((n−2!)−(n−6)!)/4! It is the object of the present invention to improve over U.S. Pat. No. 7,809,659 teaching the 1st intermediate reduction calculation using subject matter is how to build a Simple Subject Layer of refinement.

(67) Example 14: Creating the 2.sup.nd intermediate reduction calculation: The Codex 160 using 800 the link data base calculating any webpage and resource where the search pattern condition is true is valid and the website value is greater than first threshold. Then using human wisdom performs the 2.sup.nd intermediate reduction calculation using subject matter as the third threshold test to the search using simplified numbers, marketing, the improved environment has 10,000 valid resources. To those in the art the 2.sup.nd intermediate reduction calculation is described as a 3.sup.rd significant difference gamma factor equation ((n−3!)−(n−6)!)/3! It is the object of the present invention to improve over U.S. Pat. No. 7,809,659 teaching the 2.sup.nd intermediate reduction calculation using subject matter how to build a Hybrid Subject Layer of refinement.

(68) Example 15: Creating the 3.sup.rd intermediate reduction calculation: The Codex 160 using 800 the link data base calculating any webpage and resource where the search pattern condition is true is valid and the website value is greater than first threshold. Then using human understanding performs the 3.sup.rd intermediate reduction calculation using subject matter as the fourth threshold test to the search using simplified numbers, marketing, the improved environment has 100 valid resources. To those in the art the 3.sup.rd intermediate reduction calculation is described as a 4.sup.th significant difference gamma factor equation ((n−4!)−(n−6)!)/2! It is the object of the present invention to improve over U.S. Pat. No. 7,809,659 teaching the 3.sup.rd intermediate reduction calculation using subject matter how to build a Complex Subject Layer of refinement.

(69) Example 16: Creating the nth intermediate reduction calculation: The Codex 160 using 800 the link data base calculating any webpage and resource where the search pattern condition is true is valid and the website value is greater than first threshold. Then using human discernment performs the nth intermediate reduction calculation using subject matter as the nth threshold test to the search using simplified numbers, marketing, the optimal environment that has 100 valid resources. To those in the art the nth intermediate reduction calculation is described as a 5.sup.th significant difference gamma factor equation ((n−5!)−(n−6)!)/1! It is the object of the present invention to improve over U.S. Pat. No. 8,676,667 teaching nth intermediate reduction calculation using subject matter how to build an Answer Subject Layer of refinement.

(70) Example 17: Using the nth intermediate reduction calculation to cherry pick la crème de la crème: The Codex 160 using 800 the link database performed a set of intermediate reduction calculating using the interactive input search pattern from the end user. To those in the art the interactive search pattern is the (I) or input, and the subject matter or (S) intermediate reduction calculations are better improvement using human knowledge, wisdom, understanding and discernment. To those in the art (S) subject matter describes (T) topicality scores. Once the Codex 160 performs all the neural network calculations to the nth, the output is sent to the ‘Cherry Picking’ process of the 700 Virtual Maestro weights 185 inventory content of ‘related objects’ such as people, keywords in the content, products such as audio, video, and shopping cart items, geospatial such as addresses and ANI (or telephones) and events such as news, financial, and sporting trending monitoring and evaluation indicia, and then based on the [DX] Hot/Cold Inventory sample update the historical end user profile for each valid Codex Page hierarchical set of the human monitoring and evaluation indicia being tracked selects la crème de la crème. To those in the art la crème de la crème is the destination or optimal response given the REGEX.

(71) Example 18: From input to search pattern to la crème de la crème: Using the benefit of U.S. Pat. No. 7,809,659 subject layers of refinement, U.S. Pat. Nos. 7,908,263, 8,868,535, 8,977,621 gamma factor mathematics, U.S. Pat. No. 8,386,456 Codex and U.S. Pat. No. 9,355,352 personalized results starting from interactive input “AMERICAN CIVIL WAR” to the optimal dataset as follows:

(72) ZSD searchable environment has 100,000,000 valid resources everything is ‘Boolean’ valid.

(73) FSD improved environment has 10,000,000 attenuated spam and duplicates.

(74) SSD improved environment has 1,000,000 attenuated using human knowledge.

(75) TSD improved environment has 10,000 attenuated using human wisdom.

(76) QSD improved environment has 100 attenuate using human understanding.

(77) PSD optimal environment has 10 attenuate using human discernment.

(78) HSD optimal response has 1 or la crème de la crème upon ‘Cherry Picking the output.

(79) Codex Search Patterns 2020

(80) Big Data Indexing: Codex Search Patterns is now updated based on Big Data Indexing as follows “Big data is a field that treats ways to analyze, systematically extract information from, or otherwise deal with data sets that are too large or complex to be dealt with, by traditional data-processing application software. Data with many cases (rows) offer greater statistical power, while data with higher complexity (more attributes or columns) may lead to a higher false discovery rate. Big data challenges include capturing data, data storage, data analysis, search, sharing, transfer, visualization, querying, updating, information privacy and data source. Big data was originally associated with three key concepts: volume, variety, and velocity.”

(81) Rule 1: Volume: “The quantity of generated and stored data.”

(82) Rule 2: Velocity: “The speed at which the data is generated and processed to meet the demands and challenges that lie in the path of growth and development.”

(83) Rule 3: Veracity: “It is the extended definition for big data, which refers to the data quality and the data value,”

(84) Rule 4: Value: “The utility that can be extracted from e data.”

(85) Rule 5: Variability: “It refers to data whose value or other characteristics are s in relation to the context they are being generated.” en.wikipedia.org

(86) Rule 6: The volume is massive and complex since it is the Internet.

(87) Virtual Maestro as an Interface Device

(88) Rule 7: The Codex has real time velocity, where 95% of the responses and the lion share of the remaining responses occur under 1 second, and humanlike scripted communications and dialogue interaction execute software instruction with delays in the seconds.

(89) Rule 8: The Codex Encyclopedia and subject layers of index refinement describe veracity making sure that geospatial and semantics consistency exists in the best responses.

(90) Rule 9: Each resource is assigned, a Supersite rank, site rank probability value in an order from highest to lowest, where Supersite rank is used to identify of the quality value of la crème de la crème and Site rank is used to attenuate viral, spam and duplicates as irrelevancy.

(91) Rule 10: Search Patterns measure usage patterns of behavior, trending and live human monitoring and evaluation indicia, which describes to those in the art variability.

(92) Evolving System Equivalencies

(93) Rule 11: Virtual environment using the benefit of U.S. Pat. No. 9,355,352 The evolving fuzzy system can be describes as: (EFS) can be defined as self-developing, self-learning fuzzy rule-based or neuro-fuzzy systems that have both their parameters but also (more importantly) their structure self-adapting on-line. They are usually associated with streaming data and on-line (often real-time) modes of operation. In a narrower sense they can be seen as adaptive or evolving fuzzy systems. The difference is that evolving fuzzy systems assume on-line adaptation of system structure in addition to the parameter adaptation, which is usually associated with the term adaptive or evolving. They also allow for adaptation of the learning mechanism. Therefore, evolving assumes a higher level of adaptation of a virtual environment.

(94) Rule 12: Virtual Metadata can be described as: “is structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource. Metadata is often called data about data or information about information.” “An important reason for creating descriptive metadata is to facilitate discovery of relevant information. In addition to resource discovery, metadata can help organize electronic resources, facilitate interoperability and legacy resource integration, provide digital identification, and support archiving and preservation.” Virtual Metadata serves the same functions in resource discovery as good cataloging does by allowing resources to be found by relevant criteria; identifying resources; bringing similar resources together; distinguishing dissimilar resources and giving location information.”

(95) Rule 13: Virtual Jesus: supercomputer command intelligent data warehouses, that transform input into a question and then search based on subject matter to improve the responses. The 2020 version is referred hereinafter as Virtual da Vinci 900 that is ubiquitous and does not makes responses in “red font” as describing the divinity and omnipresence of our Lord Jesus Christ. The magnifico instead transforms input into a search pattern with vector components such as geospatial, human knowledge, wisdom, understanding and discernment to make the search pattern into a high-quality question—answer response, where answers are communicated audibly.

Environment and Statistical Object Definitions

(96) Rule 14: Internet environment: comprises all of the ‘related objects’, webpages, sites and super sites that are navigational in the latest master index.

(97) Rule 15: Searchable environment: comprises all navigational ‘related objects’, webpages, sites and super sites given the search pattern a map a Superset (U) or ZSD.

(98) Rule 16: Improved environment: comprises all navigational ‘related objects’, webpages, sites and super sites given the search pattern a map a Superset (I) or SSD upon removing duplicates and using Site Rank to remove spam, viral content and redirection threats.

(99) Rule 17: Relevant environment: comprises the first sample or the square root of the size of the searchable environment, that is stored as the Superset (I) partial master index

(100) Rule 18: Subject Matter: comprises searching using data warehousing using business intelligence, statistical analysis and big data indexing of each valid Codex Page and hierarchical set of natural variants.

(101) Rule 19: Codex Page is the Superset given the search pattern 180 that comprises the searchable environment, that is attenuated/gain factored by Site ranking probabilities and further adjusted when corporate organization have Super ranking probabilities and real time news and exchange human monitoring and evaluation indicia, or alternatively social media, trending and reference subject matter collections data values.

(102) Rule 20: Super Sites are continuously updated as real time news events and financial exchange information is processed into primed data human monitoring and evaluation indicia

(103) Rule 21: Super Sites data is primed into human monitoring and evaluation indicia by web crawlers and the Virtual Da Vinci 900 upon receiving the primed data compares to social media, trending and reference subject matter collections data values to automatically update virtual maestros 700 that are tracking what is the craving need and of interest to the end user.

(104) Rule 22: Natural Variants 720 are Superset (I.sub.n) given the search pattern that comprises valid subject matter associative attributes using rules of semantics, when big data indexing.

(105) Rule 23: Probable Responses 740 are the Set (I.sub.n, J.sub.o) given the search pattern that comprises probable associative attributes using rules of semantics, when big data indexing.

(106) Rule 24: Plausible Responses 790 are the Subset (I.sub.n, J.sub.o, K.sub.p) given the search pattern that comprises probable associative attributes using rules of semantics, when big data indexing.

(107) 2020 Virtual Maestro and Virtual Da Vinci Expanding Big Data Indexing

(108) Rule 25: ‘Vueno, Vonito y Varato’, Spanglish marketing term, the evolving system must be good (informational certain), real time quality data and cheap or inexpensive to produce.

(109) Rule 26: Vim the virtual maestro is programmed to have vim, is no longer an interface device that monitors interactively input into a search pattern. Now, with vim or full of energy the virtual maestro continuously scans, gathers, cleanse, verifies, validates, distills and primes data and also tracks subject matter patterns, trending, social media, news, sport and entertainment events and financial exchanges data to communicate with the end user. Vim is what the evolving system intelligence ambience emulates to behaves as if a living organism.

(110) Rule 27: Vigor the virtual maestro is programmed to have vigor, as in vitality to grow and learn that monitors interactively changes in the environment, and determines, what is significant in order to highlight subject matter patterns, trending, social media, news, sport and entertainment events and financial exchanges data that could be probabilistically satisficing or of interest to the end user. Thus, the evolving system is with vim and vigor.

(111) Rule 28: Variant the virtual maestro performs for each search pattern a hierarchical dataset after performing subject matter big data indexing of the improved environment to identify the natural variants to the search. Natural variants are forecasted or alternative queries offered to the end user that are considered Superset (I) subordinates of the Superset (U) improved environment given a search pattern after removing redundancy, spam, viral content and low quality sites that fail to pass the (Page*Site probability) Superset (I.sub.n) threshold or top (n).

(112) Rule 29: Variant the virtual maestro performs for each search pattern a hierarchical dataset after performing subject matter big data indexing of the improved environment to identify the probable branching to each natural variant to the search. Probable branching natural variants are forecasts or alternative queries offered to the end user and are considered Set (I.sub.n, J.sub.o) subordinates of the Superset (U) improved environment given a search pattern removing results that fail the (Page*Site probability) Set (I.sub.n, J.sub.o) threshold.

(113) Rule 30: Variant the virtual maestro performs for each search pattern a hierarchical dataset after performing subject matter big data indexing of the improved environment to identify the plausible branching to each natural variant to the search. Plausible branching natural variants are forecasts or alternative queries offered to the end user and are considered Subset (I.sub.n, J.sub.o, K.sub.p) subordinates of the Superset (U) improved environment given a search pattern removing results that fail to pass the (Page*Site probability) Subset (I.sub.n, J.sub.o, K.sub.p) threshold.

(114) Rule 31: Multivariant hierarchical datasets, where the big data indexing stores as the Superset (I.sub.n) threshold the 1.sup.st sample or sqrt of the searchable environment, and Set (I.sub.n, J.sub.o) is threshold the 2.sup.nd sample or 2.sup.nd sqrt of the searchable environment, and Subset (I.sub.n, J.sub.o, K.sub.p) threshold the 3.sup.rd sample or 3.sup.rd sqrt of the searchable environment or optimal responses given the search.

(115) Rule 32: Figures out a dialogue: the system analyzes the multivariant hierarchical datasets and upon identifying a significant difference change in the environment, triggers a communication or dialogue event. The software determines if and how the information should be transmitted to the end user based on personal, craving needs values, and humanlike vim and vigor script guidelines. Avoiding trivial data and sending optimal satisficing data.

(116) Rule 33: Pattern matching thresholds: the virtual maestro 700 interface communicates using interactive input exact pattern matching threshold to respond to valid request. The natural variant communication threshold is 80% to respond given an exact pattern matching after searching thousands of combinations. The scripted reactive probable branching natural variant communications 90% likely threshold after searching millions of combinations. The scripted proactive dialogue 95% likely threshold after searching billions of combinations.

(117) Rule 34: Command and control computers: comprising virtual da Vinci 900 analyzes trillions of pertinent and relevant combination in real time to the end user's personal, social group, and/or demographic satisficing and interest big data indexing historical profile level values applying significant difference 1.sup.st, 2.sup.nd and 3.sup.rd samples variances. 1.sup.st: when applying a demographic dimension uses 1.sup.st sample variances. 2.sup.nd when also including end user social media friends use 2.sup.nd sample variances. 3.sup.rd: when including the ‘vueno, vonito and varato’ algorithm using demographic, social group, news events, trending data and the user personal historical tracking of craving needs, satisficing and interest values uses 3.sup.rd sample variances.

(118) Rule 35: Financial News Events: Upon determining from financial exchange or news sources that a stock has a significant news in view of its market cap value and demographic and social group, the system can notify the information as a natural variant given a personal historical tracking of craving needs, satisficing and interest values hierarchical set.

(119) Rule 36: Automatic notification: the system can notify the information as a probable branching natural variant given a personal historical tracking of craving needs, satisficing and interest values hierarchical set. For example: notify the end user upon determining a craving need, in this case, the Miami Dolphins won their first game of their lackluster 2019 season.

(120) Rule 37: Automatic responses: the system using the ‘Vueno, Vonito y Varato’ personalized mantra can notify the information as a plausible branching natural variant given a personal historical tracking of craving needs, satisficing and interest values hierarchical set. For example: notifying the user upon determining that Cristiano Ronaldo (craving need) scored a goal.

(121) Rule 38: Artificial Intelligence: Virtual Da Vinci, 900 upon sending a response to the virtual maestro to communicate with the end user, this action triggers an update of the virtual maestro and end user's profile, and the reverse engineering of the combination of vector component belonging to the search pattern to match the corresponding big data indexing Codex Page.

Third Preferred Embodiment: Virtual Da Vinci Supercomputer Simplifications

(122) Harmony, Balance and Proportion W_RANK Hierarchical Sets for Small Samples

(123) Rule 39: Zero Clusters the following applies: the searchable environment is set to 210, the improved environment size=100, the optimal environment size=10 and the optimal element size=4. The Superset (I) size=16, Set (I, J) size=4, and the Subset (I, J, K) size=2.

(124) Rule 40: Small sample calculations consider Site Quality Partitions 0 to 2 as irrelevant.

(125) Rule 41: When the searchable environment <=1,000 the following applies: the improved environment size=100, the optimal environment size=10 and the optimal element size=4. The Superset (I) size=20, Set (I, J) size=6, and the Subset (I, J, K) size=3.

(126) Rule 42: When the searchable environment <=10,000 the following applies: the improved environment size=100, the optimal environment size=10 and the optimal element size=4. The Superset (I) size=32, Set (I, J) size=8, and the Subset (I, J, K) size=4.

(127) Rule 43: When the searchable environment <=100,000 the following applies: the improved environment size=128, the optimal environment size=16 and the optimal element size=5 The Superset (I) size=64, Set (I, J) size=10, and the Subset (I, J, K) size=5.

(128) Rule 44: When the searchable environment <=1,000,000 the following applies: the improved environment size=256, the optimal environment size=32 and the optimal element size=6. The Superset (I) size=100, Set (I, J) size=16, and the Subset (I, J, K) size=6.

(129) Harmony, Balance and Proportion W_RANK Hierarchical Sets for Medium Samples

(130) Rule 45: Medium size calculations considering Site Quality Partitions <4 as irrelevant.

(131) Rule 46: When the searchable environment <=10,000,000 the following applies: the improved environment size=316, the optimal environment size=40 and the optimal element size=10. The Superset (I) size=128, Set (I, J) size=20, and the Subset (I, J, K) size=8.

(132) Rule 47: When the searchable environment <=100,000,000 the following applies: the improved environment size=512, the optimal environment size=64 and the optimal element size=12. The Superset (I) size=200, Set (I, J) size=32 and the Subset (I, J, K) size=10.

(133) Rule 48: When the searchable environment <=1 Billion the following applies: the improved environment size=1024, the optimal environment size=128 and the optimal element size=16. The Superset (I) size=256, Set (I, J) size=40 and the Subset (I, J, K) size 14.

(134) Harmony, Balance and Proportion W_RANK Hierarchical Sets for Large Samples

(135) Rule 49: Large sample size consider Site Quality Partitions <5 as irrelevant

(136) Rule 50: The searchable environment <=10 billion the following applies: the improved environment size 2048, the optimal environment size=256 and the optimal element size=32. The Superset (I) size=316, Set (I, J) size=50, and the Subset (I, J, K) size=18.

(137) Rule 51: The searchable environment <=100 billion the following applies: the improved environment size 4,096, the optimal environment size=64 and the optimal element size=24. The Superset (I) size=512, Set (I, J) size=64, and the Subset (I, J, K) size=24.

(138) Rule 52: The searchable environment <=1 trillion has the following applies: the improved environment size=10,000, the optimal environment size=1000 and the optimal element size=100. The Superset (I) size=1,024, Set (I, J) size=128, and the Subset (I, J, K) size 32.

(139) Rule 53: Huge sample size consider Site Quality Partitions <6 as irrelevant

(140) Rule 54: The searchable environment <=100 trillion the following applies: the improved environment size=100,000, the optimal environment size=10,000 and the optimal element size=1000. The Superset (I) size=2,048, Set (I, J) size=256, and the Subset (I, J, K) size 64.

(141) Rule 55: Massive sample size consider Site Quality Partitions <7 as irrelevant.

(142) Rule 56: The searchable environment <=10,000 trillion the following applies: the improved environment=1,000,000, the optimal environment=100,000 and the optimal element=10,000. The Superset (I) size=4,096, Set (I, J) size=512, and the Subset (I, J, K) size=128.

(143) Big Data Indexing Reference Subject Matter Layers of Refinement

(144) Rule 57: Big Data Indexing given the searchable environment performs subject layer of index refinement to remove irrelevancy and identify a Superset (U) given the search pattern.

(145) Rule 58: Big Data Indexing given the searchable environment performs the first subject layer of index refinement to identify a plurality of Natural Variants Superset (I.sub.n).

(146) Rule 59: Big Data Indexing given the optimal environment performs the second subject layer of index refinement to identify a plurality of probable branching Set (I.sub.n, J.sub.o).

(147) Rule 60: Big Data Indexing given the optimal element performing the third subject layer of index refinement to identify a plurality of plausible branching Subset (I.sub.n, J.sub.o, K.sub.p).

(148) Minimum Super Site Quality Partition Given the Market Value in USD (2020)

(149) Rule 61: Super Site with a market value >1 trillion USD are 10.

(150) Rule 62: Super Site with a market value >500 billion USD are 9++.

(151) Rule 63: Super Site with a market value >200 billion USD are 9+.

(152) Rule 64: Super Site with a market value >100 billion USD are 9.

(153) Rule 65: Super Site with a market value >10 billion USD are 8.

(154) Rule 66: Super Site with a market value >1 billion USD are 7+.

(155) Rule 67: Super Site with a market value >500 million USD are 7.

(156) Rule 68: Super Site with a market value >200 million USD are 6+.

(157) Rule 69: Super Site with a market value >100 million USD are 6.

(158) Rule 70: Big Data Indexing given the Super Site 6+ or better are never automatically removed from calculation as irrelevancy.

(159) Rule 71: Big Data Indexing given a searchable environment <=1000 remove from calculation Site Quality <3.

(160) Rule 72: Big Data Indexing given a searchable environment <=1 million remove from calculation Site Quality <4.

(161) Rule 73: Big Data Indexing given a searchable environment <=1 billion remove from calculation Site Quality <5.

(162) Rule 74: Big Data Indexing given a searchable environment <=1 trillion remove from calculation Site Quality <6.

(163) Rule 75: Big Data Indexing given a searchable environment >=10 trillion remove from calculation Site Quality <7. In otherwise, only calculate using high quality Super Sites.

(164) Rule 76: Big Data Indexing given a searchable environment from a valid search pattern and the corresponding mapping of subject matter hierarchical set uses Site and Super Site quality partition values to create the chain of command of entity knowledge objects.

(165) Virtual Da Vinci Valorization of the Hierarchical Set of Entity Knowledge Objects

(166) Rule 77: Superset (U) count distinct Super Site with value >6 from the searchable environment given a search pattern to select the best fit Codex Page when more than 1.

(167) Rule 78: Superset (I.sub.n) count distinct Super Site with value >6 from the improved environment given a search pattern to select the best fit Natural Variants.

(168) Rule 79: Set (I.sub.n, J.sub.o) count distinct Super Site with value >6 from the optimal environment given a search pattern to select the best fit probable branching Natural Variants.

(169) Rule 80: Subset (I.sub.n, J.sub.o, K.sub.p) count distinct Super Site with value >6 from the optimal environment given a search pattern to select the best fit probable branching Natural Variants.

(170) Rule 81: Search pattern environments with a count=0 are deemed irrelevant.

(171) Rule 82: Search pattern environments with a count=1 with a Super Site=10 are always deemed satisfying. To those in the art best fit describes the high probable alternative.

(172) Rule 83: W_RANK for search pattern environments such as Superset (U), Superset (I), Set (I, J) and Subject (I, J, K) objects is the total (Super Site Value) for the respective optimal element size. The highest valued W_RANK object is deemed of greater interest or satisfaction.

(173) Rule 84: W_CHANGE for search pattern environments such as Superset (U), Superset (I), Set (I, J) and Subject (I, J, K) objects at predefined time interval measures total difference (count of unique request to a Super Site webpage response value) for the respective optimal element size. The highest W_CHANGE object is deemed of the greatest usage and trend value.

(174) Rule 85: W_CHANGE=10, when it deemed of significant difference and great importance given the quality of Super Sites and unique end user.

(175) Rule 86: W_CHANGE <5 when the object is probabilistically deemed COLD given the end user's historical profile is considered irrelevant and skipped from further calculation.

(176) Rule 87: W_CHANGE >5 when the object is probabilistically deemed HOT given the end user's historical profile is consider relevant and skipped from further calculation. If W_CHANGE is 9+, 9++, and or 10, the virtual Maestro knows a la crème de la crème quality forecasted, or alternative query recommendation or direct communication was found.

(177) Example 19: Virtual Da Vinci detecting Breaking News: The Codex 160 using 900 Virtual da Vinci that searches link database 800 at predefined time intervals detects a new Master Index. Immediately, determines which Superset (U), Superset (I), Set (I, J) and Subject (I, J, K) objects have W_CHANGE value. The highest values are deemed Breaking News, and the frequency of change is deemed a 10 (from 0 to 10 basis) given the time interval. During the next World Cup 2022, lets assumer the final goes to a penalty kick shootout, Virtual Da Vinci, will immediately detect when the winner is determined. The result is followed by millions of people, and upon occurring will be updated to the end user's that have a craving need for this information.

(178) Example 20 Super Site News events triggers automatic response to the end user: When the W_CHANGE value=10, the virtual Maestro searches the end user's profile and upon pattern matching a tracking object will automatically respond to the end user. Otherwise, probabilistically, will start a script given the set of the W_CHANGE value objects and determine, if the end user want to receive the real time updates via text or audio and video if available. The W_CHANGE value can also be modified by the usage pattern of behavior of the end user profile, and the advertisement monies given the end user versus the advertiser promotional value. The same condition applies for non-trending responses using W_RANK value of the objects.

(179) Why is the Supersite probability value required? The Supersite is required when Corporate identification comprising a plurality of website, in order to replace the count of unique hyperlinks to a website and unique search clicks to resources to the website, with the count of distinct hyperlink to a supersite and distinct search clicks to resources to the website. Thus, when a supersite comprising a spam site over inflates the lion share of as duplicate and dependent hyperlinks to resources to supersite but not to each unique site when analyzed as a resultant vector given the same Corporate identification. The evolving system main objective is to remove redundancy when web master intent is hype the unique count of hyperlinks to a resource.

LIST OF ELEMENTS

(180) 100 Search Engine System 105 Computer Terminal, Subscriber Device or Smart Input Device 110 End User or Subscriber 115 Interactive Input 119 Request 120 Browser 130 Optimizer 135 Personal input 140 Internet 150 The Hive 155 HIVE SMP (Symmetric Multi-Processing) Artificial Intelligence Software 160 Codex Inventory Control System 165 Rules of Semantics 167 Pattern Matching 169 Codex Page 170 Human Knowledge Encyclopedia 175 Entity Object 177 Natural Variants 180 Optimal Environment 185 Inventory Control Content 189 Optimal Dataset 199 Personalized Dataset 200 Web Crawler Sub System 205 Web Crawler 207 Web Crawler navigating every Site 209 Reading each URL of a webpage 210 New Document 215 Raw Data 219 Primed Data (for human monitoring and evaluation) 220 Parse Data (using rules of grammar and semantics) 230 Determining if each webpage and associated ‘related objects’ are navigational 240 Counting unique hyperlinks to ‘related objects’ in the webpage. 242 Change in the count of distinct hyperlinks to ‘related objects’ in the webpage 245 Counting search clicks to ‘related objects’ in the web page 247 Counting the frequency of search clicks to ‘related objects’ in the web page 249 Identifying end users searching each resource, webpage, website and super site. 250 Determining for each resource a ‘related object’ type 260 Ranking each webpage 265 Trend Data (measures pattern of behavior) 266 Protected Trend Data (measures pattern of behavior) 269 Derive Significant Portions of Information 270 Identifying end user search patterns and relevant natural variants. 275 Map Entity Object 276 Protected Entity Object 277 Map Natural Variant 278 Protected Natural Variant 280 Mapping valid search pattern combinations given the ‘related object’ type 285 Update Super Glyph (Mathematical) Equation 630 Scripted Algorithm and Database 700 Virtual Maestro (artificial intelligence computer program product) 701 Input Probabilistic Spatial Environment 702 Output Probabilistic Spatial Environment 710 Weighted Output Natural Variants (feature attributes, or alternatives) 720 Pick Best Natural Variant 730 Best Response Probable Branching 740 Pick Best Probable Branching Response 785 Weighted Plausible Responses 790 Pick Best Plausible Response 799 Dialogue Best Plausible Responses with the End User 800 Link Database 810 End User Historical Profile given a valid Search Pattern 820 Virtual Maestro Profile given a valid Search Pattern 830 Determining the unique count of incoming hyperlinks to a web page 831 Determining the unique count of search clicks to a web page 832 Determining a probabilistic ranking value for every web page 833 Assign a quality partition from 0 to 10 given the web page ranking value 840 Determining the unique count of incoming hyperlinks to a website 841 Determining the unique count of search clicks to a website 842 Determining a probabilistic ranking value for every website 843 Assign a quality partition from 0 to 10 given the website ranking value 900 Virtual Da Vinci supercomputer artificial intelligence program device 910 Simulating for each codex page the optimal environment 911 Updating each codex page upon identifying a higher value webpage 912 Associate the new web page to the codex page storing and updating changes 913 Continuously updating at least one collection of top (n) web pages, and the top (n) sites geospatial information 914 continuously update relative master index belonging to each codex page 915 determining at predefined time intervals the total number of web pages in the codex and for each codex page in its chain of command 916 determining at predefined time intervals the total number of significant difference changes in the Internet and then revaluing each site that updated its top ranked (n) web pages 917 cleansing, mapping and plotting the old master index into the new master index using the content value of the relative master index of the highest vector valued codex page 918 continuously synchronize in real time the new master index that reflect the latest condition of the environment 919 cleansing, mapping and plotting the new master index and the Codex and the entire chain of command of codex pages 930 Determining the unique count of incoming hyperlinks to a Super site 931 Determining the unique count of search clicks to a Super site 932 Determining a probabilistic ranking value for every Super site 933 Assign a quality partition from 0 to 10 given the ranking value

Site rank codex search patterns

Inventors

Cpc classification

Classification Explorer

G06F16/337

PHYSICS

Classification Explorer

G06F16/951

PHYSICS

Classification Explorer

G06F16/3341

PHYSICS

Classification Explorer

G06F16/9535

PHYSICS

Classification Explorer

H04L67/02

ELECTRICITY

Classification Explorer

G06F16/243

PHYSICS

Classification Explorer

H04L67/52

ELECTRICITY

Classification Explorer

G06F16/3328

PHYSICS

International classification

Classification Explorer

G06F7/00

PHYSICS

Classification Explorer

G06F16/242

PHYSICS

Classification Explorer

G06F16/951

PHYSICS

Classification Explorer

H04L67/52

ELECTRICITY

Abstract

Claims

Description