METHOD AND APPARATUS FOR DIRECTING ACQUISITION OF INFORMATION IN A SOCIAL NETWORK
20210150565 · 2021-05-20
Inventors
Cpc classification
International classification
Abstract
Methods and apparatus for directing access to sources of information through an information dissemination network are disclosed. Upon detecting a user's access to a source, a recommendation to access a complementing source is communicated to the user. The recommendation is based on affinity measures for pairs of sources of information. Upon detecting the user's access transition to a new source of information within a predefined time interval, inter-source transition scores are recorded and new affinity levels of the current source to the new source are determined for each affinity measure. Affinity merits corresponding to affinity levels are determined based on tracking a population of users and determining users' compliance with recommendation. A complementing source is selected according to affinity merits corresponding to specific affinity levels of a current source to a set of sources.
Claims
1. A method of directing information access comprising: executing instructions causing a hardware processor to perform processes of: acquiring inter-source affinity levels of a set of predefined affinity measures for each pair of sources of information of a set of sources of information; initializing affinity merits of said each pair to equal respective affinity levels; detecting access of a user of an information dissemination network to a current source of information; recommending to the user a complementing source of information based on affinity merits corresponding to specific affinity levels of the current source to a set of sources of information; detecting access transition of the user to a new source of information; accumulating inter-source transition scores; determining new affinity levels of the current source to the new source for each said affinity measure according to respective transition scores; and updating affinity merits for each said affinity measure according to said respective transition scores.
2. The method of claim 1 further comprising: dividing the inter-source affinity levels corresponding to each affinity measure of the set of affinity measures into a respective number of affinity-level bands; grouping said inter-source transition scores according to respective affinity-level bands; and determining an affinity merit corresponding to each affinity-level band as a ratio of a respective transition score to total transition score.
3. The method of claim 1 further comprising: dividing the inter-source affinity levels corresponding to each affinity measure of the set of affinity measures into a respective number of affinity-level bands; and for each affinity measure and a respective affinity-level band: increasing a recommendation score; increasing a compliance score subject to a determination that the new source is the recommended source; determining an affinity merit as a ratio of a respective compliance score to a respective recommendation score.
4. The method of claim 2 further comprising: determining, for each affinity measure, an interpolating function for determining an affinity merit corresponding to an arbitrary affinity level, said interpolating function based on affinity merits corresponding to affinity-level bands; and determining said affinity merit corresponding to said specific affinity levels according to the interpolating function.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0012] Embodiments of the present invention will be further described with reference to the accompanying exemplary drawings, in which:
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
TERMINOLOGY
[0053] Information tracking system: The term refers to apparatus and means for interaction with an information dissemination system to identify patterns of users' access to information [0054] User: The term denotes a member of a population under consideration for developing a marketing system for specific commodities or for acquisition of data aiming at gaining insight for policy development. The population may include users of social media or respondents to surveys, among many other entities. [0055] Characteristics of a user: The characteristics of a user represent slowly-varying properties (such as age or income), quasi-static properties (such as height of an adult), and/or permanent attributes such as place of birth. The characteristics of a user may comprise numerous attributes represented as a vector. [0056] Traits of a user: The traits of a user represent evolving properties, such as societal views, favourite entertainment or sport, etc. [0057] Cluster: A population under consideration may be segmented into a number of clusters according to values of a predefined set of characteristics for each member (user) of the population. The number of clusters may be predefined or determined automatically under specific constraints. A user may be characterized according to the user's affiliation with one of a predefined number of clusters and, possibly, the user's proximity to the centroid of the cluster. [0058] Community: Members of the population (users) possessing a specific trait form a respective community. The number of communities equals the number of predefined traits of interest. A user belongs to a one cluster but may belong to numerous communities. [0059] Article: The term refers to any computer readable file which may comprise a text, drawings, pictures, an audio signal, or a video signal. The term may refer to any information source, such as a website not just a file posted in a website. [0060] Article access: The act of reading, viewing, or listening to an article is referenced as “article access” or “article inspection”. [0061] Article transition: The act of successively accessing two articles within a predefined interval of time is referenced as “article transition” or simply “transition”. [0062] Article succession: An article succession comprises two articles accessed by a same user [0063] Article's metadata: When access to an article is detected, the system of the invention acquires information relevant to both the article and a network user accessing the article. The information may include an identifier of the article, data characterizing the article, an identifier of the user and data relevant to the user's characteristics and traits. The acquired information is collectively referenced as the article's metadata. [0064] Article's characterization data: The term refers to content of an article and/or metadata of the article such as “author”, “language”, “topic”, “file-size”, etc. The topic may be one of predefined classifications such as art, sports, travel, cooking, politics, philosophy, history, computing, finances, etc. [0065] Reference article: When a user accesses a first article then a second article, the first article is said to be the reference article. [0066] Complementing article: An article perceived to provide information of interest to a user accessing a current article is referenced as a complementing article of the current article. [0067] Usage data: Usage data comprises: [0068] identifiers of accessed articles and succession of articles accessed by a same user; [0069] an overall score of the number of transitions from a reference article to other articles; [0070] a cluster-specific score of the number of transitions from a reference article to other articles; and/or [0071] a trait-specific score of the number of transitions from a reference article to other articles. [0072] Inter-article similarity level: The similarity level of two articles (information sources) is a measure of resemblance between the contents of the two articles or complementariness of the two articles where the two articles are of the same topic. Known techniques may be applied to determine a similarity measure of two files. For a collection of a large number of articles (tens of thousands, for example), content similarity may be assessed for an article pair only after inspecting the articles metadata; for example, there is no point in comparing lengthy contents of an article on cooking and an article on philosophy. Metadata comparison may also be appropriate for determining similarity between a lengthy text file and a video file containing few words; there would a high level of similarity between an article on the benefits of Yoga and a video recording of Yoga poses. Thus, a user who accesses a word article may be directed to an audio article or a video article, and vice versa. [0073] Inter-article gravitation score: The term refers to a count of incidences of successive articles accessed by a same user regardless of the characteristics of the user. [0074] Inter-article gravitation measure: The term refers to a measure based on a count (score) of the number of transitions from one article to another. A gravitation measure is independent of the types of users effecting the transitions. [0075] Inter-article gravitation level: The term refers to a ratio of an inter-article gravitation score from a first article to a second article to the total number of transitions from the first article to all other articles. [0076] Inter-article attraction score: The term refers to a count of transitions (incidences of successive articles accessed by a same user) for a specific cluster of users. The score may take into account proximity of descriptors of a user to descriptors of the centroid of a respective cluster. [0077] Inter-article attraction level: The term refers to a ratio of an inter-article attraction score from a first article to a second article to the total number of transitions from the first article to all other articles effected by users of a same cluster. [0078] Inter-article interest score: The term refers to a count of transitions (incidences of successive articles accessed by a same user) for users of a same community. [0079] Inter-article interest measure: The term refers to a measure based on inter-article interest scores. [0080] Composite affinity level: For a directed article pair and a specific user characteristic, a composite affinity level is determined as a function of the similarity level and the gravitation score of the article pair, as well as the attraction score which depends on the user's characteristics and the interest score which depends on the user's traits. Where the user's characteristics or traits are unknown, the composite affinity level is determined as a function of only the similarity level and the gravitation score of the article pair. [0081] Measure of effective recommendations: The term refers to a value indicating effectiveness of recommendations, such as a proportion of transitions obeying recommendations or a (positive) change in mean value of a composite measure of affinity. [0082] Merit of an affinity measure: The merit of any of the affinity measures (similarity, gravitation, attraction, or interest) is defined as either a proportion of article transitions corresponding to a respective value of an affinity measure, or users' compliance to recommendations corresponding to a respective value of an affinity measure. [0083] Randomly sequenced round robin process: The term refers to selecting items from different sets in a random order. [0084] system age: The time period since the start of tracking users is referenced as “system age”. [0085] Concurrence period: The term “concurrence period” (or “simultaneity period”) refers to a time period during which two articles have been available for user access.
REFERENCE NUMERALS
[0086] 100: Methods of guiding social-network information acquisition [0087] 110: Process of detecting user's access to an article [0088] 120: Process of selecting an article of significant affinity based on values of multiple affinity measures [0089] 140: Process of selecting an article of significant affinity based on merits of multiple affinity measures [0090] 150: Process of determining a merit of an affinity measure based on a proportion of article transitions [0091] 160: Process of determining a merit of an affinity measure based on compliance to recommendation [0092] 170: Process of determining a favorite succeeding article and communicating respective information to user [0093] 200: Overview of a first method of guiding social-network information acquisition [0094] 210: Process of monitoring article transitions [0095] 220: Process of determining article-pair affinity levels, for each affinity measure [0096] 230: Process of selecting, and communicating, a successor article based on article-pair affinity level [0097] 300: Overview of a second method of guiding social-network information acquisition [0098] 330: Process of determining an article-pair merit for each affinity measure [0099] 340: Process of selecting, and communicating, a successor article based on article-pair merits [0100] 400: Details of second method of guiding social-network information acquisition [0101] 410: Processes of initializing an array of tuples for each affinity measure (pre-processing stage) [0102] 420: Process of detecting a current-user's access to an article [0103] 430: Process of acquiring information relevant to article and user [0104] 440: Process of determining article-pair affinity levels for each predefined affinity measure [0105] 450: Process of determining a merit value for each affinity measure [0106] 460: Process of determining, and communicating, a succeeding article based on article-pair merit values [0107] 470: Process of determining response to recommendation [0108] 510: Process of identifying a set of candidate articles of significant affinity to a current article [0109] 520: Process of evaluating each candidate article [0110] 530: Process of determining an affinity merit for each of predefined affinity measures [0111] 540: Process of determining a composite affinity merit [0112] 550: Process of performing a weighted random selection to determine a recommended article [0113] 620: Process of receiving an identifier of a recommended article [0114] 630: Process of monitoring user's action over a prescribed time window [0115] 640: Process of determining whether the user accessed another article within the time window [0116] 650: Process of identifying a selected article [0117] 660: Process of updating a score of article transition [0118] 700: Process of updating article-selection scores (relevant to process 150) [0119] 710: Process of determining affinity levels for different affinity measures corresponding to an article transition [0120] 720: Process of determining a merit value for a selected affinity parameter [0121] 730: Process of identifying a specific tuple corresponding to the selected affinity parameter [0122] 740: Process of increasing current score stored in an element of the specific tuple [0123] 750: Process of computing a merit value corresponding to the affinity parameter as a proportion of all article transitions from a reference article [0124] 760: Process of updating a merit vector corresponding to the selected affinity parameter [0125] 800: Process of updating article-selection scores (relevant to process 160) [0126] 810: Process of determining affinity levels of different affinity measures corresponding to an article transition [0127] 820: Process of determining a merit value for a selected affinity measure [0128] 830: Process of identifying a tuple corresponding to selected affinity measure [0129] 840: Process of ascertaining user's compliance with recommendation [0130] 850: Process of increasing current score stored in a first element of the specific tuple [0131] 860: Process of increasing current score stored in a second element of the specific tuple [0132] 870: Process of computing a merit value corresponding to the affinity parameter as a compliance ratio [0133] 880: Process of updating a merit vector corresponding to the selected affinity parameter [0134] 900: Clusters of users formed according to characteristics of individual users [0135] 920: Universe of tracked users [0136] 1000: Communities of users formed according to traits of individual users [0137] 1020: A community of users corresponding to a single trait [0138] 1100: Superposition of communities onto clusters [0139] 1200: Data required for determining harmonious article succession [0140] 1210: Article characterization data including articles' metadata and content storage address or network address for retrieving articles' contents [0141] 1220: Usage data including identifiers of accessed articles and succession of articles accessed by a same user [0142] 1225: Active-users' registry structured to identify users accessing an article during a moving time window or a window of a predefined number of most recently tracked users' access to articles [0143] 1230: Users' characterization data including identifiers of clusters of users [0144] 1240: Users' traits data including identifiers of communities of users [0145] 1250: Inter-article similarity levels based on comparing articles' contents [0146] 1260: Inter-article gravitation data based on tracking successive articles accessed by a same user regardless of the characteristics of the user [0147] 1270: Inter-article attraction data based on tracking successive articles accessed by a same user taking into account the user's affiliation such as a cluster to which the user belongs and proximity of descriptors of the user to descriptors of the centroid of the cluster [0148] 1280: Inter-article interest data based on tracking successive articles accessed by a same user taking into account the user's communities [0149] 1290: Composite affinity coefficients for directed article pairs [0150] 1300: Exemplary statistics of article-transition compliance ratio [0151] 1320: Collection of article pairs [0152] 1321: First set of article pairs [0153] 1322: Second set of article pairs [0154] 1323: Third set of article pairs [0155] 1324: Fourth set of article pairs [0156] 1340: Mean number of article recommendations per compliance [0157] 1350: Zone of autonomous transitions and/or effective recommendations [0158] 1360: Zone of ineffective recommendations [0159] 1400: Division of levels of each of affinity measures into respective bands [0160] 1410: Normalized affinity measure value (any affinity measure) [0161] 1420: Band of levels of similarity [0162] 1430: Band of levels of gravitation [0163] 1440: Band of levels attraction [0164] 1450: Band of levels of interest [0165] 1500: Captured article-transition scores [0166] 1510: Level of one of the affinity measures [0167] 1520: Transition score [0168] 1530: Cumulative transitions for a hypothetical case where article transition is independent of an affinity measure [0169] 1540: Cumulative transitions [0170] 1550: Interpolated cumulative transitions based on individual transitions [0171] 1560a: Number of transitions during a parameter band [0172] 1560b: Number of transitions during another parameter band [0173] 1600: Cumulative article-transition scores [0174] 1660a: Cumulative transitions during a parameter band [0175] 1660b: Cumulative transitions during another parameter band [0176] 1670: Interpolated cumulative transitions based on transition-count for parameter bands [0177] 1700: Captured transition scores for each affinity measure [0178] 1710: Index of similarity band [0179] 1712: Article-transition scores corresponding to similarity bands [0180] 1714: Proportion of article-transitions corresponding to similarity bands [0181] 1720: Index of gravitation band [0182] 1722: Article-transition scores corresponding to gravitation bands [0183] 1724: Proportion of article-transitions corresponding to gravitation bands [0184] 1730: Index of attraction band [0185] 1732: Article-transition scores corresponding to attraction bands [0186] 1734: Proportion of article-transitions corresponding to attraction bands [0187] 1740: Index of interest band [0188] 1742: Article-transition scores corresponding to interest bands [0189] 1744: Proportion of article-transitions corresponding to interest bands [0190] 1800: Merits of a specific affinity measure based on the article-transition scores [0191] 1810: Similarity values [0192] 1820: Proportion of inter-article transitions [0193] 1830: Similarity merit for a specific similarity band based on the article-transition scores [0194] 1840: Interpolated similarity merits [0195] 1900: Compliance scores for each affinity measure [0196] 1912: Compliance scores corresponding to similarity bands [0197] 1914: Increment of compliance score corresponding to a respective similarity band [0198] 1922: Compliance scores corresponding to gravitation bands [0199] 1924: Increment of compliance score corresponding to a respective gravitation band [0200] 1932: Compliance scores corresponding to attraction bands [0201] 1934: Increment of compliance score corresponding to a respective attraction band [0202] 1942: Compliance scores corresponding to interest bands [0203] 1944: Increment of compliance score corresponding to a respective interest band [0204] 2000: Noncompliance scores for each affinity measure [0205] 2012: Noncompliance scores corresponding to similarity bands [0206] 1914: Increment of compliance score corresponding to a respective similarity band [0207] 2022: Noncompliance scores corresponding to gravitation bands [0208] 2024: Increment of compliance score corresponding to a respective gravitation band [0209] 2032: Noncompliance scores corresponding to attraction bands [0210] 2034: Increment of noncompliance score corresponding to a respective attraction band [0211] 2042: Noncompliance scores corresponding to interest bands [0212] 2044: Increment of noncompliance score corresponding to a respective interest band [0213] 2100: Merits for each affinity measure based on compliance [0214] 2112: Similarity merit corresponding to a similarity band [0215] 2114: Computation of similarity merit. [0216] 2122: Gravitation merit corresponding to a gravitation band [0217] 2114: Computation of gravitation merit. [0218] 2132: Attraction merit corresponding to an attraction band [0219] 2114: Computation of attraction merit. [0220] 2142: Interest merit corresponding to an interest band [0221] 2144: Computation of interest merit. [0222] 2200: Interpolated merits for each affinity measure [0223] 2205: Normalized values of affinity measure [0224] 2210: Similarity-merit vector [0225] 2214: Similarity merit corresponding to a similarity value [0226] 2220: Gravitation-merit vector [0227] 2224: Gravitation merit corresponding to a gravitation value [0228] 2230: Attraction-merit vector [0229] 2234: Attraction merit corresponding to an attraction value [0230] 2240: Interest-merit vector [0231] 2244: Interest merit corresponding to an interest value [0232] 2300: Compliance-ratio (merit) for different bands of similarity levels [0233] 2310: Similarity level [0234] 2320: Score of recommended articles and selected articles [0235] 2330: Compliance ratio [0236] 2340: Score of recommended articles corresponding to a respective similarity band [0237] 2350: Score of selected articles corresponding to a respective similarity band [0238] 2360: Similarity merit corresponding to a respective similarity band [0239] 2370: Interpolated similarity merit for an arbitrary similarity level [0240] 2400: Hypothetical independence of compliance-ratio (merit) from similarity levels [0241] 2440: Score of recommended articles corresponding to a respective similarity band [0242] 2450: Score of selected articles corresponding to a respective similarity band [0243] 2460: Hypothetical similarity merit corresponding to a respective similarity band [0244] 2500: Compliance-ratio (merit) for different bands of gravitation levels [0245] 2510: Gravitation level [0246] 2540: Score of recommended articles corresponding to a respective gravitation band [0247] 2550: Score of selected articles corresponding to a respective gravitation band [0248] 2560: Gravitation merit corresponding to a respective gravitation band [0249] 2570: Interpolated gravitation merit for an arbitrary similarity level [0250] 2600: Compliance-ratio (merit) for different bands of attraction levels [0251] 2610: Attraction levels [0252] 2640: Score of recommended articles corresponding to a respective attraction band [0253] 2650: Score of selected articles corresponding to a respective attraction band [0254] 2660: Attraction merit corresponding to a respective attraction band [0255] 2670: Interpolated attraction merit for an arbitrary attraction level [0256] 2700: Compliance-ratio (merit) for different bands of interest levels [0257] 2710: Interest levels [0258] 2740: Score of recommended articles corresponding to a respective interest band [0259] 2750: Score of selected articles corresponding to a respective interest band [0260] 2760: Attraction merit corresponding to a respective interest band [0261] 2770: Interpolated attraction merit for an arbitrary interest level [0262] 2800: Comparison of compliance ratios corresponding to different affinity measures for different affinity-level bands [0263] 2810: Affinity levels for any of four affinity measures [0264] 2910: Reference article [0265] 2920: Candidate articles for recommendation to a user [0266] 2930: A composite affinity level based on affinity levels of the four affinity measures [0267] 2940: A composite affinity level based on affinity merits of the four affinity measures [0268] 3000: Affinity measures corresponding to a moving window of a predefined number of transitions. [0269] 3010: Affinity measure type [0270] 3020: Index of buffer cell [0271] 3040: Similarity level of successively accessed articles of a particular transition [0272] 3050: Gravitation level of successively accessed articles of a particular transition [0273] 3060: Attraction level of successively accessed articles of a particular transition [0274] 3070: Interest level of successively accessed articles of a particular transition [0275] 3080: Insertion of newest data to overwrite oldest data in a circular buffer [0276] 3100: Pairwise affinity-measures correlation [0277] 3102: Recommendations count [0278] 3104: Correlation coefficient [0279] 3110: Gravitation-interest correlation [0280] 3120: Similarity-gravitation correlation [0281] 3130: gravitation-attraction correlation [0282] 3140: Attraction-interest correlation [0283] 3150: Similarity-attraction correlation [0284] 3160: Similarity-interest correlation [0285] 3200: Cross-correlation of any two variables over a moving window of article-pair transitions [0286] 3210: Array of N Tuples, N>1 [0287] 3220: Tuple corresponding to a respective event [0288] 3230: Expression for determining cross-correlation of two variables u and v [0289] 3240: Expression, derived from expression 3230, for recursively determining cross-correlation of the two variables u and v for values of u and v within a moving window. [0290] 3300: Process of updating window data [0291] 3310: Buffer-cell indices [0292] 3320: Zero-initialized buffer cells [0293] 3321: Tuple elements holding similarity levels [0294] 3322: Tuple elements holding gravitation levels [0295] 3330: Transition indices [0296] 3400: Process of updating window data—continued [0297] 3500: Method for fast computation of cross-correlation of any two affinity measures, following each article transition [0298] 3510: Process of work-arrays initialization [0299] 3520: Process of receiving, retaining, and updating records of tracked data [0300] 3530: Process of defining window boundaries [0301] 3540: Process of updating cumulative data [0302] 3550: Process of computing a correlation coefficient [0303] 3600: Method for fast computation of cross-correlation of any two affinity measures, at spaced article transitions [0304] 3645: Process of spacing correlation-coefficient computations [0305] 3700: Compliance ratio dependence on system's age and article-pair duration [0306] 3710: System age or article-pair coincidence period [0307] 3720: Compliance ratio corresponding to age or coincidence period [0308] 3730: Compliance ratio versus system age [0309] 3740: Compliance ratio versus coincidence period—case-1 [0310] 3750: Compliance ratio versus coincidence period—case-2 [0311] 3760: Compliance ratio versus coincidence period—case-3 [0312] 3800: Apparatus for tracking users and identifying a user's access to an article [0313] 3802: Current article [0314] 3804: Recommended article [0315] 3810: Tracking module [0316] 3820: Assembly of processors [0317] 3832: A memory device storing data relevant to information sources (articles) [0318] 3834: A memory device storing data relevant to clusters of users formed according to users' characteristics [0319] 3836: A memory device storing data relevant to users' communities formed according to users' traits [0320] 3842: Memory device storing similarity levels of article-pairs (pairs of information sources). A similarity vector of a specific article indicates a similarity level to each other article of a similarity level exceeding a prescribed value. [0321] 3844: Memory device storing a gravitation vector for each article. A gravitation vector of a specific article indicates a gravitation level to each other article of a gravitation level exceeding a prescribed value. [0322] 3846: Memory device storing an attraction vector for each article. An attraction vector of a specific article indicates an attraction level to each other article where the attraction level exceeds a prescribed value. [0323] 3848: Memory device storing an interest vector for each article. An interest vector of a specific article indicates an interest level relevant to each other article where the interest level exceeds a prescribed value. [0324] 3880: Memory devices storing software modules [0325] 3881: A memory device storing a software module for determining an appropriate information source (an appropriate article) to follow a currently accessed information source or article [0326] 3882: A memory device storing a software module for determining users' compliance scores [0327] 3883: Memory device storing software modules for updating mutual similarity levels of information sources or articles [0328] 3884: Memory device storing software modules for updating mutual gravitation levels of information sources or articles [0329] 3885: Memory device storing software modules for updating mutual attraction levels of information sources or articles [0330] 3886: Memory device storing software modules for updating mutual interest levels of information sources or articles [0331] 3900: Method of directing article selection [0332] 3910: Process of initializing inter-article affinity levels and affinity merits for each affinity measure [0333] 3920: Process of detecting user's access to a current article [0334] 3930: Process of recommending a complementing article to follow the current article based on respective affinity merits of a set of candidate complementing articles [0335] 3940: Process of detecting transition to a new article within a predefined time interval [0336] 3950: Process of updating inter-article transition scores [0337] 3960: Process of updating inter-article affinity level for each affinity measure [0338] 3970: Process of determining affinity merits corresponding to respective affinity levels [0339] 4020: Processes of determining affinity merits for a single affinity measure [0340] 4030: process of determining a specific affinity band of a specific affinity measure for a specific affinity level [0341] 4035: Process of selecting a method of determining affinity merits [0342] 4040: Process of increasing transition score for the specific affinity band determined in process 4030 [0343] 4050: Process of determining an affinity merit corresponding to the specific affinity band as a ratio of the transition score of the specific affinity band to the total transition score [0344] 4055: Process of selecting process 4060 if the new article is not the recommended complementing article or process 4070 inn the case of compliance (the new article being the complementing article) [0345] 4060: Process of increasing a noncompliance score corresponding to the specific affinity band [0346] 4070: Process of increasing a compliance score corresponding to the specific affinity band [0347] 4080: Process of determining an affinity merit corresponding to the specific affinity band as a ratio of the compliance score to the sum of compliance score and noncompliance score
DETAILED DESCRIPTION
[0348]
[0349] The new article selected according to method-I, option-1 of method-II, or option-2 of method-II is recommended to the user (process 170).
[0350]
[0351]
[0352]
[0353]
[0354]
[0355]
[0356]
[0357]
[0358]
[0359]
[0360]
[0361] Inter-article similarity levels 1250 are determined from article characterization data 1210 based on comparing articles' contents. Inter-article gravitation scores 1260 are determined from usage data 1220 based on tracking successive articles accessed by a same user regardless of the characteristics of the user. Inter-article attraction scores 1270 are determined from usage data 1220 based on tracking successive articles accessed by a same user taking into account the user's affiliation such as a cluster to which the user belongs and proximity of descriptors of the user to descriptors of the centroid of the cluster. Inter-article interest scores 1280 are determined from usage data 1220 based on tracking successive articles accessed by a same user taking into account the user's membership of communities. Composite affinity coefficients 1290 are determined according to similarity levels 1250, gravitation scores 1260, attraction scores 1270, and interest scores 1280.
[0362]
[0363] The objective is to guide a social-network user observing a specific article to switch to another article having a high affinity to the specific article. There may be numerous factors that influence the user's article selection. One of the factors conjectured to be effective is the inter-article similarity which may be based on content similarity. Another factor may be based on author's or advertiser's attributes.
[0364] Four affinity measures are considered for recommending an inter-article transition: (1) articles' similarity; (2), historical transition patterns of the entire population of users (gravitation); (3) historical transition patterns of users of same characteristics (attraction); and (4) historical transition patterns of users of the same traits.
[0365] Inter-article similarity levels are determined a priori. Inter-article transition patterns of users of the entire population of users define a gravitation measure. Thus, if a significant number of users has switched to article x.sub.1, after observing article x or any article of type y, then article xi would be considered a good candidate for transition from article x or any article of type y. Alternatively, if article x is of article type y, and if a significant number of users accessing articles of type y has switched to articles of type y.sub.1, then any article of type y.sub.1 would be considered a candidate for recommendation to access after accessing article x.
[0366] Inter-article transition patterns of users of the same cluster of users define an attraction measure. Thus, if a specific user is currently observing an article x of article type y, and a significant proportion of users belonging to the specific-user's cluster has switched to article x.sub.1 of type y.sub.1, after observing article x or any article of type y, then article x.sub.1 or any article of type y.sub.1, is considered a candidate for recommendation.
[0367] Inter-article transition patterns of users of the same traits define an interest measure. Thus, if a specific user is currently observing an article x of article type y, and a significant proportion of users having common traits with those of the specific user has switched to article x.sub.1 of type y.sub.1, after observing article x or any article of type y, then article x.sub.1 or any article of type y.sub.1, would be considered a candidate for recommendation.
[0368] Article selection is based on a composite affinity level based on similarity, gravitation, attraction, and interest. To evaluate the effectiveness of the method, users' compliance ratios over a moving window of a predefined number of transitions are determined, as illustrated in
[0369] For example, common traits of users accessing a specific article and users accessing a candidate article may determine the likelihood of a user selecting the candidate article following the specific article. A user may belong to one cluster but multiple trait communities. The intersection levels of trait communities of users who accessed an article x.sub.1 and trait communities of users who have accessed an article x.sub.2 may determine the likelihood of selecting article x.sub.2 to follow article x.sub.1. The same applies when article types, rather than specific articles, are considered.
[0370]
[0371]
[0372] A practical method of tracking compliance scores is to determine compliance count per affinity-measure band. For example, the number of transitions 1560a corresponds to band {0.250-0.375) of the normalized affinity measure and the number of transitions 1560b corresponds to band {0.875-1.0) of the normalized affinity measure.
[0373]
[0374]
[0375] The transition proportion corresponding to the bands of the similarity measure are denoted θ.sub.0, θ.sub.1, . . . , θ.sub.7 (reference 1714), where θ.sub.t=J.sub.t/(J.sub.0+J.sub.1, . . . +J.sub.7), 0≤t<8.
[0376] The transition proportion corresponding to the bands of the gravitation measure are denoted ϕ.sub.0, ϕ.sub.1, . . . , ϕ.sub.7 (reference 1724), where ϕ.sub.t=K.sub.t/(K.sub.0+K.sub.1, . . . +K.sub.7), 0≤t<8.
[0377] The transition proportion corresponding to the bands of the attraction measure are denoted ρ.sub.0, ρ.sub.1, ρ.sub.7 (reference 1734), where ρ.sub.t=L.sub.t/(L.sub.0+L.sub.1, . . . +L.sub.7), 0≤t<8.
[0378] The transition proportion corresponding to the bands of the interest measure are denoted ω.sub.0, ω.sub.1, . . . , ω.sub.7 (reference 1744), where ω.sub.t=H.sub.t/(H.sub.0+H.sub.1, . . . +H.sub.7), 0≤t<8.
[0379] Upon detecting an article transition, the indices 1710, 1720, 1730, and 1740 of bands of the similarity, gravitation attraction, and interest levels are determined. Each of the corresponding article-transition scores 1712, 1722, 1732, and 1742 is increased. The corresponding proportions 1714, 1724, 1734, and 1744 of article-transition are recomputed to be considered current merits; the proportion of article transitions for a specific band is a merit of a corresponding affinity measure.
[0380]
[0381]
[0382] The number of bands for each affinity measure is selected to be the same (eight) for ease of illustration. The compliance scores 1912 corresponding to the bands of the similarity measure are denoted J.sub.0, J.sub.1, . . . , J.sub.7. The compliance scores 1922 corresponding to the bands of the gravitation measure are denoted K.sub.0, K.sub.1, . . . , K.sub.7. The compliance scores 1932 corresponding to the bands of the attraction measure are denoted L.sub.0, L.sub.1, . . . , L.sub.7. The compliance scores 1942 corresponding to the bands of the interest measure are denoted H.sub.0, H.sub.1, . . . , H.sub.7.
[0383] Upon detecting an article transition following a recommendation, the indices 1910, 1920, 1930, and 1940 of bands of the similarity, gravitation attraction, and interest levels corresponding to the recommendation are determined. Each of the corresponding compliance scores 1912, 1922, 1932, and 1942 is increased. For a transition corresponding to affinity bands 5, 2, 4, and 5, each of compliance scores J.sub.5, K.sub.2, L.sub.4, and H.sub.5 is increased (references 1914, 1924, 1934, and 1944, respectively).
[0384]
[0385] Upon determining that a recommendation has not been followed, the indices 2010, 2020, 2030, and 2040 of bands of the similarity, gravitation attraction, and interest levels corresponding to the recommendation are determined. Each of the corresponding noncompliance scores 2012, 2022, 2032, and 2042 is increased. For noncompliance corresponding to affinity bands 2, 3, 1, and 4, each of noncompliance scores J.sup.(2), K.sup.(3), L.sup.(1), and H.sup.(4) is increased.
[0386]
[0387] The merits are determined as:
α.sub.x=J.sub.x/(J.sub.x+J.sup.(x));
β.sub.x=K.sub.x/(K.sub.x+K.sup.(x));
γ.sub.x=L.sub.x/(L.sub.x+L.sup.(x)); and
η.sub.x=H.sub.x/(H.sub.x+H.sup.(x));
[0388] references 2114, 2124, 2134, and 2144, respectively.
[0389]
[0390] Each of
[0391]
[0392]
[0393]
[0394]
[0395]
[0396]
[0397]
[0398] As mentioned earlier, inter-article similarity levels are determined a priori. The inter-article gravitation levels, attraction levels, and interest levels are determined from usage data as described above with reference to
[0399] For the candidate articles m.sub.0, m.sub.1, m.sub.2, m.sub.3, and m.sub.4, the levels of similarity to the current article are denoted s.sub.0, s.sub.1, s.sub.2, s.sub.3, and s.sub.4, respectively, the gravitation levels are denoted g.sub.0, g.sub.1, g.sub.2, g.sub.3, and g.sub.4, respectively, the attraction levels are denoted p.sub.0, p.sub.1, p.sub.2, p.sub.3, and p.sub.4, respectively, and the interest levels are denoted q.sub.0, q.sub.1, q.sub.2, q.sub.3, and q.sub.4, respectively.
[0400] The merit of each affinity measure corresponding to a respective affinity-measure level is determined according to transition proportion as illustrated in
[0401] Thus, the similarity merits a.sub.0, a.sub.1, a.sub.2, a.sub.3, and a.sub.4, corresponding to similarity levels s.sub.0, s.sub.1, s.sub.2, s.sub.3, and s.sub.4, respectively, are determined according to transition proportion as illustrated in
[0402] The gravitation merits b.sub.0, b.sub.1, b.sub.2, b.sub.3, and b.sub.4, corresponding to gravitation levels g.sub.0, g.sub.1, g.sub.2, g.sub.3, and g.sub.4, respectively, are determined according to transition proportion (reference 1724) or according to compliance (reference 2124).
[0403] The attraction merits c.sub.0, c.sub.1, c.sub.2, c.sub.3, and c.sub.4, corresponding to attraction levels p.sub.0, p.sub.1, p.sub.2, p.sub.3, and p.sub.4, respectively, are determined according to transition proportion (reference 1734) or according to compliance (reference 2134).
[0404] The interest merits d.sub.0, d.sub.1, d.sub.2, d.sub.3, and cd.sub.4, corresponding to attraction levels q.sub.0, q.sub.1, q.sub.2, pq.sub.3, and q.sub.4, respectively, are determined according to transition proportion (reference 1744) or according to compliance (reference 2144).
[0405] The selection of one of the candidates (m.sub.0, m.sub.1, m.sub.2, m.sub.3, and m.sub.4) may be based on a composite affinity level 2930 determined as a function of affinity levels of the four affinity measures (similarity, gravitation, attraction, and interest). Preferably, the selection may be based on a composite affinity merit 2940 determined as a function of affinity merits of the four affinity measures.
[0406]
[0407]
[0408] The figure illustrates exemplary cases of gravitation-interest correlation 3110; similarity-gravitation correlation 3120; gravitation-attraction correlation 3130; attraction-interest correlation 3140; similarity-attraction correlation 3150; and similarity-interest correlation 3160.
[0409]
[0410]
[0411] A case of only two affinity measures, the similarity level and the gravitation measure, is illustrated. Thus, each buffer cell stores a similarity level 3321 and a gravitation level 3322. For the first N transitions, a tuple of index λ, 0≤λ<N, is stored in buffer cell λ. As illustrated, the tuple of transition 7 is stored in cell 7 and the tuple of transition 15 is stored in cell 15.
[0412]
[0413]
[0414] Process 3510 initializes arrays U and V, used for storing values of a first affinity measure and values of a second affinity measure, to have zero entries. Each of arrays U and V has N entries, N being a number of transitions under consideration. The first affinity measure and second affinity measure may be any pair of the affinity measures under consideration; for example, similarity and attraction. An array Q of five entries holding values of intermediate variables is also initialized to have zero entries.
[0415] Process 3520 of receives, stores, and updates records of tracked data of a moving window. Process 3530 defines fill of arrays U and V. Initially, the number of entries within each of arrays U and V may be less than the buffer size of N entries. After N transitions, each of arrays U and V stores the most recent N entries. Process 3540 updates intermediate variables. Process 3550 computes a correlation coefficient.
[0416]
[0417]
[0418] At a given system age T, the mutual concurrence period of several articles may be less than T. Thus, the compliance ratio may be less than a compliance ratio for articles of mutual concurrence period T.
[0419]
[0420] A memory device 3832 stores data relevant to information sources. A memory device 3834 stores data relevant to user-cluster association, users' clustering data which may be acquired from external sources or generated within apparatus 3800 using acquired user characterization data and specific software modules. A memory device 3836 stores data relevant to user-communities association. The grouping of users into clusters based on user characteristics, or communities based on user traits, may be performed externally or within apparatus 3800. Memory device 3842 stores article similarity levels which may be determined externally and updated as users access new articles. The similarity data may be generated based on acquired article characteristics. Memory device 3844 stores a gravitation vector for article pairs based on identifiers of articles selected by any user following access to a current article. Memory device 3870 stores an attraction vector for each article for each user cluster based on identifiers of articles selected following each current article. Memory device 3848 stores an interest vector for each article based on identifiers of articles selected following each current article.
[0421] Memory devices 3880 store software modules. A memory device 3881 stores a software module containing instructions for determining an appropriate information source (an appropriate article) to follow a currently accessed information source (article). A memory device 3882 stores a software module containing instructions for determining users' compliance scores. Memory devices 3883, 3884, 3885, and 3886 store software modules for updating mutual similarity levels, gravitation levels, attraction levels, and interest levels, respectively, of information sources or articles.
[0422]
[0423]
[0424] Process 4040 increases transition score for the specific affinity band determined in process 4030. Process 4050 determines an affinity merit corresponding to the specific affinity band as a ratio of the transition score of the specific affinity band to the total transition score. Process 4055 determines compliance, or otherwise, with a recommendation of process 3930. Process 4060 is executed if the new article (process 3940) is not the recommended complementing article. Process 4070 is executed if the new article is the complementing article. Process 4060 increases a noncompliance score corresponding to the specific affinity band while Process 4070 increases a compliance score corresponding to the specific affinity band. Process 4080 determines an affinity merit corresponding to the specific affinity band as a ratio of the compliance score to the sum of compliance score and noncompliance score.
[0425] Although specific embodiments of the invention have been described in detail, it should be understood that the described embodiments are intended to be illustrative and not restrictive. Various changes and modifications of the embodiments shown in the drawings and described in the specification may be made within the scope of the following claims without departing from the scope of the invention in its broader aspect.