METHOD AND SYSTEM FOR RECOMMENDATION OF DISEASE-RELATED RESOURCES
20220392646 · 2022-12-08
Inventors
Cpc classification
G16H20/40
PHYSICS
International classification
Abstract
Proposed are concepts for generating recommendations of disease-related resources in different geographical regions. In particular, it is proposed to leverage information about the popularity (e.g. frequency, rate or regularity of use) of keywords associated with diseases across different geographic regions. Using such information, a relevancy value (i.e. geography weighting score score) for a disease-related resource with respect to a specific geographic region may be calculated. These relevance values may then be used to generate recommendations for the disease-related resources for the specific geographic region.
Claims
1. A method for recommendation of disease-related resources in different geographical regions, the method comprising: identifying, from disease-related resource documents, a plurality of keywords associated with a plurality of diseases respectively; determining trend indexes for the plurality of diseases in a geographic region based on popularity of the identified keywords in the geographic region, wherein the trend index for a disease is indicative of relevancy of the disease in the geographic region; calculating a geography weighting score for disease-related resources in the geographic region based on the determined trend indexes; and generating a recommendation of disease-related resources based on the calculated geography weighting score.
2. The method of claim 1, wherein the geography weighting score comprises a vector representing relevancy of disease-related resources to the determined trend indexes in the geographic region.
3. The method of claim 1, wherein the disease-related resources comprises a plurality of educational contents or medical resources at a medical facility, and wherein the disease-related resource documents comprise a description of the educational contents or the medical resources at the medical facility respectively.
4. The method of claim 1, wherein determining a trend index for a disease in a geographic region comprises: analysing statistical data about the provision of one or more of the identified keywords to a search engine.
5. The method of claim 1, wherein generating a recommendation of disease-related resources is further based on a user interest.
6. The method of claim 1, wherein calculating a geography weighting score for disease-related resources in the geographic region comprises: identifying which of the plurality of diseases each of the disease-related resources relate to; and for each of the disease-related resources, calculating a geography weighting score based on the determined trend index for each of diseases the disease-related resource is identified as relating to.
7. The method of claim 6, wherein calculating a geography weighting score for disease-related resources in the geographic region comprises: representing the disease-related resources and diseases each of the disease-related resources relate to as a first matrix; representing the plurality of diseases and the determined trend indexes for the plurality of diseases as a second matrix; and performing a matrix multiplication with the first matrix and the second matrix to obtain a vector representing the geography weighting score for each of the disease-related resources.
8. A computer program product comprising computer program code means which, when executed on a computing device having a processing system, cause the processing system to perform all of the steps of the method according to claim 1.
9. A system for recommending disease-related resources in different geographical regions, the system comprising: a keywords extractor configured to identify, from disease-related resource documents, a plurality of keywords associated with a plurality of diseases respectively; a trend index generator configured to determine trend indexes for the plurality of diseases in a geographic region based on popularity of the identified keywords in the geographic region, wherein the trend index for a disease is indicative of relevancy of the disease in the geographic region; a calculation unit configured to calculate a geography weighting score for disease-related resources in the geographic region based on the determined trend indexes; and a recommendation unit configured to generate a recommendation of disease-related resources based on the calculated geography weighting score.
10. The system of claim 9, wherein the calculation unit is configured to calculate a vector representing relevancy of disease-related resources to the determined trend indexes in the geographic region.
11. The system of claim 9, wherein the disease-related resources comprises a plurality of educational contents or medical resources at a medical facility, and wherein the disease-related resource documents comprise a description of the educational contents or the medical resources at the medical facility respectively.
12. The system of claim 9, wherein trend index generator comprises: a data analysis unit configured to analyse statistical data about the provision of one or more of the identified keywords to a search engine.
13. The system of claim 9, wherein the recommendation unit is configured to generate a recommendation of disease-related resources further based on a user interest.
14. The system of claim 9, wherein the calculation unit is configured to: identify which of the plurality of diseases each of the disease-related resources relate to; and for each of the disease-related resources, calculate a geography weighting score based on the determined trend index for each of diseases the disease-related resource is identified as relating to.
15. The system of claim 14, wherein the calculation unit comprises: a matrix generator configured to represent the disease-related resources and diseases each of the disease-related resources relate to as a first matrix; a vector generator configured to represent the plurality of diseases and the determined trend indexes for the plurality of diseases as a second matrix; and a multiplication unit configured to perform a matrix multiplication with the first matrix and the second matrix to obtain a vector representing the geography weighting score for each of the disease-related resources.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0022] For a better understanding of the invention, and to show more clearly how it may be carried into effect, reference will now be made, by way of example only, to the accompanying drawings, in which:
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0029] The invention will be described with reference to the Figures.
[0030] It should be understood that the detailed description and specific examples, while indicating exemplary embodiments of the apparatus, systems and methods, are intended for purposes of illustration only and are not intended to limit the scope of the invention. These and other features, aspects, and advantages of the apparatus, systems and methods of the present invention will become better understood from the following description, appended claims, and accompanying drawings. It should be understood that the Figures are merely schematic and are not drawn to scale. It should also be understood that the same reference numerals are used throughout the Figures to indicate the same or similar parts.
[0031] The invention provides concepts for generating recommendations of disease-related resources in different geographical regions. In particular, it is proposed to leverage information about the popularity (e.g. frequency, rate or regularity of use) of keywords associated with diseases across different geographic regions. Using such information, a relevancy value (i.e. geography weighting score score) for a disease-related resource with respect to a specific geographic region may be calculated. These relevance values may then be used to generate recommendations for the disease-related resources for the specific geographic region.
[0032] Proposed embodiments may therefore be employed to provide geographic-dependent trend-aware recommendations that help to improve allocation of disease-related resources across different geographic regions.
[0033] Proposed embodiments may thus, for example, facilitate generation of personalized/customized recommendations disease-related resources that are tailored/adapted to specific geographic regions (e.g. towns, cities, postal codes, states, counties, countries, continents, etc.). Information about disease trends in different regions may therefore be used to provide refined disease-related resource recommendations.
[0034] According to a concept of the invention, there is proposed a method and system for recommendation of disease-related resources in different geographical regions. Keywords associated with diseases respectively are extracted from disease-related resource documents. Trend indexes for the diseases in a geographic region are then determined based on popularity of the identified keywords in the geographic region. Geography weighting score for disease-related resources in the geographic region are then calculated based on the determined trend indexes. Using the calculated geography weighting score, a recommendation of disease-related resources can be generated.
[0035] Illustrative embodiments may, for example, be employed in relation to healthcare-related digital services, include medical training/educational platform and/or medical resource allocation systems.
[0036] Implementations in accordance with the present disclosure relate to various systems, adaptions, and/or methods for recommending disease-related resources in different geographical regions. According to proposed concepts, a number of possible solutions may be implemented separately or jointly. That is, although these possible solutions may be described below separately, two or more of these possible solutions may be implemented in one combination or another.
[0037] By way of example, a first simplified embodiment will now be described with reference to
[0038]
[0039] The method begins with the step 110 of identifying, from disease-related resource documents, a plurality of keywords associated with a plurality of diseases respectively. By way of example, step 110 of identifying keywords may employ one or more of a plurality of known Natural Language Processing (NLP) algorithms. Step 110 of identifying keywords may alternatively (or additionally) comprise providing the disease-related resource documents to a machine-learning algorithm (e.g. neural network) to obtain one or more keywords as prediction results from the machine-learning algorithm. In this way, keywords may be identified/extracted from written descriptions of (medical-related) subject-matter and/or medical resources.
[0040] Next, in step 120, trend indexes for the plurality of diseases in a geographic region are determined based on popularity of the identified keywords in the geographic region. A trend index for a disease comprises a value indicative of a relevancy (e.g. popularity, prevalence or commonness) of the disease in the geographic region. In the example of
[0041] The trend indexes determined in step 120 are then used in step 130 to calculate a geography weighting score for disease-related resources in the geographic region. Here, a geography weighting score comprises a vector representing relevancy of disease-related resources to the determined trend indexes in the geographic region.
[0042] More specifically, in the example embodiment of
[0043] Finally, in step 140, a recommendation of disease-related resources is generated based on the calculated geography weighting score.
[0044] It is to be understood that the method described above with reference to
[0045] By way of further example and explanation, an exemplary embodiment of a system for recommending disease-related resources in different geographical regions will now be described with reference to
[0046]
[0047] The keywords extractor 210 is configured to identify, from disease-related resource documents, a plurality of keywords associated with a plurality of diseases respectively. Specifically, in the example embodiment of
[0048] The keywords extractor 210 can be realized by a known NLP algorithm (e.g. TF-IDF, Co-occurrence, and deep learning approaches) combined with expert (i.e. supervisor) experience.
[0049] The keywords extractor 210 generates and outputs a list of keywords 215 related to the diseases of interest, which may, for example, comprise ‘cough’, ‘flu’, ‘fever’, ‘COPD’, ‘NIV’, etc.
[0050] The list of keywords 215 is provided to the trend index generator 220 of the system 200. The trend index generator 220 is configured to determine trend indexes for the plurality of diseases in a geographic region based on popularity of the identified keywords in the geographic region. In particular, the trend index for a disease is indicative of relevancy of the disease in the geographic region. Determined trend indexes can therefore summarize the popularity of the keywords/diseases in different regions.
[0051] The trend index generator 220 may be implemented by leveraging data from a database 225 or search engine. For instance, the trend index generator 220 may employ a data analysis unit which undertakes statistical analysis about the provision of one or more of the identified keywords to a database or search engine in order to determine the popularity of keywords.
[0052]
[0053] The exemplary trend index data of
[0054] Referring back to
[0055] The calculated geography weighting scores are provided to a recommendation unit 240 which is configured to generate a recommendation (e.g. list) 250 of disease-related resources based on the geography weighting scores Such recommendations may, for example, be generated based on inferences such as those detailed above with reference to the exemplary data of
[0056] Referring now to
[0057] In the first exemplary application (depicted in
[0058] For example, a matrix representation of the disease-related resources (e.g. Learning Courses) and diseases each of the disease-related resources relate may provided as a first matrix as follows:
TABLE-US-00001 Flu Cough Ventilator Course A 1 1 0 Course B 0 0 1 Course C 1 0 1 Course D 0 1 0 Course E 1 1 1
[0059] The above first matrix indicates a one-hot encoding map to represent whether a course is related to each keyword—if yes it is 1, else it is 0. Alternatively, the matrix can be achieved by a machine-learning algorithm, such as an NLP-algorithm can determine which topics are related to the content and even a relevance score can be predicted rather than the 0 and 1 here.
[0060] Also, an exemplary matrix representation of the plurality of diseases and the determined trend indexes for the plurality of diseases may be provided as a second (score) matrix as follows:
TABLE-US-00002 Popularity Score Flu 0.4 Cough 0.38 Ventilator 0.22
[0061] The above second matrix represents the normalized popularity score for each keyword.
[0062] By performing a matrix multiplication with the first matrix and second matrix, the following weighted score vector is obtained:
TABLE-US-00003 Weighted Score Course A 0.78 Course B 0.22 Course C 0.62 Course D 0.38 Course E 1
[0063] The weighted score vector above thus indicates how relevant a course in relation to the disease trends. The weighted score vector can therefore be used to modify the prediction 420 (e.g. re-rank the initial recommendation list) so as to provide recommendations based on the disease trends. In this way, the generated recommendations) may not only account for user interest(s) but also account for potential knowledge needs (in view of disease trends).
[0064] In the second exemplary application (depicted in
[0065] Accordingly, it will be understood that the proposed concept(s) may facilitate geographical-dependent resource recommendation. Embodiments may thus provide for personalized (i.e. user-specific) disease-related resource recommendations, wherein the recommendations take account of one or more disease trends in a geographical region.
[0066] Proposed methods and system (including those of
[0067]
[0068] The computer 500 includes, but is not limited to, PCs, workstations, laptops, PDAs, palm devices, servers, storages, and the like. Generally, in terms of hardware architecture, the computer 500 may include one or more processors 510, memory 520, and one or more I/O devices 570 that are communicatively coupled via a local interface (not shown). The local interface can be, for example but not limited to, one or more buses or other wired or wireless connections, as is known in the art. The local interface may have additional elements, such as controllers, buffers (caches), drivers, repeaters, and receivers, to enable communications. Further, the local interface may include address, control, and/or data connections to enable appropriate communications among the aforementioned components.
[0069] The processor 510 is a hardware device for executing software that can be stored in the memory 520. The processor 510 can be virtually any custom made or commercially available processor, a central processing unit (CPU), a digital signal processor (DSP), or an auxiliary processor among several processors associated with the computer 500, and the processor 510 may be a semiconductor based microprocessor (in the form of a microchip) or a microprocessor.
[0070] The memory 520 can include any one or combination of volatile memory elements (e.g., random access memory (RAM), such as dynamic random access memory (DRAM), static random access memory (SRAM), etc.) and non-volatile memory elements (e.g., ROM, erasable programmable read only memory (EPROM), electronically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), tape, compact disc read only memory (CD-ROM), disk, diskette, cartridge, cassette or the like, etc.). Moreover, the memory 820 may incorporate electronic, magnetic, optical, and/or other types of storage media. Note that the memory 520 can have a distributed architecture, where various components are situated remote from one another, but can be accessed by the processor 510.
[0071] The software in the memory 520 may include one or more separate programs, each of which comprises an ordered listing of executable instructions for implementing logical functions. The software in the memory 520 includes a suitable operating system (O/S) 550, compiler 540, source code 530, and one or more applications 560 in accordance with exemplary embodiments. As illustrated, the application 560 comprises numerous functional components for implementing the features and operations of the exemplary embodiments. The application 560 of the computer 500 may represent various applications, computational units, logic, functional units, processes, operations, virtual entities, and/or modules in accordance with exemplary embodiments, but the application 560 is not meant to be a limitation.
[0072] The operating system 550 controls the execution of other computer programs, and provides scheduling, input-output control, file and data management, memory management, and communication control and related services. It is contemplated by the inventors that the application 560 for implementing exemplary embodiments may be applicable on all commercially available operating systems.
[0073] Application 560 may be a source program, executable program (object code), script, or any other entity comprising a set of instructions to be performed. When a source program, then the program is usually translated via a compiler (such as the compiler 540), assembler, interpreter, or the like, which may or may not be included within the memory 520, so as to operate properly in connection with the O/S 550. Furthermore, the application 560 can be written as an object oriented programming language, which has classes of data and methods, or a procedure programming language, which has routines, subroutines, and/or functions, for example but not limited to, C, C++, C#, Pascal, BASIC, API calls, HTML, XHTML, XML, ASP scripts, JavaScript, FORTRAN, COBOL, Perl, Java, ADA, .NET, and the like.
[0074] The I/O devices 570 may include input devices such as, for example but not limited to, a mouse, keyboard, scanner, microphone, camera, etc. Furthermore, the I/O devices 570 may also include output devices, for example but not limited to a printer, display, etc. Finally, the I/O devices 570 may further include devices that communicate both inputs and outputs, for instance but not limited to, a NIC or modulator/demodulator (for accessing remote devices, other files, devices, systems, or a network), a radio frequency (RF) or other transceiver, a telephonic interface, a bridge, a router, etc. The I/O devices 570 also include components for communicating over various networks, such as the Internet or intranet.
[0075] If the computer 500 is a PC, workstation, intelligent device or the like, the software in the memory 520 may further include a basic input output system (BIOS) (omitted for simplicity). The BIOS is a set of essential software routines that initialize and test hardware at startup, start the O/S 550, and support the transfer of data among the hardware devices. The BIOS is stored in some type of read-only-memory, such as ROM, PROM, EPROM, EEPROM or the like, so that the BIOS can be executed when the computer 500 is activated.
[0076] When the computer 500 is in operation, the processor 510 is configured to execute software stored within the memory 520, to communicate data to and from the memory 520, and to generally control operations of the computer 500 pursuant to the software. The application 560 and the O/S 550 are read, in whole or in part, by the processor 510, perhaps buffered within the processor 510, and then executed.
[0077] When the application 560 is implemented in software it should be noted that the application 560 can be stored on virtually any computer readable medium for use by or in connection with any computer related system or method. In the context of this document, a computer readable medium may be an electronic, magnetic, optical, or other physical device or means that can contain or store a computer program for use by or in connection with a computer related system or method.
[0078] The application 560 can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. In the context of this document, a “computer-readable medium” can be any means that can store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The computer readable medium can be, for example but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, device, or propagation medium.
[0079] The present invention may be a system, a method, and/or a computer program product. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention.
[0080] According to embodiments, a computer-readable storage medium stores a computer program comprising computer program code configured to cause one or more physical computing devices to carry out a method as described above when the program is run on the one or more physical computing devices.
[0081] Storage media may include volatile and non-volatile computer memory such as RAM, PROM, EPROM, and EEPROM. Various storage media may be fixed within a computing device or may be transportable, such that the one or more programs stored thereon can be loaded into a processor.
[0082] All of part of a schematic model according to an embodiment may be stored on a storage medium. Data according to an embodiment may be stored on the same storage medium or a different storage medium. The schematic model and/or data may be transmitted as a signal modulated onto an electromagnetic carrier wave. The signal may be defined according to a standard for digital communications. The carrier wave may be an optical carrier, a radio-frequency wave, a millimeter wave, or a near field communications wave. It may be wired or wireless.
[0083] To the extent that an embodiment is implemented partly or wholly in hardware, the blocks shown in the diagrams of
[0084] As used herein, a memory includes one or more of a non-transient computer readable medium; a magnetic disk or other magnetic storage medium; an optical disk or other optical storage medium; a random access memory (RAM), read-only memory (ROM), or other electronic memory device or chip or set of operatively interconnected chips; an Internet/Intranet server from which the stored instructions may be retrieved via the Internet/Intranet or a local area network; or so forth. Further, as used herein, a processor includes one or more of a microprocessor, a microcontroller, a graphic processing unit (GPU), an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), and the like; a user input device includes one or more of a mouse, a keyboard, a touch screen display, one or more buttons, one or more switches, one or more toggles, and the like; a display device includes one or more of a liquid crystal display (LCD), an light emitting diode (LED) display, a plasma display, a projection display, a touch screen display, and the like; and databases include one or more memories.
[0085] The invention has been described with reference to the preferred embodiments. Modifications and alterations may occur to others upon reading and understanding the preceding detailed description. It is intended that the invention be construed as including all such modifications and alterations insofar as they come within the scope of the appended claims or the equivalents thereof.
[0086] It will also be understood that the disclosed methods may be computer-implemented methods. As such, there is also proposed a concept of a computer program comprising code means for implementing any described method when said program is run on a processing system.
[0087] The skilled person would be readily capable of developing a processor for carrying out any herein described method. Thus, each step of a flow chart may represent a different action performed by a processor, and may be performed by a respective module of the processing processor.
[0088] A proposed system may make use of a processor to perform data processing. The processor can be implemented in numerous ways, with software and/or hardware, to perform the various functions required. The processor typically employs one or more microprocessors that may be programmed using software (e.g. microcode) to perform the required functions. The processor may be implemented as a combination of dedicated hardware to perform some functions and one or more programmed microprocessors and associated circuitry to perform other functions.
[0089] Examples of circuitry that may be employed in various embodiments of the present disclosure include, but are not limited to, conventional microprocessors, application specific integrated circuits (ASICs), and field-programmable gate arrays (FPGAs).
[0090] In various implementations, the processor may be associated with one or more storage media such as volatile and non-volatile computer memory such as RAM, PROM, EPROM, and EEPROM. The storage media may be encoded with one or more programs that, when executed on one or more processors and/or controllers, perform the required functions. Various storage media may be fixed within a processor or controller or may be transportable, such that the one or more programs stored thereon can be loaded into a processor.
[0091] Variations to the disclosed embodiments can be understood and effected by those skilled in the art in practicing the claimed invention, from a study of the drawings, the disclosure and the appended claims. In the claims, the word “comprising” does not exclude other elements or steps, and the indefinite article “a” or “an” does not exclude a plurality. A single processor or other unit may fulfill the functions of several items recited in the claims. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage. A computer program may be stored/distributed on a suitable medium, such as an optical storage medium or a solid-state medium supplied together with or as part of other hardware, but may also be distributed in other forms, such as via the Internet or other wired or wireless telecommunication systems. If the term “adapted to” is used in the claims or description, it is noted that the term “adapted to” is intended to be equivalent to the term “configured to”. Any reference signs in the claims should not be construed as limiting the scope.