METHOD AND DEVICE FOR IDENTIFYING SPECIFIC REGION IN MICROORGANISM TARGET FRAGMENT AND USE THEREOF
20230129284 · 2023-04-27
Assignee
Inventors
- CONG JI (SHANGHAI, CN)
- JUNBIN SHAO (SHANGHAI, CN)
- YAN LIU (SHANGHAI, CN)
- XIA QI (SHANGHAI, CN)
- YUDAN JIN (SHANGHAI, CN)
- QITENG LI (SHANGHAI, CN)
Cpc classification
G16B25/20
PHYSICS
International classification
Abstract
The present disclosure provides a method for identifying a specific region in a microorganism target fragment, including the following operations: S100, respectively comparing a microorganism target fragment with genome sequences of one or more comparison strain one-to-one, and removing fragments of which the similarity exceeds a preset value, to obtain a plurality of residual fragments as first-round cut fragments T.sub.1-T.sub.n; S200, respectively comparing the first-round cut fragments T.sub.1-T.sub.n with the remaining comparison strains, and removing fragments of which the similarity exceeds a preset value, to obtain a collection of residual cut fragments as a candidate specific region of the microorganism target fragment; and S300, verifying and obtaining a specific region. The method is high in accuracy and sensitivity which can identify the subspecies level and a dual-verification module is provided. The present disclosure is suitable for identifying a specific region in the whole genome of any pathogen.
Claims
1. A method for identifying a specific region in a microorganism target fragment, at least comprising: S100, respectively comparing a microorganism target fragment with whole genome sequences of one or more comparison strains one-to-one, and removing fragments of which a similarity exceeds a preset value, to obtain a plurality of residual fragments as first-round cut fragments T.sub.1-T.sub.n, wherein n is an integer greater than or equal to 1; S200, respectively comparing the first-round cut fragments T.sub.1-T.sub.n with whole genome sequences of the remaining comparison strains, and removing fragments of which the similarity exceeds a preset value, to obtain a collection of residual cut fragments as a candidate specific region of the microorganism target fragment; and S300, verifying and obtaining a specific region: determining whether the candidate specific region meets the following requirements: 1) searching in public databases to find whether there are other species of which a similarity to the candidate specific region is greater than the preset value; 2) respectively comparing the candidate specific region with whole genome sequences of the comparison strains and a whole genome sequence of a host of a source strain of the microorganism target fragment, to find whether there are fragments with a similarity greater than the preset value; if the candidate specific region does not meet the above requirements, the candidate specific region is a specific region of the microorganism target fragment.
2. The method for identifying a specific region in a microorganism target fragment according to claim 1, further comprising one or more of the following: a. the method is capable of distinguishing whether the source strain of the microorganism target fragment and a comparison strain belong to a same species or a same subspecies; b. the similarity refers to a product of a coverage rate and a matching rate of the microorganism target fragment, and the coverage rate=(length of similar sequence fragment/(end value of the microorganism target fragment−starting value of the microorganism target fragment+1))%; c. in operation S200, the first-round cut fragments T.sub.1-T.sub.n are respectively compared with whole genome sequences of the remaining comparison strains by group iteration; d. the method further comprises the following operation: S110, comparing selected adjacent microorganism target fragments in pairs; if the similarity after comparison is lower than the preset value, issuing an alarm and displaying screening conditions corresponding to a target strain.
3. The method for identifying a specific region in a microorganism target fragment according to claim 2, wherein the first-round cut fragment T.sub.n being compared with whole genome sequences of the remaining comparison strains by group iteration includes the following operations: S210, dividing the remaining comparison strains into P groups, each group including a plurality of comparison strains; S211, simultaneously comparing the first-round cut fragment T.sub.n with the whole genome sequences of each comparison strain in the first group one-to-one, and removing fragments of which the similarity exceeds the preset value, to obtain a plurality of residual fragments as a first-round candidate sequence library of the first-round cut fragment T.sub.n, and S212, simultaneously comparing a previous-round candidate sequence library of the first-round cut fragment T.sub.n with whole genome sequences of each comparison strain in the next group one-to-one, and removing fragments of which the similarity exceeds the preset value, to obtain a plurality of residual fragments as a next-round candidate sequence library of the first-round cut fragment T.sub.n; repeating operation S212 from the first-round candidate sequence library until a Pth-round candidate sequence library is obtained as a candidate specific sequence library of the first-round cut fragment T.sub.n; wherein a collection of all the candidate specific sequence libraries of the first-round cut fragments is the candidate specific region.
4. The method for identifying a specific region in a microorganism target fragment according to claim 3, wherein a number of comparison strains contained in a comparison strain group is set according to a hardware configuration of a computing environment.
5. The method for identifying a specific region in a microorganism target fragment according to claim 1, further comprising: after operation S200, performing operations S100 and S200 to obtain candidate specific regions of each microorganism target fragment in a target sequence, taking a collection of the candidate specific regions of each microorganism target fragment as candidate specific regions of the target sequence.
6. A device for identifying a specific region in a microorganism target fragment, comprising: a first-round cut fragment obtaining module, configured to respectively compare a microorganism target fragment with genome sequences of one or more whole comparison strains one-to-one, and remove fragments of which a similarity exceeds a preset value, to obtain a plurality of residual fragments as first-round cut fragments T.sub.1-T.sub.n, wherein n is an integer greater than or equal to 1; a candidate specific region obtaining module, configured to respectively compare the first-round cut fragments T.sub.1-T.sub.n with whole genome sequences of remaining comparison strains, and remove fragments of which the similarity exceeds a preset value, to obtain a collection of residual cut fragments as candidate specific regions of the microorganism target fragment; and a specific region verifying and obtaining module, configured to determine whether the candidate specific region meets the following requirements: 1) searching in public databases to find whether there are other species of which a similarity to the candidate specific region is greater than the preset value; 2) respectively comparing the candidate specific region with whole genome sequences of the comparison strains and a whole genome sequence of a host of a source strain of the microorganism target fragment, to find whether there are fragments with a similarity greater than the preset value; if the candidate specific region does not meet the above requirements, the candidate specific region is a specific region of the microorganism target fragment.
7. The device for identifying a specific region in a microorganism target fragment according to claim 6, further comprising one or more of the following: a. the device is capable of distinguishing whether the source strain of the microorganism target fragment and a comparison strain belong to a same species or a same subspecies; b. the similarity refers to a product of a coverage rate and a matching rate of the microorganism target fragment, and the coverage rate=(length of similar sequence fragment/(end value of the microorganism target fragment−starting value of the microorganism target fragment+1))%; c. in the candidate specific region obtaining module, the first-round cut fragments T.sub.1-T.sub.n are respectively compared with whole genome sequences of the remaining comparison strains by group iteration; d. the first-round cut fragment obtaining module further includes the following submodules: a raw data similarity comparison submodule, configured to compare selected adjacent microorganism target fragments in pairs; if the similarity after comparison is lower than the preset value, an alarm is issued and screening conditions corresponding to the target strain are displayed.
8. The device for identifying a specific region in a microorganism target fragment according to claim 7, wherein when the first-round cut fragment T.sub.n is compared with whole genome sequences of the remaining comparison strains by group iteration, the candidate specific region obtaining module includes the following submodules: a comparison strain grouping submodule, configured to divide the remaining comparison strains into P groups, each group including a plurality of comparison strains; a first-round candidate sequence library obtaining submodule, configured to simultaneously compare the first-round cut fragment T.sub.n with the whole genome sequences of each comparison strain in the first group one-to-one, and remove fragments of which the similarity exceeds the preset value, to obtain a plurality of residual fragments as a first-round candidate sequence library of the first-round cut fragment T.sub.n; a candidate specific region obtaining submodule, to simultaneously compare a previous-round candidate sequence library of the first-round cut fragment T.sub.n with whole genome sequences of each comparison strain in a next group one-to-one, and remove fragments of which the similarity exceeds the preset value, to obtain a plurality of residual fragments as a next-round candidate sequence library of the first-round cut fragment T.sub.n, the candidate specific region obtaining submodule is repeated from the first-round candidate sequence library until a Pth-round candidate sequence library is obtained as a candidate specific sequence library of the first-round cut fragment T.sub.n; wherein a collection of all the candidate specific sequence libraries of the first-round cut fragments is the candidate specific region.
9. The device for identifying a specific region in a microorganism target fragment according to claim 8, wherein a number of comparison strains contained in a comparison strain group is set according to a hardware configuration of a computing environment.
10. The device for identifying a specific region in a microorganism target fragment according to claim 6, further comprising: a candidate specific region obtaining module of a target sequence, which executes the first-round cut fragment obtaining module of a target fragment and the candidate specific region obtaining module of a microorganism target fragment to obtain candidate specific regions of each microorganism target fragment in the target sequence, and takes a collection of the candidate specific regions of each microorganism target fragment as candidate specific regions of the target sequence.
11. A computer readable storage medium, which stores a computer program, wherein when executed by a processor, the computer program implements the method for identifying a specific region in a microorganism target fragment according to claim 1.
12. A computer processing device, comprising a processor and the computer readable storage medium according to claim 11, wherein the processor executes the computer program on the computer readable storage medium to implement a method for identifying a specific region in a microorganism target fragment, wherein the method comprises at least the following operations: S100, respectively comparing a microorganism target fragment with whole genome sequences of one or more comparison strains one-to-one, and removing fragments of which a similarity exceeds a preset value, to obtain a plurality of residual fragments as first-round cut fragments T.sub.1-T.sub.n, wherein n is an integer greater than or equal to 1; S200, respectively comparing the first-round cut fragments T.sub.1-T.sub.n with whole genome sequences of the remaining comparison strains, and removing fragments of which the similarity exceeds a preset value, to obtain a collection of residual cut fragments as a candidate specific region of the microorganism target fragment; and S300, verifying and obtaining a specific region: determining whether the candidate specific region meets the following requirements: 1) searching in public databases to find whether there are other species of which a similarity to the candidate specific region is greater than the preset value; 2) respectively comparing the candidate specific region with whole genome sequences of the comparison strains and a whole genome sequence of a host of a source strain of the microorganism target fragment, to find whether there are fragments with a similarity greater than the preset value; if the candidate specific region does not meet the above requirements, the candidate specific region is a specific region of the microorganism target fragment.
13. An electronic terminal, comprising a processor, a memory and a communicator; the memory stores a computer program, the communicator communicates with an external device, and the processor executes a computer program stored in the memory, so that the electronic terminal executes the method for identifying a specific region in a microorganism target fragment according to claim 1.
14. A use of the method for identifying a specific region in a microorganism target fragment according to claim 1 for identifying a specific region in a microorganism target fragment.
15. The use according to claim 14, further comprising one or more of the following: a. distinguishing whether the source strain of the microorganism target fragment and a comparison strain belong to the same species or the same subspecies; b. the microorganism includes one or more of bacterium, virus, fungus, amoeba, cryptosporidium, flagellate, microsporidium, piroplasma, plasmodium, toxoplasma, trichomonas and kinetoplastid.
16. A use of the device for identifying a specific region in a microorganism target fragment according to claim 6 for identifying a specific region in a microorganism target fragment.
17. A use of the computer readable storage medium according to claim 11 for identifying a specific region in a microorganism target fragment.
18. A use of the computer processing device according to claim 12 for identifying a specific region in a microorganism target fragment.
19. A use of the electronic terminal according to claim 13 for identifying a specific region in a microorganism target fragment.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0032] The embodiments of the present disclosure will be described below. Those skilled in the art can easily understand other advantages and effects of the present disclosure according to contents disclosed by the specification. The present disclosure may also be implemented or applied through other different specific implementation modes. Various modifications or changes may be made to all details in the specification based on different points of view and applications without departing from the spirit of the present disclosure.
[0033] In addition, it should be understood that one or more method operations mentioned in the present disclosure are not exclusive of other method operations that may exist before or after the combined operations or that other method operations may be inserted between these explicitly mentioned operations, unless otherwise stated. It should also be understood that the combined connection relationship between one or more operations mentioned in the present disclosure does not exclude that there may be other operations before or after the combined operations or that other operations may be inserted between these explicitly mentioned operations, unless otherwise stated. Moreover, unless otherwise stated, the numbering of each method step is only a convenient tool for identifying each method step, and is not intended to limit the order of each method step or to limit the scope of the present disclosure. The change or adjustment of the relative relationship shall also be regarded as the scope in which the present disclosure may be implemented without substantially changing the technical content.
[0034] Please refer to
[0035] As shown in
[0036] S100, respectively comparing a microorganism target fragment with whole genome sequences of one or more comparison strains one-to-one, and removing fragments of which a similarity exceeds a preset value, to obtain a plurality of residual fragments as first-round cut fragments T.sub.1-T.sub.n, wherein n is an integer greater than or equal to 1;
[0037] S200, respectively comparing the first-round cut fragments T.sub.1-T.sub.n with whole genome sequences of the remaining comparison strains, and removing fragments of which the similarity exceeds a preset value, to obtain a collection of residual cut fragments as a candidate specific region of the microorganism target fragment; and
[0038] S300, verifying and obtaining a specific region: determining whether the candidate specific region meets the following requirements:
[0039] 1) searching in public databases to find whether there are other species of which a similarity to the candidate specific region is greater than the preset value;
[0040] 2) respectively comparing the candidate specific region with whole genome sequences of the comparison strains and a whole genome sequence of a host of a source strain of the microorganism target fragment, to find whether there are fragments with a similarity greater than the preset value;
[0041] if the candidate specific region does not meet the above requirements, the candidate specific region is a specific region of the microorganism target fragment.
[0042] The method of the present disclosure is capable of distinguishing whether the source strain of the microorganism target fragment and the comparison strains belong to the same species or the same subspecies.
[0043] In the above operations, the similarity refers to a product of a coverage rate and a matching rate of the microorganism target fragment.
The coverage rate=(length of similar sequence fragment/(end value of the microorganism target fragment−starting value of the microorganism target fragment+1))%;
[0044] The matching rate refers to the identity value when the microorganism target fragment is compared with the comparison strain. The identity value of the two compared sequences may be obtained by software such as needle, water or blat.
[0045] The length of similar sequences refers to the number of bases that the matched fragment occupies in the target fragment when two sequences are compared, that is, the length of the matched fragment.
[0046] The preset value of the similarity may be determined as needed. The higher the preset value of the similarity, the fewer fragments will be removed. The recommended preset value of the similarity should exceed 95%, such as 96%, 97%, 98%, 99% or 100%.
[0047] The specific sequence is shown in operation S100 in
[0048] The coverage rate and matching rate of microorganism target fragments may be calculated by software such as needle, water or blat.
[0049] For example, a calculation result is shown in
Coverage rate of sequence A=(187/(187−1+1))*100%=100%
[0050] The matching rate of sequence A and sequence B is equal to 98.4%.
[0051] Then the similarity between A and B=100%*98.4%=98.4%.
[0052] The microorganism target fragment and the comparison strains in operation S100 are all derived from public databases, which are mainly selected from NCBI (https://www.ncbi.nlm.nih.gov).
[0053] Further, the method includes the following operations: S110, comparing the selected adjacent microorganism target fragments in pairs; if the similarity after comparison is lower than the preset value, issuing an alarm and displaying the screening conditions corresponding to the target strain.
[0054] Abnormal data caused by human errors or other reasons can be filtered. The microorganism target fragment in operation S100 may be a whole genome of a microorganism or a gene fragment of a microorganism.
[0055] In operation S200, in order to speed up the comparison, in a preferred embodiment, the first-round cut fragments T.sub.1-T.sub.n are respectively compared with whole genome sequences of the remaining comparison strains by group iteration.
[0056] Specifically, as shown in
[0057] S210, dividing the remaining comparison strains into P groups, each group including a plurality of comparison strains;
[0058] S211, simultaneously comparing the first-round cut fragment T.sub.n with the whole genome sequences of each comparison strain in the first group one-to-one, and removing fragments of which the similarity exceeds the preset value, to obtain a plurality of residual fragments as a first-round candidate sequence library of the first-round cut fragment T.sub.n;
[0059] S212, simultaneously comparing a previous-round candidate sequence library of the first-round cut fragment T.sub.n with whole genome sequences of each comparison strain in the next group one-to-one, and removing fragments of which the similarity exceeds the preset value, to obtain a plurality of residual fragments as a next-round candidate sequence library of the first-round cut fragment T.sub.n; repeating operation S212 from the first-round candidate sequence library until a Pth-round candidate sequence library is obtained as a candidate specific sequence library of the first-round cut fragment T.sub.n;
[0060] a collection of all the candidate specific sequence libraries of the first-round cut fragments is the candidate specific region.
[0061] In order to avoid multi-thread blocking, the number of comparison strains contained in a comparison strain group should be set according to the hardware configuration of the computing environment. The number may be the number of threads set according to the total configuration of the operating environment. Generally, the number of threads may be 1-50. Specifically, the number of threads may be 1-4, 4-8, 8-10, 10-20, or 20-50. Preferably, the number of threads is 4. In the embodiment shown in
[0062] For example, as shown in
[0063] Secondly, simultaneously comparing the microorganism target fragment 2 in the target sequence with the sequences 1-8 in the 588 comparison strains, performing the first-round cutting to remove the matched sequences, and obtaining the first-round specific sequence library after a comprehensive summary; then, simultaneously comparing the first-round specific sequence library with the sequences 9-16 in the 588 comparison strains, performing the second-round cutting to remove the matched sequences, and obtaining the second-round specific sequence library after a comprehensive summary; then, simultaneously comparing the second-round specific sequence library with the sequences 17-24 in the 588 comparison strains, performing the third-round cutting to remove the matched sequences, and obtaining the third-round specific sequence library after a comprehensive summary; . . . , performing sequentially, until the 73th-round specific sequence library is simultaneously compared with the sequences 585-588 in the 588 comparison strains, the matched sequences is removed by performing the 74th-round cutting, and the 74th-round specific sequence library, i.e., the specific sequence library of the target fragment 2, is obtained after a comprehensive summary.
[0064] Performing sequentially, until the comparison of the microorganism target fragment 2541 in the target sequence and the 588 comparison strains are completed. The cut fragments obtained are the candidate specific regions of the microorganism target fragments.
[0065] In a preferred embodiment, the operation S200 further includes:
[0066] performing operations S100 and S200 to obtain candidate specific regions of each microorganism target fragment in the target sequence, taking a collection of the candidate specific regions of each microorganism target fragment as candidate specific regions of the target sequence.
[0067] The target sequence may include multiple target fragments. The multiple target fragments may be fragments obtained by screening from the genome of microorganisms through other screening operations, for example, multi-copy fragments of specific microorganisms.
[0068] In operation S300, the public databases are mainly selected from NCBI (https://www.ncbi.nlm.nih.gov). The algorithm for searching in the public database may be the blast algorithm.
[0069] Further, before performing operations S100, S200 and S300, the cutting size is set according to the hardware configuration of the computing environment, and the data to be calculated is cut in units. Specifically, in operation S100, the data to be calculated is the target fragments. In operation S200, the data to be calculated is the current-round specific sequence library after removing the matched sequences in each iteration. In operation S300, the data to be calculated is the candidate specific region.
[0070] After cutting in units, the number of units*the configuration required to run a unit file cannot exceed the total configuration of the operating environment.
[0071] Cutting in units refers to dividing the total number of the to-be-cut sequences by the number of threads, and m is recorded as the number of units after cutting in units. Each thread runs the same number of computing tasks in a multi-thread operating environment to ensure efficient computing under optimal performance conditions.
[0072] As shown in
[0073] a first-round cut fragment obtaining module, configured to respectively compare a microorganism target fragment with whole genome sequences of one or more comparison strains one-to-one, and remove fragments of which a similarity exceeds a preset value, to obtain a plurality of residual fragments as first-round cut fragments T.sub.1-T.sub.n, wherein n is an integer greater than or equal to 1;
[0074] a candidate specific region obtaining module, configured to respectively compare the first-round cut fragments T.sub.1-T.sub.n with whole genome sequences of remaining comparison strains, and remove fragments of which the similarity exceeds a preset value, to obtain a collection of residual cut fragments as candidate specific regions of the microorganism target fragment; and
[0075] a specific region verifying and obtaining module, configured to determine whether the candidate specific region meets the following requirements:
[0076] 1) searching in public databases to find whether there are other species of which a similarity to the candidate specific region is greater than the preset value;
[0077] 2) respectively comparing the candidate specific region with whole genome sequences of the comparison strains and a whole genome sequence of a host of a source strain of the microorganism target fragment, to find whether there are fragments with a similarity greater than the preset value;
[0078] if the candidate specific region does not meet the above requirements, the candidate specific region is a specific region of the microorganism target fragment.
[0079] The device of the present disclosure is capable of distinguishing whether the source strain of the microorganism target fragment and the comparison strain belong to the same species or the same subspecies.
[0080] The similarity refers to a product of a coverage rate and a matching rate of the target fragment. The coverage rate=(length of similar sequence fragment/(end value of the microorganism target fragment−starting value of the microorganism target fragment+1))%.
[0081] In the candidate specific region obtaining module, the first-round cut fragments T.sub.1-T.sub.n are respectively compared with whole genome sequences of the remaining comparison strains by group iteration.
[0082] When the first-round cut fragment T.sub.n is compared with whole genome sequences of the remaining comparison strains by group iteration, the candidate specific region obtaining module includes the following submodules:
[0083] a comparison strain grouping submodule, configured to divide the remaining comparison strains into P groups, each group including a plurality of comparison strains;
[0084] a first-round candidate sequence library obtaining submodule, configured to simultaneously compare the first-round cut fragment T.sub.n with the whole genome sequences of each comparison strain in the first group one-to-one, and remove fragments of which the similarity exceeds the preset value, to obtain a plurality of residual fragments as a first-round candidate sequence library of the first-round cut fragment T.sub.n;
[0085] a candidate specific region obtaining submodule, to simultaneously compare a previous-round candidate sequence library of the first-round cut fragment T.sub.n with whole genome sequences of each comparison strain in a next group one-to-one, and remove fragments of which the similarity exceeds the preset value, to obtain a plurality of residual fragments as a next-round candidate sequence library of the first-round cut fragment T.sub.n; the candidate specific region obtaining submodule is repeated from the first-round candidate sequence library until a Pth-round candidate sequence library is obtained as a candidate specific sequence library of the first-round cut fragment T.sub.n;
[0086] a collection of all the candidate specific sequence libraries of the first-round cut fragments is the candidate specific region.
[0087] The number of comparison strains contained in a comparison strain group is set according to the hardware configuration of the computing environment.
[0088] In an embodiment, the device further includes:
[0089] a candidate specific region obtaining module of the target sequence, which executes the first-round cut fragment obtaining module of the target fragment and the candidate specific region obtaining module of the target fragment to obtain candidate specific regions of each target fragment in the target sequence, and takes a collection of the candidate specific regions of each target fragments as candidate specific regions of the target sequence.
[0090] Optionally, the first-round cut fragment obtaining module further includes the following submodules: a raw data similarity comparison submodule, to compare the selected adjacent microorganism target fragments in pairs; if the similarity after comparison is lower than the preset value, an alarm is issued and the screening conditions corresponding to the target strain are displayed.
[0091] Since the principles of the device in the present embodiment is basically the same as that of the above-mentioned method embodiment, the definitions of the same features, the calculation methods, the enumeration of the embodiments, and the enumeration of the preferred embodiments may be used interchangeably, thus will not be described again.
[0092] It should be noted that the division of each module of the above apparatus is only a division of logical functions. In actual implementation, the modules may be integrated into one physical entity in whole or in part, or may be physically separated. These modules may all be implemented in the form of processing component calling by software. These modules may also be implemented entirely in hardware. It is also possible that some modules are implemented in the form of processing component calling by software, and some modules are implemented in the form of hardware. For example, the obtaining module may be a separate processing element, or may be integrated into a chip, or may be stored in a memory in the form of program code. The function of the above obtaining module is called and executed by one of the processing elements. The implementation of other modules is similar. In addition, all or part of these modules may be integrated or implemented independently. The processing elements described herein may be an integrated circuit with signal processing capabilities. In the implementation process, each operation of the above method or each of the above modules may be implemented by an integrated logic circuit of hardware in the processor element or instruction in a form of software.
[0093] For example, the above modules may be one or more integrated circuits configured to implement the above method, such as one or more application specific integrated circuits (ASIC), or one or more digital signal processors (DSP), or one or more field programmable gate arrays (FPGA), or graphics processing unit (GPU). As another example, when one of the above modules is implemented in the form of calling program codes of a processing element, the processing element may be a general processor, such as a central processing unit (CPU) or other processors that may call program codes. As another example, these modules may be integrated and implemented in the form of a system-on-a-chip (SOC).
[0094] Some embodiments of the present disclosure further provide a computer readable storage medium, which stores a computer program. When executed by a processor, the program implements the above-mentioned method for identifying specific regions in microorganism target fragments.
[0095] Some embodiments of the present disclosure provide a computer processing device, including a processor and the above-mentioned computer readable storage medium. The processor executes the computer program on the computer readable storage medium to implement the operations of the above-mentioned method for identifying specific regions in microorganism target fragments.
[0096] Some embodiments of the present disclosure provide an electronic terminal, including a processor, a memory and a communicator; the memory stores a computer program, the communicator communicates with an external device, and the processor executes the computer program stored in the memory, so that the electronic terminal executes and implements the above-mentioned method for identifying specific regions in microorganism target fragments.
[0097]
[0098] The system bus mentioned above may be a peripheral component interconnect (PCI) bus or an extended industry standard architecture (EISA) bus, etc. The system bus may include address bus, data bus, control bus and so on. For convenience of representation, only a thick line is used in the figure, but it does not mean that there is only one bus or one type of bus. The communication interface is used to implement communication between the database access device and other devices (such as a client, a read-write library, and a read-only library). The memory may include a random access memory (RAM), or may also include a non-volatile memory, such as at least one disk memory.
[0099] The above-mentioned processor may be a general processor, including a central processing unit (CPU), a network processor (NP), and the like. The above-mentioned processor may also be a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a graphics processing unit (GPU) or other programmable logic devices, discrete gate or transistor logic devices, or discrete hardware components.
[0100] Those of ordinary skill will understand that all or part of the operations to implement the various method embodiments described above may be accomplished by hardware associated with a computer program. The computer program may be stored in a computer readable storage medium. The program, when executed, performs the operations including the above method embodiments. The computer readable storage mediums may include, but are not limited to, floppy disks, optical disks, compact disc read-only memories (CD-ROM), magneto-optical disks, read only memories (ROM), random access memories (RAM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), magnetic cards or optical cards, flash memories, or other types of medium or machine-readable media suitable for storing machine-executable instructions. The computer readable storage medium may be a product that is not accessed to a computer device, or a component that has been accessed to a computer device for use.
[0101] In terms of specific implementation, the computer programs may be routines, programs, objects, components, data structures or the like that perform specific tasks or implement specific abstract data types.
[0102] The above-mentioned method for identifying a specific region in a microorganism target fragment, the device for identifying a specific region in a microorganism target fragment, the computer readable storage medium, the computer processing device or the electronic terminal may be used in the PCR detection of microorganisms,
[0103] and specifically, in the screening of template sequences.
[0104] The present disclosure provides a use of the above-mentioned method for identifying a specific region in a microorganism target fragment, the above-mentioned device for identifying a specific region in a microorganism target fragment, the above-mentioned computer readable storage medium, the above-mentioned computer processing device, or the above-mentioned electronic terminal for identifying a specific region in a microorganism target fragment.
[0105] The use is to distinguish whether the source strain of the microorganism target fragment and the comparison strain belong to the same species or the same subspecies.
[0106] The microorganism includes one or more of bacterium, virus, fungus, amoeba, cryptosporidium, flagellate, microsporidium, piroplasma, plasmodium, toxoplasma, trichomonas and kinetoplastid.
[0107] The above-mentioned embodiments are merely illustrative of the principle and effects of the present disclosure instead of limiting the present disclosure. Modifications or variations of the above-described embodiments may be made by those skilled in the art without departing from the spirit and scope of the disclosure. Therefore, all equivalent modifications or changes made by those who have common knowledge in the art without departing from the spirit and technical concept disclosed by the present disclosure shall be still covered by the claims of the present disclosure.