Copy number variant leading to virus resistance

11434500 · 2022-09-06

Assignee

Inventors

Cpc classification

International classification

Abstract

The present invention relates to a genetic determinant which may comprise at least two copies of a combination of two closely linked RDR1 genes, which two closely linked RDR1 genes are inversely oriented, and which genetic determinant leads to virus resistance when present in a plant. In one embodiment, of the RDR1 genes in the combination is represented by SEQ ID NO: 1 or has at least 70% sequence identity, and one of the RDR1 genes in the combination is represented by SEQ ID NO: 3 or has at least 70% sequence identity; or one of the RDR1 genes in the combination encodes a protein represented by SEQ ID NO: 2 or a protein that has at least 70% sequence identity, and one of the RDR1 genes encodes a protein represented by SEQ ID NO: 4 or a protein that has at least 70% sequence identity.

Claims

1. A method for selecting a virus resistant plant, comprising determining the copy number of a combination of two linked RNA-dependent RNA polymerase 1 (RDR1) genes that are inversely oriented in the genome of the plant, and selecting a plant that comprises at least two copies of said combination as a virus resistant plant comprising a DNA sequence which leads to virus resistance when present in a plant, wherein the combination of the two linked RDR1 genes comprises at least one RDR1 gene that is represented by SEQ ID NO: 1 or has a sequence identity of at least 90% thereof, and at least one RDR1 gene that is represented by SEQ ID NO: 3 or has a sequence identity of at least 90% thereof.

2. The method as claimed in claim 1, wherein the virus is of the family Potyviridae, Bromoviridae, and/or of the family Virgaviridae.

3. The method as claimed in claim 1, wherein the virus resistant plant is a Cucumis sativus plant which is resistant to Cucumber Green Mottle Mosaic Virus.

4. The method as claimed in claim 1, wherein the combination of two closely linked RDR1 genes comprises CsRDR1_II represented by SEQ ID NO: 1, or a gene that has at least 90% sequence identity thereof, and CsRDR1_I, represented by SEQ ID NO: 3, or a gene that has at least 90% sequence identity thereof, and wherein the selected virus resistant plant is a Cucumis sativus plant comprising at least two copies of said combination, the presence of which leads to resistance to Cucumber Green Mottle Mosaic Virus.

5. A plant selected by the method of claim 1, wherein the plant is resistant to Cucumber Green Mottle Mosaic Virus and optionally resistant to Cucumber Vein Yellowing Virus.

6. A seed capable of growing into the plant of claim 5.

7. The method of claim 1, wherein the plant comprises at least three copies of said combination.

8. The method of claim 4, wherein the plant comprises at least three copies of said combination.

9. The method as claimed in claim 1, wherein the two linked RDR1 genes that are inversely oriented are not more than 3000 bp apart.

10. The method as claimed in claim 9, wherein copies of combinations of two RDR1 genes are not more than 6000 bp apart.

11. A plant selected by the method of claim 1, wherein the plant is resistant to Cucumber Green Mottle Mosaic Virus and Cucumber Vein Yellowing Virus.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the office upon request and payment of the necessary fee.

(2) The following detailed description, given by way of example, but not intended to limit the invention solely to the specific embodiments described, may best be understood in conjunction with the accompanying drawings.

(3) FIGS. 1A-1G—Genomic sequences of SEQ ID NO: 1, SEQ ID NO: 3, and SEQ ID NO: 5, including the coding sequence (CDS), which starts with the start codon ATG (bold in the sequence), and the sequence about 2 kb upstream of the start codon including the 5′-UTR and the promoter. Sequences are of Cucumis sativus CsRDR1_II, CS RDR1_I, and CsRDR1_II having an indel upstream of the start codon respectively.

(4) FIG. 2—Protein sequences of SEQ ID NO: 2 and SEQ ID NO: 4, generated by the CDS's of CsRDR1_II and CsRDR1_I respectively, whereby CsRDR1_II with the indel, represented by SEQ ID NO: 5, codes for the same protein as CsRDR1_II since the CDS is the same.

(5) FIGS. 3A-3D—Read depth analysis of sequencing data from various lines that were susceptible to both CVYV and CGMMV, were resistant to CGMMV and susceptible to CVYV, or were resistant to both viruses.

(6) A: WGS read mapping of Geno3 to reference genome (pacbio);

(7) B: WGS read mapping to BF11 reference (pacbio), CGMMV susceptible (S) and CVYV susceptible (S) lines;

(8) C: WGS read mapping to BF11 reference (pacbio), CGMMV resistant (R) and CVYV susceptible (S) lines;

(9) D: WGS read mapping to BF11 reference (pacbio), CGMMV resistant (R) and CVYV resistant (R) lines.

(10) FIG. 4—Possible locations of copies of the combination of RDR1 genes within the genome, in relation to each other. For ‘Geno4’ a genetic determinant with 3 copies is depicted, and for Geno2′ and Geno3′ genetic determinants having 2 copies are depicted.

DETAILED DESCRIPTION OF THE INVENTION

(11) Copy number variants (CNVs) are relatively recently identified as one of the major potential sources of genetic variation. The approach for determining the presence of CNVs in a genome to be able to identify their effect is however very different from identification of other variations within genes, such as modifications that are present within genes. As for the latter, usually analysis of sequences leads to the identification of differences between the sequences in the comparison. These differences are subsequently used for the development of markers that are linked to a genomic region that comprises a modification, or markers that comprise the mutation itself. This is however not feasible for establishing the presence of CNVs, since these are not based on differences in the nucleotide sequence of a gene. A copy number variant has to be identified by determining the repetition of specific sequences within a genome, and especially sequences that form genes or parts of genes. These variations in copy number can be present close to each other, for example on the same chromosome, but they can also be positioned on different locations in the genome. The majority of genetic variation that is caused by CNVs, and especially their impact on and relation to specific phenotypic traits, is not yet revealed.

(12) In a preferred embodiment the genetic determinant of the invention comprises at least three copies of the combination of two closely linked, inversely oriented, RDR1 genes.

(13) In one embodiment the sequence of at least one of the RDR1 genes of the combination is represented by SEQ ID NO: 1, which is the sequence of CsRDR1_II, or has a sequence with a sequence identity of, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% to SEQ ID NO: 1. Said similar sequence should underlie a functionally homologous gene of the CsRDR1_II gene. Alternatively, at least one of the RDR1 genes of the combination has a sequence that encodes a protein that is represented by SEQ ID NO: 2, or encodes a protein that has a sequence identity of, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% to SEQ ID NO: 2.

(14) In one embodiment, the sequence of at least one of the RDR1 genes of the combination is represented by SEQ ID NO: 3, which is the sequence of CsRDR1_I, or has a sequence with a sequence identity of, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% to SEQ ID NO: 3. Said similar sequence should underlie a functionally homologous gene of the CsRDR1_I gene. Alternatively, at least one of the RDR1 genes of the combination has a sequence that encodes a protein that is represented by SEQ ID NO: 4, or encodes a protein that has a sequence identity of, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% to SEQ ID NO: 4.

(15) In a particular embodiment the combination comprises one RDR1 gene represented by SEQ ID NO: 1 or by a sequence having, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and one RDR1 gene represented by SEQ ID NO: 3 or by a sequence having, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.

(16) In an alternative embodiment the combination comprises one RDR1 gene that encodes a protein as represented by SEQ ID NO: 2 or encodes a protein having, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and one RDR1 gene that encodes a protein as represented by SEQ ID NO: 4 or encodes a protein having, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.

(17) A genetic determinant comprising the particular combination of SEQ ID NO: 1 and SEQ ID NO: 3 leads to resistance to Cucumber Green Mottle Mosaic Virus when present in a Cucumis sativus plant. In a preferred embodiment said genetic determinant comprises at least three copies of the particular combination of SEQ ID NO: 1 and SEQ ID NO: 3.

(18) In one embodiment the combination comprises one RDR1 gene represented by SEQ ID NO: 1 or by a sequence having, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and one RDR1 gene that encodes a protein as represented by SEQ ID NO: 4 or encodes a protein having, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.

(19) In one embodiment the combination comprises one RDR1 gene that encodes a protein as represented by SEQ ID NO: 2 or encodes a protein having, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and one RDR1 gene represented by SEQ ID NO: 3 or by a sequence having, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.

(20) SEQ ID Nos. 1 and 3 represent the wild-types of the CsRDR1_II and CsRDR1_I genes of Cucumis sativus, and the corresponding proteins represent the wild-type proteins. Genes that are the functional homologue of CsRDR1_I or CsRDR1_II in other crops have at least 70% up to 99% sequence identity with one of these RDR1 genes of Cucumis sativus.

(21) In certain instances the expression of at least one of the RDR1 genes of the combination in the genetic determinant can be increased as compared to the expression when only a single version of the wild-type is present. The expression of one or both of the RDR1 genes of the combination can for example be increased due to the presence of at least two copies, optionally three, four, or more copies of the combination in the genetic determinant.

(22) The expression of at least one of the RDR1 genes can alternatively be increased due to a modification in the wild-type nucleotide sequence of said gene. Such a modification comprises for example a modification upstream of the start codon of the gene, in particular a modification in the promoter or the 5′-UTR.

(23) The increased expression can be an increase of the mRNA level of the RDR1 gene, or an increase of the level of the RDR1 protein, or an increase of the activity of the RDR1 protein.

(24) Increased expression of a gene that is present in a plant can be measured in steady state situation, which in relation to the function of this gene means a situation wherein no virus infection is present in the plant. Alternatively increased expression of a gene that is incorporated in a plant can be measured in an infected state situation, whereby a virus infection is present in the plant.

(25) In a specific embodiment the modification upstream of the start codon of one of the RDR1 genes in the combination resulting in increased expression and virus resistance is an indel. The indel that leads to increased expression is suitably an indel resulting in a modified gene sequence represented by SEQ ID NO: 5. Downstream from the start codon ATG, SEQ ID NO: 5 has the same sequence as SEQ ID NO: 1. The invention also relates to a genetic determinant whereby the sequence upstream of the start codon of one of the CsRDR1 genes in the combination has, in order of increased preference, at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% sequence identity to the sequence upstream of the start codon of SEQ ID NO: 5.

(26) In a particular embodiment the genetic determinant comprises one CsRDR1 gene represented by SEQ ID NO: 3 or having, in order of increased preference, at least 90%, 95%, 98%, 99% sequence identity thereto, and one CsRDR1 gene represented by SEQ ID NO: 5 or having, in order of increased preference, at least 90%, 95%, 98%, 99% sequence identity thereto.

(27) A genetic determinant comprising the particular combination of SEQ ID NO: 3 and SEQ ID NO: 5 leads to resistance to Cucumber Green Mottle Mosaic Virus and to Cucumber Vein Yellowing Virus when present in a Cucumis sativus plant. In a preferred embodiment said genetic determinant comprises at least three copies of the particular combination of SEQ ID NO: 3 and SEQ ID NO: 5. The CsRDR1 genes represented by SEQ ID NO: 3 and SEQ ID NO: 5 need not necessarily have the exact sequence of these SEQ ID's in said resistant Cucumis sativus plant, but can also show at least 90%, 95%, 98%, 99% sequence identity thereto.

(28) As used herein, the percentage ‘sequence identity’ is the percentage of nucleotides or amino acids that is identical between two sequences after proper alignment of those sequences. The person skilled in the art is aware of how to align sequences. To obtain the most significant result, the best possible alignment that gives the highest sequence identity score should be obtained. The sequences are compared over the length of the shortest sequence in the assessment.

(29) A high percentage of sequence identity is commonly assumed to point to a homologous sequence. A genetic determinant comprising RDR1 genes having a sequence identity percentage as claimed is part of the invention if said similar sequence is functionally homologous. Functionally homologous means that it is a gene sequence that leads to a protein that has a similar function as the RDR1 genes that were identified in Cucumis sativus. A similar sequence is a sequence has at least 70%, up to 99%, sequence identity to SEQ ID NO: 1 and/or SEQ ID NO: 3 and/or SEQ ID NO: 5. For this invention ‘functionally homologous’ means that the gene or protein is involved in virus resistance.

(30) An ‘indel’ as used herein can represent an insertion, a deletion, or a combination of both. Preferably, the indel in one of the RDR1 genes in the combination, resulting in increased expression, comprises at least a deletion.

(31) The presence of the genetic determinant of the invention in a plant suitably leads to resistance to a virus of the family Potyviridae, Bromoviridae, and/or Virgaviridae. Virus species belonging to these families that cause major problems by infecting a large number of cultivated crops are for example, but not limited to, Cucumber Vein Yellowing Virus (CVYV), Cucumber Mosaic Virus (CMV), Zucchini Yellow Mosaic Virus (ZYMV), Papaya Ringspot Virus (PRSV), Watermelon Mosaic Virus (WMV), Cucumber Green Mottle Mosaic Virus (CGMMV), Tobacco Mosaic Virus (TMV), Tomato Mosaic Virus (ToMV), Pepper Mild Mottle Virus (PMMoV), Pepper Mottle Virus (PepMoV), Potato Virus Y (PVY), Soybean Mosaic Virus (SMV), and Maize Dwarf Mosaic Virus (MDMV).

(32) Plant species that have in their genome RDR1 genes that are homologous to SEQ ID NO: 1 and/or SEQ ID NO: 3, and are therefore particularly suitable for acquiring a genetic determinant of the invention, belong to various plant families such as Cucurbitaceae, Solanaceae, Brassicaceae, Apiaceae, Fabaceae, Amaranthaceae, and Asteraceae. Crop species suitable for acquiring a genetic determinant of the invention can specifically be selected from any of the following: Phaseolus vulgaris, Beta vulgaris, Brassica oleracea, Daucus carota, Lactuca sativa, Cucumis melo, Cucumis sativus, Cucumis pepo, Spinacia oleracea, Solanum lycopersicum, Capsicum annuum, and Citrullus lanatus.

(33) The present invention relates to a method for producing a virus resistant plant comprising introducing a genetic determinant that has at least two copies of a combination of two closely linked RDR1 genes, which two closely linked RDR1 genes are inversely oriented, in a plant. The genetic determinant can be introduced from another plant which comprises the genetic determinant through commonly used breeding techniques such as crossing and selection when the plants are sexually compatible. Such introduction can be from a plant of the same species, that usually can be crossed easily, or from a plant of a related species. Difficulties in crossing can be overcome through techniques known in the art such as embryo rescue, or cis-genesis can be applied. Suitably markers can be developed for the genetic determinant to follow the incorporation of that genetic determinant into another plant.

(34) The above method can in particular be used to introduce the genetic determinant of the invention into a plant species that is suitable for incorporation of such genetic determinant. In a particular embodiment the genetic determinant of the invention can be introduced from a Cucumis sativus plant comprising the genetic determinant into a Cucumis sativus plant lacking the genetic determinant using standard breeding methods. In Cucumis sativus the genetic determinant can comprise two, three, four or more copies of the combination of two inversely oriented RDR1 genes. Introduction of the genetic determinant in Cucumis sativus leads to resistance to Cucumber Green Mottle Mosaic Virus. When one of the RDR1 genes in the combination is represented by SEQ ID NO: 5, the presence of the genetic determinant in Cucumis sativus leads to resistance to Cucumber Green Mottle Mosaic Virus and to resistance to Cucumber Vein Yellowing Virus.

(35) Alternatively the genetic determinant of the invention can be introduced by increasing the copy number of closely linked inversely oriented RDR1 genes that are already present in the genome of a plant, or they can be transferred from another, sexually incompatible, plant, for example by using transgenic modification. Techniques that can suitably be used for modification of the copy number of a gene or a combination of genes, or for the transfer of multiple copies of RDR1 genes from other plants, comprise general plant transformation techniques known to the skilled person, such as the use of an Agrobacterium-mediated transformation method. Other genome editing methods such as the use of a CRISPR/Cas system might also be employed.

(36) The invention further provides a plant comprising the genetic determinant of the invention, which plant is resistant to one or more viruses due to the presence of the genetic determinant. A plant of the invention is preferably resistant to one or more viruses of the family Potyviridae, Bromoviridae, and/or of the family Virgaviridae.

(37) The invention also relates to a seed comprising the genetic determinant of the invention, wherein the plant grown from the seed is resistant to one or more viruses, in particular to one or more viruses of the family Potyviridae, Bromoviridae, and/or the family Virgaviridae.

(38) A plant or a seed of the invention is a plant or a seed in which two or more copies of a combination of two inversely oriented RDR1 genes are present, which presence results in virus resistance, for example a plant or a seed of a species selected from the group consisting of Phaseolus vulgaris, Beta vulgaris, Brassica oleracea, Daucus carota, Lactuca sativa, Cucumis melo, Cucumis sativus, Spinacia oleracea, Solanum lycopersicum, and Citrullus lanatus.

(39) A Cucumis sativus plant comprising the genetic determinant of the invention preferably is resistant against Cucumber Green Mottle Mosaic Virus, optionally in combination with resistance to Cucumber Vein Yellowing Virus.

(40) The present invention also relates to a method for selecting a virus resistant plant, comprising determining the copy number of a combination of two RDR1 genes that are inversely present, and selecting a plant that comprises at least two copies of said combination as a virus resistant plant. A plant comprising at least two copies of said combination comprises the genetic determinant of the invention. In a preferred embodiment a plant comprising at least three copies of the combination is selected as a virus resistant plant.

(41) Various methods based on sequencing of the genome have been developed to identify copy number variants (CNVs), and it is known to the person skilled in the art how to establish the presence of copy number variants within a genome of a plant. Straightforward strategies for CNV detection based on next generation sequencing data are for example (1) read depth analysis, (2) split-read analysis, (3) discordantly mapped read pair analysis, and (4) de novo genome assembly.

(42) In the present research a tool was used to detect local copy number variation of the combination of the two RDR1 genes that were identified in the region of relevance in Cucumis sativus. Subsequently qPCR was used to also determine the expression level of the genes in the combination. Results of both the variation in copy number that was identified, and of the expression analysis, were linked to virus resistance of the plants (Example 1). In this ways the relation between genotype and phenotype was established.

(43) In one embodiment the invention relates to a method for selecting a virus resistant plant, comprising determining the copy number of a combination of two RDR1 genes that are inversely present, and selecting a plant that comprises at least two copies of said combination as a virus resistant plant, wherein the combination of two closely linked RDR1 genes comprises at least one RDR1 gene that is represented by SEQ ID NO: 1 or has a sequence identity of at least 70% thereto, and/or at least one RDR1 gene that is represented by SEQ ID NO: 3 or has a sequence identity of at least 70% thereto.

(44) In one embodiment the invention relates to a method for selecting a virus resistant plant, comprising determining the copy number of a combination of two RDR1 genes that are inversely present, and selecting a plant that comprises at least two copies of said combination as a virus resistant plant, wherein the combination of two closely linked RDR1 genes comprises at least one RDR1 gene that is represented by SEQ ID NO: 3 or has a sequence identity of at least 70% thereto, and/or at least one RDR1 gene that is represented by SEQ ID NO: 5 or has a sequence identity of at least 70% thereto.

(45) In one embodiment the invention relates to a method for selecting a Cucumis sativus plant that is resistant to Cucumber Green Mottle Mosaic Virus, comprising determining the copy number of a combination of two RDR1 genes that are inversely present, and selecting a plant that comprises at least two copies, preferably at least three copies, of said combination as a CGMMV resistant plant, wherein the combination of two closely linked RDR1 genes comprises CsRDR1_II, represented by SEQ ID NO: 1, or a gene that has at least 90% sequence identity thereto, and CsRDR1_I, represented by SEQ ID NO: 3, or a gene that has at least 90% sequence identity thereto.

(46) In one embodiment the invention relates to a method for selecting a Cucumis sativus plant that is resistant to Cucumber Green Mottle Mosaic Virus and to Cucumber Vein Yellowing Virus, comprising determining the copy number of a combination of two RDR1 genes that are inversely present, and selecting a plant that comprises at least two copies, preferably at least three copies, of said combination as a CGMMV and CVYV resistant plant, wherein the combination of two closely linked RDR1 genes comprises a modified CsRDR1_II, represented by SEQ ID NO: 5, or a gene that has at least 90% sequence identity thereto, and CsRDR1_I, represented by SEQ ID NO: 3, or a gene that has at least 90% sequence identity thereto.

(47) Although the present invention and its advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined in the appended claims.

(48) The present invention will be further illustrated in the following Examples which are given for illustration purposes only and are not intended to limit the invention in any way.

EXAMPLES

Example 1: Identification of Copy Number Variants for RDR1 Genes in Cucumis sativus Related to Virus Resistance

(49) In previous research two closely linked, inversely oriented, RDR1 genes were identified on chromosome 5 of the Cucumis sativus genome. At least one of these RDR1 genes was, when modified, determined to be involved in conferring resistance to CVYV. Although there appeared to be a linkage with resistance to CGMMV, recombinants having only CGMMV or only CVYV resistance could be observed. Markers that were developed for the modified RDR1 gene were predictive for the CVYV resistance, but not always for the CGMMV resistance.

(50) Whole genome sequencing (WGS) was subsequently done on a number of lines that were susceptible, or had one or both resistances. More than 250 additional markers were developed based on sequence differences in the region of interest. Again, also the analysis of these data and the application of the new markers did not result in better markers for the CGMMV resistance. This led to the assumption that something different than a modification in or around the gene on the chromosome 5 region might be leading to the CGMMV resistance, and it was decided to take a different approach.

(51) The WGS data of the various lines were then mapped and subsequently analyzed using WGS read alignment visualization tools, and compared against an internally generated reference genome sequence of Cucumis sativus. Using this, the read depth of the sequence with the two RDR1 genes was observed. It was highly interestingly found that the read-depth of the specific sequence that has the two RDR1 genes was around two times or even three times higher for certain material when compared to the reference genome (FIGS. 3A-D).

(52) To confirm this information a large number of 61 lines having CGMMV and CVYV resistance in various combinations, or being susceptible to both, were resequenced and a qPCR was performed using the available data. The qPCR data were analyzed using the ‘delta delta Cq’ method (ddCq), with efficiency correction. The threshold was calculated using suitable software, which is commonly known, for both target genes CsRDR1_I and CsRDR1_II, as well as for the reference gene. The following formula was used to calculate the eventual fold changes, which also indicates copy number variation:

(53) Sample A Sample B = ( 1 + E ref ) CqA ref - CqB ref ( 1 + E tar ) CqA tar - CqB tar .

(54) The different methods eventually led to the same result, and confirmed the presence of copy number variants for the combination of the two closely linked, inversely oriented, RDR1 genes in the genome of Cucumis sativus.

Example 2: Determination of the Location of the Different Copies

(55) The location within the Cucumis sativus genome of the different copies was determined by a combined strategy, which includes split-read analysis, discordantly mapped read pair analysis, and de novo genome assembly. Based on this analysis, it was determined that the multiple copies of the RDR1 combination in the Cucumis sativus genome in some backgrounds are present as tandem repeats. This was determined by split-read analysis, but also verified with the use of an MGB assay that confirmed the presence of overlapping sequences. Other backgrounds showed that the copies were present on the same chromosome, but with around 1000 up to 6000 bp in between. This was done through de novo genome assembly. A third result showed that the copies can be even further away, possibly with around 200 kb in between, but a location on a different chromosome is also still feasible. To obtain this result, split-read analysis was combined with discordantly mapped read pair analysis. The combined results are visualized in FIG. 4.

Example 3: Linking Resistance to the Presence of Copy Number Variation

(56) The lines that were analyzed in Examples 1 and 2 were also phenotyped for resistance to CVYV and to CGMMV. A bio-assay was performed using commonly known inoculation and observation methods for evaluating the resistance. For CGMMV two repetitions were carried out. CVYV resistance score was based on several bio-assays in different years.

(57) Subsequently for each line the genotypic data indicating the copy number and the presence or absence of an indel, and the phenotypes indicating virus resistance were compared with each other to be able to draw conclusions. Results of certain representative lines are presented in Table 1.

(58) ‘Geno1’ refers to plants in which only one version of the combination of two closely linked inversely oriented RDR1 genes is present, so there are no multiple copies. ‘Geno2’ is a genetic background in which 2 copies are present, but they are located far from each other in the genome, probably around 200 kb or even more in between. Geno3′ is a genetic background that does not have the indel in CsRDR1_II that is known to lead to CVYV. The copies in this background are located at a distance of between 1000 and 6000 bp from each other. ‘Geno4’ is a genetic background in which the copies of the combination of RDR1 genes are tandem duplications. Also, CsRDR1_II in this background has the indel that leads to CVYV resistance.

(59) ‘R’ means that in that test all plants were resistant. A score of 8/2/0 means that 8 plants are resistant, i.e. without symptoms, 2 plants show light symptoms, and 0 plants are susceptible.

(60) TABLE-US-00001 TABLE 1 Copy number in relation to CVYV and CGMMV resistance, and gene expression COPY NUMBER calculated Calculated copy EXPRESSION RESISTANCE Reference cnv WGS CNV qPCR number 14138 14137 CVYV CGMMV-t1 CGMMV-t2 Geno1_1 1.1 0.87 −1.31 −0.81 S S S Geno1_2 1.11 0.93 −0.64 −0.55 S S S Geno2_1 2.19 1.92 2x 0.9 −1.32 S 8/2/0 8/2/0 Geno3_1 2.02 1.86 2x 1.05 −0.33 S R R Geno4_1 1.08 1.02 2.29 −0.68 R S S Geno4_2 2.13 1.99 2x 3.73 −1.14 R R R Geno4_3 3.21 3.13 3x 3.56 −0.47 R R R Geno4_4 3.03 3.2 3x 4.54 −0.22 R R 3/7/0

(61) Based on these results it was concluded that the presence of multiple copies of the combination of RDR1 genes leads to resistance to CGMMV. When the indel in one of the two RDR1 genes is present (Geno4) it gives only CVYV resistance when just 1 version is present (Geno4_1). Only when two or more copies are present there is resistance to both CVYV and CGMMV. No copies and no indel gives susceptibility to both viruses (Geno1). Geno2_1 and Geno3_1 show that CGMMV resistance can be present independent of CVYV resistance.

(62) The invention is further described by the following numbered paragraphs:

(63) 1. Genetic determinant comprising at least two copies of a combination of two closely linked RDR1 genes, which two closely linked RDR1 genes are inversely oriented, and which genetic determinant leads to virus resistance when present in a plant.

(64) 2. Genetic determinant of paragraph 1, comprising three, four, or more copies of the combination of two closely linked inversely oriented RDR1 genes.

(65) 3. Genetic determinant of paragraph 1 or 2, wherein a) at least one of the RDR1 genes in the combination is represented by SEQ ID NO: 1 or has a sequence identity of at least 70% thereto; or b) at least one of the RDR1 genes in the combination encodes a protein represented by SEQ ID NO: 2 or encodes a protein that has a sequence identity of at least 70% to SEQ ID NO: 2.

(66) 4. Genetic determinant of paragraph 1 or 2, wherein a) at least one of the RDR1 genes in the combination is represented by SEQ ID NO: 3 or has a sequence identity of at least 70% thereto; or b) at least one of the RDR1 genes in the combination encodes a protein represented by SEQ ID NO: 4 or encodes a protein that has a sequence identity of at least 70% to SEQ ID NO: 4.

(67) 5. Genetic determinant as paragraphed paragraph 3 or 4, wherein a) one of the RDR1 genes in the combination is represented by SEQ ID NO: 1 or has a sequence identity of at least 70% thereto, and one of the RDR1 genes in the combination is represented by SEQ ID NO: 3 or has a sequence identity of at least 70% thereto; or b) one of the RDR1 genes in the combination encodes a protein represented by SEQ ID NO: 2 or a protein that has a sequence identity of at least 70% thereto, and one of the RDR1 genes encodes a protein represented by SEQ ID NO: 4 or a protein that has a sequence identity of at least 70% thereto.

(68) 6. Genetic determinant of any of the paragraphs 1-5, wherein at least one of the RDR1 genes in the combination has an indel upstream of the start codon.

(69) 7. Genetic determinant of any of the paragraphs 1-6, wherein the distance between the two RDR1 genes that are inversely present is not more than 3000 bp.

(70) 8. Genetic determinant of any of the paragraphs 1-7, wherein the distance between copies of combinations of two RDR1 genes is not more than 6000 bp, preferably not more than 1000 bp, most preferably 0 bp.

(71) 9. Genetic determinant of any of the paragraphs 1-8, wherein one of the RDR1 genes is CsRDR1_II, represented by SEQ ID NO: 1, or has at least 90% sequence identity thereto, and one of the RDR1 genes is CsRDR1_I, represented by SEQ ID NO: 3, or has at least 90% sequence identity thereto, the presence of which genetic determinant in a Cucumis sativus plant leads to resistance to Cucumber Green Mottle Mosaic Virus.

(72) 10. Genetic determinant of any of the paragraphs 6-8, wherein one of the RDR1 genes is CsRDR1_I, represented by SEQ ID NO: 3, or has at least 90% sequence identity thereto, and one of the RDR1 genes is a modified CsRDR1_II, represented by SEQ ID NO: 5, or has at least 90% sequence identity thereto, the presence of which genetic determinant in a Cucumis sativus plant leads to resistance to Cucumber Green Mottle Mosaic Virus and Cucumber Vein Yellowing Virus.

(73) 11. Method for producing a virus resistant plant comprising introducing the genetic determinant of any of the paragraphs 1-10 into a plant.

(74) 12. Method for selecting a virus resistant plant, comprising determining the copy number of a combination of two closely linked RDR1 genes that are inversely present, and selecting a plant that comprises at least two copies, preferably at least three copies of said combination as a virus resistant plant comprising the genetic determinant of any of the paragraphs 1-10.

(75) 13. Method of paragraph 11 or 12 wherein the plant belongs to a species selected from the group consisting of Phaseolus vulgaris, Beta vulgaris, Brassica oleracea, Daucus carota, Lactuca sativa, Cucumis melo, Cucumis sativus, Cucumis pepo, Spinacia oleracea, Solanum lycopersicum, Capsicum annuum, and Citrullus lanatus.

(76) 14. Method of paragraph 11, 12, or 13 wherein the virus is of the family Potyviridae, Bromoviridae, and/or of the family Virgaviridae.

(77) 15. Method of any of the paragraphs 11-14, wherein the combination of two closely linked RDR1 genes comprises at least one RDR1 gene that is represented by SEQ ID NO: 1 or has a sequence identity of at least 70% thereto, and/or at least one RDR1 gene that is represented by SEQ ID NO: 3 or has a sequence identity of at least 70% thereto.

(78) 16. Method of any of the paragraphs 11-14, wherein the combination of two closely linked RDR1 genes comprises at least one RDR1 gene that is represented by SEQ ID NO: 3 or has a sequence identity of at least 70% thereto, and/or at least one RDR1 gene that is represented by SEQ ID NO: 5 or has a sequence identity of at least 70% thereto.

(79) 17. Method of any of the paragraphs 11-16, wherein the virus resistant plant is a Cucumis sativus plant which is resistant to Cucumber Green Mottle Mosaic Virus, and optionally resistant to Cucumber Vein Yellowing Virus.

(80) 18. Method of paragraph 15 or 17, wherein the combination of two closely linked RDR1 genes comprises CsRDR1_II, represented by SEQ ID NO: 1, or a gene that has at least 90% sequence identity thereto, and CsRDR1_I, represented by SEQ ID NO: 3, or a gene that has at least 90% sequence identity thereto, and wherein the selected virus resistant plant is a Cucumis sativus plant comprising two copies, preferably three or more copies, of said combination, the presence of which leads to resistance to Cucumber Green Mottle Mosaic Virus.

(81) 19. Method of paragraph 16 or 17, wherein the combination of two closely linked RDR1 genes comprises a modified CsRDR1_II, represented by SEQ ID NO: 5, or a gene that has at least 90% sequence identity thereto, and CsRDR1_I, represented by SEQ ID NO: 3, or a gene that has at least 90% sequence identity thereto, and wherein the selected virus resistant plant is a Cucumis sativus plant comprising at least two copies, preferably at least three copies, of said combination, the presence of which leads to resistance to Cucumber Green Mottle Mosaic Virus resistance and to Cucumber Vein Yellowing Virus.

(82) 16. Plant, which is resistant to one or more viruses due to the presence in its genome of the genetic determinant of any of the paragraphs 1-10.

(83) 17. Seed, wherein a plant grown from the seed is resistant to one or more viruses due to the presence in its genome of the genetic determinant of any of the paragraphs 1-10.

(84) 18. Plant of paragraph 16, or seed of paragraph 17, wherein the virus is of the family Potyviridae, Bromoviridae, and/or of the family Virgaviridae.

(85) 19. Plant of paragraph 16 or 18, or seed of paragraph 17 or 18, which is a plant or a seed of a species selected from the group consisting of Phaseolus vulgaris, Beta vulgaris, Brassica oleracea, Daucus carota, Lactuca sativa, Cucumis melo, Cucumis sativus, Cucumis pepo, Spinacia oleracea, Solanum lycopersicum, Capsicum annuum, and Citrullus lanatus.

(86) 20. Plant of paragraph 16, 18, or 19 which is a Cucumis sativus plant comprising the genetic determinant of paragraph 9 or 10, wherein the genetic determinant comprises at least two copies, preferably at least three copies of the combination, which Cucumis sativus plant is resistant against Cucumber Green Mottle Mosaic Virus, and is optionally resistant against Cucumber Vein Yellowing Virus.

(87) 21. Cucumis sativus plant of paragraph 20 comprising the genetic determinant of paragraph 10 which plant is resistant against Cucumber Green Mottle Mosaic Virus and Cucumber Vein Yellowing Virus due to the presence of the genetic determinant.

(88) Having thus described in detail preferred embodiments of the present invention, it is to be understood that the invention defined by the above paragraphs is not to be limited to particular details set forth in the above description as many apparent variations thereof are possible without departing from the spirit or scope of the present invention.