OPTICAL DEVICES AND METHODS
20220300798 · 2022-09-22
Inventors
- James Tan You SIAN (Oxford (Oxfordshire), GB)
- Zengguang CHENG (Oxford (Oxfordshire), GB)
- Harish BHASKARAN (Oxford (Oxfordshire), GB)
Cpc classification
G06N3/0675
PHYSICS
G02F1/0118
PHYSICS
G06N3/049
PHYSICS
G02F1/0126
PHYSICS
International classification
Abstract
An optical associative learning element (200) comprising a first waveguide (202), a second waveguide (204) and a modulating element (206), wherein: a cascaded first (208) and second (210) directional coupler are formed from a portion (212) of the first (202) and second (204) waveguides in which the first (202) and second (204) waveguides are substantially parallel, evanescently coupled and separated by a gap; the modulating element (206) is evanescently coupled to the second waveguide (204) in the second directional coupler (210) and is arranged to modify a transmission or absorption characteristic of the second waveguide (204) dependent on the state of the modulating element (206); and the state of the modulating element (206) is adjustable between a first and second state by an optical field carried by the first (202) and/or second (204) waveguide.
Claims
1. An optical associative learning element comprising a first waveguide, a second waveguide and a modulating element, wherein: a cascaded first and second directional coupler are formed from a portion of the first and second waveguides in which the first and second waveguides are substantially parallel, evanescently coupled and separated by a gap; the modulating element is evanescently coupled to the second waveguide in the second directional coupler and is arranged to modify a transmission or absorption characteristic of the second waveguide dependent on the state of the modulating element; and the state of the modulating element is adjustable between a first and second state by an optical field carried by the first and/or second waveguide.
2. The optical associative learning element according to claim 1, wherein modulating element is configured to modify the amount of coupling between the first and second waveguides in the second directional coupler dependent on the state of the modulating element.
3. The optical associative learning element according to claim 1, wherein the modulating element comprises a phase change material, the modulating element comprising a compound or alloy of a combination of elements selected from the following list of combinations: GeSbTe, VO.sub.x, NbO.sub.x, GeTe, GeSb, GaSb, AgInSbTe, InSb, InSbTe, InSe, SbTe, TeGeSbS, AgSbSe, SbSe, GeSbMnSn, AgSbTe, AuSbTe, and AlSb.
4. (canceled)
5. The optical associative learning element according to claim 1, wherein the second waveguide is tapered in the portion corresponding to the second directional coupler, such that a width of the second waveguide in the first directional coupler is greater than a corresponding width of the second waveguide in the second directional coupler.
6. The optical associative learning element according to claim 5, wherein the width of the second waveguide in the first directional coupler is in the range 1.05 μm to 1.15 μm and the width of the second waveguide in the second directional coupler is in the range 0.95 μm to 1.04 μm and the second waveguide tapers over a distance in the range 0.4 μm to 0.6 μm.
7. The optical associative learning element according to claim 1 wherein the length of the first directional coupler is in the range 1.5 μm to 3.0 μm and the length of the second directional coupler is in the range 10 μm to 20 μm.
8. The optical associative learning element according to claim 1, wherein the second directional coupler is arranged such that: when the modulating element is in the first state, the first waveguide provides a first output intensity I.sub.1 when an optical field having intensity I.sub.0 is introduced into the first waveguide prior to the first directional coupler, and a second output intensity I.sub.2 when an optical field having intensity I.sub.0 is introduced into the second waveguide prior to the first directional coupler; and when the modulating element is in the second state, the first waveguide provides a third output intensity I.sub.3 when an optical field having intensity I.sub.0 is introduced into the first waveguide prior to the first directional coupler, and a fourth output intensity I.sub.4 when an optical field having intensity I.sub.0 is introduced into the second waveguide prior to the first directional coupler, wherein the magnitude of the difference between I.sub.4 and I.sub.3, |I.sub.4−I.sub.3|, is less than the magnitude of the difference between I.sub.2 and I.sub.1, |I.sub.2−I.sub.1|.
9. The optical associative learning element according to claim 8, wherein the magnitude of the difference between I.sub.4 and I.sub.3 is less than or equal to 10% of the magnitude of the difference between I.sub.2 and I.sub.1, preferably less than or equal to 5% of the magnitude of the difference between I.sub.2 and I.sub.1, more preferably less than or equal to 1% of the magnitude of the difference between I.sub.2 and I.sub.1.
10. (canceled)
11. A photonic chip comprising: the optical associative learning element according to claim 1; an input coupler for coupling optical fields into the photonic chip; and a splitter arranged to divide an output of the input coupler into first and second spatial paths on the photonic chip, wherein the first spatial path is coupled to the first waveguide of the optical associative learning element and the second spatial path is coupled to the second waveguide of the optical associative learning element, and the first and second spatial paths are arranged to introduce an optical phase delay between optical fields arriving at the first directional coupler of the optical associative learning element.
12. The photonic chip according to claim 11, wherein the optical phase delay and the first directional coupler of the optical associative learning element are arranged such that optical intensity is accumulated in the second waveguide at the interface between the first and second directional couplers of the learning element when both the first and second waveguides carry optical fields contemporaneously.
13. An optical system, comprising: the photonic chip according to claim 11:44; a light source coupled to the input coupler and arranged to provide optical fields to the optical assortative learning element via the first and second spatial paths; and a detector arrangement coupled to the first waveguide of the optical associative learning element at the output of the second directional coupler thereof.
14. The optical system according to claim 13, wherein: the light source comprises a first laser, a second laser and an optical combiner, the first laser is arranged to produce first optical pulses having a first wavelength; the second laser is arranged to produce second optical pulses having a second wavelength, different from the first wavelength; the optical combiner is arranged to receive the first and second optical pulses from the first and second lasers and combine them into a common spatial mode; an output of the optical combiner is coupled to the input coupler of the photonic chip; the first and second spatial paths on the photonic chip comprise a first ring resonator and a second ring resonator respectively, the first ring resonator is arranged to select said first wavelength from said first portion and the second ring resonator is arranged to select said second wavelength from said second portion; and the outputs of the first and second ring resonators are coupled to the first and second waveguides respectively of the optical associative learning element, prior to the first directional coupler.
15. The optical system according to claim 14, wherein the detector arrangement comprises a beam splitter, a first optical tuneable filter, a second optical tuneable filter, a first photodiode and a second photodiode, wherein: the beam splitter is arranged to split the optical intensity from the first waveguide into a first spatial mode and a second spatial mode; the first optical tuneable filter is arranged to select the first wavelength in the first spatial mode; the second optical tuneable filter is arranged to select the second wavelength in the second spatial mode; the first photodiode is arranged to detect optical intensity after the first optical tuneable filter; and the second photodiode is arranged to detect optical intensity after the second optical tuneable filter.
16. The optical system according to claim 13, further comprising a controller arranged to: control the light source to produce a pre-determined sequence of optical fields; receive one or more readouts from the detector arrangement; and determine a learning status of the optical associative learning element based on the one or more readouts.
17. An optical artificial neural network, comprising a plurality of optical associative learning elements according to claim 1, wherein at least two of the optical associative learning elements are coupled together.
18. The optical artificial neural network according to claim 17, wherein the output of the first waveguide of a first one of the plurality of optical associative learning elements is coupled to the input of the first or second waveguide of a second one of the plurality of optical associative learning elements.
19. A method of performing an associative learning operation in the optical domain using a device comprising a first waveguide, a second waveguide and a modulating element, wherein: a cascaded first and second directional coupler are formed from the portion of the first and second waveguides in which the first and second waveguides are substantially parallel, evanescently coupled and separated by a gap; the modulating element is evanescently coupled to the second waveguide in the second directional coupler and is arranged to modify a transmission or absorption characteristic of the second waveguide dependent on the state of the modulating element, the method comprising: providing first and second optical fields contemporaneously to the first and second waveguides respectively thereby modifying a state of the modulating element.
20. The method according to claim 19, wherein modifying a state of the modulating element comprises changing the state of the modulating element from a more crystalline state to a less crystalline state, such as an amorphous state.
21. (canceled)
22. The method according to claim 19, further comprising selecting a relative optical phase delay between the first and second optical fields in order to maximize an accumulated optical intensity in the second waveguide at the interface between the first directional coupler and the second directional coupler.
23. The method according to claim 19, further comprising determining a learning status of the device by determining optical transmittance factors through the device for optical fields coupled to inputs of the first and second waveguides separately, wherein the device is deemed to be in a post-learning state if said optical transmittance factors are within 10% of each other.
24-25. (canceled)
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0084] Example embodiments will be described, by way of example only, with reference to the drawings, in which:
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
[0100]
[0101]
[0102]
[0103] It should be noted that the Figures are diagrammatic and not drawn to scale. Relative dimensions and proportions of parts of these Figures have been shown exaggerated or reduced in size, for the sake of clarity and convenience in the drawings. The same reference signs are generally used to refer to corresponding or similar feature in modified and different embodiments.
DETAILED DESCRIPTION
[0104]
[0105] Classical conditioning was initially described in Ivan Pavlov's dog experiment in 1927. In the experiment, food was the UCS that triggered an unconditioned response (UCR) i.e., the dog's salivation; while the ringing bell sound was the NS or CS. The bell (NS/CS) only triggered the salivation response R after the ringing bell was associated by repletion with food. Thus, these initially distinct responses eventually converged to a single response after similar stimuli co-occurrence, which associated the stimuli.
[0106] Two main roles of the simplified neural circuitry of
[0107] According to the present disclosure, with reference to
[0108] The optical associative learning element 200 is arranged to accumulate optical intensity in the second waveguide 204 at the interface 214 between the first 208 and second 210 directional couplers when both the first 202 and second 204 waveguides carry optical fields contemporaneously, i.e. when the optical fields carried by the first 202 and second 204 waveguides are substantially overlapping in time. For example, the first directional coupler 208 is arranged such that when a first optical field is carried by the first waveguide 202 and contemporaneously a second optical field is carried by the second waveguide 204, the first directional coupler 208 transfers at least a portion of the intensity of the first optical field from the first waveguide 202 to the second waveguide 204, such that the total optical intensity in the second waveguide 204 at the interface 214 between the first 208 and second 210 directional couplers is greater than the total optical intensity in the second waveguide 204 at the start of the first directional coupler 208.
[0109] In this manner, the net optical energy/intensity from both the inputs (UCS and NS/CS) is converged in the lower waveguide 204 of the first coupler 208. This can cause a fractional volume of the modulating element 206 to be switched to a different state. For example, the modulating element may comprise a phase change material (PCM) and the converged optical energy/intensity causes a fractional volume of the modulating element 206 to be switched from a crystalline state to an amorphous state. With more converging learning optical fields (e.g. pulses), a larger volume of material of the modulating element 206 switches from crystalline to amorphous which could be considered to correspond to a switching from a “before learning” state to an “after learning state”. This is illustrated in
[0110] In some embodiments, the PCM 206 deposited on the second waveguide 204 is a germanium antimony tellurium alloy Ge.sub.2Sb.sub.2Te.sub.5 (GST). In general, the modulating element 206 comprises a material comprising a compound or alloy of a combination of element selected from the following list of combinations: GeSbTe, VOx, NbOx, GeTe, GeSb, GaSb, AgInSbTe, InSb, InSbTe, InSe, SbTe, TeGeSbS, AgSbSe, SbSe, GeSbMnSn, AgSbTe, AuSbTe, and AlSb. GST is well-suited as it has a low structural phase transition time (sub-ns amorphization and few-ns crystallization time), high cycling endurance (˜10.sup.12 cycles), and long retention time (>10 years at room temperature). In some embodiments, a thin capping layer of indium tin oxide (ITO) may be additionally deposited on the PCM cell to prevent oxidation, and to localize optically-induced heat for PCM structural phase switching.
[0111] The first directional coupler 208 performs the function of determining the input optical intensity combinations (input to the first 202 and second 204 waveguides) that sufficiently trigger the associative learning process. Meanwhile, the second directional coupler 210 regulates the output response R which is measured from the output of the first waveguide 202. The lower (second) waveguide 204 of the first directional coupler 208 is the site where optical energy from the UCS and NS/CS inputs accumulates for structural phase switching to occur in the modulating element 206, thereby regulating the output response R. It is desirable that the optical associative learning element 200 associatively learns only upon two-input incidence, i.e. when optical fields are present in the first 202 and second waveguides 204 contemporaneously. As mentioned above, the first directional coupler 208 is configured to accumulate optical intensity in the lower waveguide 204 at the interface 214 for switching the state of the modulating element 206—which constitutes the learning process. The regulation of the output response R is performed by the second coupler 210. This is measured upon one-input incidence, i.e. a single optical field incident either in the first waveguide 202 or the second waveguide 204.
[0112]
[0113] In embodiments, the photonic chip 416 comprises a first ring resonator 410 and a second ring resonator 412 in the first 422 and second 424 spatial paths respectively. The light source 402 comprises a first laser 404, a second laser 406 and an optical combiner 408. The first laser 404 is arranged to produce first optical pulses having a first wavelength B. The second laser 406 is arranged to produce second optical pulses having a second wavelength A, in general different from the first wavelength. The optical combiner 408 is arranged to receive the first and second optical pulses from the first and second lasers and combine them into a common spatial mode, e.g. in a single fiber optical cable or waveguide. The first ring resonator is arranged to receive a first portion of the output intensity of the optical combiner and the second ring resonator is arranged to receive a second portion of the output intensity of the optical combiner. The first ring resonator is arranged to select said first wavelength from said first portion and the second ring resonator is arranged to select said second wavelength from said second portion. The outputs of the first and second ring resonators are coupled to the first 202 and second 204 waveguides of the learning element 200 respectively, prior to the first directional coupler 208. The first 404 and second 406 lasers represent the UCS and NS/CS stimuli. After passing through the ring resonators, only UCS or NS/CS is selected for each waveguide at the resonant wavelength B or A and sent to the element 200. Thus, control of wavelengths helps regulate the device operation.
[0114] The optical phase difference or optical delay between the optical fields carried by the first 202 and second waveguides 204 in the learning element 200 affects how energy is coupled in the first directional coupler 208 and therefore the extent to which optical energy/intensity is accumulated in the lower waveguide 204. The relative time delay of the optical phases between the UCS (upper waveguide 202) and NS/CS (lower waveguide 204) inputs may be denoted Δt+t.sub.UCS−t.sub.NS/CS, where t.sub.UCS and t.sub.NS/CS are the times at which the respective optical field input signals UCS and NS/CS are referenced to the same point in phase. In embodiments, phase delay control is achieved using an on-chip photonic layout 416 which contains the learning element 200 in addition to the ring resonators 410, 412 and spatial paths 422 and 424. The layout 416 locks the time delay of the phases (phase delay) as a function of spatial path length difference from the optical splitter 418, contained on the layout 416 and arranged to receive the output of the optical combiner 408, to the first 202 and second 204 waveguide inputs of the element 200. Given the broadband response of the optical element, the relative time delay of the phases to the waveguide inputs of the element can be precisely defined with respect to the input wavelength to the layout. On the other hand, to enable single input incidences to the element, a respective ring resonator 410, 412 is coupled to the two waveguide paths prior to the input ports of the element. The single UCS (NS/CS) input is incident when the input to the on-chip layout is of ring B (A) resonant wavelength λ.sub.B (λ.sub.A). By precisely defining the optical wavelength of the input laser source, the on-chip layout sorts both the single- and two-input incidences to the element. Simultaneous real-time monitoring of the element is carried out by using a photodetector 414 to measure the output transmission and thereby determine the learning element response R.
[0115] Example physical parameters of the learning element 200 were determined using coupled mode theory. The first directional 208 coupler effectively performs the function of determining the input optical field combinations that sufficiently trigger the associative learning process, whilst the second directional coupler 210 is used for regulating the output response R. In other words, two-input coupling to the second waveguide 204 at the interface 214 between the first 208 and second 210 directional couplers should be enhanced and ‘one-input coupling’ from the first waveguide 202 to the second waveguide 204 should be impeded by exploiting the critical coupling length contrast between the one-input case and two-input case. On the other hand, in the second directional coupler 210, the difference in output response R due to the loss contrast between the PCM 206 structural states that represent the before and after learning cases is exploited.
[0116] In one example, the first directional coupler 208 has a length of 2 μm and the second directional coupler 210 has a length of 15 μm. The first waveguide 202 is a plain waveguide consistently of nominal width 0.9 μm. In the second waveguide 204, the width of the segment corresponding to the first directional coupler 208 is 0.9 μm, whereas the width of the segment corresponding to the second directional coupler 210 is tapered from 0.9 μm to 0.8 μm. This tapering compensates for the non-zero permittivity of the PCM 206 which contributes to the effective refractive index of the waveguide. The tapering therefore provides optimal inter-waveguide coupling with a waveguide separation gap of 0.1 μm. However, it should be appreciated that the tapering is not essential and the learning element can still function without it. It should be appreciated that the learning element 200 capitalizes on PCM optical loss contrast between the two phases (crystalline and amorphous) to absorb and direct the optical field before and after the learning process. The use of directional couplers ensures the applicability of the element over a broad optical wavelength range.
[0117] Simulations were performed based on the above exemplary dimensions of the learning element 200 using three-dimensional finite difference time domain FDTD numerical simulation, the results of which are shown in
[0118]
[0119] Experimental results are presented in
[0120] At the start of the experiment, UCS pump input pulses at 14.5 mW power were sent into the learning element 200 (in the first waveguide 202) in events 1 and 2. It was observed that the readouts remained at the baselines. The readouts likewise remained the same when only NS pump input pulses at 14.5 mW power were sent into the learning element 200 (in the second waveguide 204) in events 3 and 4. However, when both UCS and NS pump pulses were sent together with a fixed phase delay (according to this example, Δt=0.825 fs) at 6.6 mW each (13.2 mW total) in event 5, the transmission change (ΔTr Tr.sub.0) for the UCS and NS probe readouts changed by ˜−4% and ˜+4% respectively. As the input pump pulse power was increased from 6.6 mW to 14.5 mW each in events 6-8, the probe readouts further changed by nearly −7% and +7% respectively, both of which were well above the UCR/CR response threshold at Tr˜0.07. The experiment confirms the association of input NS/CS (analogous to the ringing bell in Pavlov's dog experiment) to input UCS (analogous to the food in Pavlov's dog experiment) through its output CR which is the learned response from UCR (analogous to salivation in Pavlov's dog experiment), after the temporal pairing of UCS and NS pump inputs in events 5-8 that caused the PCM to switch towards a more amorphous state (in contrast to that in events 1-4).
[0121] The reversibility of the associative learning process is further shown in
[0122] With reference to
[0123] It should be appreciated that in embodiments, Δt influences the accumulated optical pump field at the interface 214 between the first 208 and second 210 directional couplers of the learning element 200.
[0124] The Δt-dependence on the coupling provides greater on-demand control to generalize, discriminate and scale the pulse wavelengths that can induce the learning process, when both inputs are sent to the learning element 200. Given the sinusoidal/modular nature of Δt, sending both the pump pulses can produce the same output probe response at a set of predetermined regularly-spaced wavelengths, in contrast to single-input incidence case. The wavelength-insensitive feature of the element upon single-input incidence is due to the non-temporally resonant cascaded structures (i.e. the cascaded first 208 and second 210 directional couplers) that make up the element, whose broadband response is limited only by the change in coupling strength as the wavelength is varied. The timing-dependent plasticity of the associative learning element is consistent with the STDP rule albeit at a different order, thus permitting the associative implementation of input-input temporal contiguity in photonic neuromorphic systems.
[0125] Table 1 summarizes the minimum active volume and learning energy of other associative learning devices, except that of the synthetic biological genetic device which cannot be determined. These known electronic and optoelectronic associative learning devices range from ˜0.1 to 10.sup.10 μm.sup.3 in active volume and consume ˜2.63 to 105 nJ of energy per learning event. In comparison, the all-optical associative learning element 200 according to the present disclosure exhibits favourable characteristics in terms of dimensions and energy usage, with a low active volume at 0.12 μm.sup.3 and minimum learning energy at 1.8 nJ. In an embodiment, the single-element device is of 3 μm×17 μm area dimensions.
TABLE-US-00001 TABLE 1 active volume and learning energy of associative learning devices Active volume Min. learning energy Type (μm.sup.3) (nJ) Electronic Memresistive i. Chalcogenide 0.2-15 4.7 × 10.sup.4 8 2.63 ii. Manganite ~0.1 1.35 × 10.sup.3 1.25 × 10.sup.10 1.02 × 10.sup.5 iv. Nickelate 4.7 × 10.sup.3 7.20 × 10.sup.5 4.8 × 10.sup.4 2.04 × 10.sup.5 v. Metal oxide 900 4.5 × 10.sup.3 vi. Organic ~0.5 9.75 × 10.sup.3 Electrochemical 6 × 10.sup.3 6 × 10.sup.4 9.6 × 10.sup.5 125 Memcapacitive 80.7 ~30 Optoelectronic 1.62 × 10.sup.3 ~2.1 × 10.sup.3 Learning element of 0.12 1.8 the present disclosure
[0126] The associative learning element 200 of the present disclosure may be employed as a building block in artificial neural networks, with reduced energy consumption, as is apparent from the data presented in Table 1. Conventional artificial neural networks on the Hebbian learning rule adopt the backpropagation algorithm, with an inherent nonlinear scaling (O(N.sup.2˜N.sup.3)) of computational effort with the synaptic connection number N. In contrast, the computational effort in neural networks that are built on associative learning scales linearly (O(N)) with N (see U.S. Pat. No. 5,588,091). Considering the typically large training input datasets required to solve a particular machine learning task, it follows that the number of iterations needed to achieve convergence can be significantly reduced by using associative learning elements; thus substantially downscaling the training time and energy usage of neural network. Therefore it should be appreciated that the present disclosure also provides photonic neural networks built on the optical associative learning element 200 according to the present disclosure, with applications in noisy pattern recognition and classification, for example.
[0127] The relation between the learning element 200 output response R and input stimuli S can be expressed in the compact matrix notation R=M.sub.(II) M.sub.(I) S, where the 2×1 column vector S=(UCS, NS/CS).sup.T, while the 2×2 and 1×2 matrices that describe the first 208 and second 210 directional couplers respectively are given by:
in which s is the waveguide mode coupling coefficient, θ.sub.b=cosh.sup.−1(γ.sub.crys/4κ), θ.sub.a=sin.sup.−1(γ.sub.am/4κ), l.sub.1 is the length of the first directional coupler 208 and l.sub.2 is the length of the second directional coupler 210.
[0128] In the first directional coupler 208, when two identical inputs E.sub.0 of the same wavelength λ.sub.0 are sent into the learning element 200, the total field coupled to the respective waveguides at the interface 214 between the first 208 and second 210 directional couplers is the product of matrix M.sub.(I) and column vector (e.sup.−ωΔt, 1).sup.T, where ω=(2πc/λ.sub.eff), c is the vacuum speed of light, λ.sub.eff=λ.sub.0/n.sub.eff is the effective wavelength in the waveguide, and n.sub.eff is the effective refractive index in the waveguide. It follows that the field intensity at the second waveguide 204 at the interface 214 is |E.sub.l1|.sup.2.sub.two=E.sub.0.sup.2 (1+sin(2κl.sub.1) sin(ωΔt)). In comparison, for one-input incidence, the coupled field intensity is |E.sub.l1|.sup.2.sub.one=E.sub.0.sup.2 sin.sup.2(κl.sub.1). Thus, the critical coupling (maximum energy transfer) length of the first directional coupler 208 is l.sub.crit=π/κ for one-input incidence and l.sub.crit/2 (at ωΔt=π/2) for two-input incidence. Given κ=0.157 μm.sup.−1 in embodiments of the present disclosure, this gives |E.sub.l1|.sup.2.sub.two=1.588 (for ωΔt=π/2) and |E.sub.l1|.sup.2.sub.one=0.095 at l.sub.1=2 μm. From a PCM 206 switching energy threshold perspective, the ratio |E.sub.l1|.sup.2.sub.two/(1−|E.sub.l1|.sup.2.sub.one)=1.755 is indicative of associative learning because of the significant energy surplus upon two-input incidence relative to the maximum energy from one-input incidence.
[0129] In the second directional coupler 210, the relative change in output response R, which is measured for one-input incidences, can be estimated largely based on M.sub.(II) because |E.sub.l1|.sup.2.sub.one in the first cascade (l.sub.1=2 μm) is negligibly low. Thus, the ratio η=R.sub.NS/CS/R.sub.UCS|.sup.2 can be approximated as η≈|M.sub.(II)12/M.sub.(II)11|.sup.2. This leads to η.sub.b≈|sinh (κl.sub.2 sinh θ.sub.b)/sinh (κl.sub.2 sinh θ.sub.b+θ.sub.b)|.sup.2 and η.sub.a≈sin(κl.sub.2 cos θ.sub.a)/cos (κl.sub.2 cos θ.sub.a−θ.sub.a)|.sup.2 (subscript ‘b’ and ‘a’ denote before and after learning). Additionally, the output transmission difference between UCR.sub.b and UCR.sub.a can be denoted as Δ|R|.sup.2=|M.sub.(II)11b|.sup.2−|M.sub.(II)11a|.sup.2 where the alphabetic subscripts likewise denote the learning states. Given γ.sub.crys=7.65κ and γ.sub.am=0.24κ according to the present disclosure, η.sub.b≈0.072 and η.sub.a≈1.006 at l.sub.2=15 μm. Therefore it is possible to attain η.sub.b<<η.sub.a due to the unbounded sinh and positive unbounded cosh functions which cause η.sub.b.fwdarw.0 with the substantially large γ.sub.crys. The set of relations η.sub.b<<η.sub.a and η.sub.a≈1 is the second signature of associative learning because the output R upon NS/CS input incidence transitions from a significantly low value (η.sub.b<<η.sub.a) to that of UCS (η.sub.a≈1) which remains within the same transmission range (Δ|R|.sup.2<0.5).
[0130] In embodiments, the optical associative learning element 200 was fabricated on a Si.sub.3N.sub.4/SiO.sub.2 platform. Electron beam lithography (JEOL 5500FS, JEOL Ltd.) was used at 50 kV to define the Si.sub.3N.sub.4 structure on the Ma-N 2403 negative-tone resist-coated substrate. After the development process, reactive ion etching (PlasmaPro 80, Oxford Instruments) was performed in CHF.sub.3/O.sub.2/Ar to etch down 330 nm of Si.sub.3N.sub.4. A subsequent step of electron beam lithography was implemented on a poly(methyl methacrylate) (PMMA) positive resist-coated substrate to open a window for the PCM cell. This was followed by the sputter-deposition of 10-nm GST/10-nm ITO on the substrate. The element characterization process was performed using a high resolution emission gun SEM (Hitachi S-4300 SEM system—Ibaraki, Japan) with low accelerating voltage (1 to 3 kV) at a working distance of ˜13 mm.
[0131] An exemplary optical setup 1000 employing an optical associative learning element 200 according to the present disclosure is illustrated schematically in in
[0132]
TABLE-US-00002 TABLE 2 parameters of on-chip layout Dimension Dimensions Parameter (μm) Parameter (μm) V.sub.1 150 H.sub.2 242.5 V.sub.2 73.5 H.sub.3 12.35 V.sub.3 39 R.sub.1 45 H.sub.1 78.5 R.sub.2 100
TABLE-US-00003 TABLE 3 parameters of learning element Width Length Parameter (μm) Parameter (μm) d.sub.n 1.1 l.sub.T 17 d.sub.T 1.0 l.sub.1 1.75 d.sub.nT 0.5 l.sub.2 14.75
TABLE-US-00004 TABLE 4 parameters of ring resonators Radius Width Parameter (μm) Parameter (μm) R.sub.A 35 d.sub.rA 0.85 R.sub.B 45 d.sub.rB 0.9
[0133] In some embodiments, the optical associative learning element 200 consists of two cascaded optical directional couplers 208 and 210. For brevity and consistency, these are referred to as cascade I and II in the following paragraphs. The directional couplers, made up of two parallel channel optical waveguides 202 and 204 in close proximity, allow optical energy exchange between the guided modes of adjacent waveguides. The lower waveguide 204 of cascade I (segment L.sub.1) is the site where optical energy from the UCS and NS/CS inputs accumulate for PCM 206 structural phase switching to occur at the lower waveguide 204 of cascade II (segment L.sub.2), thus regulating the output response R of the element 200.
[0134] For the case of a lossy bottom waveguide with similar propagation constants, one can theoretically treat the optical modes in the element starting from the coupled-mode equations da/dx=iκb and db/dx=iκa−(γ/2)b where the normalized x-direction spatially dependent mode amplitudes of the coupled upper and lower waveguides are denoted by a and b; κ is the coupling coefficient, and γ is the loss coefficient of mode b due to the PCM. To ensure the relevance of these equations, the difference in propagation constant is compensated by tapering the second waveguide 204 on which the PCM patch 206 is deposited, which is comparable to using a lossy material with diminishing real permittivity in passive parity-time symmetric directional couplers. Because cascades I and II are respectively without and with the PCM 206, the modes in cascade II are first solved for and then conveniently it is possible to obtain the solution for cascade I by letting γ.fwdarw.0, before cascading both matrices to solve for the output R with respect to the UCS and NS/CS inputs.
[0135] For γ/4κ≤1, given the [−1, +1] range of a sine function, let γ/4κ=sin θ to arrive at
where a.sub.0 and b.sub.0 are the fields a(x=0) and b(x=0) which we relate to the general notations a(x) and b(x) after applying initial boundary condition to the equations. For γ/4κ≤1, let γ/4κ=cosh θ given [1, =∞] range of a hyperbolic cosine function. Following through the same procedure, this gives
[0136] From equation 3 the input-output relation of cascade I is obtained by letting γ.fwdarw.0.
[0137] To describe the output R as a function of the inputs UCS and CS, one can multiply the 2×2 matrix in equation 3 after letting γ.fwdarw.0 for cascade I by that of equation 3 when γ/4κ≤1 or equation 4 when γ/4κ>1 for cascade II. The 2×2 matrix in cascade II can be reduced to a 1×2 matrix because only the output field on the upper waveguide 202 of cascade II represents the output R. The equation for the overall system can thus be concisely written as R=M.sub.(II) M.sub.(I) S where S=(UCS, NS/CS).sup.T is the column vector that denotes the respective inputs to the element while the matrices M.sub.(I) and M.sub.(II) respectively describe the optical coupling tendencies in the cascaded sections of the lengths x=l.sub.1 and x=l.sub.2.
in which θ=sin.sup.−1(γ/4κ) when γ/4κ≤1 and θ=cosh.sup.−1(γ/4κ) when γ/4κ>1. Here, the inputs to the first and second cascades are respectively at x.sub.1=0 and x.sub.2=0.
[0138] From eigenmode simulations of the structure, estimates of the parameter values were obtained as x=0.157 μm.sup.−1, γ.sub.crys=7.65κ and γ.sub.am=0.24κ using the eigenvalue splitting equation Δβ.sub.±=2i (κ.sup.2+(γ/4κ).sup.2).sup.1/2 which directly follows from the coupled mode equations, where γ.sub.crys and γ.sub.am are the loss coefficient γ when the PCM is at crystalline and amorphous structural phases. Because γ.sub.crys/4κ>1 and γ.sub.am/4κ≤1, equation 5 and equation 6 can be written respectively as equations 1 and 2 above.
[0139] When two optical inputs of the same magnitude E.sub.0 and wavelength λ.sub.0 are launched into the element 200, the total field coupled to the respective waveguides at L.sub.1 is scaled by the product of matrix M.sub.(I) and column vector (e.sup.−iωΔt 1). The inputs to the element can thus be rewritten as a.sub.0=E.sub.0e.sup.−ωΔt and b.sub.0=E.sub.0 where the angular frequency ω=(2πc/n.sub.eff λ.sub.0), c is the vacuum speed of light, and n.sub.eff is the waveguide effective refractive index. It follows that the field coupled to the lower and upper waveguide in the first cascade are respectively given by
|E.sub.upper|.sup.2=|E.sub.0e.sup.−iωΔtM.sub.(I)11+E.sub.0M.sub.(I)12|.sup.2=E.sub.0.sup.2(1−sin(2κl.sub.1)sin(ωΔt)) (7)
|E.sub.lower|.sup.2=|E.sub.0e.sup.−ωΔtM.sub.(I)21+E.sub.0M.sub.(I)22|.sup.2=E.sub.0.sup.2(1+sin(2κl.sub.1)sin(ωΔt)) (8)
[0140] At κl.sub.1=π/4 when ωΔt=π/2, it follows that the field coupled to the upper and lower waveguide are respectively 0 and 2E.sub.0, which is indicative of two-input critical coupling. This implies that the two-input critical coupling length at l.sub.1=π/4κ is half the single-input critical coupling length at l.sub.1=π/2κ.
[0141] With reference to
[0142] To conveniently turn on/off the UCS (NS/CS) input and to precisely define the time delay of the phases between the UCS (NS/CS) inputs, the associative learning element 200 is integrated on an on-chip structure 416 as described above. When the input to the on-chip structure is of ring B (A) resonant wavelength λ.sub.B (λ.sub.A), the UCS (NS/CS) inputs are incident to the associative learning element 200.
[0143] While the single-input probe readouts are carried out at the resonant wavelengths λ.sub.A and λ.sub.B, the two-input pump signals (which induce associative learning) are let incident at the non-resonant wavelengths of the ring resonators. The time delay between the inputs Δt can be conveniently defined from the spectrum in
[0144] The simulation results can be further corroborated by equations 7 and 8, which give |E.sub.L1|.sup.2.sub.lower.fwdarw.0 when Δt=2.475 fs at x.sub.2=5 μm and |E.sub.L2|.sup.2.sub.lower.fwdarw.1.588 when Δt=0.825 fs at x.sub.2=2 μm (at the interface 214). To compare the field magnitude at the interface 214 in the second waveguide 204, the electric field profile of the vertical cross-section was retrieved, shown in
[0145] In
TABLE-US-00005 TABLE 5 Fitting parameters of input-input synaptic weight Fitting Model and Parameters FIG. 9, panel (b) (different pump energies) A.sub.f × (1 − cos(ω.sub.f × Δt)) 1.3 nJ 1.8 nJ 2.4 nJ 2.9 nJ A.sub.f Mean 0.01836 0.02529 0.02972 0.03204 Std. 0.00153 0.00134 0.00137 0.00133 deviation ω.sub.f (×10.sup.15 Mean 5.49905 5.36012 5.18749 5.00685 rads.sup.−1) Std. 0.05849 0.0383 0.03369 0.02976 deviation Adjusted R-squared 0.90708 0.96292 0.9711 0.97621
[0146] The results disclosed herein show that after the associative learning process, input CS comes to suggest input UCS, which reflects the typical one-way associative learning process NS/CS.fwdarw.UCS. Additionally, with reference to
[0147] Based on these exemplary dimensions, three-dimensional FDTD numerical simulations of the structure were performed before and after the learning process to corroborate the χ and η calculations above.
[0148] For optical neuromorphic computing applications that require the ability to handle rapid bursts of traffic and heavy loads with little or no notice, it is desirable to have a scalable monolithic hardware system architecture. The optical associative learning element 200 according to the present disclosure can serve as a building block in a neuromorphic network. As disclosed herein, the all-optical associative learning element 200 can be integrated onto a platform (i.e. photonic chip 416) which locks the phase difference between the UCS and NS/CS as a function of the input optical frequency after the optical input through e.g. an apodized grating coupler was divided equally by the on-chip optical splitter 418. This approach capitalizes on the fact that the all-optical associative learning element 200 consists of cascaded first 208 and second 210 directional couplers, which have been found to be robust to stimuli wavelength difference within a reasonably wide wavelength range. An all-optical phase shifter may be introduced on a first layer of the neuromorphic network. Subsequent layers may require only judicious determination of the path length between one associative learning node to another (as demonstrated herein) once the operating optical wavelength has been determined for the prospective scalable neuromorphic network. Several all-optical artificial neural network architectures based on the associative learning element 200 are disclosed herein.
[0149] Typical artificial neural networks originate from the Hebbian learning rule, which describes how neuronal activities affect the connections between neurons i.e., biological neural plasticity. The rule states that the synaptic weight of a neural connection is adjusted based on the relative timing between the activities from two neurons on either sides of a synapse (pre-synaptic and post-synaptic activities). An example of a scheme to artificially implement the spike-based formulation of the Hebbian learning rule, known as spike-timing dependent plasticity (STDP) is shown in
[0150] On the other hand, associative learning for machine learning is based on empirical evidences of the learning process in the marine snail Hermissenda crassicornis and the hippocampus of the rabbit. Inspired by the learning process in these biological neural systems, a distinctively unique type of artificial neural network based on associative learning has been proposed, with the basic neural connection shown in
[0151] Although the appended claims are directed to particular combinations of features, it should be understood that the scope of the disclosure of the present invention also includes any novel feature or any novel combination of features disclosed herein either explicitly or implicitly or any generalisation thereof, whether or not it relates to the same invention as presently claimed in any claim and whether or not it mitigates any or all of the same technical problems as does the present invention.
[0152] Features which are described in the context of separate embodiments may also be provided in combination in a single embodiment. Conversely, various features which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable sub combination. The applicant hereby gives notice that new claims may be formulated to such features and/or combinations of such features during the prosecution of the present application or of any further application derived therefrom.
[0153] For the sake of completeness it is also stated that the term “comprising” does not exclude other elements or steps, the term “a” or “an” does not exclude a plurality and reference signs in the claims shall not be construed as limiting the scope of the claims.