Method for processing substrates, in particular wafers, masks or flat panel displays, with a semi-conductor industry machine

Abstract

A method for processing substrates, in particular wafers, masks or flat panel displays, with a semi-conductor industry machine, wherein a computer-supported process is used to determine the presence and/or position and/or orientation of the substrate. Further, a system designed to execute the method. The computer-supported process includes an artificial neural network.

Claims

1. Method for processing substrates, in particular wafers, masks or flat panel displays, with a semi-conductor industry machine, wherein an artificial neural network is used to determine the presence and/or position and/or orientation and/or type of a substrate, to determine the presence and/or position and/or orientation and/or type of the substrate based on at least one image, which shows a location in or on the semi-conductor industry machine or in the environment of the machine where a substrate may be located when operating the semi-conductor industry machine, wherein the at least one image is taken by at least one acquisition unit, and wherein the artificial neural network generates and/or allows for an information data set comprising information about the determined presence and/or position and/or orientation and/or type of the substrate and/or generates and/or allows for a control command, which is used to directly control the semi-conductor industry machine, or that is used by the machine's control system, or is passed on to a higher-level control system, or is passed on to a user who draws conclusions from this information for his actions operating the machine, or who passes on this information to control systems or other users, or is saved for later or further evaluation.

2. Method according to claim 1, wherein the artificial neural network forms at least one model that comprises at least one convolutional layer and/or at least one neuronal layer, and may comprise additional components, such as activation layers, and wherein the artificial neural network can be taught, and/or learned data from a preceding learning process is used, and/or wherein the artificial neural network uses at least one of the methods of regression, machine learning or deep learning.

3. Method according to claim 1, wherein the information data set comprises information to make it possible to determine or derive the presence of the substrate in a slot, or on a tray, or on a robot, or on an end effector, or on a processing station in the machine, or at another location where the mask may be located in the semi-conductor industry machine, and/or the spatial orientation of the substrate with respect to the side of a slot facing a machine robot, or in relation to a tray, or in relation to a processing station, or in relation to another part of the machine, and/or the spatial location of the substrate with respect to a slot, or to a tray, or to a robot, or to an end effector, or to another location where the substrate may be located in the semi-conductor industry machine, and/or the type of substrate, and/or the presence of a pellicle on a substrate, in particular on a mask, and/or the side of a deposited substrate, in particular a mask on which a pellicle is located and/or the orientation of a pellicle in relation to a reference axis of a tray, a cassette, a robot, an end effector, and/or the presence and/or the position and/or the orientation of a substrate.

4. Method according to claim 1, wherein the artificial neural network detects incorrect positions of the substrate, including whether the incorrect position concerns a substrate located over several layers of a tray, and/or whether the incorrect position concerns at least two substrates located directly on top of each other, and/or whether the incorrect position concerns a deviation from a specified target position of a substrate, and/or whether the incorrect position concerns a substrate not correctly positioned on all provided support points of a tray.

5. Method according to claim 1, wherein the artificial neural network detects substrate types, or where the generated information data set can be used to derive the substrate types, including whether the substrate has a substrate holder, and/or which type of a substrate holder, and/or whether the substrate has a structure, and/or which type of a structure, and/or whether the substrate has a pellicle, and/or whether the substrate is a wafer, and/or whether the substrate is a mask, and/or whether the substrate is a flat panel display, wherein the presence and/or position and/or orientation and/or position of the substrate and/or of the substrate type can also be derived from the generated information data set, and wherein the artificial neural network is configured and taught to detect substrate types and outputs the information data set required to detect or derive the substrate types.

6. Method according to claim 1, wherein the substrate can also be located or is located in cassettes and/or on trays and/or on an end effector and/or on a chuck and/or on a processing station of the machine and/or on a positioning unit.

7. Method according to claim 1, wherein the at least one image is generated by at least one acquisition unit permanently or temporarily integrated in the machine, and/or arranged on the machine or in the environment of the machine, and/or wherein the at least one acquisition unit is installed in a fixed location in relation to the machine, and/or is located on moving machine elements, preferably on a robot arm or on an end effector of a robot or on a positioning unit.

8. Method according to claim 1, wherein the acquisition unit comprises an optical sensor, an ultrasonic sensor, a distance sensor, a reflex sensor, a radar sensor, an imaging camera or video camera, or an infrared camera.

9. Method according to claim 1, wherein two acquisition units, in particular two cameras, are provided, which are located at the front of an end effector, wherein the optical axes of the two acquisition units are parallel or at an angle to each other, which is between 5° and 90°, preferably between 10° and 80° and particularly preferably between 15° and 75°.

10. Method according to claim 1, wherein the artificial neural network acquires more than one image, in particular at least two images, preferably of the same location, which were generated by different acquisition units and/or acquisition units that are arranged differently.

11. Method according to claim 1, wherein after generating the information data set at least one characteristic is analyzed in a second step by means of another image recognition method, for example a rule-based method or a method for edge detection, or by means of a sensor.

12. Method according to claim 1, wherein a lighting device is provided which is designed to emit electromagnetic radiation in the direction of the substrate or the location in or on the machine, preferably when an image is captured, wherein the electromagnetic radiation is preferably at least in the visible wavelength range and/or in a wavelength range in which the acquisition unit is sensitive, wherein preferably the lighting device is permanently or temporarily integrated in the machine and/or is mounted to the machine, wherein the lighting device is installed in a fixed location with respect to the machine, and/or wherein the lighting device is located on moving elements of the machine, preferably on a robot arm or on an end effector of a robot or on a positioning unit.

13. Method according to claim 1, wherein the artificial neural network is housed in a processing unit as a component of a computer architecture, and wherein the computer architecture also comprises at least one component for computer hardware and for a computer operating system.

14. Method according to claim 1, wherein at least the components of the processing unit, computer hardware and operating system are integrated in a system controller of the semi-conductor industry machine, or in a robot or in an end effector of a robot, or in a robot controller, or in a machine control system, or in a control system at a higher-level than the machine, or in a control system not assigned to the machine, preferably in a cloud, or another computer at any location worldwide.

15. Method according to claim 13, wherein at least the components of the processing unit, computer hardware and the operating system are arranged in at least two different locations, and wherein the artificial neural network and/or the application of the artificial neural network run over more than one of these locations, or wherein the artificial neural network can also be trained on a different computer architecture than the application of the artificial neural network.

16. Method for providing a taught or trained artificial neural network for use in a method according to one of the preceding claims claim 1, comprising the steps: provision of a processing unit, which has or comprises at least one artificial neural network, preferably for image processing, training the artificial neural network by capturing and providing at least one image, preferably a plurality of at least 20 images, at least 100, particularly preferably at least 1,000 images, wherein preferably a plurality of at least two images is captured and/or provided for training and teaching, which in their expression differ at least in one parameter or influencing factor.

17. Method for providing a taught or trained artificial neural network according to claim 16 with at least one different expression in at least one of the following parameters: the presence of the substrate at a location the type of the substrate the position of the substrate in relation to a target position the orientation of the substrate in relation to a reference in the machine the number of substrates in the slot and/or in the cassette in total the color and/or the transmission behavior of the substrate, the dimensions of the substrate the condition of the edges of the substrate the presence of identification tags the condition of the surface of the substrate the presence of a pellicle as well as the location, position and orientation of the pellicle in relation to the substrate, the lighting conditions (light intensity and/or light direction) the type, color and/or condition of the background, the image sharpness, the focus, the reflection of other objects on the substrate light scattering from the environment and training the artificial neural network based on a categorization or classification of the images.

18. Method according to claim 16, wherein images are captured and stored during the application of the artificial neural network in the semi-conductor industry machine in order to use these in at least one initial or at least one new learning process to improve the result of the artificial neural network.

19. Method according to claim 16, wherein the machine has means to grip, hold, transport and/or deposit a substrate, preferably a robot, particularly preferably comprising a moving element or a robot arm and/or an end effector, or a positioning unit.

20. Method for monitoring or controlling handling systems, preferably comprising a robot or a moving element, preferably a robot arm and/or an end effector, and/or a positioning unit, wherein the handling system preferably has means for gripping, transporting and/or depositing a substrate, wherein at least one image depicting a location in or on the handling system or in the environment of the handling system is captured in a digitized form by an artificial neural network, and wherein the artificial neural network analyzes the image and generates an information data set and/or a control command, which is used to directly or supportively control, align, train and/or monitor the handling system, and/or wherein this information data set is used to align, train or monitor the handling system.

21. Method according to claim 20, wherein the at least one image taken by an acquisition unit and/or from a database is fed to the artificial neural network.

22. Method according to claim 20, wherein the information data set contains information about the presence or position or orientation of an object in the image, or about the type of object in the image, in particular the presence of trays, cassettes, or of parts, of markings, stickers, labels or reference marks, or about possible obstacles in the movement area of the handling system, such as doors or load locks, or about the presence of processing stations or about the spacing between at least one object in the image to a reference point of the handling system, or about the position of the object in the image, or about the dimensions of the object in the image, wherein the objects may also be substrates or parts of substrates.

23. Method according to claim 20, wherein images are captured and stored during the application of the artificial neural network in order to use these in at least one initial or at least one new learning process to improve the result of the artificial neural network.

24. Method according to claim 20, wherein geometric methods, preferably methods of triangulation, are used to determine the position and/or orientation and/or spacing and/or dimensions of the object, wherein data of the information data set generated by the trained artificial neural network is used.

25. Handling system, system or machine for processing substrates in the semi-conductor industry, in particular for processing wafers, masks or flat panel displays, designed to execute a method according to claim 1, preferably comprising: a semi-conductor industry machine, a robot, a moving element, preferably a robot arm and/or an end effector, or a positioning unit, a processing unit, which comprises at least one trained artificial neural network, and an acquisition unit for capturing at least one image.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

[0215] The invention is described in more detail below using preferred embodiments and referring to the accompanying figures.

[0216] FIG. 1 a schematic example of a design with an opened cassette to accommodate wafers with correctly arranged wafers and an acquisition unit,

[0217] FIG. 2 a schematic image of an opened cassette from FIG. 1,

[0218] FIG. 3 a schematic depiction of the process of training and teaching the artificial neural network using a design example of a cassette populated with wafers,

[0219] FIG. 4 a schematic setup of a “CNN” network of the cassette occupancy in a very simplified representation,

[0220] FIG. 5a a schematic application of the trained artificial neural network for detection of a cassette occupancy,

[0221] FIG. 5b a schematic section of a classification table based on a design example,

[0222] FIG. 6 a schematic design example of another setup with an acquisition unit and a lighting unit,

[0223] FIG. 7 a schematic design example for a suitable software architecture,

[0224] FIG. 8 a schematic design example for a computer architecture,

[0225] FIG. 9 a schematic design example of an opened cassette with wafers in the correct position and in an incorrect position,

[0226] FIG. 10 a schematic design example of a receptacle for wafers with a wafer placed in the correct position,

[0227] FIGS. 11a, 11b a schematic design example of a receptacle for wafers with a wafer placed in the correct position (FIG. 11a) and in an incorrect position (FIG. 11b),

[0228] FIGS. 12a, 12b another schematic design example of a receptacle for wafers with a wafer placed in the correct position (FIG. 12a) and in an incorrect position (FIG. 12b),

[0229] FIGS. 13a-13f another schematic design example of a receptacle for wafers with different pellicle designs,

[0230] FIGS. 14a-14f a schematic design example of an image recognition based on wafers with different pellicle designs,

[0231] FIGS. 15a, 15b another schematic design example with different arrangement of the acquisition unit,

[0232] FIG. 16 a schematic view of a system for processing semiconductor elements,

[0233] FIG. 17 a design example with a single camera housing with two cameras,

[0234] FIG. 18 a design example in which a camera with integrated lighting is integrated in an end effector,

[0235] FIG. 19 another design example of the invention in which the cameras are arranged on the arms of the end effector,

[0236] FIG. 20 a side view and a top view from above of the key components of a semi-conductor industry machine for processing semiconductor elements,

[0237] FIG. 21 the system shown in FIG. 20 at rest in a top view,

[0238] FIG. 22 the system shown in FIG. 20 in an operating state in a top view,

[0239] FIG. 23 the system shown in FIG. 20 in another operating state in a top view,

[0240] FIGS. 24a, 24b each a design example of a substrate with a superstructure using the example of a mask,

[0241] FIGS. 25a, 25b each a design example of a substrate with a substrate holder using the example of a wafer, and

[0242] FIG. 26 a design example with an upper and lower shell.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

[0243] In the following detailed description of preferred embodiments, the same reference signs designate essentially the same parts in or on these embodiments for the sake of clarity. To better illustrate the invention, however, the preferred embodiments shown in the figures are not always drawn to scale.

[0244] The invention generally concerns a semi-conductor industry machine for processing a substrate, in particular a wafer, a photolithographic mask or a flat panel display. The processing can concern the transport of a substrate between two or more stations, for example a deposit location or a processing station, or also the processing of a substrate.

[0245] The invention also concerns a handling system, wherein the handling system comprises a robot or a moving element, preferably a robot arm and/or an end effector, and/or a positioning unit, and wherein the handling system preferably has means for gripping, transporting and/or depositing a substrate. The handling system can concern the transport of a substrate between two or more stations, for example a deposit location or a processing station, or also the processing of a substrate.

[0246] According to the invention, at least one image depicting a location in or on the handling system or in the environment of the handling system is captured in a digitized form by an artificial neural network,

[0247] and the artificial neural network analyzes the image and generates an information data set and/or a control command, which directly or supportively controls, aligns, trains and/or monitors the handling system.

[0248] The image can also be fed to the artificial neural network from a database.

[0249] FIG. 1 shows a schematic design example with a storage 10 in the form of an opened cassette for receiving substrates 12, in the design example for the reception of wafers, in a top view.

[0250] The example only shows one section of the storage 10, which has a total of 25 individual deposit locations 11 or slots for receiving an individual substrate 12 each.

[0251] The design example only shows 9 such deposit locations 11. Of the deposit locations 11, five deposit locations 11 are occupied with substrates 12 in the example and four deposit locations 11 are not occupied. In the design example, all deposit locations 11 that are occupied are correctly occupied. In other words, all wafers 12 are correctly and flawlessly deposited.

[0252] The computer architecture and the semi-conductor industry machine are not shown in the design example.

[0253] The semi-conductor industry machine may comprise a robot that can comprise a moving element, preferably a robot arm and/or an end effector. In addition, the semi-conductor industry machine can also comprise a positioning unit that can be used to move or transport the substrate. The substrate can not only comprise wafers as in the design example, but also, for example, masks or flat panel displays.

[0254] As a result, the semi-conductor industry machine can comprise a device for picking up and/or moving and/or depositing a substrate. The design example depicted accordingly shows a section of a system as per the invention.

[0255] The dashed line 21 marks the recording area of the acquisition unit 20. In the example, the acquisition unit 20 is a camera.

[0256] The acquisition unit 20 records the depicted image area, i.e. in the example the entire opened cassette, and generates a digital representation, which is provided to the artificial neural network.

[0257] In addition to the imaging camera shown, the acquisition unit 20 may comprise an optical sensor, an ultrasonic sensor, a distance sensor, a reflex sensor, a radar sensor, a video camera, or an infrared camera.

[0258] The generated image is fed to an artificial neural network to determine the presence and/or position and/or the orientation of the substrate and, based on this, the artificial neural network creates an information data set comprising information about the determined presence and/or position and/or orientation and/or type of the substrate and/or a control command, which is used to directly control the semi-conductor industry machine, or that is used by the machine's control system or is passed on to a higher-level control system.

[0259] The information data set can additionally or alternatively be passed on to a user who uses this information to draw conclusions for his actions operating the machine, or who can pass on this information to control systems or other users, or the information data set can be used for later or further evaluation.

[0260] The information data set accordingly comprises information [0261] to make it possible to determine or derive the presence of the substrate in a slot, or on a tray, or on a robot, or on an end effector, or on a processing station in the machine, or at another location where the mask may be located in the semi-conductor industry machine, [0262] and/or the spatial orientation of the substrate with respect to the side of a slot facing a machine robot, or in relation to a tray, or in relation to a processing station, or in relation to another part of the machine, [0263] and/or the spatial location of the substrate with respect to a slot, or to a tray, or to a robot, or to an end effector, or to another location where the substrate may be located in the semi-conductor industry machine, [0264] and/or the type of substrate, [0265] and/or the presence of a pellicle on a substrate, in particular on a mask, [0266] and/or the side of a deposited substrate, in particular a mask on which a pellicle is located [0267] and/or the orientation of a pellicle in relation to a reference axis of a tray, a cassette, a robot, an end effector, [0268] and/or the presence and/or the position and/or the orientation of a substrate.

[0269] It therefore makes it possible to detect incorrect positions of the substrate 12, comprising

[0270] whether the incorrect position concerns a substrate 12 located over several layers of a tray, and/or

[0271] whether the incorrect position concerns at least two substrates 12 located directly on top of each other, and/or

[0272] whether the incorrect position concerns a deviation from a specified target position of a substrate 12, and/or

[0273] whether the incorrect position concerns a substrate 12 not correctly positioned on all provided support points of a tray.

[0274] The substrate 12 can, for example, be located in cassettes, as shown in the design example, and/or on trays and/or on an end effector and/or on a chuck and/or on a processing station of the machine and/or on a positioning unit.

[0275] The artificial neural network can also record more than one image, in particular at least two images, preferably of the same location, but which are taken by different acquisition units and/or acquisition units arranged differently.

[0276] FIG. 2 schematically shows an image or an overall image 30 of the opened cassette from FIG. 1 in a top view.

[0277] The image 30 is divided into individual images in the example, each showing a deposit location 11 or a slot. In the example shown, the lowermost six slot images 31a, 31b, 31c, 31d, 31e and 31f are labeled for the sake of clarity.

[0278] FIG. 3 shows a schematic depiction of the process of training the artificial neural network using a design example of a cassette populated with substrates 12, in the example with wafers. The individual steps are marked (1) to (8) and are shown in the following.

[0279] In a first step (1), one or more images are taken of the opened cassette with 25 deposit locations for wafers and with different slot occupancies. For this purpose, images can also be taken from different viewing angles.

[0280] In a second step (2), the 25 individual slot images in the example are created from this. This can be done manually through image sections of the overall image 30. However, it is also possible, for example, to create image areas of the overall image 30 via the “bounding boxes,” which can then be processed by the artificial neural network. The image sections can be easily varied to different sizes. The ideal size depends on the configuration of the artificial neural network, among other variables.

[0281] In a third step (3), labeling occurs, i.e. an allocation to classes: occupied slot or open slot. This classification is done manually. For example, the images can be stored in different folders, which are assigned to the classes. However, it is also possible, for example, to allocate the labels as well as the “bounding boxes” to separate files each assigned to the individual images. If additional classifications, such as the type of substrate, and/or “cross-slotted” wafer, and/or “double slotted” wafer, are to be determined, these classes are also defined here.

[0282] In a fourth step (4), the different expressions of the parameters are generated via augmentation of the slot images. In the design example, the images are augmented in the brightness parameter. In other words, the image brightness is varied for training purposes in order to incorporate fluctuations in brightness of the images into the training when applying the artificial neural network. The scaling is also augmented. In other words, the size of the image is varied, similar to zooming in or out, to be able to incorporate fluctuations in the size of the depicted slot and wafers in images into the training when applying the artificial neural network, such as due to small position tolerance-induced spacing differences of the wafers and slots to the acquisition unit.

[0283] It is not mandatory to perform these types of augmentation. The number of images for training the artificial neural network in the design example is about 150. However, in the case of several parameters that should be taken into consideration, and/or in the case of several possible expressions, significantly more images may be helpful, which may amount to 10,000 images, for example, depending on the application.

[0284] In the 5th step, images are then divided into images for training (“Train”) and for testing and validating (“Test” images) the iterations of the training process (FIGS. 5a, 5b). However, this allocation can also be automatically controlled by a program, wherein the artificial neural network uses different images to test and verify the result with other images (“Test” images) in the individual training runs (“Train”) images.

[0285] In a sixth step (6), the artificial neural network is configured or created. In this example, a “CNN” network is trained, i.e. a “convolutional” neural network, containing convolutional and neuronal layers, among others. In the example, this network has two output neurons that stand for the two defined classes (occupied slot, and open slot). If additional classes are required, for example the “cross slotted” and/or “doubled slotted” wafer classes, and/or wafers with substrate holder, corresponding neurons must be added.

[0286] The learning process occurs in the seventh step (7). The network (6) performs the learning process with the set learning parameters (such as the number of iterations, also called “epochs,” learning rate and other parameters known for CNN networks.

[0287] As a result of the learning process, the so-called “weights file” is finally created in the eighth step (8), containing the parameterization of the individual elements and connections of the CNN network.

[0288] FIG. 4 schematically shows the setup of this “CNN” network or a model of this network using the example of the cassette occupancy in a very simplified representation.

[0289] For this purpose, the so-called “Keras” platform is used; however, a variety of other possible platforms can be used, such as “Pytorch” or others.

[0290] The designations (1) to (5) specify the different layers or hierarchical levels in FIG. 4. [0291] (1) Input layer: this is where the image is imported into the network. [0292] (2) Convolutional layer(s): A convolutional process with different filters is applied to the image. In the design example, two convolutional layers are used, with the filter size 3×3, number of filters 32 and 64, ReLu activation function, max pooling with size 2×2. However, a variety of other configurations is possible, such as with respect to the number of layers, the number+size of the filters, or the activation functions. [0293] (3) Flatten layer: Conversion of the three-dimensional result of the last convolutional layer into a one-dimensional format for transfer to the neuronal layer. [0294] (4) Neuronal layer: two neuronal layers with 128 and 64 neurons and ReLu activation function are applied in the example. Dropouts were used to avoid a saturation. However, a variety of other configurations is possible, such as with respect to the number of layers, the number of the neurons, the activation functions or the dropouts. It is also possible to forgo the neuronal layers entirely, which also concerns the classification layer. In this case, the classification can also be generated by a convolutional layer. [0295] (5) Neuronal layer for classification: Two neurons for the classes ‘slot occupied’ and ‘slot open.’ In the learning process, the network parameters are optimized through an iterative target-actual comparison of the created classification. These parameters are then used in the application of the classification. A softmax activation was used to generate a float value to represent the image's belonging to the two classes (similar to a ‘scoring,’ slot open, slot occupied).

[0296] FIG. 5a schematically shows the application of the trained artificial neural network to detect a cassette occupancy, and FIG. 5b also schematically shows a section of a data set created by the application with assignments of the individual slot images to the classes “slot_empty” (slot without wafer) and “slot_full” (slot with water) for affiliation based on the design example, wherein the assignment is shown with values between 0 and 1.

[0297] The individual steps (1) to (3) are as follows: [0298] (1) An image is taken of the opened cassette. [0299] (2) The individual 25 slot images are created. The corresponding image sections of the individual 25 slots are automatically created from the image of the opened cassette. This is easily possible in the design example shown and also in a practical application if there is a fixed geometric assignment of camera and cassette via pre-defined positions. It may also be convenient here that the acquisition unit is located on the end effector of a robot, which can be moved into the correct position for frontal image acquisition of the cassette. [0300] However, it is also possible to determine the position of the cassette or the slots using object detection methods, for example, which comprise location recognition, and to derive the image sections from this. [0301] (3) As described further above with the steps “Training the network” and “Setup of the network, the CNN network is parameterized with the “weights file” generated when teaching the network. The slot images are individually analyzed one after the other in the artificial neural network, wherein the filtering into convolutional layers and the linking in the neuronal layers generated in the learning process is applied to the individual images, and wherein the assignment of the slot image to the two classes (open slot, occupied slot) is done via the last neuronal layer by the neurons assigned to the classes. This process is also called classification or prediction.

[0302] FIG. 6 schematically shows a design example of another setup with an acquisition unit 20, a lighting unit 50 and a computer architecture 40 in a top view.

[0303] The acquisition unit 20 can be permanently integrated or mounted in or on the machine, as shown in the design example, or be temporarily integrated or mounted in or on the machine. In the design example, the acquisition unit 20 is permanently arranged in the environment of the machine. The computer architecture 40 can be connected by wire, in this case via cables 41. Of course, wireless connections between the computer architecture 40, acquisition unit 20 and/or lighting unit 50 are also possible.

[0304] The acquisition unit 20 can be installed in a fixed location with respect to the machine and/or the location of the image acquisition, as shown in the design example. The acquisition unit 20 or another, second acquisition unit 20 can also be arranged on moving elements of the machine, preferably on a robot arm or on an end effector of a robot or on a positioning unit.

[0305] In the design example shown, the images for training and for the application were taken with a camera and transferred to the computer architecture.

[0306] However, it is also possible, for example, to use existing stored images from databases for training purposes. It is also possible to directly connect the acquisition unit 20 for capturing images and recording videos for learning purposes and for applying the artificial neural network to the computer architecture, or to use an acquisition unit 20 connected via a network.

[0307] For certain application cases, room lighting and therefore the ambient light may already be sufficient. In general, however, it is helpful if a lighting unit 50 is provided as shown in the design example. This is especially useful if images are to be taken within a machine that is generally closed in which there is no ambient light.

[0308] The lighting device 50 is provided to emit electromagnetic radiation 51 in the direction of the substrate or the location in or on the machine, preferably at the time an image is being captured. The electromagnetic radiation 51 is preferably located at least in the visible wavelength range and/or in a wavelength range in which the acquisition unit is sensitive. The lighting device 50 may be a lamp, for example a lamp with LEDs or a halogen lamp.

[0309] The lighting device 50 can be permanently or temporarily integrated in the machine and/or mounted on the machine, or be installed in a fixed location on the machine, as in this design example. Alternatively or additionally, the lighting device 50 may be located on moving elements of the machine, preferably on a robot arm or on an end effector of a robot or on a positioning unit.

[0310] The computer architecture 40 may have different computer configurations, for example a PC, an industrial PC, an embedded controller, such as a Raspberry computer or a programmable logic controller (PLC).

[0311] It is understood that these computer configurations should be sized accordingly to be able to provide the required computing power. Ideally, the image analysis by the artificial neural network occurs in real time or at least in near real time. It is also understood that the corresponding communication interfaces should be available.

[0312] FIG. 7 shows a schematic design example for a suitable software architecture. It is to be taken into consideration here that a plurality of operating systems, environments and libraries are available, which may be useful for the method as per the invention. The software architecture shown is therefore just one design example among several possible design examples.

[0313] In the design example, the software architecture comprises an “OpenCV” library. This is used to import images, to process images or to generate image sections. Alternatively, other libraries or methods can also be used for this task.

[0314] Furthermore, the software architecture comprises a “Keras” library. This contains the program packages of the artificial neural network used for the design example, such as for the creation and application of the CNN model, or for the implementation of the learning process. Alternatively, other libraries can also be used, such as “Pytorch,” “Caffe” or similar.

[0315] Moreover, the software architecture in the design example comprises a “Python” environment for programming. Alternatively, other programming environments or programming languages can be used, such as C++, Java or Mathlab.

[0316] The aforementioned components of the software architecture are also referred to as a processing unit comprising the artificial neural network.

[0317] Finally, the software architecture in the design example is based on a “Windows” operating system.

[0318] Alternatively, other operating systems can also be used, such as Linux, MacOS for Apple computers or Raspbian for Raspberry computers.

[0319] FIG. 8 schematically shows a design example of a computer architecture setup and its integration into a semi-conductor industry machine, which is marked in the example with a dashed line 80.

[0320] In the sense of the invention, it is therefore possible and also provided to centrally or decentrally implement the computer architecture or the associated components, comprising the processing unit, the computer hardware and the computer's operating system, in a suitable way.

[0321] The components of the processing unit, the hardware components, the programs, databases, the operation, data storage, or also other components can either be centrally located on a computer here, or be spread out across several computers or different locations.

[0322] This also applies to the individual process steps associated with the artificial neural network, such as the generation of data for the learning process, training or application of the method for processing substrates.

[0323] An example is used to illustrate the integration of the artificial neural network for detecting masks and pellicles.

[0324] In the example, a camera is provided on an end effector 81. The at least one camera captures images that are used to train the artificial neural network, as well as images that are needed during system operation to detect masks/pellicles.

[0325] The images for training purposes are transferred to another, external computer 88, on which the processing unit is installed, and where the CNN model is located in the example. The “weights file” is transferred to a robot controller after training, where a processing unit is also installed and the CNN model is located.

[0326] In the application, i.e. during operation of the semi-conductor industry machine 80, the camera in the end effector 81 imports the images and transfers them to the robot controller 83, where the artificial neural network then performs the analysis.

[0327] As a result, an information data set is generated with information regarding whether a mask is present or not, whether a pellicle is present or not, and possibly the location and orientation of the pellicle. This information is then sent to the system controller 85, where it is used by the machine control system user software running there.

[0328] Accordingly, the processing unit may be integrated in the robot 82, in or on the end effector 81 with integrated processing unit, but also in the robot controller 83 with integrated processing unit, in a robot handling system 84 or in the robot system 91 with integrated processing unit, in a system controller 85 with integrated processing unit, in the semi-conductor industry machine 86 with integrated processing unit, in a control system 87 with integrated processing unit not assigned to the machine, such as a factory control system, or another computer 88 with integrated processing unit or in a cloud 89.

[0329] Various examples are specified below, which are intended to illustrate the different types of typical incorrect positions of substrates.

[0330] For example, FIG. 9 schematically depicts the design example from FIG. 1 with an opened cassette as storage 10 with a total of three substrates 12 in the correct position each in a separate deposit location 11. The reference signs 121 and 122 each denote a substrate 12 in an incorrect position. In the case of substrate 121, this is a wafer that is positioned in a so-called “cross-slotted” position, i.e. it is located skewed over two deposit locations. In the case of substrate 122, this is two wafers that are positioned in a so-called “double-slotted” position in one deposit location 11. In both cases, these are incorrect positions, which prevent further processing of the substrate and may require the user to intervene manually.

[0331] To detect the incorrect positions shown, for example, [0332] slot images with correct wafer placement are taken [0333] slot images with the incorrect “cross-slotted” position are taken [0334] slot images with the incorrect “double-slotted” position are taken

[0335] The model of the artificial neural network can be defined with the classes “slot without wafer,” “slot with correctly placed wafer,” “slot with double-slotted wafer” and “slot with cross-slotted wafer.” The model can be trained accordingly and the “weights file” can be created. In this way, the artificial neural network can be used to detect the cassette mapping and incorrect positions in machine operation.

[0336] FIG. 10 schematically shows a design example of a tray 100 for substrates 12 with substrate 12 placed in the correct position in an oblique view. The tray 100 comprises a total of four position points 101 placed on the corners, on which the substrate 12, a mask in the example, should be located when positioned correctly.

[0337] FIGS. 11a and 11b schematically depict a design example of the support for wafers with a wafer placed in the correct position (FIG. 11a) and in an incorrect position (FIG. 11B), which is based on the tray 100 from FIG. 10, in a top view. The example is intended to illustrate the predominant problems in state of the art technology when using rule-based image processing to detect a mask.

[0338] As shown in the example, a rule-based method requires a discreet area be defined, which is defined in the example with the dashed border 110. While the placed substrate 12 in FIG. 11a can be detected using the rule-based image recognition system, the incorrect position in FIG. 11b leads to a substrate 12 not being detected, since the substrate 12 is located outside of the defined, discreet area 110.

[0339] FIGS. 12a and 12b show another schematic design example of a receptacle for wafers with a wafer placed in the correct position (FIG. 12a) and in an incorrect position (FIG. 12b), wherein an artificial neural network is used instead of rule-based image recognition. This may be a CNN network as described further above, or a different network. A schematic image section 120 of the entire mask can be used for teaching and detection during image recognition via artificial neural network as per the invention. This makes it possible to also detect a substrate 12 that significantly deviates from its target position. In this example, the substrate 12 has sufficient characteristics of a typical substrate 12, which permits an allocation, even in the case of a clear incorrect position.

[0340] The mask in the example of FIG. 12b is thus detected, even though it is in an incorrect position. Depending on the design of the artificial neural network, the mask can also be classified as being in an incorrect position.

[0341] FIGS. 13a, 13b, 13c, 13d, 13e and 13f schematically depict different typical embodiments of a tray 100 for substrates 12 in an oblique view. In this example, the substrate 12 is a mask, which is shown without pellicle (FIG. 13b) or with pellicle 13 (FIGS. 13c to 13f). FIG. 13a shows the tray 100 without substrate 12.

[0342] In FIG. 13c, the pellicle 13 is arranged on top of the mask transverse to the front side 102 of the tray 100. In other words, the mask with the pellicle is oriented so that the pellicle is located transversely on the top side. In FIG. 13d, the pellicle 13 is arranged on top longitudinally on the mask.

[0343] In FIG. 13e, the pellicle 13 is arranged below the mask transversely to the front side 102 of the tray 100. In FIG. 13f, finally, the pellicle 13 is arranged below the mask longitudinally. The designations “on top” or “below” refer to a deposited substrate, wherein “below” then refers to the bottom. The designations “longitudinally” and “transversely” refer to the front side 102 of the tray 100.

[0344] The FIGS. 14a, 14b, 14c, 14d, 14e and 14f correspondingly schematically show the respective image sections 130, which the artificial neural network uses to detect pellicles. It would also be possible and sufficient to use parts of the image sections shown, provided these clearly describe the situation. For example, it would also be sufficient to use the right or left half of the images.

[0345] In the example of FIG. 14a, corresponding to the example of FIG. 13a, it is detected that no substrate 12 is present on the tray 100. In the example of FIG. 14b, a substrate without pellicle is detected, corresponding to the example of FIG. 13b.

[0346] The examples explained above show based on two simple parameters alone, “substrate present/not present,” and “pellicle present/not present,” or in which layer there is a detected pellicle, the complexity that a rule-based method would require in which each layer would have to be stored in corresponding rules.

[0347] If a system parameter then changes, for example to the effect that the ambient lighting conditions change because the machine is used in a different environment, or material or optical properties of substrates or pellicles change, all of the rules would have to be redefined and restored, which can lead to a tremendous amount of work and expense.

[0348] So, for example, it is conceivable that a new end effector is installed on the semi-conductor industry machine, which also has an additional controllable rotational axis, and which makes it possible to grip and transport even slightly skewed substrates 12. If this makes it possible to grip substrates, for example, up to a skewed position of the substrate at an angle of up to 2°, 5° or 7° with respect to the horizontal, to correct the skewed position or to transport them, then these types of new rules can be very easily defined and stored.

[0349] The use of an artificial neural network in the sense of the invention makes it possible, on the other hand, to use some images showing the corresponding skewed positions of the substrate 12, and/or changing system parameters, to re-train or additionally train the artificial neural network and thus to significantly and quickly increase the semi-conductor industry machine's efficiency.

[0350] FIGS. 15a and 15b schematically depict another design example with a different arrangement of the acquisition unit 20 in an oblique view.

[0351] In the design example of FIG. 15a, the acquisition unit 20, an imaging camera in the example, is arranged on an end effector 150. The advantage in this arrangement is that at least one gripper arm 151 is located in the acquisition unit's 20 field of view.

[0352] In the design example of FIG. 15b, the acquisition unit 20, also an imaging camera in the example, is mounted in a non-changing position on the semi-conductor industry machine.

[0353] The advantage of the acquisition unit 20 arranged in this way is that at least the relevant objects, in particular the tray 100, or the robot or a certain area of interest is in the acquisition unit's 20 field of view.

[0354] The robot and the robot arm are not shown in the design examples for the sake of clarity. Several acquisition units 20 can also be attached in different alignments, whose images are then combined and evaluated, or whose images are evaluated individually and then the results of the evaluation are compared with each other.

[0355] FIG. 16 shows a schematic view of a system 1 for processing semiconductor elements.

[0356] The system 1 comprises an end effector 150 with the arms 162a, 162b via which a substrate 12, in the example a mask or a wafer, can be deposited on and/or picked up from a deposit 160.

[0357] The end effector 150 is part of a robot (not shown) and is moved in several spatial directions by the robot. In particular, a robot with a radial and theta axis can be used.

[0358] According to the invention, the system comprises at least one acquisition unit, which is arranged on the end effector 150 so it can move with it in this design example.

[0359] In this design example, two cameras 161a, 161b are provided as the acquisition unit. The optical axes of the two acquisition units can be aligned at an angle to each other or also parallel to each other. In the design example, the two optical axes of both acquisition units are aligned parallel to each other, but laterally offset from one another to be able to show image arms 161a and 161b.

[0360] The cameras 161a and 161b are located on the front of the end effector 150 in this design example between or next to the arms 162a and 162b.

[0361] The image of the cameras 161a and 162b is transmitted to an electronic device, in this design example to a portable tablet 6.

[0362] In this design example, the views of the cameras 161a, 162b on the tablet are shown separately as view 8a and view 8b. This can be realized, for example, via a split screen.

[0363] For example, it is possible that the camera 161a is aimed at the arm 162a, whereas the camera 161b is aimed at the front edge of the substrate 12.

[0364] In this way, the motion sequence of the end effector 150 can be monitored via the camera 161a.

[0365] Camera 161b can be used to easily determine if there is an offset when picking up or depositing the substrate 12, or whether the substrate 12 is in a position, such as a skewed position of 5°, which still allows for processing.

[0366] In addition, the cameras 161a and 161b can be used to check whether a substrate 12 is on the end effector 150.

[0367] Additional sensors can then be done without.

[0368] FIG. 17 shows a design example in which a single camera housing 160 contains two cameras 161a and 161b in a corner.

[0369] It can be seen that each of the cameras 161a and 161b capture the edge emanating from the corner 171. The two cameras 161a and 161b are advantageously arranged so that their viewing directions or their optical axes are not parallel to each other, but rather remain at an angle to each other.

[0370] The angle of the optical axes can be formed in a plane, which is specified by the arms 162a and 162b, i.e. the gripping device of the end effector. In the design example, the optical axis of the at least one acquisition unit is located in this plane in order to at least capture the one arm 162a, 162b well. However, it is also possible that the optical axis of the second acquisition unit is not located in this plane. For example, it can be detected earlier when approaching an object whether there is a pending collision.

[0371] This angle can be designed differently and is advantageously between 5° and 90°, but preferably between 10° and 80° and particularly preferably between 15° and 75°. In the example, the angle is between approximately 20° to 30°. This design allows for a particularly compact configuration of the camera housing 170 and therefore a simple and space-saving assembly on the end effector.

[0372] An offset can also easily be detected when picking up or depositing. The camera housing 170 is preferably mounted on the end effector.

[0373] In this way, a compact and easy-to-integrate possibility can be created to arrange the acquisition units on the end effector so that they can be moved with the end effector and at the same time provide the necessary information during operation or when moving the end effector.

[0374] The camera housing 170 can also comprise lighting 9 (e.g. LEDs) for the camera's 161a, 161b field of view. The lighting 9 is preferably arranged here so that the main direction of the light's beam is parallel to the viewing direction of the camera 161a, 161b.

[0375] FIG. 18 shows a design example in which a camera 161 with integrated lighting 9 is integrated in the end effector 150.

[0376] For this purpose, the support of the end effector 150, from which the arms 162a, 162b protrude, has a recess 17 within which the camera 161 is arranged. The camera 161 in this embodiment of the invention is installed at an angle in the housing of the end effector 150.

[0377] In this way, the camera 150, which can have a rod-shaped housing (not shown), takes up an otherwise largely unused space in the housing of the end effector 150. Moreover, the angled arrangement can provide a large field of view. The optical axis of the camera 161 can in particular have an angle from 30 to 60° to the main extension direction of the arms 162a, 162b.

[0378] The at least one camera 161 is located in the recess 17 in an angled surface, so that the camera is aimed at the corner 16 of the substrate 12.

[0379] It is understood that a camera is or can be preferably present in the second recess 17, which is obscured in this view.

[0380] FIG. 19 shows another embodiment of the invention, in which the cameras 161a, 161b are arranged on the arms 162a, 162b of the end effector 150.

[0381] FIG. 20 shows a side view and a top view from above (shown below) of the key components of a semi-conductor industry machine 200 for processing semiconductor elements.

[0382] The machine 200 comprises a storage 10, in the example a cassette, for substrates 12, in the case depicted for masks, which is shown in this case in particular as FOUP (“front opening unified pod”), which is opened or closed with a SMIF (“standard mechanical interface”) load port.

[0383] A robot 212 with the end effector 150 can be used to remove substrates 12 from the storage 10 and transport them further to the processing station 213.

[0384] The processing station 213 comprises a movable stage 218 in this design example, on which there is a chuck for holding the mask.

[0385] The mask processing machine shown here may in particular comprise an inspection device 214, which is located on a plate 219 that is supported by the insulators 220 to be insulated from vibration.

[0386] At least the end effector 150 of the robot 212 comprises a camera that the robot 212 can use to capture images in operation that can be processed by the artificial neural network.

[0387] At the same time, the occupancy of the storage 10 with substrates can be checked here. The robot 212 only approaches receptacles in which there is a

[0388] FIG. 21 shows the system shown in FIG. 20 at rest. As shown in FIG. 20, the end effector with its arms 162a, 162b is moved to below the mask to pick up the substrate 12 or the mask from the storage 10 or the cassette.

[0389] The mask is lifted with the robot 212 (FIG. 22) and moved into the target position on the deposit 4 shown in FIG. 23. The target position is monitored by the camera mounted on the end effector.

[0390] The semi-conductor industry machine may also contain a vacuum chamber. It is understood that the methods as per the invention can also be executed within vacuum chambers, provided the components, such as the acquisition unit or the lighting, if necessary, and/or if they are located within a vacuum chamber, are designed to be suitable for vacuums.

[0391] For example, the acquisition unit 20 can be installed without tools by using clips.

[0392] The image recognition by the artificial neural network can be used to very precisely and efficiently control the semi-conductor industry machine or an end effector of a robot.

[0393] The method as per the invention thus makes it possible to process substrates, in particular wafers, masks or flat panel displays.

[0394] As a supplement, an end effector can be used where the substrate, i.e. the masks, the wafer or the flat panel display, is positioned on the receptacle by motors integrated in the end effector, for example using a piezo. This means it is possible to more precisely position the substrate than via the remote drive in the robot. In another embodiment of the invention, the end effector itself comprises actuators for finely adjusting the target position.

[0395] In particular, the advantages of using the artificial neural network come to bear here, which make it possible to detect tolerable and easy-to-adjust deviations from target positions of the substrate, i.e. deviations from an ideally deposited substrate, and to decide whether and under which adjustments to the end effector processing is possible.

[0396] The robot itself can also, as is provided in one embodiment of the invention, also have its motion sequence finely adjusted via image recognition.

[0397] FIGS. 24a, 24b each show a design example of a substrate with a structure using the example of a mask, and FIGS. 25a, 25b each show a design example of a substrate with a substrate holder using the example of a wafer, in an oblique view. These design examples should depict possible superstructures 14 of a substrate 12, which, according to the invention, can be recognized by classifying the type of substrate 12.

[0398] FIG. 24a schematically shows a substrate 12, in the example a mask, with a structure 14; FIG. 24b shows the mask from FIG. 24a with a structure 14 and a pellicle 13 transversely below.

[0399] FIG. 25a schematically shows a substrate 12, in the example a wafer, with a substrate holder 251 for holding the substrate 12 or wafer; FIG. 25b shows the wafer from FIG. 25a with a structure 14. A structure 14 may, for example, comprise a calibration device for the machine, measuring equipment or other components.

[0400] Finally, FIG. 26 shows a substrate holder with an upper shell 260 and a lower shell 261.

[0401] The method as per the invention therefore allows, according to one embodiment, for the artificial neural network also detecting substrate types, or for the generated information data set to be used to derive the substrate types, comprising

[0402] whether the substrate 12 has a substrate holder 251, and/or

[0403] whether the substrate 12 has a structure 14, and/or

[0404] whether the substrate 12 has a pellicle 13, and/or

[0405] whether the substrate 12 is a wafer, and/or

[0406] whether the substrate 12 is a mask, and/or

[0407] whether the substrate 12 is a flat panel display.

[0408] The presence and/or position and/or orientation and/or position of the substrate 12 and/or of the substrate type can also be derived from the generated information data set.

[0409] Furthermore, the artificial neural network can be configured and taught to detect substrate types and to issue the required information data set to detect or derive the substrate types.

LIST OF REFERENCE SIGNS

[0410] 1 Handling system

4 Deposit

6 Tablet

8a, 8b View

9 Lighting

10 Storage

[0411] 11 Deposit location

12 Substrate

13 Pellicle

14 Structure

16 Corner

17 Recess

[0412] 20 Acquisition unit
21 Image area
30 Overall image
31-31f Slot images
40 Computer architecture

41 Cable

[0413] 50 Lighting unit
80 Handling system
81 End effector

82 Robot

[0414] 83 Robot controller
84 Robot handling system
85 System controller
86 Semi-conductor industry machine
87 Control system

88 Computer

89 Cloud

[0415] 91 Robot system

100 Tray

[0416] 101 Position points
102 Front side of the tray
110 Discreet area of the image recognition
120 Image section
121 Substrate in incorrect position (“cross-slotted”)
122 Substrate in incorrect position (“double-slotted”)
130 Image section
150 End effector

151 Gripper arm

160 Deposit

161a, 161b Camera

[0417] 162a, 162b Gripper arm of the end effector
170 Camera housing

200 Machine

212 Robot

[0418] 213 Processing station
214 Inspection device

218 Stage

219 Plate

220 Insulator

[0419] 251 Substrate holder

260, 261 Shell

Method for processing substrates, in particular wafers, masks or flat panel displays, with a semi-conductor industry machine

Inventors

Cpc classification

Classification Explorer

H05K13/0818

ELECTRICITY

Classification Explorer

H05K13/0812

ELECTRICITY

Classification Explorer

H05K13/0015

ELECTRICITY

Classification Explorer

G06N3/08

PHYSICS

Classification Explorer

G06N3/045

PHYSICS

Classification Explorer

H05K13/081

ELECTRICITY

International classification

Classification Explorer

H05K13/08

ELECTRICITY

Classification Explorer

G06N3/08

PHYSICS

Classification Explorer

H05K13/00

ELECTRICITY

Abstract

Claims

Description