ARTIFICIAL INTELLIGENCE NEURAL NETWORK APPARATUS AND DATA CLASSIFICATION METHOD WITH VISUALIZED FEATURE VECTOR
20210224604 · 2021-07-22
Assignee
Inventors
Cpc classification
G06F18/214
PHYSICS
G06V10/765
PHYSICS
G06V10/758
PHYSICS
International classification
Abstract
An artificial intelligence neural network apparatus, comprising: a labeled learning database having data of a feature vector composed of N elements; a first feature vector image converter configured to visualize the data in the learning database to form an imaged learning feature vector image database; a deep-learned artificial intelligence neural network configured to use a learning feature vector image in the learning feature vector image database to perform an image classification operation; an inputter configured to receive a test image, and generate test data based on the feature vector; and a second feature vector image converter configured to visualize the test data and convert the visualized test data into a test feature vector image. The deep-learned artificial intelligence neural network is configured to determine a class of the test feature vector image.
Claims
1. An artificial intelligence neural network apparatus coupled with a visualized feature vector, comprising: a labeled learning database having data of a feature vector composed of N elements; a first feature vector image converter configured to visualize the data in the labeled learning database to form an imaged learning feature vector image database; a deep-learned artificial intelligence neural network configured to use a learning feature vector image in the learning feature vector image database to perform an image classification operation; an inputter configured to receive a test image, and generate test data based on the feature vector; and a second feature vector image converter configured to visualize the test data and convert the visualized test data into a test feature vector image, wherein the deep-learned artificial intelligence neural network is configured to determine a class of the test feature vector image.
2. The artificial intelligence neural network apparatus of claim 1, wherein the deep-learned artificial intelligence neural network is deep-learned by supervised learning using the learning feature vector image stored in the learning feature vector image database.
3. The artificial intelligence neural network apparatus of claim 1, wherein the first feature vector image converter includes: a pattern image storage configured to store pattern images of a relationship between element x.sub.i and other elements {x.sub.j|j=i+1, i+2, . . . , N} for data represented by a feature vector composed of the N elements {x.sub.i|i∈1, 2, . . . , N}; an address generator configured to calculate an address for reading the pattern image from the pattern image storage; an element x.sub.i visualizer configured to obtain visualized cross correlation images {A.sub.ij|j=i+1, i+2, . . . , N} by reading a pattern image corresponding to the address generated from the address generator and mapping the pattern image read into a two-dimensional space; a first addition operator configured to generate a local pattern image B.sub.i by synthesizing cross correlation images {A.sub.ij|j=i+1, i+2, . . . , N} obtained from the element x.sub.i visualizer; and a second addition operator configured to generate a feature vector image by synthesizing a local pattern image {B.sub.i|i=1, 2, . . . , N−1} obtained from the first addition operator.
4. The artificial intelligence neural network apparatus of claim 3, wherein the first addition operator further includes: a multiplier configured to perform multiplication of weight W.sub.ij by the cross correlation image A.sub.ij; and an adder configured to perform addition to the bias b.sub.i, wherein the weight W.sub.ij and the bias b.sub.i are customized while being learned by a supervised learning of the deep-learned artificial intelligence neural network.
5. A processor implemented artificial intelligence neural network method, comprising: visualizing data in a labeled learning database to form an imaged learning feature vector image database, wherein the labeled learning database has the data composed of N elements; using a learning feature vector image in the learning feature vector image database to perform an image classification operation; receiving a test image and generating test data based on the feature vector; visualizing the test data and converting the visualized test data into a test feature vector image; and determining a class of the test feature vector image.
6. The method of claim 5, further including: storing pattern images of a relationship between element x.sub.i and other elements {x.sub.j|j=i+1, i+2, . . . , N} for data represented by a feature vector composed of the N elements {x.sub.i|i∈1, 2, . . . , N}; calculating an address for reading the pattern image from the pattern image storage; obtaining visualized cross correlation images {A.sub.ij|j=i+1, i+2, . . . , N} by reading a pattern image corresponding to the address generated from the address generator and mapping the pattern image read into a two-dimensional space; generating a local pattern image B.sub.i by synthesizing cross correlation images {A.sub.ij|j=i+1, i+2, . . . , N} obtained from the element x.sub.i visualizer; and generating a feature vector image by synthesizing a local pattern image {B.sub.i|i=1, 2, . . . , N−1} obtained from the first addition operator.
7. The method of claim 6, further including: performing multiplication of weight W.sub.ij by the cross correlation image A.sub.ij; and performing addition to the bias b.sub.i, wherein the weight W.sub.ij and the bias b.sub.i are customized while being learned by a supervised learning of a deep-learned artificial intelligence neural network.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027] Throughout the drawings and the detailed description, the same reference numerals refer to the same elements. The drawings may not be to scale, and the relative size, proportions, and depiction of elements in the drawings may be exaggerated for clarity, illustration, and convenience.
DETAILED DESCRIPTION OF THE EMBODIMENT
[0028] The following detailed description is provided to assist the reader in gaining a comprehensive understanding of the methods, apparatuses, and/or systems described herein. However, various changes, modifications, and equivalents of the methods, apparatuses, and/or systems described herein will be apparent after an understanding of the disclosure of this application. For example, the sequences of operations described herein are merely examples, and are not limited to those set forth herein, but may be changed as will be apparent after an understanding of the disclosure of this application, with the exception of operations necessarily occurring in a certain order. Also, descriptions of features that are known in the art may be omitted for increased clarity and conciseness.
[0029] The features described herein may be embodied in different forms, and are not to be construed as being limited to the examples described herein. Rather, the examples described herein have been provided merely to illustrate some of the many possible ways of implementing the methods, apparatuses, and/or systems described herein that will be apparent after an understanding of the disclosure of this application.
[0030] Throughout the specification, when an element, such as a layer, region, or substrate, is described as being “on,” “connected to,” or “coupled to” another element, it may be directly “on,” “connected to,” or “coupled to” the other element, or there may be one or more other elements intervening therebetween. In contrast, when an element is described as being “directly on,” “directly connected to,” or “directly coupled to” another element, there can be no other elements intervening therebetween.
[0031] As used herein, the term “and/or” includes any one and any combination of any two or more of the associated listed items.
[0032] Although terms such as “first,” “second,” and “third” may be used herein to describe various members, components, regions, layers, or sections, these members, components, regions, layers, or sections are not to be limited by these terms. Rather, these terms are only used to distinguish one member, component, region, layer, or section from another member, component, region, layer, or section. Thus, a first member, component, region, layer, or section referred to in examples described herein may also be referred to as a second member, component, region, layer, or section without departing from the teachings of the examples.
[0033] The terminology used herein is for describing various examples only, and is not to be used to limit the disclosure. The articles “a,” “an,” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. The terms “comprises,” “includes,” and “has” specify the presence of stated features, numbers, operations, members, elements, and/or combinations thereof, but do not preclude the presence or addition of one or more other features, numbers, operations, members, elements, and/or combinations thereof.
[0034] The features of the examples described herein may be combined in various ways as will be apparent after an understanding of the disclosure of this application. Further, although the examples described herein have a variety of configurations, other configurations are possible as will be apparent after an understanding of the disclosure of this application.
[0035] An object to be achieved by the present disclosure is to provide an artificial intelligence neural network apparatus coupled with a visualized feature vector and a data classification method thereof using the same capable of automatically classifying data by visualizing and imaging data based on a feature vector and applying the data based on the feature vector to CNN.
[0036] Further, another object to be achieved by the present disclosure is to provide an apparatus and method capable of classifying data based on a feature vector using the existing deep learning neural network by visualizing a feature vector pre-selected by a human and learning and classifying the visualized feature vector using the deep learning neural network.
[0037] Further, still another object to be achieved by the present disclosure is to provide an advantage of greatly improving feature vector extraction efficiency as well as greatly improving a learning speed of a deep learning neural network by selecting a feature vector pre-selected by a human and providing the selected feature vector to the deep learning neural network in a form that the feature vector pre-selected by a human and their own feature vector extraction ability of the deep learning neural network are coupled.
[0038] However, the technical problems to be achieved by the embodiments of the present disclosure are not limited to the technical problems as described above, and other technical problems may exist.
[0039] According to an aspect of the present disclosure, there is provided an artificial intelligence neural network apparatus coupled with a visualized feature vector includes: data that is represented by a feature vector composed of N elements; a learning database that is labeled which class the learning database belongs to and composed of data represented by the feature vector; a feature vector to image conversion unit 1 that visualizes data stored in the learning database to form an imaged learning feature vector image database; an artificial intelligence neural network that is deep-learned by supervised learning using a learning feature vector image stored in the learning feature vector image database, and then performs an image classification operation; a data input unit that receives a test image to be classified and generates test data represented by the feature vector; and a feature vector to image conversion unit 2 that visualizes the test data and converts the visualized test data into a test feature vector image, in which the deep-learned artificial intelligence neural network classifies which class the test feature vector image belongs to.
[0040] Further, the feature vector to image conversion unit includes: a pattern image storage unit that stores pattern images showing a relationship between element xi and other elements {xj□j=i+1, i+2, . . . , N} for data represented by a feature vector composed of N elements {xi□i□1, 2, . . . , N}; an address generation unit that calculates an address for reading the pattern image from the pattern image storage unit; an element xi visualization unit that obtains visualized cross correlation images {Aij□j=i+1, i+2, . . . , N} by reading a pattern image corresponding to an address generated from the address generation unit, from the pattern image storage unit, and mapping the read pattern image to a two-dimensional space; an addition operation unit 1 that generate a local pattern image Bi by synthesizing cross correlation images {Aij□j=i+1, i+2, . . . , N} obtained from the element xi visualization unit; and an addition operation unit 2 that generates a feature vector image by synthesizing a local pattern image {Bi□i=1, 2, . . . , N−1} obtained from the addition operation unit 1.
[0041] Further, the addition operation unit 1 that generates the local pattern image Bi further includes: a multiplier that performs multiplication of weight Wij by the cross correlation image Aij; and an adder that performs addition to the bias bi, in which the weight Wij and the bias bi are optimized (customized) while being learned by the supervised learning of the artificial intelligence neural network.
[0042] The means for solving the problem described above are merely and should not be construed as limiting the present disclosure. In addition to the embodiments described above, additional embodiments may exist in the drawings and detailed description of the disclosure.
[0043] As described above, the present disclosure relates to the artificial intelligence neural network apparatus for classifying data and the classification method based on the same, and more particularly, provides the apparatus and method capable of effectively classifying data by coupling the visualized feature vector with the artificial intelligence neural network.
[0044]
[0045] Referring to
[0046] According to an embodiment of the present disclosure, data (not illustrated) may be represented by a feature vector composed of N elements.
[0047] The learning database 10 is labeled which class it belongs to and may be composed of data represented by the feature vector.
[0048] The feature vector to image conversion unit 1 12 may visualize data stored in the learning database 10 to form the imaged learning feature vector image database 14.
[0049] The artificial intelligence neural network 20 is deep-learned by supervised learning using a learning feature vector image stored in the learning feature vector image database 14, and then may perform an image classification operation.
[0050] The data input unit 32 may receive a test image 30 to be classified and generate test data represented by the feature vector.
[0051] In addition, the feature vector to image conversion unit 2 34 may visualize the test data and convert the visualized test data into a test feature vector image.
[0052] An artificial intelligence neural network apparatus 100 is characterized by classifying which class the test feature vector image belongs to by the deeply learned artificial intelligence neural network 20.
[0053] The present disclosure uses any one selected from entropy, histogram, histogram of oriented gradients (HOG), wavelet transform, and dimensionality reduction techniques to extract feature vectors pre-selected by a human.
[0054] For the dimensionality reduction, any one of principal component analysis, linear discriminant analysis (LDA), factor analysis, multi-dimensional scaling (MDS), singular value decomposition (SVD), isometric feature mapping (Isomap), locally linear embedding (LLE), Hessian Eigenmapping (HLLE), and spectral embedding (Laplacian Eigenmaps) techniques may be used.
[0055] The artificial intelligence neural network 20 of the present disclosure includes a neural network capable of deep learning, and may use a convolutional neural network (CNN) and a recurrent neural network (RNN).
[0056] In the present disclosure, the artificial intelligence neural network is a neural network for allowing deep learning, and is configured by combining any one or more layers or elements selected from a convolution layer, a pooling layer, a ReLu layer, a transpose convolution layer, an unpooling layer, a 1×1 convolution layer, skip connection, a global average pooling (GAP) layer, a fully connected layer, a support vector machine (SVM), a long short term memory (LSTM), Atrous convolution, Atrous spatial pyramid pooling, separable convolution, and bilinear upsampling. In an example, the artificial intelligence neural network further includes an operation unit for a batch normalization operation in front end of the ReLu layer.
[0057] In the present disclosure, the deep learning of the artificial intelligence neural network may use a backpropagation algorithm technique that reduces an error between an output result and an actual value in the neural network, and may use any one algorithm selected from optimization algorithms such as stochastic gradient descent with momentum, Adagrad, Adam, and RMSProp algorithms. Herein, it is noted that use of the term ‘may’ with respect to an example or embodiment, e.g., as to what an example or embodiment may include or implement, means that at least one example or embodiment exists where such a feature is included or implemented while all examples and embodiments are not limited thereto.
[0058] For the description of the present disclosure, assuming that the given feature vector is composed of N elements, each data may be represented by a feature vector composed of {x.sub.1, x.sub.2, . . . , x.sub.N} which are N elements.
[0059]
[0060] According to the embodiment of the present disclosure, an address generation unit 90a may calculate an address for reading pattern images showing a relationship between element x.sub.i and other elements {x.sub.j|j=i+1, i+2, . . . , N} for data represented by the feature vector composed of N elements {x.sub.i|i∈1, 2, . . . , N} from the pattern image storage unit 90.
[0061] In addition, element xi visualization units 70a, 70b, and 70c read the pattern image corresponding to the address generated by the address generation unit 90a from the pattern image storage unit 90, and as a result, may acquire cross correlation images A.sub.ij 60a, 60b, and 60c visualized by being mapped to a two-dimensional space.
[0062] The feature vector to image conversion unit 1 12 and the feature vector to image conversion unit 2 34 include addition operation units 1 52a, 52b, and 52c that generate local pattern images B.sub.i 72a, 72b, and 72c by synthesizing the cross correlation images A.sub.ij 60a, 60b, and 60c respectively obtained from the element x.sub.i visualization units 70a, 70b, and 70c, and an addition operation unit 2 70 that generates a feature vector image G 77 by synthesizing the local pattern images B.sub.i 72a, 72b, and 72c obtained from the addition operation unit 1.
[0063] The element x.sub.i visualization unit obtains {A.sub.ij|j=i+1, i+2, . . . , N}, which are the cross correlation images.
[0064] The addition operation unit 1 obtains {B.sub.i|i=1, 2, . . . , N−1}, which are the local pattern images.
[0065] For example, the cross correlation image A.sub.12 60a reads the pattern image showing the relationship between the element x.sub.1 and the element x.sub.2 from the pattern image storage unit 90 to indicate the cross correlation image visualized by being mapped to the two-dimensional space, and the cross correlation image A.sub.12 60b indicates the cross correlation image between the element x.sub.1 and the element x.sub.2.
[0066] In one example, the local pattern images B.sub.i 72a, 72b, and 72c are obtained by synthesizing the obtained cross correlation images A.sub.ij 60a, 60b, and 60c as in the following Equation 1.
[0067] In the above Equation 1, weight W.sub.ij and bias b.sub.i are optimized (customized) while being learned by user defined variables or supervised learning applied according to application fields.
[0068] In an example, the addition operation unit 1 for generating the local pattern image B.sub.i may include a multiplier that performs multiplication of the weight W.sub.ij by the cross correlation image A.sub.ij and an adder that performs addition to the bias b.sub.i.
[0069] In addition, the addition operation unit 1 provides the artificial intelligence neural network apparatus coupled with the visualized feature vector that is optimized while the weight W.sub.ij and the bias b.sub.i are deep-learned by the supervised learning of the artificial intelligence neural network.
[0070] In an example, the feature vector image G 77 is obtained by synthesizing the local pattern images B.sub.i 72a, 72b, and 72c using the following Equation 2.
[0071]
[0072] In addition, the local pattern image B.sub.2 72b is a two-dimensional image obtained by synthesizing cross correlation images A.sub.2j visualizing the relationship between the element x.sub.2 and other elements {x.sub.j|j=3, . . . , N}, and is represented by the following Equation 4.
[0073] In this way, a local pattern image B.sub.N−1 72c is a two-dimensional image obtained from cross correlation image A.sub.N−1, N visualizing a relationship between element x.sub.N−1 and other elements x.sub.N, and is represented by the following Equation 5.
[0074] In addition, the feature vector image G 77 is a two-dimensional image obtained by synthesizing the obtained local pattern images 72a, 72b, and 72c, and is represented by the above Equation 2.
[0075]
[0076] For example, assuming that the four elements of the feature vector={ear size, weight, skin color, and eye size}, x.sub.1=ear size, x.sub.2=body weight, x.sub.3=skin color, and x.sub.4=eye size. In this case, the cross correlation image A.sub.12 60a is obtained by reading the pattern image stored in the corresponding address and mapping the read pattern image to the two-dimensional space by using x.sub.1, new, and x.sub.2, new obtained by applying values of the element x.sub.1 and the element x.sub.2 to the following Equation 6 or Equation 7 as an address for selecting one of the pattern images stored in the pattern image storage unit 90.
[0077] Reference numeral 60b is a cross correlation image A.sub.13 formed by the element x.sub.1 and the element x.sub.3.
[0078] Reference numeral 60c is a cross correlation image A.sub.14 formed by the element x.sub.1 and the element x.sub.4.
[0079] Reference numeral 72a is a local pattern image B.sub.1 generated by synthesizing the three obtained cross correlation images A.sub.12 60a, A.sub.13 60b, and A.sub.14 60c by the addition operation unit 1 52a.
[0080] In an example, the cross correlation image of the present disclosure may be obtained by mapping each data based on the feature vector to the two-dimensional space in the form of a pattern image.
[0081] In order to determine the pattern image to be mapped, the pattern image storage unit 90 storing various pattern images and an address for selecting one of the pattern images stored in the pattern image storage unit 90 are required.
[0082] The address is obtained by the address generation unit 90a, and values x.sub.i, new and x.sub.j, new obtained by standardization that has the element x.sub.i and element x.sub.j and depends on the following Equation 6 are used as an address.
[0083] In Equation 6, when x.sub.i.sup.DB and x.sub.j.sup.DB are the elements x.sub.i and x.sub.j components of the feature vector stored in the learning database 10, μ.sub.i.sup.DB is a mean value of x.sub.i.sup.DB, μ.sub.j.sup.DB is a mean value of x.sub.j.sup.DB, σ.sub.i.sup.DB is a standard deviation of x.sub.i.sup.DB, and σ.sub.j.sup.DB is a standard deviation of x.sub.j.sup.DB.
[0084] That is, when data based on the feature vector is mapped to the two-dimensional space in the form of the pattern image, the pattern images stored in the addresses x.sub.i, new and x.sub.j, new are read from the pattern image storage unit 90 and mapped.
[0085] If the addresses are out of the address range of the pattern image storage unit 90, in an example, a null pattern is read and mapped.
[0086] In an example, another aspect of the address generation unit 90a is to determine the addresses by the following Equation 7.
[0087] In the above Equation 7, the number of classes refers to the number of classes (categories) to be classified by the artificial intelligence neural network 20.
[0088] In an example, the addresses are multiplied by a predetermined scaling factor to cover the address range of the pattern image storage unit 90, and then converted into an integer value by rounding to be used.
[0089]
[0090] The cross correlation image A.sub.ij is generated by reading the pattern images corresponding to the address values given by the above x.sub.i, new and x.sub.j, new.
[0091] Reference numeral 92 denotes pattern image storage locations 92a, 92b, and 92c for generating cross correlation images A.sub.12, A.sub.13, . . . , A.sub.1N, as a pattern storage location that stores the pattern image for generating the cross correlation image A.sub.1j.
[0092] Reference numeral 92a denotes a pattern storage location for generating the cross correlation image A.sub.13 as a pattern storage location that stores the pattern image for generating the cross correlation image A.sub.12. In addition, reference numeral 92c denotes a pattern storage location for generating the cross correlation image A.sub.IN.
[0093] Reference numeral 94 denotes pattern image storage locations 94a, 94b, and 94c for generating cross correlation images A.sub.22, A.sub.23, . . . , A.sub.2N as a pattern storage location that stores the pattern image for generating the cross correlation image A.sub.2j.
[0094] Reference numeral 96 denotes a pattern image storage location 96a that stores a pattern image for generating a cross correlation images A.sub.N−1, N.
[0095]
[0096] The artificial intelligence neural network apparatus 100, a learning database 10, a feature vector to image conversion unit 1 12, a learning feature vector image database 14, an artificial intelligence neural network 20, a test image 30, a data input unit 32, and a feature vector to image conversion unit 2 34, and the pattern image storage unit 90 in
[0097] The methods illustrated in
[0098] Instructions or software to control computing hardware, for example, one or more processors or computers, to implement the hardware components and perform the methods as described above may be written as computer programs, code segments, instructions or any combination thereof, for individually or collectively instructing or configuring the one or more processors or computers to operate as a machine or special-purpose computer to perform the operations that are performed by the hardware components and the methods as described above. In one example, the instructions or software include machine code that is directly executed by the one or more processors or computers, such as machine code produced by a compiler. In another example, the instructions or software includes higher-level code that is executed by the one or more processors or computer using an interpreter. The instructions or software may be written using any programming language based on the block diagrams and the flow charts illustrated in the drawings and the corresponding descriptions in the specification, which disclose algorithms for performing the operations that are performed by the hardware components and the methods as described above.
[0099] The instructions or software to control computing hardware, for example, one or more processors or computers, to implement the hardware components and perform the methods as described above, and any associated data, data files, and data structures, may be recorded, stored, or fixed in or on one or more non-transitory computer-readable storage media. Examples of a non-transitory computer-readable storage medium include read-only memory (ROM), random-access memory (RAM), flash memory, CD-ROMs, CD-Rs, CD+Rs, CD-RWs, CD+RWs, DVD-ROMs, DVD-Rs, DVD+Rs, DVD-RWs, DVD+RWs, DVD-RAMs, BD-ROMs, BD-Rs, BD-R LTHs, BD-REs, magnetic tapes, floppy disks, magneto-optical data storage devices, optical data storage devices, hard disks, solid-state disks, and any other device that is configured to store the instructions or software and any associated data, data files, and data structures in a non-transitory manner and provide the instructions or software and any associated data, data files, and data structures to one or more processors or computers so that the one or more processors or computers can execute the instructions. In one example, the instructions or software and any associated data, data files, and data structures are distributed over network-coupled computer systems so that the instructions and software and any associated data, data files, and data structures are stored, accessed, and executed in a distributed fashion by the one or more processors or computers.
[0100] While this disclosure includes specific examples, it will be apparent after an understanding of the disclosure of this application that various changes in form and details may be made in these examples without departing from the spirit and scope of the claims and their equivalents. The examples described herein are to be considered in a descriptive sense only, and not for purposes of limitation. Descriptions of features or aspects in each example are to be considered as being applicable to similar features or aspects in other examples. Suitable results may be achieved if the described techniques are performed in a different order, and/or if components in a described system, architecture, device, or circuit are combined in a different manner, and/or replaced or supplemented by other components or their equivalents. Therefore, the scope of the disclosure is defined not by the detailed description, but by the claims and their equivalents, and all variations within the scope of the claims and their equivalents are to be construed as being included in the disclosure.