Active Learning for Few-Shot Learning
20260119962 ยท 2026-04-30
Inventors
- Xiao LIN (Union City, CA, US)
- Yi YAO (Princeton, NJ, US)
- Meng YE (Pennington, NJ, US)
- Yunye GONG (West Windsor, NJ, US)
- Giedrius Tomas BURACHAS (Redwood City, CA, US)
- Nikoletta BASIOU (Redwood City, CA, US)
- Ajay DIVAKARAN (Monmouth Junction, NJ, US)
Cpc classification
G06V20/70
PHYSICS
International classification
G06V10/774
PHYSICS
Abstract
A method and apparatus for adapting a pretrained machine learning model using active learning for improved task performance in a target domain includes embedding a vector representation of at least one unlabeled target class data in an embedding space associated with the pretrained machine learning model, analyzing the embedding space to select, for labeling, at least one unlabeled class data representation based on a distance measurement in the embedding space that identifies an unlabeled class data representation that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space, labeling the selected at least one unlabeled class data representation, and adapting the pretrained machine learning model for improved task performance in a domain of the at least one unlabeled target class data for which coverage was improved by retraining the pretrained machine learning model using the labeled unlabeled class data representation.
Claims
1. A method for adapting a pretrained machine learning model using active learning for improved task performance in a target domain in which the pretrained model was not originally trained, comprising: embedding or projecting a vector representation of at least one unlabeled target class data in an embedding space associated with the pretrained machine learning model, the embeddings space including embedded, respective vector representations of labeled class data and unlabeled class data; analyzing the embedding space to select, for labeling, at least one unlabeled class data representation based on a distance measurement in the embedding space that identifies an unlabeled class data representation that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space; labeling the selected at least one unlabeled class data representation; and adapting the pretrained machine learning model for improved task performance in a domain of the at least one unlabeled target class data for which coverage was improved by retraining the pretrained machine learning model using the labeled at least one unlabeled class data representation.
2. The method of claim 1, wherein the machine learning model comprises an image classifier, the embedded labeled class data and unlabeled class data comprise labeled image representations and unlabeled image representations, and the target class data comprises unlabeled test image representations.
3. The method of claim 1, wherein at least one unlabeled class data representation that maximizes a coverage score in the embedding space for the at least one unlabeled target class data representation is selected for labeling.
4. The method of claim 1, further comprising: determining an area of coverage for each labeled class data representation in the embedding space; determining an area in the embedding space having limited coverage by at least one labeled class data representation for at least one of the at least one unlabeled class data representations; and selecting, for labeling, at least one unlabeled class data representation in the area of limited coverage based on a distance measurement in the embedding space that identifies an unlabeled class data representation in the limited coverage area that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space.
5. The method of claim 1, further comprising: reanalyzing the embedding space to select, for labeling, at least one unlabeled class data representation using the newly labeled class data representation that was previously unlabeled in the embedding space.
6. The method of claim 1, wherein the selected at least one unlabeled class data representation is labeled with the assistance of a human.
7. The method of claim 1, wherein the selected at least one unlabeled class data representation is labeled using a machine learning model.
8. An apparatus for adapting a pretrained machine learning model using active learning for improved task performance in a target domain in which the pretrained model was not originally trained, comprising: a processor; and a memory accessible to the processor, the memory having stored therein at least one of programs or instructions executable by the processor to configure the apparatus to: embed or project a vector representation of at least one unlabeled target class data in an embedding space associated with the pretrained machine learning model, the embeddings space including embedded, respective vector representations of labeled class data and unlabeled class data; analyze the embedding space to select, for labeling, at least one unlabeled class data representation based on a distance measurement in the embedding space that identifies an unlabeled class data representation that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space; label the selected at least one unlabeled class data representation; and adapt the pretrained machine learning model for improved task performance in a domain of the at least one unlabeled target class data for which coverage was improved by retraining the pretrained machine learning model using the labeled at least one unlabeled class data representation.
9. The apparatus of claim 8, wherein the machine learning model comprises an image classifier, the embedded labeled class data and unlabeled class data comprise labeled image representations and unlabeled image representations, and the target class data comprises unlabeled test image representations.
10. The apparatus of claim 8, wherein at least one unlabeled class data representation that maximizes a coverage score in the embedding space for the at least one unlabeled target class data representation is selected for labeling.
11. The apparatus of claim 8, wherein the apparatus is further configured to: determine an area of coverage for each labeled class data representation in the embedding space; determine an area in the embedding space having limited coverage by at least one labeled class data representation for at least one of the at least one unlabeled class data representations; and select, for labeling, at least one unlabeled class data representation in the area of limited coverage based on a distance measurement in the embedding space that identifies an unlabeled class data representation in the limited coverage area that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space.
12. The apparatus of claim 8, wherein the apparatus is further configured to: reanalyze the embedding space to select, for labeling, at least one unlabeled class data representation using the newly labeled class data representation that was previously unlabeled in the embedding space.
13. The apparatus of claim 8, wherein the selected at least one unlabeled class data representation is labeled with the assistance of a human.
14. The apparatus of claim 8, wherein the selected at least one unlabeled class data representation is labeled using a machine learning model.
15. A non-transitory computer readable storage medium having stored thereon instructions that when executed by a processor perform a method for adapting a pretrained machine learning model using active learning for improved task performance in a target domain in which the pretrained model was not originally trained, comprising: embedding or projecting a vector representation of at least one unlabeled target class data in an embedding space associated with the pretrained machine learning model, the embeddings space including embedded, respective vector representations of labeled class data and unlabeled class data; analyzing the embedding space to select, for labeling, at least one unlabeled class data representation based on a distance measurement in the embedding space that identifies an unlabeled class data representation that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space; labeling the selected at least one unlabeled class data representation; and adapting the pretrained machine learning model for improved task performance in a domain of the at least one unlabeled target class data for which coverage was improved by retraining the pretrained machine learning model using the labeled at least one unlabeled class data representation.
16. The non-transitory computer readable storage medium of claim 15, wherein the machine learning model comprises an image classifier, the embedded labeled class data and unlabeled class data comprise labeled image representations and unlabeled image representations, and the target class data comprises unlabeled test image representations.
17. The non-transitory computer readable storage medium of claim 15, wherein at least one unlabeled class data representation that maximizes a coverage score in the embedding space for the at least one unlabeled target class data representation is selected for labeling.
18. The non-transitory computer readable storage medium of claim 15, wherein the method further comprises: determining an area of coverage for each labeled class data representation in the embedding space; determining an area in the embedding space having limited coverage by at least one labeled class data representation for at least one of the at least one unlabeled class data representations; and selecting, for labeling, at least one unlabeled class data representation in the area of limited coverage based on a distance measurement in the embedding space that identifies an unlabeled class data representation in the limited coverage area that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space.
19. The non-transitory computer readable storage medium of claim 15, wherein the method further comprises: reanalyzing the embedding space to select, for labeling, at least one unlabeled class data representation using the newly labeled class data representation that was previously unlabeled in the embedding space.
20. The non-transitory computer readable storage medium of claim 15, wherein the selected at least one unlabeled class data representation is labeled with at least one of a machine learning model or the assistance of a human.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0013] So that the manner in which the above recited features of the present principles can be understood in detail, a more particular description of the principles, briefly summarized above, may be had by reference to embodiments, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only typical embodiments in accordance with the present principles and are therefore not to be considered limiting of its scope, for the principles may admit to other equally effective embodiments.
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024] To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures. The figures are not drawn to scale and may be simplified for clarity. It is contemplated that elements and features of one embodiment may be beneficially incorporated in other embodiments without further recitation.
DETAILED DESCRIPTION
[0025] Embodiments of the present principles generally relate to methods, apparatuses and systems for adapting a pretrained machine learning model/classifier for, for example, targeted tasks and/or for few-shot learning using novel active learning techniques. While the concepts of the present principles are susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and are described in detail below. It should be understood that there is no intent to limit the concepts of the present principles to the particular forms disclosed. On the contrary, the intent is to cover all modifications, equivalents, and alternatives consistent with the present principles and the appended claims. For example, although embodiments of the present principles will be described primarily with respect to a specific embedding space and specific distance measurement techniques with regards to image classification, embodiments of the present principles can be implemented with substantially any embedding space using substantially any distance measurement techniques for adapting a machine learning model to perform targeted tasks.
[0026] In the description herein, the phrase class data and the like is intended to describe data associated with the performance of a portion of a task, an entire task, or a group of related tasks that share a common category. As such, when class data is referenced herein, the class data can be referring data associated with the performance of a portion of a task, an entire task, or a group of related tasks that share a common category.
[0027] Embodiments of the present principles provide, at least, a technical solution to the technical problem of how to adapt a machine learning model for targeted tasks for which the model was not trained by providing a method, apparatus and system including novel active leaning techniques which adapt a pretrained machine learning model using active learning for improved task performance in a target domain in which the pretrained model was not originally trained at least as described herein.
[0028] In some embodiments, novel active learning techniques of the present principles are based on distance measures between vectors in an embedding space. Such active learning techniques of the present principles can be applied to a pre-trained machine learning model, such as an image classifier/classification model, to adapt the pre-trained machine learning model for improved learning to enable the machine learning model to perform targeted tasks for which the machine learning model was not specifically pretrained, for example, image classification, object detection, and/or few-shot learning. For example, embodiments of the present principles can be implemented using two active learning approaches, distance-entropy and distance-goal, that are based on distance measurements between vectors in an embedding space. For both approaches, a two-stage training paradigm, namely pre-training and adaptation, can be adopted for adapting a pre-trained machine learning model, such as an image classifier/classification model or an object detection model, to better detect images and/or objects in a specific class, or in addition, in some embodiments for few-shot learning. In such embodiments of the present principles, a pre-trained embedding space is well defined and, as such, the distance between vectors in such an embedding space becomes a more robust metric.
[0029] For embodiments of a distance-entropy active learning technique of the present principles, a vector representation of unlabeled class data, such as in some embodiments an unlabeled/unclassified image embedded in an embedding space, that is equidistant from a number of selected vector representations of respective labeled class data, such as in some embodiments classified images embedded in the embedding space, can be selected for classification.
[0030] For embodiments of a distance-goal active learning technique of the present principles, a vector representation of at least one unlabeled target class data, such as test/query samples, can be embedded in an embedding space associated with a pretrained machine learning model. An unlabeled class data representation embedded in an embedding space that, if classified, would provide better classification amongst selected unlabeled target class data representations, such as unlabeled test samples, can be selected for classification. In accordance with the present principles, the labeling/classification of the selected unlabeled/unclassified class data embedded in the embedding space improves the performance of a task associated with target data representations for an associated, pretrained machine learning, for which the machine learning model was not previously trained. For example, in some embodiments, the labeling/classification of the selected unlabeled/unclassified image representations in the embedding space improves a gain per label query associated with the embedding space, which enables few-shot learning. Embodiments of the novel active learning techniques of the present principles are described in greater detail below.
[0031]
[0032] An active learning training system of the present principles, such as the active learning training system 100 of
[0033] Alternatively or in addition, in some embodiments in which an embedding space is not predetermined (i.e., is not communicated to the active learning training system of the present principle from a machine learning model), the embedding module 110 of the active learning training system 100 of
[0034]
[0035] With reference to the embedding space 200 in
[0036] In accordance with the present principles, the distance measurement module 120 of the active learning training system 100 of
[0037] For example and with reference to the embedding space 200 of
[0038] In some embodiments of the present principles, the labeling module 125 can cause a selected unlabeled image to be classified/labeled. For example, in some embodiments of the present principles, the labeling module 125 can cause a visual representation of the selected unlabeled image to be presented to a user of an active learning training system of the present principles, such as the active learning training system 100 of
[0039] Alternatively or in addition, in some embodiments, a labeling module of the present principles, such as the labeling module 125 of
[0040] In some embodiments in accordance with the present principles, to label selected, unlabeled images, the machine learning model/system 127 can implement suitable machine learning techniques to learn commonalities in sequential application programs and for determining from the machine learning techniques at what level sequential application programs can be canonicalized. In some embodiments, machine learning techniques that can be applied to learn commonalities in sequential application programs can include, but are not limited to, regression methods, ensemble methods, or neural networks and deep learning such as Seq2Seq Recurrent Neural Network (RNNs)/Long Short-Term Memory (LSTM) networks, Convolution Neural Networks (CNNs), graph neural networks applied to the abstract syntax trees corresponding to the sequential program application, Transformer networks, and the like. In some embodiments a supervised machine learning (ML) classifier/algorithm could be used such as, but not limited to, Multilayer Perceptron, Random Forest, Naive Bayes, Support Vector Machine, Logistic Regression and the like. In addition, in some embodiments, the ML classifier/algorithm of the present principles can implement at least one of a sliding window or sequence-based techniques to analyze data.
[0041] A machine learning model/system of the present principles, such as the machine learning model/system 127 of the labeling module 125 of the, active learning training system 100 of
[0042] Although in the embodiment of
[0043]
[0044] In the embodiment of
[0045] More specifically, in some embodiments of the present principles, the distance measurement module 120 of the active learning training system 100 of
[0046] Referring back to the embedding space 200 of
[0047] For example, in the embodiment of
[0048] In embodiments of the present principles, a vector representation of an unlabeled image in the embedding space is selected for classification based on distance measurements in the embedding space. For example, in some embodiments, distances between vector representations of unlabeled images (represented by light, smaller circles) within the area of determined reduced coverage (larger darker circle in
[0049] For example, in some embodiments, a coverage measure can be implemented to determine at least how a selected unlabeled image in an embedding space, if labeled, will improve the coverage of at least one unlabeled test image (example) in the embedding space. In some embodiments, the coverage measure of the present principles of how well a training set
covers test set
can be determined according to equation two (2), which follows:
depicts a set of current training examples, already selected and labeled, i depicts the i-th example,
depicts a set of unlabeled test examples from the target domain, {x.sub.k} depicts a set of unlabeled examples that active learning is selecting from, v(.Math.) depicts the embedding function, and <.Math.,.Math.> depicts the similarity function. In some embodiments of the present principles, a cosine similarity:
is implemented for the distance measurements. Alternatively or in addition, in some embodiments, an inverse of Euclidean distance can also be used: <a, b>=ab.sub.2.
[0050] The coverage measure of the present principles evaluates, for every unlabeled test example from the target domain
what is the maximum embedding similarity across all training examples (or from a distance point of view, minimum distance). The log-sum-exp structure implements a soft maximum over similarities.
[0051] In some embodiments of the present principles, a temperature parameter T can be used to control how soft the maximum is. For small T, high similarity examples will contribute exponentially more. For large T, high similarity examples will contribute linearly proportional to the similarity value. Empirically, T can typically be set to 0.1 for cosine similarity.
[0052] It should be noted that certain few-shot learning algorithms, e.g. Prototypical Networks, also implement a temperature T on cosine similarity (often called scaled cosine distance s.Math.<v.sub.0, v.sub.1>, in which s is equivalent to 1/T), and in few-shot learning T can be optimized through cross validation. However, in the active learning of the present principles, cross validation was not possible. However, in some embodiments, a same temperature T that's optimal in few-shot learning was implemented. In some embodiments of the preset principles, an unlabeled example x.sub.k that maximizes an improvement in coverage measure is selected according to the equations, which follow:
[0056] In some embodiments, the temperature T variable is not implemented in the calculations of the present principles, and, as such, is removed from the equations above.
[0057] In some embodiments, the active learning of the present principles can include a selection of multiple unlabeled images that maximize coverage. In such embodiments, a greedy algorithm can be used in which in each iteration (e.g. iteration t) the example
with maximum improvement is selected. Then
can be added and an improvement computation can be run again and example
can be subsequently selected.
[0058] Referring back to the embodiment of
[0059] In accordance with the present principles, once at least one unlabeled class data representation, such as an unlabeled image representation, is labeled, the pretrained machine learning model can be adapted for improved task performance in a domain of the at least one unlabeled target class data for which coverage was improved by retraining the pretrained machine learning model using the labeled at least one unlabeled class data representation. In some embodiments of the present principles, a labeling module of the present principles, such as the labeling module 125 of
[0060] In some embodiments of the present principles, an embedding space associated with an adapted, pretrained machine learning model, such as the embedding space 200, is reanalyzed to select, for labeling, at least one unlabeled class data representation using the newly labeled class data representation that was previously unlabeled in the embedding space. In some embodiments, the embedding space of the pretrained machine learning model can be reanalyzed by the distance measurement module 120 of the active learning training system 100 of
[0061]
[0062] At 404, the embedding space is analyzed to select, for labeling, at least one unlabeled class data representation based on a distance measurement in the embedding space that identifies an unlabeled class data representation that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space. The method 400 can proceed to 406.
[0063] At 406, the selected at least one unlabeled class data representation is labeled. The method 400 can proceed to 408.
[0064] At 408, the pretrained machine learning model is adapted for improved task performance in a domain of the at least one unlabeled target class data for which coverage was improved by retraining the pretrained machine learning model using the labeled at least one unlabeled class data representation. The method 400 can be exited.
[0065] In some embodiments, the machine learning model comprises an image classifier, the embedded labeled class data and unlabeled class data comprise labeled image representations and unlabeled image representations, and the target class data comprises unlabeled test image representations.
[0066] In some embodiments, at least one unlabeled class data representation that maximizes a coverage score in the embedding space for the at least one unlabeled target class data representation is selected for labeling.
[0067] In some embodiments the method further includes determining an area of coverage for each labeled class data representation in the embedding space, determining an area in the embedding space having limited coverage by at least one labeled class data representation for at least one of the at least one unlabeled class data representations, and selecting, for labeling, at least one unlabeled class data representation in the area of limited coverage based on a distance measurement in the embedding space that identifies an unlabeled class data representation in the limited coverage area that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space.
[0068] In some embodiments, the method further includes reanalyzing the embedding space to select, for labeling, at least one unlabeled class data representation using the newly labeled class data representation that was previously unlabeled in the embedding space.
[0069] In some embodiments, the selected at least one unlabeled class data representation is labeled with the assistance of a human. In some embodiments, the selected at least one unlabeled class data representation is labeled using a machine learning model.
[0070] In some embodiments, an apparatus for adapting a pretrained machine learning model using active learning for improved task performance in a target domain in which the pretrained model was not originally trained includes a processor and a memory accessible to the processor. In such embodiments, the memory has stored therein at least one of programs or instructions which when executed by the processor configures the apparatus to embed or project a vector representation of at least one unlabeled target class data in an embedding space associated with the pretrained machine learning model, the embeddings space including embedded, respective vector representations of labeled class data and unlabeled class data, analyze the embedding space to select, for labeling, at least one unlabeled class data representation based on a distance measurement in the embedding space that identifies an unlabeled class data representation that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space, label the selected at least one unlabeled class data representation, and adapt the pretrained machine learning model for improved task performance in a domain of the at least one unlabeled target class data for which coverage was improved by retraining the pretrained machine learning model using the labeled at least one unlabeled class data representation.
[0071] In some embodiments, the apparatus is further configured to determine an area of coverage for each labeled class data representation in the embedding space, determine an area in the embedding space having limited coverage by at least one labeled class data representation for at least one of the at least one unlabeled class data representations, and select, for labeling, at least one unlabeled class data representation in the area of limited coverage based on a distance measurement in the embedding space that identifies an unlabeled class data representation in the limited coverage area that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space.
[0072] In some embodiments, the apparatus is further configured to reanalyze the embedding space to select, for labeling, at least one unlabeled class data representation using the newly labeled class data representation that was previously unlabeled in the embedding space.
[0073] In some embodiments, a non-transitory computer readable storage medium has stored thereon instructions that when executed by a processor perform a method for adapting a pretrained machine learning model using active learning for improved task performance in a target domain in which the pretrained model was not originally trained. In some embodiments, the method includes embedding or projecting a vector representation of at least one unlabeled target class data in an embedding space associated with the pretrained machine learning model, the embeddings space including embedded, respective vector representations of labeled class data and unlabeled class data, analyzing the embedding space to select, for labeling, at least one unlabeled class data representation based on a distance measurement in the embedding space that identifies an unlabeled class data representation that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space, labeling the selected at least one unlabeled class data representation, and adapting the pretrained machine learning model for improved task performance in a domain of the at least one unlabeled target class data for which coverage was improved by retraining the pretrained machine learning model using the labeled at least one unlabeled class data representation.
[0074] In some embodiments, the method further includes determining an area of coverage for each labeled class data representation in the embedding space, determining an area in the embedding space having limited coverage by at least one labeled class data representation for at least one of the at least one unlabeled class data representations, and selecting, for labeling, at least one unlabeled class data representation in the area of limited coverage based on a distance measurement in the embedding space that identifies an unlabeled class data representation in the limited coverage area that, if labeled, improves a coverage for at least one unlabeled target class data representation in the embedding space.
[0075] In some embodiments, the method further includes reanalyzing the embedding space to select, for labeling, at least one unlabeled class data representation using the newly labeled class data representation that was previously unlabeled in the embedding space.
[0076] Experiments were conducted using an active learning training system of the present principles, such as the active learning training system 100 of
[0077]
[0078]
[0079] Embodiments of the present principles can be implemented in any technical applications in which machine learning systems, such as image classification and object detections system can be applied including, but not limited to healthcare industries to, for example, assist in identifying diseases and abnormalities, security industries for, for example, feature detection of humans and objects, data analytics industries to identify documents, autonomous driving vehicles industries to, for example, map environments, and may more.
[0080] As depicted in
[0081] For example,
[0082] In the embodiment of
[0083] In different embodiments, the computing device 700 can be any of various types of devices, including, but not limited to, a personal computer system, desktop computer, laptop, notebook, tablet or netbook computer, mainframe computer system, handheld computer, workstation, network computer, a camera, a set top box, a mobile device, a consumer device, video game console, handheld video game device, application server, storage device, a peripheral device such as a switch, modem, router, or in general any type of computing or electronic device.
[0084] In various embodiments, the computing device 700 can be a uniprocessor system including one processor 710, or a multiprocessor system including several processors 710 (e.g., two, four, eight, or another suitable number). Processors 710 can be any suitable processor capable of executing instructions. For example, in various embodiments processors 710 may be general-purpose or embedded processors implementing any of a variety of instruction set architectures (ISAs). In multiprocessor systems, each of processors 710 may commonly, but not necessarily, implement the same ISA.
[0085] System memory 720 can be configured to store program instructions 722 and/or data 732 accessible by processor 710. In various embodiments, system memory 720 can be implemented using any suitable memory technology, such as static random-access memory (SRAM), synchronous dynamic RAM (SDRAM), nonvolatile/Flash-type memory, or any other type of memory. In the illustrated embodiment, program instructions and data implementing any of the elements of the embodiments described above can be stored within system memory 720. In other embodiments, program instructions and/or data can be received, sent or stored upon different types of computer-accessible media or on similar media separate from system memory 720 or computing device 700.
[0086] In one embodiment, I/O interface 730 can be configured to coordinate I/O traffic between processor 710, system memory 720, and any peripheral devices in the device, including network interface 740 or other peripheral interfaces, such as input/output devices 750. In some embodiments, I/O interface 730 can perform any necessary protocol, timing or other data transformations to convert data signals from one component (e.g., system memory 720) into a format suitable for use by another component (e.g., processor 710). In some embodiments, I/O interface 730 can include support for devices attached through various types of peripheral buses, such as a variant of the Peripheral Component Interconnect (PCI) bus standard or the Universal Serial Bus (USB) standard, for example. In some embodiments, the function of I/O interface 730 can be split into two or more separate components, such as a north bridge and a south bridge, for example. Also, in some embodiments some or all of the functionality of I/O interface 730, such as an interface to system memory 720, can be incorporated directly into processor 710.
[0087] Network interface 740 can be configured to allow data to be exchanged between the computing device 700 and other devices attached to a network (e.g., network 790), such as one or more external systems or between nodes of the computing device 700. In various embodiments, network 790 can include one or more networks including but not limited to Local Area Networks (LANs) (e.g., an Ethernet or corporate network), Wide Area Networks (WANs) (e.g., the Internet), wireless data networks, some other electronic data network, or some combination thereof. In various embodiments, network interface 740 can support communication via wired or wireless general data networks, such as any suitable type of Ethernet network, for example; via digital fiber communications networks; via storage area networks such as Fiber Channel SANs, or via any other suitable type of network and/or protocol.
[0088] Input/output devices 750 can, in some embodiments, include one or more display terminals, keyboards, keypads, touchpads, scanning devices, voice or optical recognition devices, or any other devices suitable for entering or accessing data by one or more computer systems. Multiple input/output devices 750 can be present in computer system or can be distributed on various nodes of the computing device 700. In some embodiments, similar input/output devices can be separate from the computing device 700 and can interact with one or more nodes of the computing device 700 through a wired or wireless connection, such as over network interface 740.
[0089] Those skilled in the art will appreciate that the computing device 700 is merely illustrative and is not intended to limit the scope of embodiments. In particular, the computer system and devices can include any combination of hardware or software that can perform the indicated functions of various embodiments, including computers, network devices, Internet appliances, PDAs, wireless phones, pagers, and the like. The computing device 700 can also be connected to other devices that are not illustrated, or instead can operate as a stand-alone system. In addition, the functionality provided by the illustrated components can in some embodiments be combined in fewer components or distributed in additional components. Similarly, in some embodiments, the functionality of some of the illustrated components may not be provided and/or other additional functionality can be available.
[0090] The computing device 700 can communicate with other computing devices based on various computer communication protocols such a Wi-Fi, Bluetooth. (and/or other standards for exchanging data over short distances includes protocols using short-wavelength radio transmissions), USB, Ethernet, cellular, an ultrasonic local area communication protocol, etc. The computing device 700 can further include a web browser.
[0091] Although the computing device 700 is depicted as a general-purpose computer, the computing device 700 is programmed to perform various specialized control functions and is configured to act as a specialized, specific computer in accordance with the present principles, and embodiments can be implemented in hardware, for example, as an application specified integrated circuit (ASIC). As such, the process steps described herein are intended to be broadly interpreted as being equivalently performed by software, hardware, or a combination thereof.
[0092]
[0093] In the network environment 800 of
[0094] In some embodiments, a user can implement an active learning training system of the present principles in the computer networks 806 to adapt a pretrained machine learning model for active learning for, for example, few-shot learning in accordance with the present principles. Alternatively or in addition, in some embodiments, a user can implement an active learning training system of the present principles in the cloud server/computing device 812 of the cloud environment 810 in accordance with the present principles. For example, in some embodiments it can be advantageous to perform processing functions of the present principles in the cloud environment 810 to take advantage of the processing capabilities and storage capabilities of the cloud environment 810. In some embodiments in accordance with the present principles, a system for adapting a pretrained machine learning model for active learning for, for example, few-shot learning can be located in a single and/or multiple locations/servers/computers to perform all or portions of the herein described functionalities of a system in accordance with the present principles. For example, in some embodiments components of an active learning training system of the present principles, such as the embedding module 110, the distance measurement module 120, and the labeling module 125 of the active learning training system 100 of
[0095] Those skilled in the art will also appreciate that, while various items are illustrated as being stored in memory or on storage while being used, these items or portions of them can be transferred between memory and other storage devices for purposes of memory management and data integrity. Alternatively, in other embodiments some or all of the software components can execute in memory on another device and communicate with the illustrated computer system via inter-computer communication. Some or all of the system components or data structures can also be stored (e.g., as instructions or structured data) on a computer-accessible medium or a portable article to be read by an appropriate drive, various examples of which are described above. In some embodiments, instructions stored on a computer-accessible medium separate from the computing device 700 can be transmitted to the computing device 700 via transmission media or signals such as electrical, electromagnetic, or digital signals, conveyed via a communication medium such as a network and/or a wireless link. Various embodiments can further include receiving, sending or storing instructions and/or data implemented in accordance with the foregoing description upon a computer-accessible medium or via a communication medium. In general, a computer-accessible medium can include a storage medium or memory medium such as magnetic or optical media, e.g., disk or DVD/CD-ROM, volatile or non-volatile media such as RAM (e.g., SDRAM, DDR, RDRAM, SRAM, and the like), ROM, and the like.
[0096] The methods and processes described herein may be implemented in software, hardware, or a combination thereof, in different embodiments. In addition, the order of methods can be changed, and various elements can be added, reordered, combined, omitted or otherwise modified. All examples described herein are presented in a non-limiting manner. Various modifications and changes can be made as would be obvious to a person skilled in the art having benefit of this disclosure. Realizations in accordance with embodiments have been described in the context of particular embodiments. These embodiments are meant to be illustrative and not limiting. Many variations, modifications, additions, and improvements are possible. Accordingly, plural instances can be provided for components described herein as a single instance. Boundaries between various components, operations and data stores are somewhat arbitrary, and particular operations are illustrated in the context of specific illustrative configurations. Other allocations of functionality are envisioned and can fall within the scope of claims that follow. Structures and functionality presented as discrete components in the example configurations can be implemented as a combined structure or component. These and other variations, modifications, additions, and improvements can fall within the scope of embodiments as defined in the claims that follow.
[0097] In the foregoing description, numerous specific details, examples, and scenarios are set forth in order to provide a more thorough understanding of the present disclosure. It will be appreciated, however, that embodiments of the disclosure can be practiced without such specific details. Further, such examples and scenarios are provided for illustration, and are not intended to limit the disclosure in any way. Those of ordinary skill in the art, with the included descriptions, should be able to implement appropriate functionality without undue experimentation.
[0098] References in the specification to an embodiment, etc., indicate that the embodiment described can include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is believed to be within the knowledge of one skilled in the art to affect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly indicated.
[0099] Embodiments in accordance with the disclosure can be implemented in hardware, firmware, software, or any combination thereof. Embodiments can also be implemented as instructions stored using one or more machine-readable media, which may be read and executed by one or more processors. A machine-readable medium can include any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computing device or a virtual machine running on one or more computing devices). For example, a machine-readable medium can include any suitable form of volatile or non-volatile memory.
[0100] In addition, the various operations, processes, and methods disclosed herein can be embodied in a machine-readable medium and/or a machine accessible medium/storage device compatible with a data processing system (e.g., a computer system), and can be performed in any order (e.g., including using means for achieving the various operations). Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense. In some embodiments, the machine-readable medium can be a non-transitory form of machine-readable medium/storage device.
[0101] Modules, data structures, and the like defined herein are defined as such for ease of discussion and are not intended to imply that any specific implementation details are required. For example, any of the described modules and/or data structures can be combined or divided into sub-modules, sub-processes or other units of computer code or data as can be required by a particular design or implementation.
[0102] In the drawings, specific arrangements or orderings of schematic elements can be shown for ease of description. However, the specific ordering or arrangement of such elements is not meant to imply that a particular order or sequence of processing, or separation of processes, is required in all embodiments. In general, schematic elements used to represent instruction blocks or modules can be implemented using any suitable form of machine-readable instruction, and each such instruction can be implemented using any suitable programming language, library, application-programming interface (API), and/or other software development tools or frameworks. Similarly, schematic elements used to represent data or information can be implemented using any suitable electronic arrangement or data structure. Further, some connections, relationships or associations between elements can be simplified or not shown in the drawings so as not to obscure the disclosure.
[0103] This disclosure is to be considered as exemplary and not restrictive in character, and all changes and modifications that come within the guidelines of the disclosure are desired to be protected.