SMART DATA INGESTION
20230195724 · 2023-06-22
Inventors
Cpc classification
G06F3/0605
PHYSICS
International classification
Abstract
A method, an apparatus, and a computer program for data ingestion. Content-related descriptors are retrieved. Each descriptor is associated with a value classification. Furthermore, a data record to be ingested is retrieved. A value classification is assigned to the data record based on the descriptors. The data record is then ingested in accordance with the assigned value classification.
Claims
1. A method for ingesting data records, the method comprising: retrieving content-related descriptors, each descriptor being associated with a value classification; retrieving a data record to be ingested; assigning a value classification to the data record based on the descriptors; and ingesting the data record in accordance with the assigned value classification.
2. The method of claim 1, wherein the descriptors and the associated value classifications are obtained by analyzing usage of previously ingested data records or are specified by a user of the data records.
3. The method of claim 1, wherein the value classifications include two or more of hot, cold, and archive.
4. The method of claim 1, wherein ingesting the data record in accordance with the assigned value classification includes selecting one of a plurality of transmission channels for the data record based on the value classification.
5. The method of claim 4, wherein the transmission channels differ in bandwidth and transportation time.
6. The method of claim 1, wherein ingesting the data record in accordance with the assigned value classification includes selecting one of a plurality of storage solutions for the data record based on the value classification.
7. The method of claim 6, wherein the storage solutions differ in availability and cost.
8. A non-transitory computer readable storage medium storing instructions for ingesting data records, which, when executed by a processor of the computer, cause the computer to: retrieve content-related descriptors, each descriptor being associated with a value classification; retrieve a data record to be ingested; assign a value classification to the data record based on the descriptors; and ingest the data record in accordance with the assigned value classification.
9. The non-transitory computer readable storage medium of claim 8, wherein the descriptors and the associated value classifications are obtained by analyzing usage of previously ingested data records or are specified by a user of the data records.
10. The non-transitory computer readable storage medium of claim 8, wherein the value classifications include two or more of hot, cold, and archive.
11. The non-transitory computer readable storage medium of claim 8, wherein ingesting the data record in accordance with the assigned value classification includes selecting one of a plurality of transmission channels for the data record based on the value classification.
12. The non-transitory computer readable storage medium of claim 11, wherein the transmission channels differ in bandwidth and transportation time.
13. The non-transitory computer readable storage medium of claim 8, wherein ingesting the data record in accordance with the assigned value classification includes selecting one of a plurality of storage solutions for the data record based on the value classification.
14. The non-transitory computer readable storage medium of claim 13, wherein the storage solutions differ in availability and cost.
15. An apparatus for ingesting data records, the apparatus comprising: a descriptor retrieving unit configured to retrieve content-related descriptors, each descriptor being associated with a value classification; a data retrieving unit configured to retrieve a data record to be ingested; a data classification unit configured to assign a value classification to the data record based on the descriptors; and a data ingesting unit configured to ingest the data record in accordance with the assigned value classification.
16. The apparatus of claim 15, wherein the descriptors and the associated value classifications are obtained by analyzing usage of previously ingested data records or are specified by a user of the data records.
17. The apparatus of claim 15, wherein the value classifications include two or more of hot, cold, and archive.
18. The apparatus of claim 15, wherein for ingesting the data record in accordance with the assigned value classification, the data ingesting unit is configured to select one of a plurality of transmission channels for the data record based on the value classification.
19. The apparatus of claim 18, wherein the transmission channels differ in bandwidth and transportation time.
20. The apparatus of claim 15, wherein for ingesting the data record in accordance with the assigned value classification, the data ingesting unit is configured to select one of a plurality of storage solutions for the data record based on the value classification.
21. The apparatus of claim 20, wherein the storage solutions differ in availability and cost.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
DETAILED DESCRIPTION
[0040] The present description illustrates the principles of the present disclosure. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the disclosure.
[0041] All examples and conditional language recited herein are intended for educational purposes to aid the reader in understanding the principles of the disclosure and the concepts contributed by the inventor to furthering the art and are to be construed as being without limitation to such specifically recited examples and conditions.
[0042] Moreover, all statements herein reciting principles, aspects, and embodiments of the disclosure, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.
[0043] Thus, for example, it will be appreciated by those skilled in the art that the diagrams presented herein represent conceptual views of illustrative circuitry embodying the principles of the disclosure.
[0044] The functions of the various elements shown in the figures may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Moreover, explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (DSP) hardware, systems on a chip, microcontrollers, read only memory (ROM) for storing software, random access memory (RAM), and nonvolatile storage.
[0045] Other hardware, conventional and/or custom, may also be included. Similarly, any switches shown in the figures are conceptual only. Their function may be carried out through the operation of program logic, through dedicated logic, through the interaction of program control and dedicated logic, or even manually, the particular technique being selectable by the implementer as more specifically understood from the context.
[0046] In the claims hereof, any element expressed as a means for performing a specified function is intended to encompass any way of performing that function including, for example, a combination of circuit elements that performs that function or software in any form, including, therefore, firmware, microcode or the like, combined with appropriate circuitry for executing that software to perform the function. The disclosure as defined by such claims resides in the fact that the functionalities provided by the various recited means are combined and brought together in the manner which the claims call for. It is thus regarded that any means that can provide those functionalities are equivalent to those shown herein.
[0047]
[0048] A global positioning system (GPS) and navigation module 101 provides navigation processing and location data for the motor vehicle 10. Sensors 102 provide sensor data, which may comprise data relating to vehicle characteristic or parameter data, and may also provide environmental data pertaining to the motor vehicle 10, its interior or surroundings, such as temperature, humidity and the like. Other sensors may include proximity sensors or cameras for sensing objects or traffic proximate to the motor vehicle 10. A radio/entertainment module 103 may provide data relating to audio/video media being played in the motor vehicle 10. The radio/entertainment module 103 may be integrated into or communicatively coupled to an entertainment unit configured to play AM/FM radio, satellite radio, compact disks, DVDs, digital media, streaming media and the like. A communications module 104 allows any of the modules to communicate with each other or with external devices via a wired connection or wireless protocol, such as LTE, 3G, Wi-Fi, Bluetooth, NFC, etc. The various modules 100-104 may be communicatively coupled to a data bus 105 for certain communication and data exchange purposes.
[0049] The motor vehicle 10 may further comprise a main processor 106 that centrally processes and controls data communication throughout the system of
[0050]
[0051]
[0052] The descriptor retrieving unit 22, the data retrieving unit 23, the data classification unit 24, and the data ingesting unit 25 may be controlled by a control unit 26. A local storage unit 27 is provided for storing data during processing. A user interface 29 may be provided for enabling a user to modify settings of the descriptor retrieving unit 22, the data retrieving unit 23, the data classification unit 24, the data ingesting unit 25, and the control unit 26. The descriptor retrieving unit 22, the data retrieving unit 23, the data classification unit 24, the data ingesting unit 25, and the control unit 26 can be embodied as dedicated hardware units. Of course, they may likewise be fully or partially combined into a single unit or implemented as software running on a processor, e.g. a CPU or a GPU.
[0053] A block diagram of a second embodiment of an apparatus 30 according to the present principles for data ingestion is illustrated in
[0054] The processing device 32 as used herein may include one or more processing units, such as microprocessors, digital signal processors, or a combination thereof.
[0055] The local storage unit 27 and the memory device 31 may include volatile and/or non-volatile memory regions and storage devices such as hard disk drives, optical drives, and/or solid-state memories.
[0056]
[0057]
[0058]
[0059]
[0060]
[0061]
[0062] It is to be understood that, while some of the constituent system components and method steps depicted in the accompanying figures are preferably implemented in software, the actual connections between the system components (or the process steps) may differ depending upon the manner in which the proposed method and apparatus is programmed. Given the teachings herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations or configurations of the proposed method and apparatus.
[0063] The disclosure is not restricted to the exemplary embodiments described above. There is scope for many different adaptations and developments that are also considered to belong to the disclosure.
REFERENCE NUMERALS
[0064] 10 Motor vehicle [0065] 100 Engine/transmission module [0066] 101 Global positioning system and navigation module [0067] 102 Sensors [0068] 103 Radio/entertainment module [0069] 104 Communications module [0070] 105 Data bus [0071] 106 Main processor [0072] 107 Storage [0073] 108 Digital signal processor [0074] 109 Display [0075] 110 Input/output module [0076] 20 Apparatus [0077] 21 Input [0078] 22 Descriptor retrieving unit [0079] 23 Data retrieving unit [0080] 24 Data classification unit [0081] 25 Data ingesting unit [0082] 26 Control unit [0083] 27 Local storage unit [0084] 28 Output [0085] 29 User interface [0086] 30 Apparatus [0087] 31 Memory device [0088] 32 Processing device [0089] 33 Input [0090] 34 Output [0091] C Value classification [0092] CS Cloud storage [0093] D Descriptor [0094] DA Data analyzer [0095] DL Data logger [0096] DMS Data management system [0097] HDD Hard disk drive [0098] OPS On-premises storage [0099] PS Postal service [0100] R Data record [0101] STO Storage solution [0102] TDS Tape drive storage [0103] TPS Third-part server [0104] UC1, UC2, Upload client [0105] UC2 [0106] P1 Generation phase [0107] P2 Valuation phase [0108] P3 Ingestion phase [0109] P4 Storage phase [0110] P5 Utilization phase [0111] S1 Retrieve content-related descriptors [0112] S2 Retrieve data record to be ingested [0113] S3 Assign value classification to data record [0114] S4 Ingest data record in accordance with assigned value classification [0115] S10 Capture data [0116] S11 Drop corrupted data [0117] S12 Store data records [0118] S13 Load data records into data analyzer [0119] S14 Request content-related descriptors [0120] S15 Receive content-related descriptors [0121] S16 Distribute data records according to value classification [0122] S17 Analyze data records on the fly [0123] S18 Store data records according to value classification [0124] S19 Store high-bandwidth data records [0125] S20 Upload low-bandwidth data records [0126] S21 Receive data value information [0127] S22 Cut data records from buffer [0128] S23 Paste data records