METHOD AND SYSTEM FOR FACILITATING ACCESS TO RECORDED DATA
20230316743 · 2023-10-05
Inventors
Cpc classification
G06Q40/04
PHYSICS
G06F3/04842
PHYSICS
G06V10/24
PHYSICS
International classification
G06V10/24
PHYSICS
G06Q40/04
PHYSICS
G06F16/185
PHYSICS
Abstract
The present invention relates to a method and system for facilitating access to recorded data. The system comprises an interface and a processing device. The interface is arranged to receive data and the processing device is arranged to separate the received data in data subsets, compress each data subset and assign an identifier to each compressed data subset, thereby creating data units each comprising a compressed data subset and an associated identifier, the processing device further being arranged to establish an index on the basis of the assigned identifiers.
Claims
1. A system configured to access recorded data for detecting market abuses amongst market transactions, the system comprising: a processor; a communication interface; and a memory configured to store computer readable instructions that, when executed, cause the system to: receive data via the communication interface; apply a compression technique to compress the received data into compressed data: create data units from the compressed data; establish an index associated with the data units, wherein the index includes a plurality of index entries, and each index entry comprises: an identifier; and memory address parameters of memory locations that contain the data units, wherein the memory address parameters include at least a first memory address and a second memory address; search the memory, for the data units, using the established index; decompress the data units within a timing window; and generate a view that includes the decompressed data units within the timing window.
2. The system of claim 1, wherein each index entry further comprises timing parameters.
3. The system of claim 2, wherein the data units are decompressed within the timing window using the timing parameters.
4. The system of claim 2, wherein the first memory address is a starting memory address, the second memory address is an ending memory address, and the timing parameters include a start time parameter and an end time parameter.
5. The system of claim 1, wherein the view is generated in near real time.
6. The system of claim 1, wherein the index indicates a physical memory location, in the memory, of each data unit.
7. The system of claim 1, wherein the memory includes a first memory and a second memory, and the system is further caused to search in both the first memory, for real-time analysis, and the second memory, for historical analysis, using the established index.
8. The system of claim 1, wherein the compression technique utlizes a software library associated with any of zlib, bzip2, or snappy.
9. The system of claim 1, wherein the received data is separated into data subsets.
10. The system of claim 9, wherein the data subsets are configurable in size such that larger subsets translate to faster scan and smaller index while smaller subsets translate to slower scan and larger index.
11. A non-transitory computer readable storage medium configured to store computer readable instructions that, when executed by a processor of an information processing apparatus, cause the information processing apparatus to provide execution comprising: applying a compression technique to received date to compress the received data into compressed data; creating data units from the compressed data; establishing an index associated with the data units, wherein the index includes a plurality of index entries, and each index entry comprises: an identifier; and memory address parameters of memory locations that contain the data units; searching the memory, for the data units, using the established index; decompressing the data units within a timing window; and generating a view that includes the decompressed data units within the timing window.
12. The non-transitory computer readable storage medium of claim 11, wherein the memory address parameters include at least a first memory address and a second memory address.
13. The non-transitory computer readable storage medium of claim 12, wherein each index entry further comprises timing parameters.
14. The non-transitory computer readable storage medium of claim 13, wherein the first memory address is a starting memory address, the second memory address is an ending memory address, and the timing parameters include a start time parameter and an end time parameter.
15. The non-transitory computer readable storage medium of claim 11, wherein the memory includes a first memory and a second memory, and the system is further caused to search in both the first memory, for real-time analysis, and the second memory, for historical analysis, using the established index.
16. A method for accessing recorded data, the method comprising: at an information processing system having a processor and a memory: applying a compression technique to received date to compress the received data into compressed data; creating data units from the compressed data; establishing an index associated with the data units, wherein the index includes a plurality of index entries, and each index entry comprises: an identifier; and memory address parameters of memory locations that contain the data units; searching the memory, for the data units, using the established index; and decompressing the data units within a timing window.
17. The method of claim 16, further comprising: generating a view that includes the decompressed data units within the timing window.
18. The method of claim 16, wherein the memory address parameters include at least a first memory address and a second memory address.
19. The method of claim 18, wherein each index entry further comprises timing parameters.
20. The method of claim 19, wherein the first memory address is a starting memory address, the second memory address is an ending memory address, and the timing parameters include a start time parameter and an end time parameter.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0022] The invention is now described, by way of example, with reference to the accompanying drawings, in which:
[0023]
[0024]
[0025]
[0026]
[0027]
DETAILED DESCRIPTION
[0028] The invention will now be described more fully hereinafter with reference to the accompanying drawings, in which certain embodiments of the invention are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided by way of example so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
[0029]
[0030] The processing device 3 is typically embodied in the form of a specific- or general-purpose computer comprising a processor 6 arranged to execute program code downloaded to a suitable storage area associated with the processor, such as a RAM, a Flash memory or a hard disk. In an embodiment of the present invention, the processor is arranged to at least partly carry out the method according to the present invention when appropriate program code comprising computer-executable components is downloaded to the memory and executed by the processor. The processor may alternatively be embodied in the form of an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a complex programmable logic device (CPLD), etc.
[0031] Typically, large amounts of data are received via the interface, and to subsequently be able to locate and process the data in a swift manner, the processing device 2 is arranged to separate the received data in a plurality of data subsets, compress each data subset and assign an identifier to each compressed data subset, whereby data units each comprising a compressed data subset and an associated identifier are created.
[0032] This is illustrated by means of
[0033] With reference to
[0034] As can be seen, both the primary memory 4 and the secondary memory 5 are depicted in
[0035] However, if a fully flexible system is to be provided, a primary as well as a secondary memory is included in the system. The primary memory 4 is typically located physically close to the processing device 3. The secondary memory 5 could be located close to the processing device, but could alternatively be remotely arranged from the processing device.
[0036] The flow chart of
[0037] With further reference to the flow chart of Figure, in step S207 it is determined whether data units should be transferred from the primary memory to the secondary memory. This is not necessarily dictated by the primary memory utilization degree reaching a certain predetermined level, but could alternatively be controlled by a data transfer instruction executed by the processing device, possible as a consequence of an operator requesting the data transfer. As previously has been mentioned, data units could be written directly to the secondary memory 5 from the processing device 3, without passing through the primary memory 4.
[0038] As was discussed in connection to
[0039] Further, the index is advantageously arranged to comprise timestamps for the data units, thereby indicating when the market transaction data was received via the system interface. For instance, a complete order book view can be provided at any given time of the day without loading and/or processing all transactions prior to window of interest. In this particular example, the index is embodied as “start_trans_time, end_trans_time, start_address, end_address”. Thus, by searching the index file for a particular time interval, for instance all transactions made between 11:02 and 11:04 a given day, the market supervisor will be given the physical location to data recorded during the required time interval, which subsequently can be decompressed and analysed.
[0040] Even though the invention has been described with reference to specific exemplifying embodiments thereof, many different alterations, modifications and the like will become apparent for those skilled in the art. The described embodiments are therefore not intended to limit the scope of the invention, as defined by the appended claims.