METHOD AND SYSTEM FOR CALCULATING ORIGIN-DESTINATION MATRICES EXPLOITING MOBILE COMMUNICATION NETWORK DATA
20230199513 · 2023-06-22
Assignee
Inventors
- Massimo COLONNA (Torino, IT)
- Piero LOVISOLO (Torino, IT)
- Dario PARATA (Torino, IT)
- Massimiliano PETRA (Torino, IT)
Cpc classification
H04W4/021
ELECTRICITY
International classification
Abstract
Matrices of movement in a geographic area are computed. The method subdivides the geographic area into geographic zones. Each entry of a matrix includes an indication of movements from one geographic zone to another in the respective time slot. The geographic area of interest is subdivided into geographic area portions, and mobile communication terminals in the geographic area calculate and provide respective geographic position estimates to a data processing system. An overall number of geographic position estimates are distributed by assigning to each geographic area portion a respective number of geographic position estimates. Covered geographic portions are determined among said plurality of geographic area portions, and each one of the determined covered geographic area portions is assigned a respective weight. A correspondence map establishing a correspondence between the network cells and the geographic zones is generated, and the weights are used in computing the matrices.
Claims
1. A method, implemented by a data processing system, for computing Origin-Destination matrices indicative of movements, in a geographic area of interest, of physical entities being users of mobile communication terminals configured to be adapted to interact with a mobile communication network comprising a plurality of network cells covering said geographic area of interest, each Origin-Destination matrix being related to a respective time slot of an observation time period and comprising a plurality of entries, the method comprising: subdividing said geographic area of interest in a plurality of geographic zones, each entry of an Origin-Destination matrix to be computed containing an indication of a number of movements from one geographic zone to another geographic zone of said plurality of geographic zones in said respective time slot; subdividing the geographic area of interest in a plurality of geographic area portions; causing mobile communication terminals served by the mobile communication network and situated in the geographic area of interest to calculate respective geographic position estimates and provide the calculated geographic position estimates to the data processing system; distributing an overall number of geographic position estimates received from the mobile communication terminals, on said geographic area portions by assigning to each geographic area portion of said plurality of geographic area portions a respective number of geographic position estimates corresponding to geographic positions estimates falling within said geographic area portion; for each of said plurality of network cells: determining, among said plurality of geographic area portions, covered geographic area portions falling within a coverage area of said network cell, and assigning to each one of the determined covered geographic area portions a respective weight which depends on said respective number of geographic position estimates compared to an overall number of geographic position estimates falling within all said determined covered geographic area portions; generating a correspondence map establishing a correspondence between said network cells and said geographic zones, said generating the correspondence map comprising, for the considered network cell, calculating a cell weight of the network cell on each geographic zone of said plurality of geographic zones by determining the covered geographic area portions belonging to each of said zones and summing the respective weights of the covered geographic area portions identified as belonging to said each of said geographic zones; receiving from the mobile communication network and storing in a repository indications of events of interaction between the mobile communication terminals and the network cells of the mobile communication network, each indication of events of interaction including an indication of the network cell in which the event of interaction occurred and a time indication of the occurred event of interaction; for each of said mobile communication terminals: extracting from the repository and analysing the respective indications of events of interaction stored in the repository; for selected couples of indications of events of interactions consecutive in time, extracted from the repository identifying the respective network cells in which the two events of interaction occurred, wherein the network cell in which the first in time event of interaction of the said couple of indications of events of interactions occurred is considered as origin network cell of a movement and the network cell in which the second in time event of interaction of said couple of indications of events of interactions is considered as destination network cell of a movement, wherein said identifying the respective network cells is based on said indication of the network cell in which the event of interaction occurred included in the indication of events of interaction stored in the repository; identifying as origin geographic zones of said plurality of geographic zones where the mobile communication terminal was located when the first in time event of interaction of the couple occurred those geographic zones in respect of which the cell weight of the network cell associated with the first in time event of interaction of said couple is greater than a predetermined minimum weight; identifying as destination geographic zones of said plurality of geographic zones where the mobile communication terminal was located when the second in time event of interaction of the couple occurred those geographic zones in respect of which the cell weight of the network cell associated with the second in time event of interaction of said couple is greater than a predetermined minimum weight, assigning to each movement from one of the identified origin geographic zones to one of the identified destination geographic zones a respective movement weight calculated as a function of the weight of the origin network cell on said origin geographic zone and of the weight of the destination network cell on said destination geographic zone; assigning to the entry of the Origin-Destination matrix corresponding to said one of the origin geographic zones and said one of the destination geographic zones a value determined on the basis of said movement weight, and providing said Origin-Destination matrices in output from said data processing system for the use, by a user of the data processing system, for estimating movements, in a geographic area of interest, of physical entities.
2. The method of claim 1, wherein said indications of events of interaction between the mobile communication terminals and the network cells of the mobile communication network includes indications about one or more among: interactions at the power-on and/or power-off of the mobile communication terminals; interactions at location area updates; interactions at the re-entrance of the mobile communication terminals into an area covered by the mobile communication network from an area not covered by the mobile communication network; interactions at the switch of the mobile communication terminal from a mobile network technology to another; interactions at incoming/outgoing phone calls; interactions at sending/receiving SMS and/or MMS; interactions at Internet access.
3. The method of claim 1, wherein said causing mobile communication terminals served by the mobile communication network and situated in the geographic area of interest to calculate respective geographic position estimates and provide the calculated geographic position estimates to the data processing system comprises selecting, among all the mobile communication terminals served by the mobile communication network and situated in the geographic area of interest, a subset of selected mobile communication terminals according to a selection criterion comprising at least one among: mobile communication terminals of users which have given their consent to activate their geographic localization; mobile communication terminals of users which have given their consent to take part to measurement campaigns to be performed for purpose of calculating presences of individuals on a geographic territory; mobile communication terminals of users who, at subscription, have declared a residence in a certain area inside the geographic area of interest; mobile communication terminals of users in respect of whom the operator of the mobile communication network has determined a residence or a working location in a certain area inside the geographic area of interest.
4. The method of claim 3, comprising configuring the subset of selected mobile communication terminals, said configuring comprising providing to the selected mobile communication terminals at least one among: an indication of a time interval during which the selected mobile communication terminals will have to calculate the respective geographic position estimates; an indication of a periodicity with which the selected mobile communication terminals will have to calculate the respective geographic position estimates; an indication of whether the selected mobile communication terminals will have to provide to the data processing system the respective calculated geographic position estimates immediately after the respective geographic position estimates have been calculated or the respective calculated geographic position estimates can be provided to the data processing system at a later time after they have been calculated.
5. The method of claim 1, wherein determining covered geographic area portions of said plurality of geographic area portions falling within a coverage area of said network cell comprises: obtaining a coverage area of said network cell; determining which of said plurality of geographic area portions belong to the coverage area of said network cell, wherein said determining comprises at least one among: assessing which of said plurality of geographic area portions have geographic coordinates of a center of gravity thereof falling inside the coverage area of said network cell; assessing which of said plurality of geographic area portions fall, for a given percentage of their area, inside the coverage area of the network cell.
6. The method of claim 1, wherein said determining the covered geographic area portions belonging to each of said zones comprises one among: assessing which of said geographic area portions have geographic coordinates of a center of gravity thereof falling inside the zone; assessing which of said geographic area portions fall, for a given percentage of their area, inside the zone.
7. The method of claim 1, wherein said distributing an overall number of geographic position estimates received from the mobile communication terminals on said geographic area portions and said generating a correspondence map are performed for each time slot of said observation time period, whereby said cell weight of the network cell provides an indication of a probability that a user of a mobile communication terminal that is located in the coverage area of the network cell is, in the considered time slot, located in a specific zone of said plurality of zones.
8. A data processing system comprising modules configured to perform, when operated, the method of claim 1.
Description
BRIEF DESCRIPTION OF THE ANNEXED DRAWINGS
[0085] These and other features and advantages of the solution according to the present disclosure will be better understood by reading the following detailed description of an embodiment thereof, provided merely by way of non-limitative example, to be read in conjunction with the attached drawings, wherein:
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS OF THE PRESENT DISCLOSURE
[0099] With reference to the drawings,
[0100] The system 100 is coupled with a mobile communication network 105, such as a (2G, 3G, 4G, 5G or higher/other generation) mobile telephony network, and is configured for receiving from the mobile communication network 105 positioning (geolocation) data of each UE (e.g., mobile phones, smartphones, tablet with 2G-3G-4G-5G or higher/other connectivity, etc.) of individuals located in a surveyed geographic area 107, schematized in
[0101] The mobile communication network 105 comprises a plurality of (two or more) communication stations 105a (e.g., radio base stations of the mobile telephony network) geographically distributed in the surveyed area 107. Each communication station 105a is adapted to manage communications of UE (not shown, such as for example mobile phones) in one or more served areas or network cells 105b (in the example at issue, three cells are served by each communication station 105a) as will be discussed in greater detail below.
[0102] Even more generally, each communication station 105a of the mobile communication network 105 is adapted to interact with any UE located within one of the cells 105b served by such communication station 105a (e.g., interactions at UE power on/off, at location area update, at incoming/outgoing phone calls, at sending/receiving messages, like SMS and/or MIMS, at Internet access etc.). Such interactions between the UE and the mobile communication network 105 will be generally denoted as events e.sub.v (v=1, . . . , V; V>0) in the following.
[0103] The surveyed geographic area 107 may be regarded as subdivided in a plurality of sectors, each one corresponding to a respective cell 105b of the (part of the) mobile communication network 105 superimposed on the surveyed area 107.
[0104] The system 100 comprises a Computation Engine 110 configured to be adapted to process data retrieved from the mobile communication network 105, and a Repository 115 (such as a database, a file system, etc.) configured to be adapted to store data regarding interactions between the UE and the mobile communication network 105. The Repository can also be used to store computation results generated by the Computation Engine 110 and, possibly, any processing data generated by and/or provided to the system 100 (generally in a binary format).
[0105] The data stored in the Repository 115 include data about the coverage area (coverage area data) of the cells 105b of the mobile communication network 105. For the purposes of the present disclosure, the way in which the coverage area data are expressed is not relevant: the coverage area of a cell can indifferently be expressed by way of the geographic coordinates of the center and the radius of a circle defining the cell coverage area or by way of areas delimited by polygons, regular or irregular, with any number of sides, or by way of the union of square pixels, joint or separated. Additionally, the present disclosure does not depend on the way adopted for the computation of the cells' coverage area, so that it is possible to exploit, indifferently, both simplified electromagnetic field propagation models (such as those described in Theodore S. Rappaport, “Wireless Communications”, Prentice Hall, 1996, Chapter 3 and Chapter 4, pages 69-196) and mobile network planning tools normally exploited by mobile communication network operators. Moreover, the coverage of the network cells can indifferently be calculated specifically, ad-hoc for the system 100 before the start of the operation of the system 100, or be coverage area data already available to the operator of the mobile communication network 105 for the network planning.
[0106] The system 100 is provided with an Administrator Interface (A.I.) 120 (e.g., implemented in a computer) configured and operable for modifying parameters and/or algorithms used by the computation engine 110 and/or accessing data stored in the repository 115.
[0107] The system 100 may comprise one or more User Interfaces (U.I.) 125 (e.g., a user terminal, or a software running on a remote terminal connected to the system 100) adapted to receive inputs from, and to provide output to a user of the system 100. In the present disclosure the expression “user of the system” may refer to one or more human beings and/or to external computing systems (such as a computer network, not shown) of a third party being subscriber of the services provided by the system 100 and enabled to access the system 100—e.g., under subscription of a contract with a service provider owner of the system 100, and possibly with reduced right of access to the system 100 compared to the right of access held by an administrator of the system 100 operating through the Administrator Interface 120.
[0108] The system 100 comprises a Localization Manager 130, shown in greater detail as 200 in
[0109] The Localization Manager 200 can also include a Position Repository 215 (such as a database, a file system, etc.) in which the position estimates (and related timestamps) provided by the UE are stored. The Position Repository 215 can also be physically integrated in the Repository 115, if suitable (e.g., in case the Localization Manager 200 is physically integrated with the other components of the system 100 or when the transfer of data between the Localization Manager 200 and the Repository 115 can be accomplished quickly without causing delays on the processing operations).
[0110] The (Communication Manager 205 of the) Localization Manager 200 is configured to be adapted to communicate with the UE through the mobile communication network 105. However, for the purposes of the present disclosure the specific way in which the communication between the Localization Manager 200 and the UE is accomplished, and the protocols adopted therefore, are not limitative. The dialogue between the Localization Manager 200 and the UE can for example take place through the User-plane, e.g., using communication protocols of the Internet world (like the TCP/IP), in a way totally similar to what normally happens between a terminal and a remote server, or through the Control-plane, encapsulating the messages exchanged between the Localization Manager 200 and the UE in signaling messages that apparatuses of the mobile communication network 105 exchange for performing their specific operations.
[0111] The system 100 may be implemented in any known manner; for example, the system 100 may comprise a single computer, or a network of distributed computers, either of physical type (e.g., with one or more main machines implementing the computation engine 110 and the repository 115 connected to other machines implementing Administrator and User Interfaces 120 and 125) or of virtual type (e.g., by implementing one or more virtual machines in a network of computers).
[0112] Additionally, some modules and/or functionalities of the system 100, such as the Localization Manager 200, can be part of, implemented in, the mobile communication network 105 or possibly be implemented in some apparatuses of the mobile communication network 105 or in the O&M system thereof. The different functionalities of the Localization Manager 200 can be implemented in different apparatuses, depending on the specific necessities of the mobile communication network operator and/or administrator of the system 100 (costs, availability of apparatus, available communication bandwidth for the links, etc.). For the purposes of the present disclosure, these aspects of implementation detail are not strictly relevant.
[0113] The system 100 is configured to be adapted to retrieve (and/or receive) from the mobile communication network 105 an event record er.sub.v (v=1, . . . , V; V being a positive integer) for each event e.sub.v (v=1, . . . , V) occurred between a UE and the mobile communication network 105 (through one of its communication stations 105a) within the surveyed geographic area 107. Each event record er.sub.v retrieved by the system 100 from the mobile communication network 105 may comprise—in a non-limitative manner—an identifier of the UE that is involved in the corresponding event e.sub.v (e.g., the UE identifier may be selected as one or more among the International Mobile Equipment Identity—IMEI, the International Mobile Subscriber Identity—IMSI and the Mobile Subscriber ISDN Number—MSISDN code), time data (also denoted as timestamps) indicating the time at which the corresponding event e.sub.v has occurred, and an identifier of the cell 105b involved in such event e.sub.v.
[0114] In one embodiment of the present invention, the UE identifier of the UE involved in the event record er.sub.v may be provided not in clear, i.e. as an encrypted information in order to ensure the privacy of the UE owner. In other words, if the need arises, the information (i.e., the identity of the owner of the UE corresponding to the UE identifier) may be encrypted/decrypted by implementing a suitable encryption/decryption algorithm, or a cryptographic hash function such as for example the algorithm SHA256 described in “Secure Hash Standard (SHS)”, National Institute of Standards and Technology FIPS—180-4, Mar. 6, 2012. In this way, the identity of the individual is not in clear, being replaced by a non-intelligible pseudonym.
[0115] The system 100 may retrieve (and/or receive) the event records er.sub.v related to a generic UE from the mobile communication network 105 by acquiring records of data generated and used in the mobile communication network 105 for its own purposes. For example, in case the mobile communication network 105 is a GSM network, Charging Data Records (CDRs), also known as call data records, and/or
[0116] Visitor Location Records (VLRs) may be retrieved from the mobile communication network 105 and reused as event records err. The CDR is a data record (usually used for billing purposes by a mobile telephony service provider operating through the mobile communication network 105) that contains attributes specific to a single instance of a phone call or other communication transaction performed between a UE and the mobile communication network 105. The VLRs are databases listing UE that have roamed into the jurisdiction of a Mobile Switching Center (MSC, not shown) of the mobile communication network 105, which is a management element of the mobile communication network 105 managing events over a plurality of communication stations 105a. Each communication station 105a in the mobile communication network 105 is usually associated with a respective VLR. If the mobile communication network 105 is an LTE network, records of data associated with the event records er.sub.v of a generic UE may be generated by a Mobility Management Entity, or MME, comprised in the mobile communication network 105, which is responsible for a UE tracking and paging procedure in LTE networks (where no VLR is implemented).
[0117] It should be noted that the method described in the present disclosure may be implemented by using any source of data (e.g., provided by one or more WiFi networks) from which it is possible to obtain event records er.sub.v comprising a univocal identifier of individuals (such as the UE identifier mentioned above), a position estimate of such individuals, and a time indication of an instant during which such event has occurred.
[0118] In operation, event records er.sub.v may be continuously retrieved by the system 100 from the mobile communication network 105. Alternatively, event records er.sub.v may be collected by the system 100 periodically, e.g. for a predetermined time period (e.g., every certain number of hours, or on a daily or weekly basis). For example, event records er.sub.v may be transferred from the mobile communication network 105 to the system 100 as they are generated, in a sort of “push” modality, or event records er.sub.v may be collected daily in the mobile communication network 105 and then packed and transferred to the system 100 periodically or upon request by the system 100.
[0119] The event records er.sub.v retrieved from the mobile communication network 105 are stored in the Repository 115, where they are made available to the Computation Engine 110 for processing. Event records er.sub.v generated by a same UE can be grouped together in the Repository 115, i.e. event records er.sub.v can be grouped together if they comprise a common UE identifier and can be denoted as event records group erg.sub.n (e.g., n=0, . . . , N, N≥0) hereinafter.
[0120] The computation engine 110 is configured to be adapted to execute an algorithm (described in the following) for computing O-D matrices on a geographic area. The algorithm is implemented by a software program product stored in a Memory Element 110a of the system 100, comprised in the Computation Engine 110 in the example of
[0121] The event records er.sub.v can be processed according to (as discussed in detail below) instructions provided by the system administrator (through the Administrator Interface 120), for example stored in the Repository 115, and, possibly, according to instructions provided by a user (through the User Interface 125).
[0122] The Computation Engine 110 provides the results of the processing performed on the event records er.sub.v to the user of the system 100 through the User Interface 125, and optionally stores such processing results in the Repository 115.
[0123] The system 100 may be adapted to retrieve (or receive) data about individuals not exclusively from a mobile telephony network 105. Alternatively or in addition, the system 100 may be configured to retrieve (or receive) data about individuals from one or more wireless computer networks, such as WLANs, operating in the surveyed area 107, provided that the UE of the individuals are capable to connect to such wireless computer networks and that the UE (at least a subset of all the UE located in the surveyed area 107) are capable of estimating their position by means of any geolocation technique (such as the GPS).
[0124]
[0125] The RoI 300 is a geographic region within which the O-D matrices are to be calculated by the system 100. For example, the RoI 300 may be either a district, a town, a city, or any other kind of geographic area. Moreover, the RoI 300 may comprise a number of sub-regions including non-adjacent geographical locations, such as for example a plurality of different cities, different counties and/or different nations (and so on). In addition, the RoI 300 may comprise a set of one or more predetermined locations (such as for example airports, bus/train stations, etc.).
[0126] Let it be assumed, as non-limiting example, that a traffic analysis (e.g., an analysis of people flow) over the RoI 300 is performed in order to identify movements of people and for computing O-D matrices referred to the movements of people identified by the traffic analysis.
[0127] The RoI 300 is subdivided into a plurality of traffic analysis zones, or simply zones z.sub.q (q=1, . . . , Q; where Q is a positive integer) in which it is desired to analyze traffic flows. In the example shown in
[0128] Each zone z.sub.q may be determined by using a zoning technique. According to this zoning technique, each zone z.sub.q may be delimited by administrative (city limits, national boundaries, etc.) and/or physical boundaries (such as rivers, railroads etc.) within the RoI 300 that may hinder the traffic flow and may comprise adjacent lots of a same kind (such as open space, residential, agricultural, commercial or industrial lots) which are expected to experience similar traffic flows. The zones z.sub.q may differ in size from one another. Each zone z.sub.q is modeled as if all traffic flows starting or ending therein were concentrated in a respective single point or centroid 310.sub.q (i.e., 310.sub.1, . . . , 310.sub.9) of that zone z.sub.q. In other words, the centroid 310.sub.q of the generic zone z.sub.q represents an ideal hub from which, or at which, any traffic flow starts or ends, respectively.
[0129] Anyway, it is pointed out that the solution according to embodiments of the present invention is independent from the criteria used to partition the RoI 300 into zones z.sub.q.
[0130]
[0131] The generic O-D matrix 400 is typically a square matrix having Q rows i and Q columns j. Each row and each column of the O-D matrix 400 are associated with a corresponding zone z.sub.q of the RoI 300; thus, considering the RoI 300 in the example of
[0132] Each row i of the O-D matrix 400 represents an origin zone z.sub.i for traffic flows of moving physical entities (for example land vehicles) while each column j of the O-D matrix 400 represents a destination zone z.sub.j for traffic flows of such moving physical entities. In other words, each generic element or entry mod(i,j) of the O-D matrix 400 contains an indication of the number of traffic flows starting in the zone z.sub.i (origin zone) and ending in the zone z.sub.1 (destination zone) in the corresponding time slot ts.
[0133] The main diagonal of the O-D matrix 400, which comprises the entries mod(i,j) having i=j (i.e., entries mod(i,j) having the same zone z.sub.i both as origin and destination zone), is usually left empty (e.g., with values of the entries set to 0) or the values of the main diagonal entries mod(i,j) are discarded since they do not depict a movement between zones of the region of interest (i.e., such entries do not depict a flow of people).
[0134] As outlined above, in order to obtain a more detailed and reliable traffic analysis, a predetermined observation time period TP of the traffic flows in the region of interest is also established and it is subdivided into one or more (preferably a plurality) of time slots ts.sub.m (m=1, . . . , M, where M is a positive integer). Each time slot ts.sub.m ranges from an initial time instant t.sub.0(m) to a next time instant t.sub.0(m+1) (excluded) which is the initial time instant of the next time slot ts.sub.m+1, or:
ts.sub.m=[t.sub.0(m),t.sub.0(m+1)).
[0135] Anyway, embodiments of the present invention featuring overlapping time slots are not excluded, as well as one (or more) missing time slot(s) during the predetermined observation time period TP. Also, the time slots ts.sub.m into which the observation tome period TP is subdivided may have different lengths (time duration) from one another. Additionally, the observation time period TP may include just one time slot ts.sub.m of length equal to the length of the time period TP.
[0136] For each one of the time slots ts.sub.m a respective O-D matrix 400m is computed that accounts for the people movements that have taken place during the time slot ts.sub.m. Therefore, a sequence or set of O-D matrices 500 of a number M of O-D matrices 400m, as shown in
[0137] In one embodiment of the present disclosure, values to which the observation time period TP and the time slots ts.sub.m are set may correspond to 24 hours and 1 hour, respectively. Naturally, the scope of the present disclosure is not limited by any specific values selected for the observation time period TP and the time slots ts.sub.m.
[0138] The procedure for computing the O-D matrices 400m is now described.
[0139] The ROI 300 on which the system 100 has to compute the O-D matrices 400m is completely tessellated by a tessellation 600, as shown in
[0140] The tessellation 600 of the RoI 300 is independent from the coverage of the RoI 300 by the network cells of the mobile communication network 105. The tessellation 600 is also independent from the partition on zones (number and size of the zones) of the RoI 300. In other words, there is no relationship between the tiles 605 and the network cells 105b, and between the tiles 605 and the zones z.sub.q. The tessellation 600 is for example inputted by the administrator of the system 100 through the Administrator Interface 120.
[0141] For the purposes of the present disclosure, the system 100 requests to the UE (possibly, to a subset thereof) that happens to be located in the RoI 300 to provide a periodical estimation of their position. The UE may for example exploit the GPS localization resources integrated therein for calculating their position estimates. The position estimates (hereafter also referred to as “position measurements”, since the position estimates are obtained by measuring one or more characteristics of radio signals received by the UE from the radio base stations, from GPS satellites or other GNSS, etc., depending on the localization technology adopted) are arranged as t-uples, e.g. triplets 700 of the type (UE_id, position (x,y), timestamp), depicted in
[0145] The field UE_id 705 may also be absent. If present, this field allows optimizing some performances of the system 100, as described later on. In one embodiment of the present disclosure, the value in the field UE_id 705 may be provided not in clear, e.g. as an encrypted information in order to ensure the privacy of the UE owner. In other words, if the need arises, the information (i.e., the identity of an individual corresponding to the UE whose univocal identifier is specified in the field UE_id 705) may be encrypted/decrypted by implementing a suitable encryption/decryption algorithm, or a cryptographic hash function, such as for example the algorithm SHA256 described in “Secure Hash Standard (SHS)”, National Institute of Standards and Technology FIPS—180-4, Mar. 6, 2012. In this way, the identity of the individual is not in clear, being replaced by a non-intelligible pseudonym.
[0146] The procedure implemented by the system 100 for computing the O-D matrices can be regarded as composed by two main phases, generically, but not necessarily, separated in time from each other.
[0147] In a first phase of the procedure, cells-zones correspondence maps are constructed. This first phase is based on a set of position estimates (measurements) performed by a set of UE within the RoI 300, exploiting any suitable geolocation technique available at the UE (e.g., the GPS). The measurement time period MP during which such position estimates are carried out is subdivided into time slots, for example the same time slots ts.sub.m used to subdivide the observation time period TP during which the set 500 of O-D matrices 400m is to be calculated on the RoI 300. For each time slot ts.sub.m of the measurement time period MP, a cells-zones correspondence map is calculated: thus, a number M of cells-zones correspondence maps are calculated.
[0148] In a second phase of the procedure, for each time slot ts.sub.m of the observation time period TP the movements between the zones z.sub.q into which the RoI 300 is subdivided are calculated, exploiting the cells-zones correspondence maps calculated in the same time slot ts.sub.m of the measurement time period MP. The set 500 of O-D matrices 400m is thus generated.
[0149] The term “map” used herein should be not be intended as referring to a common geographic map, rather the term shall be intended as a tool that associates one or more numerical values (obtained by processing data provided by the mobile communication network) to entities representing portions of the territory (like the tiles 605 of the tessellation 600 or the zones z.sub.q of the subdivision into zones 300).
[0150] Referring to
[0155] In greater detail, referring to
[0156] If the system 100 has to calculate a set 500 of O-D matrices 400n related to an observation time period TP included in the measurement time period MP and such computation has to be carried out in real time, the system 100 configures the UE to immediately transmit their position estimates, as soon as they are calculated (flag N/I set to I). Differently, in case the system 100 does not need to compute the set 500 of O-D matrices 400m in real time, or in case the observation time period TP is not joined to the measurement time period MP, the system 100 may configure the UE for a non-immediate transmission of their position estimates (flag N/I set to NI); this modality allows the UE to transmit two or more triplets 700 altogether, in a same block, or to exploit the establishment of communication links for other services, in both cases making the transmission more efficient, or a UE happening to be in an uncovered area can wait to return to a covered area and then transmit the position estimates.
[0157] The selection of the UE to be inserted in the set of UE which will have to calculate and provide to the system 100 their position estimates and to which the configuration parameters have to be sent, can be accomplished in different ways. One possibility is to send the configuration parameters to all the UE which happen to be within the cells 105b of the mobile communication network 105 inside the RoI 300, for example exploiting a broadcast message. Another possibility is that the administrator of the system 100, possibly in agreement with the operator of the mobile communication network 105, sends the configuration parameters only to a subset of those UE on the basis of one or more criteria, such as for example: [0158] UE of users which have given to the network operator their consent to activate their localization; [0159] UE of users which have given to the network operator or to the administrator of the system 100 their consent to take part to the measurement campaigns to be performed for the specific purposes of the system 100; [0160] UE of users which, at the subscription of the service contract with the network operator, have declared a residence in a given area (e.g., inside the RoI 300 or inside a specific zone of the RoI 300): [0161] UE of users in respect of which the network operator has determined a residence or a working location in a certain area (based on any of the known techniques, e.g. the technique described in U.S. Pat. No. 9,706,363, disclosing a method for identifying and locating at least one relevant location visited by at least one individual within a geographical area served by a wireless telecommunication network the method including: identifying and clustering events that occurred within a predefined distance from each other and having a similar probability, computing a weight value for each cluster of events identified, comparing the weight value with a threshold weight value, if the weight value is equal to, or greater than, the threshold weight value, identifying the relevant location as belonging to a selected typology of relevant location, and providing an indication of the position of the at least one relevant location based on recorded position data of the events of the cluster, or if the weight value is lower than the threshold weight value, identifying the relevant location as not belonging to the selected typology of relevant location).
[0162] The selection of the UE which will have to calculate and provide to the system 100 their position estimates is preferably made with the aim of complying with law prescriptions (e.g., concerning privacy of users) and/or to avoid unnecessary processing by the UE or the system 100 (for example, avoiding to include in the set of UE those UE which are expected to just quickly transit through the RoI 300, and which, if selected to take part to the measurement campaign, would send position estimates also when they have moved (far) away from the RoI 300). Additionally, the administrator of the system 100, based on the available hardware/software resources (CPU, RAM, etc.) of the system 100, can decide (e.g., to avoid overloading the system 100, particularly the Localization Manager 200 and the Computation Engine 110) to select, among all the possible UE, a suitably small number of UE. The UE can be selected in a random manner or based on specific requirements for the UE, for example based on the localization features and capabilities of the UE, e.g., in terms of achievable accuracy for the position estimate.
[0163] Referring to
[0164] For each general distribution map to be constructed and for each tile 605 of the tessellation 600 in respect of each general distribution map, a counter n_pos is initialized to an initial value, e.g. to 0 (zero); the counter n_pos will be used to count the number of position estimates received by the UE (of the selected set of UE) falling in said tile in the considered time slot ts.sub.m (step 1005).
[0165] Then, starting from the time instant MP.sub.start and till the time instant MP.sub.stop of the measurement time period MP during which the measurement campaign is carried out, the UE selected at step 905 estimate their own geographic position every D seconds and send the calculated position estimates (in accordance to the timing specified by the setting of the flag I/NI) to the Position Manager 210, e.g. in the form of triplets 700 as depicted in
[0166] For each time slot ts.sub.m, at the end of the same, or, alternatively, at the end of the measurement time period MP, the Position Manager 210 extracts from the Position Repository 215 all the received triplets 700 having a value in the respective field timestamp 715 included in the time slot ts.sub.m under consideration and sends the extracted triplets to the Computation Engine 110 (step 1015). In case the observation time period TP is included in the measurement time period MP and the system 100 has to process the data and create the set 500 of O-D matrices 400m in real time, the operation 1015 should start as soon as the timeslot ts.sub.m ends; otherwise the operation can be started after the end of the measurement time period MP.
[0167] The Computation Engine 110 receives from the Position Manager 210 the extracted triplets 700 and, for each received triplet, based on the geographic coordinates included in the field position (x,y) 710 thereof, it determines in which tile 605 of the tessellation 600 the considered triplet falls (step 1020). The Computation Engine 110 increases of a predetermined amount (e.g., 1) the counter n_pos corresponding to the determined tile 605 (step 1025). In the case that the geographic coordinates present in the field position (x,y) 710 of a triplet 700 under consideration are internal to the determined tile 605 there is a univocal assignment of the position of the corresponding UE to that tile (and the counter n_pos of that tile is increased of the predetermined amount); when instead the coordinates present in the field position (x,y) 710 of the triplet 700 are located on the boundary between two or more tiles 605 of the tessellation 600, there are two possibilities: 1) a first possibility is to randomly select one of the two or more concerned tiles and to increase of the predetermined amount (e.g., 1) the counter n_pos corresponding to the randomly selected tile 605; 2) another possibility is to increase the counters n_pos associated to all the two or more concerned tiles 605 of the predetermined amount (e.g., 1) divided by the number of concerned tiles 605. The choice between one or the other of these two possibilities can for example be made by the administrator of the system 100, through the Administrator Interface 120.
[0168] Once all the triplets 700 relating to the time slot ts.sub.m have been analysed by the Computation Engine 110, the general distribution map is obtained that, for the considered time slot ts.sub.m, associates, with each tile 605 of the tessellation 600, a value of the respective counter n_pos equal to the number of position estimates which have been received from the (selected) UE and which have geographic coordinates in the field position (x,y) 710 falling in said tile 605.
[0169] Then, the step 815 of construction of the cell-specific distribution maps for each time slot ts.sub.m starts. This step is schematically depicted in the flowchart of
[0170] For each network cell c.sub.1 of the mobile communication network 105 the Computation Engine 110 extracts from the Repository 115 the corresponding coverage area (step 1105) and determines all the K tiles 605 of the tessellation 600 that belong to the coverage area of the cell c.sub.1 (step 1110). One way to determine that a tile 605 belongs to the coverage area of the cell c.sub.1 is by assessing that the coordinates of the center of gravity of the tile 605 fall inside the coverage area of the cell c.sub.1, or that a given percentage (fraction) of the tile surface, for example a predetermined % value, falls inside the coverage area of the cell c.sub.1. The choice of the specific criterion/criteria for considering that a tile 605 belongs to the coverage area of the cell c.sub.1 is for example made by the administrator of the system 100 through the Administrator Interface 120.
[0171] For each one of the K tiles determined at step 1110, the Computation Engine 110 calculates a weight value weight % as the ratio between the numerical value of the counter n_pos for that tile 605 and the sum of the numerical values of the counters n_pos for all the K tiles:
weight %.sub.l,i=n_pos.sub.l,i/Σ.sub.j=1.sup.Kn_pos.sub.j (Equation 1)
with the index i=K (step 1115). The result of this operation is, for each time slot ts.sub.m, the cell-specific distribution map for the cell c.sub.1, which establishes a correspondence between each tile 605 of the tessellation 600 and a respective weight value calculated as in Equation 1. Such a weight value can be regarded as the probability that a user of a UE that is located in the coverage area of the cell c.sub.1 is, in the generic time slot ts.sub.m, specifically located in the i-th tile 605 of the tessellation 600. By repeating these operations for all the cells of the mobile communication network 105 a set of cell-specific distribution maps is obtained for every time slot ts.sub.m, one for every cell.
[0172] When the observation time period MP lasts 24 hours or, more generally, few days (the duration of the observation time period MP may depend on how often the mobile communication network operator modifies the coverage of the network) the system 100 can be configured by the system administrator in such a way that the operations 1105 and 1110 are performed only once, at the time the triplets 700 of the first time slot ts.sub.1 of the measurement time period MP are processed. The tiles 605 identified at that time are used, without changes, also during the processing performed for the other time slots from ts.sub.2 to ts.sub.M.
[0173] At this point, the step 820 of constructions of cells-zones correspondence maps (one for each time slot ts.sub.m) commences, as schematized by the procedure 1200 in the flowchart of
[0174] For each zone z.sub.q of the subdivision in zones of the RoI 300 the K′ tiles 605 of the tessellation 600 that belongs to the considered zone z.sub.q are identified (step 1205). One way to determine that a tile 605 belongs to the zone z.sub.q is by assessing that the coordinates of the center of gravity of the tile 605 fall inside the zone z.sub.q, or that a given percentage (fraction) of the tile surface, for example a predetermined % value, falls inside the zone z.sub.q. The choice of the specific criterion/criteria for considering that a tile 605 belongs to the zone z.sub.q is for example made by the administrator of the system 100 through the Administrator Interface 120. For simplifying the configuration and the management of the system 100, the specific criteria adopted (coordinates of the center of gravity of the tile 605 inside the zone z.sub.q, a given percentage of the tile 605 inside the zone z.sub.q or any other possible criterion) may be the same adopted in step 1110 for identifying the K tiles belonging to the coverage area of each cell of the mobile communication network 105.
[0175] In addition, step 1205, similarly to step 1110, can be performed just once, at the time of processing of the position estimates of the first time slot ts.sub.1 of the measurement time period MP.
[0176] Then, for each time slot ts.sub.m and for each cell c.sub.1 of the mobile communication network 105, by exploiting the cell-specific distribution map for that cell, calculated in the preceding step 815 and referring to the time slot ts.sub.m (steps 1105-1115), the weight (P.sub.l.fwdarw.q).sub.m, of the cell c.sub.1 on the zone z.sub.q is calculated by summing up the weights of the K′ tiles identified at step 1205 and obtained from the map (step 1210), i.e.:
(P.sub.l.fwdarw.q).sub.m=Σ.sub.i=1.sup.K′(weight %.sub.l,i).sub.m (Equation 2).
[0177] Such weight (P.sub.l.fwdarw.q).sub.m can be regarded as the probability that a user of a UE that is located in the coverage area of the cell c.sub.1 is, in the time slot ts.sub.m, located in the specific zone z.sub.q among all the zones of the partition in zones of the RoI 300.
[0178] By repeating this operation in every time slot ts.sub.m and for all the cells c.sub.1 of the mobile communication network 105, the cells-zones correspondence map is obtained, which is the map in which every zone z.sub.q of the subdivision in zones of the RoI 300 is associated with the total weight of each cell c.sub.1 of the mobile communication network 105 on such a zone in the time slot ts.sub.m.
[0179] The cells-zones correspondence maps, relating to the time slot ts.sub.m of the measurement time period MP, are exploited for calculating, in the corresponding time slot ts.sub.m of the observation time period TP, the zones of origin and destination of movements of UE users to be inserted in the O-D matrices to be computed. The operations of computation of the set 500 of O-D matrices 400m are schematized as 1300 in the flowchart of
[0180] Firstly, the Computation Engine 110 retrieves from the Repository 115 event records er.sub.v stored therein. The Computation Engine 110 may retrieve at a time one group erg.sub.n of event records er.sub.v generated by a same UE (step 1305).
[0181] To this purpose, the Computation Engine 110 may initialize a UE index variable n (e.g., n=0, . . . , N, N≥0) and then retrieve, e.g. sequentially, from the Repository 115 a corresponding n-th event records group erg.sub.n stored therein, related to the n-th UE.
[0182] Then, the system 100 may check if the retrieved event records group erg.sub.n and the event records contained therein satisfy data quality requirements before processing them (step 1310). The data quality requirements are designed for ensuring reliability and accuracy in the results provided by the system 100 obtained through computation involving probability and statistic computation.
[0183] As a non-limitative example, data quality requirements to be fulfilled may comprise one or more of the following: [0184] a total number of events e.sub.v, within the observation time period TP, associated with the event records group erg.sub.n is at least equal to, or greater than, a total number of events threshold; [0185] an average number of daily events e.sub.v, within the observation time period TP, associated with the event records group erg.sub.n is at least equal to, or greater than, a predetermined average number of events threshold, and/or [0186] a number of daily (or a different subinterval) events e.sub.v associated with the event records group erg.sub.n is at least equal to, or greater than, a daily (or a different subinterval) number of events threshold.
[0187] The data quality requirements may be set by the administrator of the system 100 through the Administrator Interface 120—even though alternative embodiments in which the data quality requirements may be at least partly set by the user of the system 100 through the User Interface 125 are not excluded.
[0188] In case the event records group erg.sub.n does not satisfy quality requirements, the event records group erg.sub.n is discarded and a next event records group erg.sub.n+1 (associated with another UE) is processed (unless the n-th user was already the last user having event records in the Repository 115).
[0189] The system 100 may also perform a selection among the retrieved event records group erg.sub.n in respect of the considered n-th UE. The selection made by the system 100 may for example be in accordance with the teachings of WO 2018/121842 A1, adapted to determine a mobility status of the considered UE, wherein the UE is in a motion or in a stay status. As disclosed in WO 2018/121842 A1 (to be intended as incorporated herein by reference), the determination of the mobility status can be done by determining, for each event of interaction, a served area of the mobile communication network pertaining to that UE at the occurrence of that event of interaction, and determining the mobility status of that UE at the occurrence of that event of interaction according to at least one among: a comparison between a distance between the served area and a first further served area of the mobile communication network pertaining to that UE at the occurrence of a first further event of interaction occurring before said event of interaction, and a first threshold distance depending on the served area and on the first further served area; a comparison between a distance between the served area and a second further served area of the mobile communication network pertaining to that UE at the occurrence of a second further event of interaction occurring after said event of interaction, and a second threshold distance depending on the served area and on the second further served area.
[0190] Other criteria for selecting the events records among those retrieved from the repository can be implemented, alternative or in addition to the above ones. For example, a simple decimation of the events records can be implemented, for reducing the number of events records to be processed.
[0191] In this way, selected events records are obtained.
[0192] Then, for each pair of selected event records er.sub.k and er.sub.k+1 belonging to the event records group erg.sub.n under consideration and consecutive in time, related to two consecutive timestamps t.sub.k e t.sub.k+1 within the observation time period TP, the two network cells c.sub.k and c.sub.k+1 (as indicated by the cell identifiers contained in the event records er.sub.k and er.sub.k+1) identify a movement of the UE user that origins in cell c.sub.k (origin cell) and ends in cell c.sub.k+1 (destination cell). The two cells c.sub.k and c.sub.k+1 (origin cell and destination cell) have to be associated to the zones z.sub.q of the partition in zones of the RoI 300, and the two timestamps t.sub.k e t.sub.k+1 have to be associated to time slots of the observation time period TP.
[0193] From the knowledge of the origin cell c.sub.k (as indicated by the cell identifier contained in the event record er.sub.k), exploiting the respective cells-zones correspondence map related to the time slot ts.sub.k, in which the timestamp t.sub.k (of the event record er.sub.k) falls, a zones set ORIG.sub.k={z.sub.o}.sub.k is built, which is comprised of the K′ zones z.sub.q of the partition in zones of the RoI 300 such that the weight (P.sub.k.fwdarw.o).sub.k of the cell c.sub.k on each zone of the set ORIG.sub.k={z.sub.o}.sub.k is different from 0 (more generally, the cell c.sub.k has a weight greater than a predetermined cell weight value and this predetermined cell weight value is preferably set by the administrator of the system 100 through the Administrator Interface 120) (step 1315). In other words, the set ORIG.sub.k={z.sub.o}.sub.k includes the (K′) zones for which the probability that the considered UE is located, in the time slot ts.sub.k, in such zones is not zero (or not lower than a predetermined probability).
[0194] Similarly, from the knowledge of the destination cell c.sub.k+1 (as indicated by the cell identifier contained in the event record er.sub.k+1), exploiting the respective cells-zones correspondence map related to the time slot ts.sub.k+1, in which the timestamp t.sub.k+1 falls, a zones set DEST.sub.k+1={z.sub.d}.sub.k+1 is built, which is comprised of the K″ zones z.sub.q of the partition in zones of the RoI 300 such that the weight (P.sub.k+1.fwdarw.d).sub.k+1 of the cell c.sub.k+1 on each zone of the set DEST.sub.k+1={z.sub.d}.sub.k+1 is different from 0 (more generally, the cell has a weight greater than a predetermined cell weight value and this predetermined cell weight value is preferably set by the administrator of the system 100 through the Administrator Interface 120) (step 1320). In other words, the set DEST.sub.k+1={z.sub.d}.sub.k+1 includes the (K″) zones for which the probability that the considered UE is located, in the time slot ts.sub.k+1, in such zones is not zero (or not lower than a predetermined probability).
[0195] Each zone z.sub.q of the zones set ORIG.sub.k is the origin of a movement of the UE user towards each zone z.sub.d of the zones set DEST.sub.k+1 so that, overall, between all the zones of the set ORIG.sub.k and all the zones of the set DEST.sub.k+1 there will be a number of K′*K″ of movements. A respective movement weight is assigned to each of such movements, the movement weight being calculated as a function of the weights (P.sub.k.fwdarw.o).sub.k and (P.sub.k+1.fwdarw.d).sub.k+1, for example the function being the product of the weights (step 1325):
(P.sub.od).sub.k,k+1=(P.sub.k.fwdarw.o).sub.k*(P.sub.k+1.fwdarw.d).sub.k+1 (Equation 3).
[0196] (P.sub.od).sub.k,k+1 is a joint probability, obtained as the product of two probabilities (P.sub.k.fwdarw.o).sub.k and (P.sub.k+1.fwdarw.d).sub.k+1, assuming a statistical independency of the latter, determined by the origin and destination events corresponding to the event records er.sub.k and er.sub.k+1.
[0197] It is pointed out that in case the two cells c.sub.k and c.sub.k+1 (origin cell and destination cell) coincide (i.e., the UE user has not moved from the cell where it is), the two sets ORIG.sub.k={z.sub.o}.sub.k and DEST.sub.k+1={z.sub.d}.sub.k+1 coincide and when the number of K′*K″ of movements is calculated, some of these movements will have a same zone z.sub.q both as origin and as destination. The contribution of these “movements” (actually, “non-movements”) appears in the diagonal of the O-D matrix to be calculated or it may be not included in the O-D matrix.
[0198] The time slot, and thus the O-D matrix 400m of the set 500 of O-D matrices to which the K′*K″ of movements are assigned, is determined as described in the following.
[0199] If both the event times t.sub.k and t.sub.k+1 belong to a same time slot ts.sub.m of the observation time period TP, each one of the K′*K″ movements from the origin zone z.sub.q to the destination zone z.sub.d determines an increase in the entry mod(o,d) of the O-D matrix 400m of the set 500 of O-D matrices associated with the time slot ts.sub.m, the increase being for example of an amount equal or corresponding to (P.sub.od).sub.k,k+1 (calculated according to Equation 3), i.e., all the movements are associated with the O-D matrix 400m.
[0200] If instead the event times t.sub.k and t.sub.k+1 belong to different time slots ts.sub.m and ts.sub.m+1, substantially three possible options are available for assigning values to the entries mod(i,j) in computing the O-D matrices 400m:
[0201] 1. each one of the K′*K″ movements from the origin zone z.sub.o to the destination zone z.sub.d determines an increase of the respective entries mod(o,d) of the O-D matrix 400m of the set 500 of O-D matrices associated with the time slot ts.sub.m, said increase being for example of an amount equal or corresponding to (P.sub.od).sub.k,k+1 (calculated as in Equation 3), i.e. all the movements are associated with the O-D matrix 400m corresponding to the time slot ts.sub.m, in which the event timestamp t.sub.k (of the event record er.sub.k) falls;
[0202] 2. each one of the K′*K″ movements from the origin zone z.sub.o to the destination zone z.sub.d determines an increase of the respective entries mod(o,d) of the O-D matrix 400m+1 of the set 500 of O-D matrices associated with the time slot ts.sub.m+1 said increase being of an amount equal or corresponding to (P.sub.od).sub.k,k+1 (calculated as in Equation 3), i.e. all the movements are associated with the O-D matrix 400m+1 corresponding to the time slot ts.sub.m+1, in which the event timestamp t.sub.k+1 (of the event record er.sub.k+1) falls; or
[0203] 3. each one of the K′*K″, with the respective weight (P.sub.od).sub.k,k+1 (calculated as in Equation 3) may be distributed between each one of the O-D matrices 400m÷400m+x of the set 500 associated with each one of the time slots ts.sub.m÷ts.sub.m+x (where x≥1) comprising at least partially a movement time interval [t.sub.k, t.sub.k+1] defined by the event times t.sub.k and t.sub.k+1. (i.e., the event time t.sub.k occurring during the time slot ts.sub.m and the event time t.sub.k+1 occurring during the time slot ts.sub.m+x). The K′*K″ movements can be associated in a proportional manner to each one of the time slots ts.sub.m÷ts.sub.m+x, and consequently assigned with the respective weight (P.sub.od).sub.k,k+1 to the corresponding matrices 400m÷400m+x, according to a (time) portion of the movement time interval [t.sub.k, t.sub.k+1] taking place during each one of the time slots ts.sub.m÷ts.sub.m+x.
[0204] For example, by considering two consecutive time slots ts.sub.m and ts.sub.m+1 comprising the movement time interval [t.sub.k, t.sub.k+1], a first (time) portion of the movement time interval [t.sub.k, t.sub.k+1], e.g. a first fraction, let it be said 60%, of the movement time interval [t.sub.k, t.sub.k+1], falls in the time slot ts.sub.m while a second (time) portion of the movement time interval [t.sub.k, t.sub.k+1], e.g. the remaining second fraction, in the example 40%, of the movement time interval [t.sub.k, t.sub.k+1], falls in the other time slot ts.sub.m+1, all the K′*K″ entries mod(o,d) of the O-D matrix 400m are increased by 0.6*(P.sub.od).sub.k,k+1, (i.e., by an amount equal or corresponding to the first fraction of the movement time interval [t.sub.k, t.sub.k+1]) while all the K′*K″ entries mod(o,d) of the other O-D matrix 400m+1 are increased by 0.4*(P.sub.od).sub.k,k+1 (i.e. by an amount equal or corresponding to the second fraction of the movement time interval [t.sub.k, t.sub.k+1]).
[0205] The first option privileges the initial time (t.sub.k) at which a movement is started, the second option privileges instead the final time (t.sub.k+1) of the movement, while the third option considers the duration of the movement time interval.
[0206] Preferably, the administrator of the system 100 through the Administrator Interface 120 and/or the user of the system 100 through the User Interface 125 may choose among the three options according to their needs.
[0207] Operations from 1310 to 1330 of the flowchart of
[0208] When all the UE have been analysed (UE index variable n=N), the system 100 provides in output the calculated set 500 of O-D matrices 400m to the user of the system 100, through the User Interface 125.
[0209] After such provision of the computation results the operation of the system 100 is concluded.
[0210] In other embodiments, the solution according to the present disclosure may comprise methods featuring different steps or some steps may be performed in a different order or in parallel.
[0211] If the system 100 is exploited for calculating the O-D matrices in two or more RoIs 300 at a time, during operation at step 1015 the Position Manager 210 can look and take into consideration, in addition to the field timestamp 715 of the triplet 700 containing the position estimate, also the identifier, contained in the field UE_id 705 of the triplet, of the UE that calculated the position estimate (if the field UE_id 705 is present) and retrieve from the Position Repository 215 only the position estimates coming from those UE that, with the operation at step 905, have been identified to be in a certain one of the RoIs 300.
[0212] If, while performing the operations at step 1015 for the time slot ts.sub.m under analysis, it is determined that no useful triplets 700 are stored in the Position Repository 215, or the available triplets 700 are in a number less than a minimum threshold number, the Position Manager 210 may extract from the Position Repository 215 also the triplets 700 included in the immediately adjacent (preceding and successive) time slots, until the overall number of extracted position estimates reaches the threshold number. All these triplets are then sent to the Computation Engine 110 for being processed, as if such triplets 700 were all internal to the time slot ts.sub.m under consideration. This option and the minimum threshold number can be decided and configured by the administrator of the system 100 through the Administration Interface 120.
[0213] In order to reduce the processing time of the system 100 or to reduce the need of communication bandwidth for the communication between the Position Manager 210 and the Computation Engine 110 when the two modules are physically separated from each other, the administrator of the system 100 can decide to reduce the number of, e.g. to decimate, the position estimates 700 to be processed by the Position Manager 210 before the transmission to the Computation Engine 110 (operation at step 1015), or to be processed by the Computation Engine 110 before starting the processing (operations at step 1020). In this way, just one triplet 700 (i.e., just one position estimate) is taken out of a number n of triplets 700 (i.e., n position estimates). Such a reduction in number of the position estimates to be subject to processing can be operated on a per-user (per-UE) basis, in case the field UE_id 705 is present in the triplets 700 (this means that one measurement is retained for each value of the field UE_id), or globally on all the position estimates (this means that one measurement is retained every n measurements, irrespective of the UE—identified by the value in the field UE_id—that performed the measurements).
[0214] For similar reasons, the Position Manager 210 or the Computation Engine 110 can be configured by the administrator of the system 100 (through the Administrator Interface 120) to calculate, from the set of position estimates extracted during the operations at step 1015, an average position for each UE, if the field UE_id 705 is present, and to exploit this single average position in the operations at step 1020. Also this option can be decided by the administrator of the system 100.
[0215] The set of measurements (position estimates) performed during a measurement campaign MP can be exploited without changes for calculating the set 500 of O-D matrices 400m related to more than one observation time periods TP. In other words, when a user of the system 100 requests to the system 100 the calculation of the set 500 of O-D matrices 400m on a certain time period TP, before starting a measurement campaign on a certain time period MP, the user may ascertain if in the system 100, particularly in the Position Repository 215, there are already stored triplets 700 obtained in a preceding time period MP on the RoI 300 of interest for the user, and decide to exploit the already available triplets 700 on the subdivision in zones of the RoI of that user. In this case, the system 100 starts directly with the operations at step 1015. Even more, if the tessellation and/or the subdivision in zones of the RoI of interest for that user of the system 100 are the same to that/those used previously, the user of the system 100 may decide to reuse the general distribution maps and the cells-zones correspondence maps already calculated previously. The processing burden for calculating the new set 500 of O-D matrices 400m is consequently reduced.
[0216] The solution according to the present disclosure is independent from the specific localization technique employed by the UE for estimating their positions: it is for example possible to employ a technique based on mobile radio technology, standard or not (Cell-Id, TOA, based on radio signal strength measurements, etc.), or a technique based on short-range communication capabilities of the UE (e.g., Wi-Fi, Bluetooth, etc.) or a satellite-localization technique (GPS, Galileo, etc.).
[0217] In an embodiment of the present disclosure, the UE can send, within the triplets 700, particularly within the field position (x,y) 710 of the triplets 700, also an indication of the uncertainty, or of the accuracy, associated with the estimated position. The Computation Engine 110, when performing the operations at step 1020, can decide to use only those position estimates that are accompanied by a value of uncertainty lower than a predefined uncertainty threshold, for example set by the administrator of the system 100 through the Administrator Interface 120. The uncertainty threshold can depend on the characteristic size of the generic tile 605 of the tessellation 600 (for example, in case of square tiles 605, the uncertainty threshold can be set equal to half the length of the tile side; for circular tiles, the uncertainty threshold can be set equal to the radius of the circumference delimiting the tile).
[0218] In an embodiment of the present disclosure, the operations at step 810 can be cooperatively carried out also with the participation of those UE that are not capable of estimating their position (either because they do not have installed a suitable software or because the software available to them always provided results affected by an a-priori unacceptable uncertainty). These can be UE that are only capable of measuring the characteristics of the radio signal received from the radio base stations 105a (e.g., serving radio base station, received radio signal power, adjacent radio base stations, etc.). The Position Manager 210, upon receiving these measurements, can calculate the position of the UE exploiting a localization algorithm and store the results in the Position Repository 215 (step 1010).
[0219] The system 100 can be implemented by means of an app/server architecture, which provides for having an app installed on the UE which dialogues with a remote server using, at high level, Internet protocols and, at a lower level, the User-plane of the mobile communication network 105. Alternatively, the solution according to the present disclosure can be actuated exploiting, at least in part, messages and functions of the MDT (Minimization of Drive Tests) technology, introduced in the Release 10 of the 3GPP standard for UMTS and LTE (see e.g. the 3GPP TS 37.320). For example, in case of immediate transmission (flag I/NI set to I) of the position estimates, it is possible to exploit the functionality called “Immediate MDT”, whereas in case of delayed transmission (flag I/NI set to NI) it is possible to exploit the functionality called “Logged MDT”.