A new study has revealed a measurement system that detects subway travellers’ movements to help public transport planners make smarter decisions around future investment and management. The study was released as part of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining in Sydney this week, and is titled Traffic Measurement and Route Recommendation System for Mass Rapid Transit (MRT). The system has been operating for several months on Singapore’s Mass Rapid Transit (MRT) network, which has 92 stations and five train lines. It looks at the number of people built up on station platforms, the flows of people making transfers at stations, and the expected travel time for each possible route. The authors of the study, from DataSpark (part of SingTel), developed an algorithm that detects individual subways trips from anonymised real-time location-based data from SingTel. As Singapore’s largest telecommunications company, it was able to tap into the mobile phone location data of 3.98 million customers, which was anonymised ID, latitude, longitude, timestamp and service type. Privacy was taken into account when conducting the study by anonymising the original unique ID for each mobile customer through a two-step non-reversible AES Encryption and Hashing process. “[This] makes it impossible to trace back to the original unique ID or the mobile customer,” the authors said. They also complied with Singapore’s Personal Data Protection Act (PDPA), and consent from customers was also given through SingTel’s Data Protection Policy. To ensure that the selected customer base is representative of Singapore’s population, the authors compared the distribution of residences of SingTel customers with Singapore’s 2010 census data. The two distributions highly linearly correlated, meaning it was quite a representative population. Looking at the most densely populated stations, which are indoors, the algorithm was able to distinguish SingTel cell towers for the indoor subway from all others when detecting individual trips and their locations. “We define an MRT trip as the sequence of indoor stations s = [s1, . . . sn] a passenger ? passes during their ride and the corresponding timestamps t = [t1, . . . tn] when each of these stations is reached. s1 is the boarding station, sn the station of disembarkation and s2, . . . sn – 1 stations ? passes during their ride.” To test the measurements that the algorithm produced against some ‘ground truth’ or real measurements, the authors compared with counts taken at popular shopping spot, Orchard MRT station, engaging a market research company to do this. “We determined the number of SingTel subscribers departing and arriving at Orchard MRT station during the same three days on an hourly basis using our MRT trip detection algorithm and market extrapolated these numbers with the scaling factor A. This takes into account both SingTel’s market share in Singapore as well as the mobile device penetration rate in Singapore. The comparison found that there was a strong correlation between the counts and the measurements the algorithm was able to produce. One of the key findings from the study was that travellers do not necessarily always decide to go the shortest route when planning their travel journey. “Sometimes, the shortest distance may also not be the quickest route due to congestion. As such, the most time efficient route between two MRT stations may be difficult to determine for commuters. “With the implementation of the trip database, we are able to determine the popularity of routes between stations (i.e. the number of people taking any route between two given stations), as well as the average travel time per route.” The current methods of measuring ‘crowdedness’ and how people flow through a subway network have their shortfalls, the authors said. Video or CCTV and Wi-Fi cannot measure travel times between stations and smart ticketing cards, which give tap on and tap off data points for each traveller, do not capture line transfers. “Cell phone location data are capable of addressing the information gaps in the approaches above.” The authors are looking to extend their measurement system to also include outdoor trips. Related content brandpost The steep cost of a poor data management strategy Without a data management strategy, organizations stall digital progress, often putting their business trajectory at risk. Here’s how to move forward. By Jay Limbasiya, Global AI, Analytics, & Data Management Business Development, Unstructured Data Solutions, Dell Technologies Jun 09, 2023 6 mins Data Management feature How Capital One delivers data governance at scale With hundreds of petabytes of data in operation, the bank has adopted a hybrid model and a ‘sloped governance’ framework to ensure its lines of business get the data they need in real-time. By Thor Olavsrud Jun 09, 2023 6 mins Data Governance Data Management feature Assessing the business risk of AI bias The lengths to which AI can be biased are still being understood. The potential damage is, therefore, a big priority as companies increasingly use various AI tools for decision-making. By Karin Lindstrom Jun 09, 2023 4 mins CIO Artificial Intelligence IT Leadership brandpost Rebalancing through Recalibration: CIOs Operationalizing Pandemic-era Innovation By Kamal Nath, CEO, Sify Technologies Jun 08, 2023 6 mins CIO Digital Transformation Podcasts Videos Resources Events SUBSCRIBE TO OUR NEWSLETTER From our editors straight to your inbox Get started by entering your email address below. Please enter a valid email address Subscribe