Abstract
Data are central to any Machine Learning (ML) application but often remain scattered in different parties’ databases, hindering the development of effective and reliable models. The reluctance to share valuable data assets due to competitive concerns and strict privacy laws, such as the General Data Protection Regulation (GDPR) in Europe, adds complexity to data-sharing. This is further complicated when dealing with spatio-temporal data, which can potentially reveal individual identities through movement patterns when merged with other data sources, as shown in Rossi et al. (2015), creating a barrier to enhancing ML training processes through a broader data sharing. Federated Learning (FL) has been proposed as a solution to address the challenge of data-sharing limitations by designing a secure way to collaboratively train an ML model without the need to share the raw data Yang et al. (2019). FL variants include Horizontal Federated Learning (HFL), which aims at obtaining ML models collaboratively from data partitioned in their sample space among different clients, and Vertical Federated Learning (VFL), in which the partition is in the feature space. FL could revolutionize location-based services that leverage GeoAI by enabling more effective information transfer without compromising user privacy.