Tensor Topic Models with Graphs and Applications on Individualized Travel Patterns

Abstract

Individualized passenger travel pattern is of significant research value since the abundant information from individual trajectory data could help discover the useful insights about the multi-clustering of origin, destination, time, etc., and the passenger cluster. However, this task is rather challenging, given the data is high-dimensional, with complex spatiotemporal structure, and exhibits sparse patterns. Moreover, individual travel patterns are also affected by external information, such as the distances and functions of locations and points of interest, which are ignored in most current works. To tackle these challenges, we proposed two novel frameworks based on topic models with the external information incorporated as graphs: Trips and passengers are formulated as tensor words and tensor documents to preserve data nature; To learn multiclustering, we proposed a graph-regularized tensor Latent Dirichlet Allocation model, with graph structure formulated as Laplacian penalty; To learn passenger clustering, we proposed a graph-based tensor Dirich-let Multinomial Mixture model with graph Laplacian penalty and L1 -norm penalty for cluster amount auto-determination.

Publication
The 37th IEEE International Conference on Data Engineering (ICDE 2021), Ph.D. Symposium Track
Avatar
Ziyue LI
Professor in Data Mining and Machine Learing

To be a inspiring data science researcher

Related