Inferring Socioeconomic Characteristics from Travel Patterns
DOI:
https://doi.org/10.5614/jpwk.2023.34.1.7Keywords:
Crowd-Based Big-Data, machine learning, socioeconomic characteristics, travel patternsAbstract
Nowadays, crowd-based big data is widely used in transportation planning. These data sources provide valuable information for model validation; however, they cannot be used to estimate travel demand forecasting models, because these models need a linkage between travel patterns and the socioeconomic characteristics of the people making trips and such a connection is not available due to privacy issues. As such, uncovering the correlation between travel patterns and socioeconomic characteristics is crucial for travel demand modelers to be able to leverage such data in model estimation. Different age, gender, and income groups may have specific travel behavior preferences. To extract and investigate these patterns, we used two data sets: one from the National Household Travel Survey 2009 and the other from the Metropolitan Washington Council of Government Transportation Planning Board 2007-2008 household survey. After preprocessing the data, a range of machine learning algorithms were used to synthesize the socioeconomic characteristics of travelers. After comparison, we found that the CatBoost model outperformed the other models. To further improve the results, a synthetic population and Bayesian updating were used, which considerably improved the estimation of income. This study showed that the conventional inference of travel demand from socioeconomic patterns can be reversed, creating an opportunity to utilize the plethora of crowd-based mobility data.
Downloads
References
An, C. & Wu, C. (2020). Traffic big data assisted V2X communications toward smart transportation. Wireless Networks, 26(3), 1601-1610.
Ayed, A. B., Halima, M. B. & Alimi, A. M. (2015). Big data analytics for logistics and transportation. Paper presented at the 2015 4th international conference on advanced logistics and transport (ICALT).
Bishop, C. M. (2006). Pattern recognition. Machine learning, 128(9).
Carlsson-Kanyama, A. & Linden, A.-L. (1999). Travel patterns and environmental effects now and in the future:: implications of differences in energy consumption among socio-economic groups. Ecological Economics, 30(3), 405-417.
Chen, T. & Guestrin, C. (2016). Xgboost: A scalable tree boosting system. Paper presented at the Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining.
Collins, D. & Tisdell, C. (2002). Gender and differences in travel life cycles. Journal of Travel Research, 41(2), 133-143.
Crane, R. & Takahashi, L. (2009). Sex changes everything: the recent narrowing and widening of travel differences by gender. Public works management & policy, 13(4), 328-337.
Gling, T. & Schuitema, G. (2007). Travel demand management targeting reduced private car use: effectiveness, public acceptability and political feasibility. Journal of social issues, 63(1), 139-153.
Ghofrani, F., He, Q., Goverde, R. M. & Liu, X. (2018). Recent applications of big data analytics in railway transportation systems: A survey. Transportation Research Part C: Emerging Technologies, 90, 226-246.
Gong, L., Liu, X., Wu, L. & Liu, Y. (2016). Inferring trip purposes and uncovering travel patterns from taxi trajectory data. Cartography and Geographic Information Science, 43(2), 103-114.
Hastie, T., Rosset, S., Zhu, J. & Zou, H. (2009). Multi-class adaboost. Statistics and its Interface, 2(3), 349-360.
Jain, D. & Tiwari, G. (2019). Explaining travel behaviour with limited socio-economic data: Case study of Vishakhapatnam, India. Travel Behaviour and Society, 15, 44-53.
Jakobsson, C., Fujii, S. & Gling, T. (2000). Determinants of private car users' acceptance of road pricing. Transport policy, 7(2), 153-158.
Kaffash, S., Nguyen, A. T. & Zhu, J. (2021). Big data algorithms and applications in intelligent transportation system: A review and bibliometric analysis. International Journal of Production Economics, 231, 107868.
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., . . . Liu, T.-Y. (2017). Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems, 30.
Ko, J., Lee, S. & Byun, M. (2019). Exploring factors associated with commute mode choice: An application of city-level general social survey data. Transport policy, 75, 36-46.
Koushik, A. N., Manoj, M. & Nezamuddin, N. (2020). Machine learning applications in activity-travel behaviour research: a review. Transport reviews, 40(3), 288-311.
Krizhevsky, A., Sutskever, I. & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25.
Li, C., Bai, L., Liu, W., Yao, L. & Waller, S. T. (2019). Passenger demographic attributes prediction for human-centered public transport. Paper presented at the International Conference on Neural Information Processing.
Li, J., Lo, K. & Guo, M. (2018). Do socio-economic characteristics affect travel behavior? A comparative study of low-carbon and non-low-carbon shopping travel in Shenyang City, China. International journal of environmental research and public health, 15(7), 1346.
Li, L., Zhu, J., Zhang, H., Tan, H., Du, B. & Ran, B. (2020). Coupled application of generative adversarial networks and conventional neural networks for travel mode detection using GPS data. Transportation Research Part A: Policy and Practice, 136, 282-292.
McDonald, N. C. (2006). Exploratory analysis of children's travel patterns. Transportation Research Record, 1977(1), 1-7.
Mler, K. & Axhausen, K. W. (2010). Population synthesis for microsimulation: State of the art. Arbeitsberichte Verkehrs-und Raumplanung, 638.
Ng, W.-S. & Acker, A. (2018). Understanding urban travel behaviour by gender for efficient and equitable transport policies.
Nguyen, M. H., Armoogum, J., Madre, J.-L. & Garcia, C. (2020). Reviewing trip purpose imputation in GPS-based travel surveys. Journal of Traffic and Transportation Engineering (English Edition), 7(4), 395-412.
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V. & Gulin, A. (2017). CatBoost: unbiased boosting with categorical features. arXiv preprint arXiv:1706.09516.
Rathore, M. M., Ahmad, A., Paul, A. & Rho, S. (2016). Urban planning and building smart cities based on the internet of things using big data analytics. Computer networks, 101, 63-80.
Rathore, M. M., Paul, A., Hong, W.-H., Seo, H., Awan, I. & Saeed, S. (2018). Exploiting IoT and big data analytics: Defining smart digital city using real-time urban data. Sustainable cities and society, 40, 600-610.
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., . . . Bernstein, M. (2015). Imagenet large scale visual recognition challenge. International Journal of Computer Vision, 115(3), 211-252.
Shao, F., Sui, Y., Yu, X. & Sun, R. (2019). Spatio-temporal travel patterns of elderly people?A comparative study based on buses usage in Qingdao, China. Journal of Transport Geography, 76, 178-190.
Shepherd, S., Zhang, X.-n., Emberger, G., Hudson, M., May, A. & Paulley, N. (2006). Designing optimal urban transport strategies: The role of individual policy instruments and the impact of financial constraints. Transport policy, 13(1), 49-65.
Usyukov, V. (2017). Methodology for identifying activities from GPS data streams. Procedia Computer Science, 109, 10-17.
Yang, C., Yan, F. & Ukkusuri, S. V. (2018). Unraveling traveler mobility patterns and predicting user behavior in the Shenzhen metro system. Transportmetrica A: Transport Science, 14(7), 576-597.
Zhang, Y. & Chen, G. (2018). Inferring social-demographics of travellers based on smart card data. Paper presented at the 2nd International Conference on Advanced Research Methods and Analytics (CARMA 2018). Proceedings.
Zhang, Y. & Cheng, T. (2019). A deep learning approach to infer employment status of passengers by using smart card data. IEEE Transactions on Intelligent Transportation Systems, 21(2), 617-629.
Zhang, Y., Cheng, T. & Sari Aslam, N. (2019). Deep Learning for Demographic Prediction based on Smart Card Data and Household Survey. Paper presented at the Proceedings of the 27th Conference on GIS Research UK (GISRUK).
Zhu, L., Gonder, J. & Lin, L. (2017). Prediction of individual social-demographic role based on travel behavior variability using long-term GPS data. Journal of Advanced Transportation, 2017.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Journal of Regional and City Planning

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Manuscript submitted to JRCP has to be an original work of the author(s), contains no element of plagiarism, and has never been published or is not being considered for publication in other journals. The author(s) retain the copyright of the content published in JRCP. There is no need for request or consultation for future re-use and re-publication of the content as long as the author and the source are cited properly.