Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/104619
PIRA download icon_1.1View/Download Full Text
Title: Using machine learning to analyze longitudinal data : a tutorial guide and best-practice recommendations for social science researchers
Authors: Sheetal, A 
Jiang, Z
Di Milia, L
Issue Date: Jul-2023
Source: Applied psychology, July 2023, v. 72, no. 3, p. 1339-1364
Abstract: This article introduces the research community to the power of machine learning over traditional approaches when analyzing longitudinal data. Although traditional approaches work well with small to medium datasets, machine learning models are more appropriate as the available data becomes larger and more complex. Additionally, machine learning methods are ideal for analyzing longitudinal data because they do not make any assumptions about the distribution of the dependent and independent variables or the homogeneity of the underlying population. They can also analyze cases with partial information. In this article, we use the Household, Income, and Labour Dynamics in Australia (HILDA) survey to illustrate the benefits of machine learning. Using a machine learning algorithm, we analyze the relationship between job-related variables and neuroticism across 13 years of the HILDA survey. We suggest that the results produced by machine learning can be used to generate generalizable rules from the data to augment our theoretical understanding of the domain. With a technical guide, this article offers critical information and best-practice recommendations that can assist social science researchers in conducting machine learning analysis with longitudinal data.
Keywords: Big Five personality
Longitudinal data
Machine learning
Neuroticism
Solomonoff induction
XGBoost
Publisher: Wiley-Blackwell
Journal: Applied psychology 
ISSN: 0269-994X
DOI: 10.1111/apps.12435
Rights: © 2022 The Authors. Applied Psychology published by John Wiley & Sons Ltd on behalf of International Association of Applied Psychology.
This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
The following publication Sheetal, A., Jiang, Z., & Di Milia, L. (2023). Using machine learning to analyze longitudinal data: A tutorial guide and best‐practice recommendations for social science researchers. Applied Psychology, 72(3), 1339-1364 is available at https://doi.org/10.1111/apps.12435.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Sheetal_Using_Machine_Learning.pdf1.99 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

156
Last Week
11
Last month
Citations as of Nov 30, 2025

Downloads

941
Citations as of Nov 30, 2025

SCOPUSTM   
Citations

18
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

14
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.