Causal Representation Learning for Personalized Adaptive Learning Path Optimization

Li Qingmei; Siti Zayyana Ulfah

PDF

Published: May 18, 2026

👤 Li Qingmei

🏢 Master’s Program in Teacher Education, School of Postgraduate Studies, Universitas Pendidikan Indonesia, Bandung, Indonesia

👤 Siti Zayyana Ulfah

🏢 Master’s Program in Teacher Education, School of Postgraduate Studies, Universitas Pendidikan Indonesia, Bandung, Indonesia

Personalized adaptive learning systems commonly optimize learning paths using correlational representations derived from observational logs, which can internalize confounding and degrade under cohort or course shifts. This study introduces a unified framework for causal representation learning that jointly learns learner-state embeddings and optimizes sequencing policies under offline constraints. Experiments used LMS traces from 12 course offerings comprising 12,438 learners and 1.86 million interaction events, yielding 1,734,920 cleaned decision points. Descriptive analysis revealed a mid-semester attrition concentration, with the sharpest contraction in active learners between Weeks 4 and 7 and a semester-level dropout proxy rate of 17.6 percent. Causal effect estimation showed substantial treatment heterogeneity for difficulty assignment: the estimated effect of Hard versus Medium activities on next-step mastery ranged from minus 0.028 in the lowest baseline-skill quartile to plus 0.031 in the highest quartile, while the aggregate average treatment effect was near zero at plus 0.004 due to cancellation. The proposed representation improved next-step mastery predictability to an AUC of 0.83 while reducing environment leakage, with course-domain predictability declining to 0.46. Offline policy evaluation demonstrated consistent uplift across estimators, increasing doubly robust value from 0.407 under the logged baseline to 0.463 under the causal-representation policy, while maintaining higher support alignment with an effective sample size of 0.81 versus 0.73 for a correlational policy. Deployment-facing indicators improved concurrently, including retention proxy (10.8 active weeks versus 9.6) and reduced pacing instability (difficulty jump rate 8.4 percent versus 10.9 to 12.1 percent under ablations). Ablation studies confirmed that invariance constraints, counterfactual rollouts, and stable-dynamic disentanglement are structurally necessary to preserve both mastery and persistence benefits.

Qingmei, L., & Ulfah, S. Z. (2026). Causal Representation Learning for Personalized Adaptive Learning Path Optimization. Adaptive Learning, 2(1), 1–24. Retrieved from https://al.mbicore.com/index.php/al/article/view/5

Distributed Under Creative Commons CC-BY 4.0

Issue

Vol. 2 No. 1 (2026): Regular Issue February 2026

Section

Articles

Journal Metrics
Acceptance Rate	49%
Review Speed	45 days
Issue Per Year	4
Number of Volume	2
Number of Issues	5
Number of Articles	25
Number of Reviewers	5
Number of Contributors	57
Contributing Countries	6
No. of WoS Citations	18
No. of Scopus Citations	33
No. of Google Citations	53
Abstract Views	2,422 views
PDF Download	1,016

Tools
Reference Manager
Plagiarism Checker
Grammar Assistant

Article Sidebar

Main Article Content

Article Details