Multi-Agent Reinforcement Models for Context-Aware Adaptive Learning Systems

Main Article Content

👤 Mishael Winarta
🏢 Department of Information Systems, Faculty of AI and Data Science, Universitas Pelita Harapan, Indonesia
👤 Manwir Singh
🏢 Department of Information Systems, Faculty of AI and Data Science, Universitas Pelita Harapan, Indonesia

The increasing heterogeneity of learner behavior in digital education environments poses significant challenges to conventional adaptive learning systems, which often rely on static rules or single-objective optimization. This study proposes a context-aware adaptive learning framework based on coordinated Multi-Agent Reinforcement Learning (MARL) to address these limitations. The system decomposes pedagogical decision-making into multiple specialized agents responsible for content difficulty adaptation, feedback timing, and instructional modality, while a centralized coordination mechanism ensures policy stability during training. Learner context is explicitly modeled as a dynamic state representation derived from interaction logs, engagement indicators, and performance signals, enabling continuous and fine-grained adaptation. Comprehensive experiments were conducted to evaluate learning effectiveness, policy stability, context responsiveness, and learner engagement across adaptive and baseline configurations. The results show that the proposed multi-agent system achieves substantially higher cumulative mastery gains, with improvements of up to 64% for learners with low prior knowledge compared to non-adaptive baselines. Policy variance analysis demonstrates a reduction of more than 60% in coordinated agent configurations relative to uncoordinated multi-agent setups, confirming the effectiveness of the coordination strategy in mitigating non-stationarity. Context sensitivity evaluation indicates that the system detects and adjusts to contextual shifts within an average of 4–5 interaction steps, significantly outperforming rule-based adaptive approaches. In addition to cognitive outcomes, the system exhibits strong behavioral benefits. Engagement analysis reveals lower engagement variance (0.006) and reduced session drop-off rates (9%), indicating improved learner consistency over time. A holistic effectiveness comparison further confirms that the proposed approach delivers balanced gains across mastery progression, adaptation flexibility, engagement stability, and robustness, albeit with higher implementation complexity. These findings demonstrate that coordinated MARL provides a viable and scalable foundation for next-generation adaptive learning systems capable of robust, real-time personalization in dynamic educational contexts.

Winarta, M., & Singh, M. (2025). Multi-Agent Reinforcement Models for Context-Aware Adaptive Learning Systems. Adaptive Learning, 1(3), 192–212. Retrieved from https://al.mbicore.com/index.php/al/article/view/15

Article Details

Section
Articles

Similar Articles

<< < 1 2 3 > >> 

You may also start an advanced similarity search for this article.