Motivation: Why On Policy RL Matters for Modern AI If you've been following the latest developments in large language models (LLMs), you've probably heard of GRPO (Group Relative P...
Learn more
Posts tagged reinforcement learning.
Motivation: Why On Policy RL Matters for Modern AI If you've been following the latest developments in large language models (LLMs), you've probably heard of GRPO (Group Relative P...
Learn more
Large Language Models (LLMs) have brought a new twist to the way we think about training algorithms. While traditional Supervised Learning (SL) and Reinforcement Learning (RL) migh...
Learn more
Supervised Learning Objective: Maximum Likelihood In supervised learning , the goal is to learn a function that maps an input to a label . This is typically done by maximizing the...
Learn more
The project is the core of my PhD work. It was heavily inspired by the cognitive science theories, such as the ones from . It's about developing agents equipped with human like ext...
Learn more