reinforcement learning

August 7, 2025 Taewoon Kim

From REINFORCE to PPO: The Complete On-Policy RL Journey

Understanding the evolution from basic policy gradients to modern LLM fine-tuning algorithms

Motivation: Why On Policy RL Matters for Modern AI If you've been following the latest developments in large language models (LLMs), you've probably heard of GRPO (Group Relative P...

From REINFORCE to PPO: The Complete On-Policy RL Journey

Learn more

February 6, 2025 Taewoon Kim

RL vs SL: Understanding Their Roles in Large Language Models

Bridging Reinforcement Learning and Supervised Learning in LLMs

Large Language Models (LLMs) have brought a new twist to the way we think about training algorithms. While traditional Supervised Learning (SL) and Reinforcement Learning (RL) migh...

RL vs SL: Understanding Their Roles in Large Language Models

Learn more

September 22, 2024 Taewoon Kim

Is supervised learning a special type of reinforcement learning?

The reason why reinforcement learning is such a hard problem

Supervised Learning Objective: Maximum Likelihood In supervised learning , the goal is to learn a function that maps an input to a label . This is typically done by maximizing the...

Is supervised learning a special type of reinforcement learning?

Learn more

December 21, 2023 Taewoon Kim

A machine with human-like memory systems

Can machines think like us?

The project is the core of my PhD work. It was heavily inspired by the cognitive science theories, such as the ones from . It's about developing agents equipped with human like ext...