Taewoon Kim Taewoon Kim
Blog Contact CV
Archive

Blog

Posts tagged reinforcement learning.

August 7, 2025 Taewoon Kim

From REINFORCE to PPO: The Complete On-Policy RL Journey

Understanding the evolution from basic policy gradients to modern LLM fine-tuning algorithms

Motivation: Why On Policy RL Matters for Modern AI If you've been following the latest developments in large language models (LLMs), you've probably heard of GRPO (Group Relative P...

From REINFORCE to PPO: The Complete On-Policy RL Journey
reinforcement-learning policy-gradients ppo grpo
Learn more
February 6, 2025 Taewoon Kim

RL vs SL: Understanding Their Roles in Large Language Models

Bridging Reinforcement Learning and Supervised Learning in LLMs

Large Language Models (LLMs) have brought a new twist to the way we think about training algorithms. While traditional Supervised Learning (SL) and Reinforcement Learning (RL) migh...

RL vs SL: Understanding Their Roles in Large Language Models
reinforcement-learning supervised-learning llm machine-learning
Learn more
September 22, 2024 Taewoon Kim

Is supervised learning a special type of reinforcement learning?

The reason why reinforcement learning is such a hard problem

Supervised Learning Objective: Maximum Likelihood In supervised learning , the goal is to learn a function that maps an input to a label . This is typically done by maximizing the...

Is supervised learning a special type of reinforcement learning?
reinforcement learning supervised learning contextual bandits
Learn more
December 21, 2023 Taewoon Kim

A machine with human-like memory systems

Can machines think like us?

The project is the core of my PhD work. It was heavily inspired by the cognitive science theories, such as the ones from . It's about developing agents equipped with human like ext...

A machine with human-like memory systems
reinforcement learning AI human memory knowledge graph
Learn more
December 16, 2021 Taewoon Kim

IGLU: interactive grounded language understanding in a collaborative environment

We won the NeurIPS 2021 competition

Thank you, Hybrid Intelligence team! Check out

IGLU: interactive grounded language understanding in a collaborative environment
IGLU reinforcement learning NeurIPS competition
Learn more
Taewoon Kim
Privacy
Email GitHub LinkedIn X YouTube Scholar

This site uses analytics only if you say yes. See the privacy policy for details.