Blog | Taewoon Kim

From VAEs to Diffusion Models: A Step-by-Step Journey

Understanding the evolution of generative models through practical implementations
By Taewoon Kim
Posted on May 28, 2025

Motivation: Why Do We Need Generative Models? [Read More]
Tags:
Discrete vs. Continuous: A Tale of Two Approaches to Language Modeling

Balancing n-gram models and neural networks for next-token prediction
By Taewoon Kim
Posted on March 9, 2025

Modern Natural Language Processing (NLP) revolves around language modeling—the art of predicting the next token given the previous ones. Formally, if we have a sequence of tokens \(w_1, w_2, \dots, w_{n-1}\), we want to learn: [Read More]
Tags:
RL vs SL: Understanding Their Roles in Large Language Models

Bridging Reinforcement Learning and Supervised Learning in LLMs
By Taewoon Kim
Posted on February 6, 2025

Large Language Models (LLMs) have brought a new twist to the way we think about training algorithms. While traditional Supervised Learning (SL) and Reinforcement Learning (RL) might seem worlds apart at first glance, their underlying optimization procedures share a striking similarity. In both cases, we aim to maximize an objective... [Read More]
Tags:
Understanding Maximum Likelihood Estimation

A Deep Dive into MLE, Loss Functions, and Beyond
By Taewoon Kim
Posted on February 5, 2025

Most of us, whether consciously or not, use Maximum Likelihood Estimation (MLE) in our daily machine learning workflows. When you’re training a model to predict labels in a supervised setting, or even when you’re using a self-supervised approach like masked language modeling, you’re often implicitly performing MLE under the hood.... [Read More]
Tags:
Sequential Decision-Making with Transformers

Offline RL, Behavior Cloning, and The Magic of Sequence Modeling
By Taewoon Kim
Posted on December 15, 2024

Sequential decision-making is a fundamental challenge in machine learning and AI. From planning your next vacation itinerary to training a robot to navigate a warehouse, we often face tasks where a series of actions must be taken to achieve a goal, often with delayed feedback or sparse rewards. [Read More]
Tags:
From 1+1 in Assembly to LLMs: The Evolution of Computing Abstraction

Tracing the Layers from Machine Code to Natural Language Interfaces
By Taewoon Kim
Posted on November 12, 2024

Computing has come a long way since the early days of punch cards and assembly language. With each new generation of programming paradigms, we’ve added layers of abstraction that make it easier for humans to interact with machines. In this post, we’ll explore how a simple operation like 1 +... [Read More]
Tags:
- computing
- programming languages
- abstraction layers
- assembly
- C
- Java
- Python
- LLMs
The Problems with p-values

Why Frequentist Significance Testing Falls Short
By Taewoon Kim
Posted on October 16, 2024

Statistical significance testing, specifically the use of p-values, has been the cornerstone of hypothesis testing for decades. However, this frequentist approach has critical flaws that can lead to misleading interpretations and false confidence in research results. In this post, we will break down why p-values often fall short and discuss... [Read More]
Tags:
Can the Transformer be viewed as a special case of a Graph Neural Network (GNN)?

A natural language text can be seen as a knowledge graph
By Taewoon Kim
Posted on October 15, 2024

In recent years, Transformers have dominated the field of natural language processing (NLP), while Graph Neural Networks (GNNs) have proven essential for tasks involving graph-structured data. Interestingly, the Transformer can be seen as a special case of GNNs, particularly as an attention-based GNN. This connection emerges when we treat natural... [Read More]
Tags:
Is supervised learning a special type of reinforcement learning?

The reason why reinforcement learning is such a hard problem
By Taewoon Kim
Posted on September 22, 2024

Supervised Learning Objective: Maximum Likelihood [Read More]
Tags:
Playing around with Hugging Face Llama 3.1 locally

4bit quantization is amazing
By Taewoon Kim
Posted on August 29, 2024

Llama 3.1 was released by Meta a month ago, and you can easily access it via Hugging Face. I used their Transformers quite extensively a few years ago, and haven’t used it for some years. It was already an amazing library back then but now it has become even easier.... [Read More]
Tags:
- llm
- llama
- hugging face
Training a GCN-based edge classifier

Exploring the challenges and strategies for effective edge classification with graph neural networks (GNNs)
By Taewoon Kim
Posted on August 21, 2024

In this post, I’ll walk you through the process of training a Graph Convolutional Network (GCN) for edge classification, a common task in graph-based machine learning applications. Edge classification involves predicting the type of relationship (or edge) between two nodes in a graph, which is particularly useful in areas like... [Read More]
Tags:
A machine with human-like memory systems

Can machines think like us?
By Taewoon Kim
Posted on December 21, 2023

The project “A Machine With Human-Like Memory Systems” is the core of my PhD work. It was heavily inspired by the cognitive science theories, such as the ones from Endel Tulving. It’s about developing agents equipped with human-like external memory systems, modeled using knowledge graphs. These agents are designed to... [Read More]
Tags:
- reinforcement learning
- AI
- human memory
- knowledge graph
- machine learning
- deep learning
- PhD
- LLM
- symbolic AI
- GOFAI
- humemai
Using ChatGPT is just so great

This really can do a lot of things, although it's still biased.
By Taewoon Kim
Posted on December 14, 2023

Generating images with text [Read More]
Tags:
- ChatGPT
- AI
- GPT4
- DALL-E
My new website

My boring old website had to be gone.
By Taewoon Kim
Posted on December 12, 2023

I thought that I can make a better website than the one I had. So I made this one! I’ll be posting my projects and other stuff here. I hope you like it! :) I was surprised that I could make this website in just a few hours. I used... [Read More]
Tags:
- website
Bloom: A 176b-parameter open-access multilingual language model

Unlocking the power of open-source language models
By Taewoon Kim
Posted on November 9, 2022

This paper was a result of the Hugging Face BigScience research workshop that I participated in 2021. The paper can be found at https://arxiv.org/abs/2211.05100. [Read More]
Tags:
- llm
- prompt
- gpt
- bloom
Room classifier

A simple room classifier made using EfficientNet and PyTorch Lightning
By Taewoon Kim
Posted on February 28, 2022

Check out https://github.com/tae898/room-classification
Tags:
- room
- classifier
Promptsource: An integrated development environment and repository for natural language prompts

Revolutionizing NLP with a collaborative hub for crafting and sharing prompts
By Taewoon Kim
Posted on February 2, 2022

This paper was a result of the Hugging Face BigScience research workshop that I participated in 2021. The paper can be found at https://arxiv.org/abs/2202.01279. [Read More]
Tags:
- llm
- prompt
- gpt
- promptsource
IGLU: interactive grounded language understanding in a collaborative environment

We won the NeurIPS 2021 competition
By Taewoon Kim
Posted on December 16, 2021

Thank you, Hybrid Intelligence team! Check out https://proceedings.mlr.press/v176/kiseleva22a/kiseleva22a.pdf [Read More]
Tags:
Multitask prompted training enables zero-shot task generalization

Achieving superior zero-shot generalization through explicit multitask prompting
By Taewoon Kim
Posted on October 15, 2021

This paper was a result of the Hugging Face BigScience research workshop that I participated in 2021. The paper can be found at https://arxiv.org/abs/2110.08207. [Read More]
Tags:
- llm
- prompt
- gpt
EmoBERTa: Speaker-aware emotion recognition in conversation with RoBERTa

Achieving state-of-the-rrt emotion recognition in conversations with a simple RoBERTa-based approach
By Taewoon Kim
Posted on August 26, 2021

This is a paper that I wrote together with Piek Vossen. We were able to achieve SOTA back then by simply training a RoBERTa (a variant of BERT) with speaker tokens! Check out https://arxiv.org/abs/2108.12009 [Read More]
Tags:
- llm
- bert
- roberta
- emotion
- erc
Generalizing MLPs with dropouts, batch normalization, and skip connections

A simple age-gender classification model using Arcface embeddings and MLPs
By Taewoon Kim
Posted on August 18, 2021

I trained a simple age-gender classification model using ArcFace embeddings and MLPs. Check out the paper https://arxiv.org/abs/2108.08186 and the code https://github.com/tae898/age-gender/ [Read More]
Tags:
- mlp
- age
- gender

From VAEs to Diffusion Models: A Step-by-Step Journey

Understanding the evolution of generative models through practical implementations

Discrete vs. Continuous: A Tale of Two Approaches to Language Modeling

Balancing n-gram models and neural networks for next-token prediction

RL vs SL: Understanding Their Roles in Large Language Models

Bridging Reinforcement Learning and Supervised Learning in LLMs

Understanding Maximum Likelihood Estimation

A Deep Dive into MLE, Loss Functions, and Beyond

Sequential Decision-Making with Transformers

Offline RL, Behavior Cloning, and The Magic of Sequence Modeling

From 1+1 in Assembly to LLMs: The Evolution of Computing Abstraction

Tracing the Layers from Machine Code to Natural Language Interfaces

The Problems with p-values

Why Frequentist Significance Testing Falls Short

Can the Transformer be viewed as a special case of a Graph Neural Network (GNN)?

A natural language text can be seen as a knowledge graph

Is supervised learning a special type of reinforcement learning?

The reason why reinforcement learning is such a hard problem

Playing around with Hugging Face Llama 3.1 locally

4bit quantization is amazing

Training a GCN-based edge classifier

Exploring the challenges and strategies for effective edge classification with graph neural networks (GNNs)

A machine with human-like memory systems

Can machines think like us?

Using ChatGPT is just so great

This really can do a lot of things, although it's still biased.

My new website

My boring old website had to be gone.

Bloom: A 176b-parameter open-access multilingual language model

Unlocking the power of open-source language models

Room classifier

A simple room classifier made using EfficientNet and PyTorch Lightning

Promptsource: An integrated development environment and repository for natural language prompts

Revolutionizing NLP with a collaborative hub for crafting and sharing prompts

IGLU: interactive grounded language understanding in a collaborative environment

We won the NeurIPS 2021 competition

Multitask prompted training enables zero-shot task generalization

Achieving superior zero-shot generalization through explicit multitask prompting

EmoBERTa: Speaker-aware emotion recognition in conversation with RoBERTa

Achieving state-of-the-rrt emotion recognition in conversations with a simple RoBERTa-based approach

Generalizing MLPs with dropouts, batch normalization, and skip connections

A simple age-gender classification model using Arcface embeddings and MLPs