Large Language Models (LLMs) have brought a new twist to the way we think about training algorithms. While traditional Supervised Learning (SL) and Reinforcement Learning (RL) migh...
Learn more
Posts tagged LLM.
Large Language Models (LLMs) have brought a new twist to the way we think about training algorithms. While traditional Supervised Learning (SL) and Reinforcement Learning (RL) migh...
Learn more
Llama 3.1 was released by Meta a month ago, and you can easily access it . I used their quite extensively a few years ago, and haven't used it for some years. It was already an ama...
Learn more
The project is the core of my PhD work. It was heavily inspired by the cognitive science theories, such as the ones from . It's about developing agents equipped with human like ext...
Learn more
This paper was a result of the Hugging Face BigScience research workshop that I participated in 2021. The paper can be found at . Abstract : Large language models (LLMs) have been...
Learn more
This paper was a result of the Hugging Face BigScience research workshop that I participated in 2021. The paper can be found at . Abstract : PromptSource is a system for creating,...
Learn more
This paper was a result of the Hugging Face BigScience research workshop that I participated in 2021. The paper can be found at . Abstract : Large language models have recently bee...
Learn more
This is a paper that I wrote together with . We were able to achieve SOTA back then by simply training a RoBERTa (a variant of BERT) with speaker tokens! Check out Abstract : We pr...
Learn more