Skip to content
View clam004's full-sized avatar
Block or Report

Block or report clam004

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
clam004/README.md

My scientific passion and engineering focus are on post-training or finetuning pretrained vision & language models, reinforcement learning, augmentation with up to date knowledge, long term & episodic memory and commmunicating these concepts in a non-intimidating, empathetic, yet technically meaningful way.

Tools

Python Python Go HF

Pinned

  1. minichatgpt minichatgpt Public

    annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation

    Jupyter Notebook 15 1

  2. adaptive-computation-time adaptive-computation-time Public

    The notebook connects the formulas used in the paper to the code that implements those formulas by implementing a training pipeline on a small but meaningful dataset

    HTML 2

  3. unsupervised-speech-representation-learning unsupervised-speech-representation-learning Public

    This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that uses CPC to learn representations of sound files for the purpose…

    Jupyter Notebook 10

  4. intro_continual_learning intro_continual_learning Public

    This is a tutorial to connect the fundamental mathematics to a practical implementation addressing the continual learning problem of artificial intelligence

    Jupyter Notebook 355 24

  5. RL-Chat-pytorch RL-Chat-pytorch Public

    reinforcement learning on a encoder-decoder GRU for chatbot dialogue generation

    Jupyter Notebook 19 5

  6. triton-ft-api triton-ft-api Public

    tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server

    Python 5