Skip to content
View thwu1's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report thwu1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. pairwise-proximal-policy-optimization pairwise-proximal-policy-optimization Public

    Python 3

  2. trlx trlx Public

    Forked from CarperAI/trlx

    A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

    Python 2 3

  3. visual-tool visual-tool Public

    Python 1

  4. alpha-zero-general alpha-zero-general Public

    Forked from suragnair/alpha-zero-general

    A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

    Jupyter Notebook 1

  5. llm2vec llm2vec Public

    Python

  6. reward_exp reward_exp Public

    Python