thwu1

Follow

🎯

Focusing

Tianhao Wu thwu1

🎯

Focusing

Follow

EECS PhD @ Berkeley

24 followers · 19 following

06:30 (UTC -07:00)
https://thwu1.github.io/tianhaowu/

Achievements

Achievements

Highlights

Pro

Popular repositories Loading

pairwise-proximal-policy-optimization pairwise-proximal-policy-optimization Public

Python 3
trlx trlx Public

Forked from CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 2 3
visual-tool visual-tool Public

Python 1
alpha-zero-general alpha-zero-general Public

Forked from suragnair/alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 1
llm2vec llm2vec Public

Python
reward_exp reward_exp Public

Python