🔭 I'm a RLer + NLPer/2 + MLSyser/2.
RLer + MLSyser / 2 + NLPer / 2
Pinned Loading
-
OpenRLHF/OpenRLHF
OpenRLHF/OpenRLHF PublicAn Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)
-
Awesome-LLM-Strawberry
Awesome-LLM-Strawberry PublicA collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
-
alpha-zero-gomoku
alpha-zero-gomoku PublicA Multi-threaded Implementation of AlphaZero (C++)
-
noisy-mappo
noisy-mappo PublicMulti-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
-
cuda-neural-network
cuda-neural-network PublicConvolutional Neural Network with CUDA (MNIST 99.23%)
551 contributions in the last year
Day of Week | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | April Apr | May May | June Jun | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Contribution activity
June 2025
Created 20 commits in 2 repositories
Opened 4 pull requests in 2 repositories
hijkzzz/Awesome-LLM-Strawberry
2
merged
-
Update README.md
This contribution was made on Jun 12
-
Update README.md
This contribution was made on Jun 5
OpenRLHF/OpenRLHF
2
merged
-
Update README_zh.md
This contribution was made on Jun 11
-
Update README.md
This contribution was made on Jun 11