Skip to content
View hijkzzz's full-sized avatar
Block or Report

Block or report hijkzzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hijkzzz/README.md

🔭 I'm a Coding Lover.

Jian Hu's GitHub stats

Pinned

  1. OpenLLMAI/OpenRLHF OpenLLMAI/OpenRLHF Public

    An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

    Python 1.1k 101

  2. pymarl2 pymarl2 Public

    Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

    Python 555 106

  3. alpha-zero-gomoku alpha-zero-gomoku Public

    A Multi-threaded Implementation of AlphaZero

    Python 351 48

  4. cuda-neural-network cuda-neural-network Public

    Convolutional Neural Network with CUDA (MNIST 99.23%)

    C++ 164 38

  5. deep-reinforcement-learning-notes deep-reinforcement-learning-notes Public

    Deep Reinforcement Learning Notes

    113 6

  6. noisy-mappo noisy-mappo Public

    Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)

    Python 39 6