forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 352
Ongoing research training transformer language models at scale, including: BERT & GPT-2
License
deepspeedai/Megatron-DeepSpeed
ErrorLooks like something went wrong!
About
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Resources
License
Security policy
Stars
Watchers
Forks
Packages 0
No packages published
Languages
- Python 75.4%
- Shell 21.0%
- C++ 3.1%
- Cuda 0.3%
- C 0.1%
- HTML 0.1%