Skip to content

deepspeedai/Megatron-DeepSpeed

Error
Looks like something went wrong!

About

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Resources

License

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 75.4%
  • Shell 21.0%
  • C++ 3.1%
  • Cuda 0.3%
  • C 0.1%
  • HTML 0.1%