Skip to content

bigscience-workshop/Megatron-DeepSpeed

Error
Looks like something went wrong!

About

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Resources

License

Stars

Watchers

Forks

Packages

No packages published