-
Notifications
You must be signed in to change notification settings - Fork 223
Ongoing research training transformer language models at scale, including: BERT & GPT-2
License
bigscience-workshop/Megatron-DeepSpeed
ErrorLooks like something went wrong!
About
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Resources
License
Stars
Watchers
Forks
Packages 0
No packages published