Megatron-LM (1)
Resources about distributed training with Megatron-LM
Github: https://github.com/NVIDIA/Megatron-LM
Document on NeMo: https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
NeMo is a cloud-native generative AI framework built on top of Megatron-LM.
Overall view of Megatron-Core: https://docs.nvidia.com/megatron-core/developer-guide/latest/index.html
Official APIs with formal product support…
Megatron-LM are basically based on the following three papers. Let’s do some notes on them.