Skip to content
Snippets Groups Projects
Unverified Commit 488e7333 authored by Patrick Liu's avatar Patrick Liu Committed by GitHub
Browse files

Update TECHNICAL_DETAILS.md

parent d4ece92e
No related branches found
No related tags found
No related merge requests found
......@@ -96,4 +96,4 @@ Each process keeps an isolated model, data loader, and optimizer.
Model parameters are only synchronized once at the begining.
After a forward and backward pass, gradients will be allreduced among all GPUs,
and the optimizer will update model parameters.
Since the gradients are all reduced, the model parameter stays the same for all processes after the iteration.
Since the gradients are allreduced, the model parameter stays the same for all processes after the iteration.
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment