<aside> ✨ a monthly newsletter about ML topics and papers. goes directly in your email inbox. subscribe by emailing anything to [email protected]
</aside>
ML at Scale: Pipeline Parallelism
ML at Scale: Tensor Parallelism
How do Mixture of Expert Models Work?
Some selected works: