Deep Learning with Mixed Precision Training

Welcome to the "Deep Learning with Mixed Precision Training" repository! Here, we provide comprehensive tutorials and code examples for training deep neural networks using mixed precision techniques in both PyTorch and TensorFlow.

The PyTorch_TensorFlow directory contains a set of base tutorials and code examples based on the "Mixed-Precision using Tensor Cores Series". These tutorials walk you through the fundamentals of mixed precision training in PyTorch and TensorFlow. They serve as an essential starting point for understanding the techniques used in subsequent sections.

BERT_MLM_NSP_fp32_fp16

In the BERT_MLM_NSP_fp32_fp16 directory, you will find various methods for training BERT (Bidirectional Encoder Representations from Transformers) models. This section covers training BERT models in both Masked Language Model (MLM) and Next Sentence Prediction (NSP) modes. Additionally, we demonstrate the combined training of MLM and NSP using a combination of techniques mentioned earlier. These techniques include mixed precision training with float32 (fp32) and float16 (fp16) precisions.

Additional Resources

For more in-depth information on training BERT from scratch, you can find helpful tutorials from James Briggs.

Introduction

Deep learning models have achieved remarkable success in various fields. However, training these models can be computationally expensive and time-consuming. Mixed precision training offers an efficient solution by leveraging different numerical precisions for specific computations.

Repository Focus

In this repository, we focus on the following aspects:

Training Deep Networks: We demonstrate how to train deep neural networks using mixed precision techniques in both PyTorch and TensorFlow.
Nvidia Helper Functions: We provide examples of leveraging Nvidia helper functions, as introduced in the Mixed-Precision using Tensor Cores Series, to further optimize performance on Nvidia GPUs with Tensor Cores.
Automatic Mixed Precision in PyTorch: We show how to use PyTorch's torch.cuda.amp package for automatic mixed precision training.
PyTorch 2.x Compile Method: We explore the compile method in PyTorch 2.x for mixed precision training.
Sophia Optimizer Comparison: We compare the performance in terms of training time while using the recent Sophia optimizer.
Training BERT Model: We go beyond simple examples and train/fine-tune BERT models from scratch in MLM (Masked Language Model) and NSP (Next Sentence Prediction) modes. We compare GPU VRAM and training time requirements for each method.
Tensor Float 32 (TF32) Capability: We apply the methods on custom-designed deep networks to exploit the Tensor Float 32 capability in Nvidia Ampere GPUs.
Tips and Tricks: For each method, we share tips and tricks to achieve optimal results.

Getting Started

To explore mixed precision training and train your deep neural networks more efficiently, follow these steps:

Clone this repository to your local machine using git clone https://github.com/your_username/your_repository.git.
Navigate to the repository folder and explore the PyTorch_TensorFlow directory for foundational tutorials.
Dive into the BERT_MLM_NSP_fp32_fp16 directory to discover advanced methods for training BERT models in MLM and NSP modes using mixed precision techniques.
Run the provided scripts and notebooks in each directory to experiment with different precision settings and observe the training performance.

Contribution Guidelines

We welcome contributions from the community to expand the repository with more examples and techniques. If you encounter any issues, have innovative ideas, or wish to optimize existing code, feel free to open an issue or submit a pull request. Together, let's accelerate the world of deep learning with mixed precision training!

License

This repository is licensed under the MIT License.

Let's unleash the power of mixed precision training and push the boundaries of deep learning! Happy coding!

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
BERT_MLM_NSP_fp32_fp16		BERT_MLM_NSP_fp32_fp16
PyTorch_TensorFlow		PyTorch_TensorFlow
Mixed_Precision_slides.pdf		Mixed_Precision_slides.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning with Mixed Precision Training

Contents

PyTorch_TensorFlow

BERT_MLM_NSP_fp32_fp16

Additional Resources

Introduction

Repository Focus

Getting Started

Contribution Guidelines

License

About

Releases

Packages

Languages

HFarkhari/Mixed_Precision_Training

Folders and files

Latest commit

History

Repository files navigation

Deep Learning with Mixed Precision Training

Contents

PyTorch_TensorFlow

BERT_MLM_NSP_fp32_fp16

Additional Resources

Introduction

Repository Focus

Getting Started

Contribution Guidelines

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages