Benchmarking Suite for Loss Functions in T5 Training #36

s1k0ra · 2025-01-08T15:38:15Z

This suite benchmarks different loss functions in neural network training across three scenarios:
1. Standalone loss computation
2. Model forward pass
3. Complete training step (including gradient updates)

Supported loss functions:
• Standard Cross Entropy (CE)
• CE with Number Token Loss (NTL) using MSE
• CE with NTL using Wasserstein distance
• CE with NTL using Absolute Difference

The suite includes utility functions for generating synthetic data, timing benchmarks, and logging results. It supports experimentation with different configurations and outputs CSV reports of benchmark statistics.

…ss into loss-benchmarking

benchmarking/loss_function_benchmark.py

…ss into loss-benchmarking

jannisborn

LGTM 🚀

s1k0ra added 26 commits December 4, 2024 19:08

created benchmarking for loss functions

ee092c9

added plots and number share for loss benchmarking

1920741

Merge branch 'loss-benchmarking' of github.com:tum-ai/number-token-lo…

cd8c5b3

…ss into loss-benchmarking

deleted unwanted files

5d61ee5

fixed loggging

ca4746c

changed benchmark cases, improved plots, added std

bc8363e

changed import paths

53237cd

reordered data generation

0586006

moved model to gpu, created run up for benchmark

d7c8249

changed storing of yaml file

eae1a53

added cuda sync in benchmarking

e00ec3e

added benchmark for influence of number share

37c4f5d

added plots of latest gpu benchmarks

b9b39f3

renamed loss plots, added realtive speed up plot

d1bda72

added multiple measurement points in benchmarking

15cbfc1

complete cleanup

9a8a434

renamed timer point

5c2a0ba

added plotting for different parts of benchmark

57d2413

removed old files

95afe1c

changed tokenizer for ce loss

c2c55e4

added warmup runs

bea958f

minor change in input generation

632f820

minor change in input generation

653af60

added paper plot

b228c91

refined comment

31bbb28

Merge remote-tracking branch 'origin/main' into loss-benchmarking

4f319d5

s1k0ra requested review from jannisborn, zausin33 and Larspennig January 8, 2025 15:38

zausin33 reviewed Jan 8, 2025

View reviewed changes

benchmarking/loss_function_benchmark.py Outdated Show resolved Hide resolved

s1k0ra and others added 3 commits January 8, 2025 22:20

fused loss functions into models to avoid overhead

4420384

removed unused line and added png to gitignore

b325d62

Delete benchmarking/benchmarking_plots.png

8b1d44e

zausin33 approved these changes Jan 8, 2025

View reviewed changes

s1k0ra added 2 commits January 9, 2025 00:23

removed comment

30414fb

Merge branch 'loss-benchmarking' of github.com:tum-ai/number-token-lo…

4a078f5

…ss into loss-benchmarking

jannisborn approved these changes Jan 9, 2025

View reviewed changes

zausin33 merged commit dc4eabe into main Jan 9, 2025
2 checks passed

zausin33 deleted the loss-benchmarking branch January 9, 2025 13:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking Suite for Loss Functions in T5 Training #36

Benchmarking Suite for Loss Functions in T5 Training #36

s1k0ra commented Jan 8, 2025

jannisborn left a comment

Benchmarking Suite for Loss Functions in T5 Training #36

Benchmarking Suite for Loss Functions in T5 Training #36

Conversation

s1k0ra commented Jan 8, 2025

jannisborn left a comment

Choose a reason for hiding this comment