feat: E4M3fnuz FP8 format added #281

maktukmak · 2024-08-15T21:44:38Z

Currently, the E4M3 fP8 format implemented is ARM-Intel-Nvidia style. However, there is another style, IEEE 754 (torch name is float8_e4m3fnuz), which has different bit configuration and min-max values. This pull request aims to incorporate this style. The unit tests currently pass on CPU because it supports both styles. However, they will fail when tested on other devices. I need guidance on how to design the tests so that they only run a specific style based on the device. Once I have this information, I can complete this PR.

dacorvo · 2024-08-20T12:32:16Z

@maktukmak thank you for this pull-request.
As a first comment, please amend your commit to make it conventional (feat: e4m3fnuz added). This is a bit tedious but I want people to use meaningful commit messages and the simplest way was to use a predefined CI workflow that enforces conventional commit messages (I may improve this in the future to make it more flexible).
To exclude e4m3fnuz tests for a specific device, you have two options:

duplicate the tests for these types and use the pytest.mark.skip_device decorator,
inside each test, start by testing the device and qtype, and skip the test explicitly with pytest.skip.
You will find examples of both solutions in the existing test files (the first solution is used to skip float8 tests for MPS, and the second for tinygemm if CUDA version is less than 2.1).

maktukmak · 2024-08-20T18:51:01Z

@dacorvo, I excluded CUDA for e4m3fnuz in tests using the second option, and changed the commit names.

github-actions · 2024-09-13T01:59:52Z

This PR is stale because it has been open 15 days with no activity. Remove stale label or comment or this will be closed in 5 days.

maktukmak · 2024-09-13T20:36:09Z

@dacorvo , I fixed the style so it may pass the checks now.

dacorvo

Thanks for this pull-request !

dacorvo · 2024-09-17T12:03:56Z

Rebased and merged as #310

maktukmak requested a review from dacorvo as a code owner August 15, 2024 21:44

maktukmak force-pushed the add_e4m3fnuz branch from 7076e64 to 8c61881 Compare August 20, 2024 18:16

root added 2 commits August 20, 2024 18:19

feat: e4m3fnuz added

8c61881

test: exclude cuda for e4m3fnuz

9b1441f

maktukmak changed the title ~~[Draft] E4M3fnuz FP8 format added~~ E4M3fnuz FP8 format added Aug 22, 2024

maktukmak changed the title ~~E4M3fnuz FP8 format added~~ feat: E4M3fnuz FP8 format added Aug 22, 2024

Merge branch 'main' into add_e4m3fnuz

9eeabbc

github-actions bot added the Stale label Sep 13, 2024

fix: style correction

908eeb6

github-actions bot removed the Stale label Sep 14, 2024

Merge branch 'main' into add_e4m3fnuz

553737b

dacorvo approved these changes Sep 17, 2024

View reviewed changes

dacorvo mentioned this pull request Sep 17, 2024

feat: e4m3fnuz added #310

Merged

dacorvo closed this Sep 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: E4M3fnuz FP8 format added #281

feat: E4M3fnuz FP8 format added #281

maktukmak commented Aug 15, 2024

dacorvo commented Aug 20, 2024

maktukmak commented Aug 20, 2024 •

edited

Loading

github-actions bot commented Sep 13, 2024

maktukmak commented Sep 13, 2024

dacorvo left a comment

dacorvo commented Sep 17, 2024 •

edited

Loading

feat: E4M3fnuz FP8 format added #281

feat: E4M3fnuz FP8 format added #281

Conversation

maktukmak commented Aug 15, 2024

dacorvo commented Aug 20, 2024

maktukmak commented Aug 20, 2024 • edited Loading

github-actions bot commented Sep 13, 2024

maktukmak commented Sep 13, 2024

dacorvo left a comment

Choose a reason for hiding this comment

dacorvo commented Sep 17, 2024 • edited Loading

maktukmak commented Aug 20, 2024 •

edited

Loading

dacorvo commented Sep 17, 2024 •

edited

Loading