Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DO-NOT-MERGE] PR encompassing all changes needed to support neuron on Axlearn #919

Open
wants to merge 34 commits into
base: main
Choose a base branch
from

Conversation

apoorvtintin
Copy link
Contributor

@apoorvtintin apoorvtintin commented Jan 13, 2025

This PR contains all of the changes needed to run fuji family of models on TRN2/1. This PR is actively being broken down to create smaller and manageable PRs that can be reviewed/merged to Axlearn.

Smaller PRs created

This PR replaces the previous do-not-merge PR from a different fork. Sorry for the last minute change

patrick-toulme and others added 30 commits December 10, 2024 13:11
* fix neuron attention to make unit tests run

* remove top level iport from flash attention utils

* revert LNC change
* fix regressions and messages

* tpu_health_check.py import

* not defined variables

* full name for neuron
* added back skipping to test_pipeline_summary_writer

* fix host_array tests
Copy link
Contributor

@ruomingp ruomingp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ack.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants