Add `Tensor Parallel` support for ALL models #34789

ArthurZucker · 2024-11-18T20:08:36Z

Just opening this to add support for all models following #34184

Lets bring support to all model! 🤗

Llama

It would be great to add the support for more architectures such as

Qwen2
QwenVl
Mistral
Llava

... and many more

For anyone who wants to contribute just open a PR and link it to this issue, and ping me for a review!! 🤗

farrosalferro · 2024-11-18T23:10:46Z

If it's okay, I want to take the Gemma and Gemma2 @ArthurZucker

kmehant · 2024-11-19T13:18:01Z

@ArthurZucker I will add support for Granite. Thanks

VladOS95-cyber · 2024-11-21T15:09:05Z

Hey @ArthurZucker! I am going to work on Mistral.

VladOS95-cyber · 2024-11-27T07:27:02Z

Hey @ArthurZucker! I am going to start working on Qwen2

akshayg08 · 2025-01-16T21:32:42Z

For tensor parallel, when changing the dimension while applying the heads, why is num_heads made to -1. This essentially means that the head computations are being distributed, which is fine. But, an alternate approach could be to keep the num_heads same per device, and let head_dim be dynamic (-1). Is there a problem with the latter?

akshayg08 · 2025-01-16T21:44:47Z

For tensor parallel, when changing the dimension while applying the heads, why is num_heads made to -1. This essentially means that the head computations are being distributed, which is fine. But, an alternate approach could be to keep the num_heads same per device, and let head_dim be dynamic (-1). Is there a problem with the latter?

I think it might affect the rotatory embeddings so it's better to split the heads

ArthurZucker added Feature request Request for a new feature Tensor Parallel Good Difficult Issue labels Nov 18, 2024

VladOS95-cyber mentioned this issue Nov 26, 2024

Add Pytorch Tensor Parallel support for Mistral #34927

Merged

4 tasks

VladOS95-cyber mentioned this issue Nov 28, 2024

Add Pytorch Tensor Parallel support for Qwen2, Qwen2Moe, Starcoder2 #35007

Merged

5 tasks

jla524 mentioned this issue Dec 3, 2024

Add Tensor Parallel support for Qwen2VL #35050

Merged

kmehant mentioned this issue Jan 9, 2025

feat: add TP plan for granite #35573

Merged

5 tasks

kwen2501 mentioned this issue Jan 23, 2025

Update doc re list of models supporting TP #35864

Open

1 task

Tanuj-rai mentioned this issue Jan 29, 2025

Add base tensor parallel support to Phi #35961

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Tensor Parallel` support for ALL models #34789

Add `Tensor Parallel` support for ALL models #34789

ArthurZucker commented Nov 18, 2024 •

edited

Loading

farrosalferro commented Nov 18, 2024

kmehant commented Nov 19, 2024

VladOS95-cyber commented Nov 21, 2024 •

edited

Loading

VladOS95-cyber commented Nov 27, 2024

akshayg08 commented Jan 16, 2025

akshayg08 commented Jan 16, 2025

Add Tensor Parallel support for ALL models #34789

Add Tensor Parallel support for ALL models #34789

Comments

ArthurZucker commented Nov 18, 2024 • edited Loading

farrosalferro commented Nov 18, 2024

kmehant commented Nov 19, 2024

VladOS95-cyber commented Nov 21, 2024 • edited Loading

VladOS95-cyber commented Nov 27, 2024

akshayg08 commented Jan 16, 2025

akshayg08 commented Jan 16, 2025

Add `Tensor Parallel` support for ALL models #34789

Add `Tensor Parallel` support for ALL models #34789

ArthurZucker commented Nov 18, 2024 •

edited

Loading

VladOS95-cyber commented Nov 21, 2024 •

edited

Loading