PyO3: Always broadcast `+`,`-`, `/` ,`*` #1101

LLukas22 · 2023-10-15T12:41:44Z

Always broadcast magic methods, similarly to pytorch.

A question i still have: Does it have any drawbacks to always use the broadcasting variants of the operations here? Would it be faster to fist check the shape and then perform the normal or broadcasted version of the operation?

LaurentMazare · 2023-10-15T14:48:25Z

There should be no performance impact as broadcast_add etc should do the right thing. The main reason for candle to use non-broadcasting ops by default is that it's very easy to shoot yourself in the foot with broadcasting so we require broadcasting to be explicit. I'm pretty convinced this is the right call for the rust api - even if we try to mimic the PyTorch api in rust, one can still use broadcast_add etc to opt-in to the broadcasting behavior.
I'm a bit torn when it comes to this python api, on one hand it would be nice to be as close as possible to the PyTorch version, but on the other hand this doesn't seem to me like a sane default and sticking to non-broadcasting by default certainly has its upside.
Any thoughts?

LLukas22 · 2023-10-15T15:34:24Z

I'm 100% on your side regarding the rust-api, not broadcasting should be the default behavior. Regarding the python wrapper i'm also a bit torn. The problem is that pytorch does a lot of things implicitly, to make the api more "pythonic" while candle does nearly everything explicitly as it's located in the rust ecosystem. This results in a bit of a lose-lose situation. We basically have 2 options:

Design the python wrapper implicitly and close to pytorch. That way we have minimal friction between the 2 ecosystems and it should be easier for pytorch users to bring their code with them and start hacking around in candle. The drawback is that similarly to pytorch we lose some of the explicit behavior, but that's just a problem python always had.
Keep the wrapper close to candles api, meaning we want most things to be explicit. This results in bigger differences and more friction between the ecosystems, as we basically need some sort of "Migration Guide". The advantage would be that we can focus more on the safety and explicit behavior we get from this approach, which would be better for production systems.

In my opinion option 1 would be the way to go, as i see the python wrapper as something i can play with and try things out before writing them up in rust. I'm not expecting it to be save or explicit, i just want to hack stuff together and if it explodes in my face, so be it. Regarding option 2, if we want to go that way we can keep the wrapper extremly simple as we can discard most of the implicit features, but if we already need a "migration guide" to port models from pytorch to our wrapper, it would probably be advisable to directly point users towards the rust implementation, because it does type-checking, safety and explicit behavior way better than python ever could.

I basically like the implicit way more, as it's a better fit for python and you just have less to think about when using it, but i definitely wouldn't use it for a production system.
What are your thoughts on this?

LaurentMazare · 2023-10-17T09:57:07Z

Ok let's give it a try and we'll see how it goes. Maybe the best would be to have something a bit more like jax, i.e. have one python api that is very low level and matches the c++ side and one higher level python api that tries to be python idiomatic but that seems a bit too much efforts at this point.

Always broadcast magic methods

8cc0342

LaurentMazare approved these changes Oct 17, 2023

View reviewed changes

LaurentMazare merged commit b355ab4 into huggingface:main Oct 17, 2023
10 of 12 checks passed

EricLBuehler pushed a commit to EricLBuehler/candle that referenced this pull request Oct 25, 2023

Always broadcast magic methods (huggingface#1101)

353e973

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyO3: Always broadcast `+`,`-`, `/` ,`*` #1101

PyO3: Always broadcast `+`,`-`, `/` ,`*` #1101

LLukas22 commented Oct 15, 2023

LaurentMazare commented Oct 15, 2023

LLukas22 commented Oct 15, 2023

LaurentMazare commented Oct 17, 2023

PyO3: Always broadcast +,-, / ,* #1101

PyO3: Always broadcast +,-, / ,* #1101

Conversation

LLukas22 commented Oct 15, 2023

LaurentMazare commented Oct 15, 2023

LLukas22 commented Oct 15, 2023

LaurentMazare commented Oct 17, 2023

PyO3: Always broadcast `+`,`-`, `/` ,`*` #1101

PyO3: Always broadcast `+`,`-`, `/` ,`*` #1101