Gridsearch normalisation not working for language after the latest update #349

young-x-skyee · 2024-07-29T08:27:16Z

After merging the main branch into the language branch today, the warning messages seem to appear for too many times which did not happen before...

/imaging/projects/cbu/kymata/analyses/tianyi/kymata-core/kymata-core-data/output/fc2_test/decoder/log/slurm_log_4.txt

young-x-skyee · 2024-07-29T09:38:00Z

The problem seems to be in the new vector.py file because when I only change that file back it is working again.

young-x-skyee · 2024-07-29T09:39:06Z

Just to clarify, I'm working on the kymata-language branch and the .sh file I'm using is

/imaging/projects/cbu/kymata/analyses/tianyi/kymata-core/submit_gridsearch_models_fc2_decoder.sh

neukym · 2024-07-29T09:42:59Z

And the commit hash for it is 64fcdad.

caiw · 2024-07-30T20:43:33Z

That x /= _normalize_magnitude(x) is inside a context manager which should raise an error on a divide-by-zero:

with np.errstate(divide="raise"):
    x /= _normalize_magnitude(x)

so I reckon this means it's dividing by nan, or perhaps inf. Looking at _normalize_magnitude(), it's just doing this:

def _normalize_magnitude(x: NDArray) -> NDArray:
     """Reusable magnitude function for use in `normalize`."""
     return np.sqrt(np.sum(x**2, axis=-1, keepdims=True))

So if it's producing a nan, it must be because the input contains a nan. Or it's possible that the input is too large and the sum-squared is going to inf. I did put in a gross hack which multiplies up the input by 10^6 in case it has zero magnitude (to avoid divide-by-zero errors on very small inputs to normalize(), where the magnitude can go to zero because of float16 precision), so if the input was very large, then multiplying by 10^6 could make it too big. But that should only happen if the input had a zero magnitude in one slice.

So if my reasoning is right (and it may not be!), it seems like the changes to vector.py should at most have exchanged some divide-by-zero warnings for some divide-by-nan/inf warnings...

@young-x-skyee Does the above shed any light on the issue? Does the emeg data you're loading in contain nans? Or perhaps it contains some dead channels which are all constant, and therefore can't be normalized without triggering a warning?

caiw · 2024-07-30T20:46:24Z

Having said the above, I'm now using full floats for function values, not float16s, so my awful hack might not be necessary any more. You could try commenting out

if (_normalize_magnitude(x) == 0).any():
    x *= 1_000_000

from normalize(). If that fixes it, please submit a pull request which deletes those lines and I'll test on the functions I was running which previously required it. If not, we can dig further into it.

caiw · 2024-07-30T20:53:13Z

Note to self: better yet, normalize should do something like x.astype(float) before calling _normalize_magnitude, to ensure the problem is always avoided. Need to think about how to deal with the inplace arg though.

young-x-skyee added 🪲 bug Something isn't working ❓ discussion needed Extra discussion is needed before work can commence gridsearch Related to the gridsearch labels Jul 29, 2024

young-x-skyee assigned caiw Jul 29, 2024

caiw assigned young-x-skyee Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gridsearch normalisation not working for language after the latest update #349

Gridsearch normalisation not working for language after the latest update #349

young-x-skyee commented Jul 29, 2024

young-x-skyee commented Jul 29, 2024

young-x-skyee commented Jul 29, 2024

neukym commented Jul 29, 2024

caiw commented Jul 30, 2024 •

edited

Loading

caiw commented Jul 30, 2024 •

edited

Loading

caiw commented Jul 30, 2024

Gridsearch normalisation not working for language after the latest update #349

Gridsearch normalisation not working for language after the latest update #349

Comments

young-x-skyee commented Jul 29, 2024

young-x-skyee commented Jul 29, 2024

young-x-skyee commented Jul 29, 2024

neukym commented Jul 29, 2024

caiw commented Jul 30, 2024 • edited Loading

caiw commented Jul 30, 2024 • edited Loading

caiw commented Jul 30, 2024

caiw commented Jul 30, 2024 •

edited

Loading

caiw commented Jul 30, 2024 •

edited

Loading