Trained Resample with Siglip Got inconvergence loss #22

lucasjinreal · 2024-04-23T03:23:01Z

Hi, I adopt this Resampler module to LLaVa without slicing, and replace the vision encoder from CLIP to siglip, the loss can not converge.

Any thought about this?

guozonghao96 · 2025-01-04T03:24:39Z

We also encountered this problem recently. We found that when using vicuna v1.5 as LLM and Siglip-Large as ViT, the model can not converge. The final loss of pretraining stage is about 2.3~2.5, which make the final model degeneration to a bad performance. After using Qwen2-7B as LLM, there is no non-converge problem and a good performance than Vicuna v1.5-7B. Maybe there is something wrong when training on vicuna v1.5, but we do not found the true reason on it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trained Resample with Siglip Got inconvergence loss #22

Trained Resample with Siglip Got inconvergence loss #22

lucasjinreal commented Apr 23, 2024

guozonghao96 commented Jan 4, 2025

Trained Resample with Siglip Got inconvergence loss #22

Trained Resample with Siglip Got inconvergence loss #22

Comments

lucasjinreal commented Apr 23, 2024

guozonghao96 commented Jan 4, 2025