Add fine-tuning documentation #76

johnbradley · 2024-12-17T13:19:13Z

Provide documentation on how users could fine-tune BioCLIP.

johnbradley · 2024-12-17T13:22:59Z

@samuelstevens Suggested some example code from biobench:

binary SVM on vision models: https://github.com/samuelstevens/biobench/blob/main/biobench/newt/__init__.py
ridge classifier (kind of a linear probe): https://github.com/samuelstevens/biobench/blob/main/biobench/plantnet/__init__.py
simpleshot (sort of an upgraded KNN): https://github.com/samuelstevens/biobench/blob/main/biobench/birds525/__init__.py

johnbradley · 2024-12-18T21:43:22Z

My understanding of the three example above: These use only the visual part of the BioCLIP model and train SVM/ridge classifier/simpleshot based on the image embeddings. No text embeddings are used.

samuelstevens · 2024-12-19T23:26:52Z

Yep, that's correct. biobench is a method to evaluate vision-only models like the vision encoder from BioCLIP, or DINOv2, etc.

@samuelstevens

Adds a notebook that trains a SVM that predicts species based on image embeddings created by BioCLIP. The SVM logic is based on https://github.com/samuelstevens/biobench by @samuelstevens. Part of issue #76.

@samuelstevens

Adds a notebook that trains a SVM that predicts species based on image embeddings created by BioCLIP. The SVM logic in init_svc() is copied from https://github.com/samuelstevens/biobench by @samuelstevens. Part of issue #76. Co-authored-by: Samuel Stevens <[email protected]> Co-authored-by: Copilot <[email protected]>

@samuelstevens

Adds a notebook that trains a SVM that predicts species based on image embeddings created by BioCLIP. The SVM logic in init_svc() is copied from https://github.com/samuelstevens/biobench by @samuelstevens. Part of issue #76. Co-authored-by: Samuel Stevens <[email protected]> Co-authored-by: Copilot <[email protected]> Co-authored-by: Hilmar Lapp <[email protected]>

@samuelstevens

Adds a notebook that trains a RidgeClassifier that predicts species based on image embeddings created by BioCLIP. The RidgeClassifier logic is simplified from init_svc() in https://github.com/samuelstevens/biobench by @samuelstevens. Part of issue #76. Co-authored-by: Samuel Stevens <[email protected]>

@samuelstevens

Adds a notebook that trains a RidgeClassifier that predicts species based on image embeddings created by BioCLIP. The RidgeClassifier logic is simplified from init_svc() in https://github.com/samuelstevens/biobench by @samuelstevens. Part of issue #76. Co-authored-by: Samuel Stevens <[email protected]>

@samuelstevens

Adds a notebook that trains a Simpleshot to predicts species based on image embeddings created by BioCLIP. The Simpleshot logic is based on from https://github.com/samuelstevens/biobench by @samuelstevens. Part of issue #76. Co-authored-by: Samuel Stevens <[email protected]>

@samuelstevens

Adds a notebook that trains a RidgeClassifier that predicts species based on image embeddings created by BioCLIP. The RidgeClassifier logic is simplified from init_svc() in https://github.com/samuelstevens/biobench by @samuelstevens. Part of issue #76. Co-authored-by: Samuel Stevens <[email protected]> Co-authored-by: Copilot <[email protected]>

johnbradley · 2025-01-23T15:09:25Z

@samuelstevens I want to add some documentation comparing the different approaches above. How would you suggest a user go about deciding between the three options?

hlapp · 2025-01-23T19:37:04Z

Maybe we should try and ask Copilot or ChatGPT, or as seems to be the latest rave, the most recent version of DeepSeek 😉 (Though only half joking)

@samuelstevens

Adds a notebook that trains a Simpleshot to predicts species based on image embeddings created by BioCLIP. The Simpleshot logic is based on from https://github.com/samuelstevens/biobench by @samuelstevens. Part of issue #76. Co-authored-by: Samuel Stevens <[email protected]>

samuelstevens · 2025-01-23T20:46:21Z

Yeah I actually think asking an LLM for a summary is a good idea. Fundamentally, BioCLIP embeddings are designed to be linearly separable. The SVM and ridge classifier are linear (unless you choose a nonlinear kernel in the SVM), while the KNN/Simpleshot is non-linear.

Another point is that SVM and ridge classifier are parametric ML methods that explicitly learn parameters, while KNN is non-parametric. But in practice, all three will produce similar results, and I'm not sure that I have any recommendations on what methods are likely to produce the best results based on different tasks.

BioBench would be a good way to test this but it's not set up to evaluate multiple ML methods on a given task right now---I am trying to write a short paper for CV4Animals about BioBench and some findings, but I won't get to that until February. If you're willing to wait, I am happy to share any insights I gain from that project.

johnbradley · 2025-01-27T13:17:36Z

Asking ChatGPT

Compare SVM, Ridge Classifier, and SimpleShot for fine tuning image embeddings 
that linearly separable for classification.

ChatGPT Output

Recommendations

SVM: Best suited for tasks where embeddings are high-quality and you need precise decision boundaries.
Ridge Classifier: Ideal for large datasets where computational efficiency is crucial and embeddings are linearly separable with noise.
SimpleShot: Excellent for few-shot learning scenarios, or when you want a fast, non-parametric approach without retraining.

If your embeddings are linearly separable and classification is the goal, SVM often provides the best performance in terms of accuracy, but Ridge Classifier may be preferred for faster computation. If simplicity or few-shot adaptability is key, go for SimpleShot.

johnbradley · 2025-01-27T14:31:37Z

Link to full details from ChatGPT

hlapp · 2025-01-27T14:47:47Z

Perhaps @samuelstevens can double check what is true and what might be hallucinations, but I'd say pretty good on the face of it.

samuelstevens · 2025-01-28T00:05:04Z

Perhaps the best strengths of each are:

SVMs can support linear and non-linear boundaries and are suitable for binary classification or fewer than ~20 classes (because you train a one-vs-rest for each class).
Ridge classifiers are best for linear classification tasks. They require training but are powerful classifiers for many, many tasks, especially with sufficient data.
SimpleShot is extremely data-efficient and works well for multiple (20+) classes.

This table summarizes this well I think:

Method	Maximum Classes	Minimum Training Data
SVM	~20	5+ examples
Ridge Classifier	No maximum	10+ examples per class
SimpleShot	No maximum	1+ example per class

So if you have 3 classes with 5 examples each, I would expect SVMs > SimpleShot > Ridge Classifier. If you have 40 classes with 50 examples each, I would expect Ridge Classifier > SimpleShot, and I would expect SVMs to take too long to train reliably.

Does that clarify things?

johnbradley mentioned this issue Dec 30, 2024

Add Fine-tune SVM example notebook #78

Merged

johnbradley mentioned this issue Jan 10, 2025

Add Fine-tune RidgeClassifier example notebook #79

Merged

johnbradley mentioned this issue Jan 23, 2025

Add Fine-tune Simpleshot example notebook #82

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fine-tuning documentation #76

Add fine-tuning documentation #76

johnbradley commented Dec 17, 2024

johnbradley commented Dec 17, 2024

johnbradley commented Dec 18, 2024 •

edited

Loading

samuelstevens commented Dec 19, 2024

johnbradley commented Jan 23, 2025

hlapp commented Jan 23, 2025 •

edited

Loading

samuelstevens commented Jan 23, 2025

johnbradley commented Jan 27, 2025

johnbradley commented Jan 27, 2025

hlapp commented Jan 27, 2025

samuelstevens commented Jan 28, 2025

Add fine-tuning documentation #76

Add fine-tuning documentation #76

Comments

johnbradley commented Dec 17, 2024

johnbradley commented Dec 17, 2024

johnbradley commented Dec 18, 2024 • edited Loading

samuelstevens commented Dec 19, 2024

johnbradley commented Jan 23, 2025

hlapp commented Jan 23, 2025 • edited Loading

samuelstevens commented Jan 23, 2025

johnbradley commented Jan 27, 2025

Asking ChatGPT

ChatGPT Output

Recommendations

johnbradley commented Jan 27, 2025

hlapp commented Jan 27, 2025

samuelstevens commented Jan 28, 2025

johnbradley commented Dec 18, 2024 •

edited

Loading

hlapp commented Jan 23, 2025 •

edited

Loading