A gradio application allowing users to detect potential logos given an input image. It performs zero-shot object detection using the OWLv2 Model.
-
Install Python 3.9+ on your machine.
-
Create and activate a python virtual enviornment (optional).
- Follow this link for instruction on how to create one.
-
Run
pip install -r requirements.txt
on the command line or powershell to install the necessary dependencies.- Note: If you have access to a GPU, please install the correct version of
pytorch
from this link according to your CUDA version.
- Note: If you have access to a GPU, please install the correct version of
-
Run
python app.py
on the command line or powershell to launch the GUI locally. Click on the given link to open the GUI in your browser.- Note: This will take some time to run the first time as it needs to download the OWLv2 model.
-
Follow the instructions given at the bottom of the application.
-
If you have a GPU and the correct version of pytorch installed, the app will automatically use the GPU for detection.
- With GPU, detection time per image is approximately 4 seconds.
- Without GPU, detection time per image is approximately 50 seconds.
-
Experiment with different images.
Credit goes to the authors of OWLv2 and Gradio.