The configuration of eval code for differenct scenes #43

ChengZY · 2024-05-14T08:17:42Z

I tested on 3D-OVS data. It can achieve similar results (IoU) on the sofa scene.
But the performance for other scenes are not good as expeted.

I noted the language feature images are well-trained, the reason should be the setting of the eval code, such as threshold and the kernal size. Does it mean we need to try the setting manually to achieve the best results?

Below is the sample of the bench scene, including language feature image, groundtruth and the predicted mask. Do you have any suggestion?

ChengZY · 2024-06-04T06:17:29Z

The predicted language feature image is the 512 dim language picture. To visualize the result, I selected 3 channels as RGB.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The configuration of eval code for differenct scenes #43

The configuration of eval code for differenct scenes #43

ChengZY commented May 14, 2024

ChengZY commented Jun 4, 2024

The configuration of eval code for differenct scenes #43

The configuration of eval code for differenct scenes #43

Comments

ChengZY commented May 14, 2024

ChengZY commented Jun 4, 2024