Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About preprocess code #2

Open
chengyj97 opened this issue Oct 14, 2020 · 10 comments
Open

About preprocess code #2

chengyj97 opened this issue Oct 14, 2020 · 10 comments

Comments

@chengyj97
Copy link

Is there any preprocess code(like sub-graph sampling or gt sub-graph extraction) that you can share with us?
Thanks a lot!!

@1216143369
Copy link

Can you share the code of sub-graph sampling or sub-graph extraction? Thanks a lot!

@YiwuZhong YiwuZhong reopened this Oct 31, 2020
@YiwuZhong
Copy link
Owner

Thanks for your interest in our work!

We'll try to get the preprocessing code released after CVPR.

@JingyuLi-code
Copy link

JingyuLi-code commented Dec 30, 2020

Thanks for your great work!
In your paper Table 3, Sub-GC-oracle is an upper bound by assuming that we have an oracle ranking function, i.e., always selecting the maximum scored sentence for each metric.
How do you solve it in your code? Thanks a lot!

@YiwuZhong
Copy link
Owner

Please refer to the section "Top-1 Accuracy Evaluation" in README, that is, "set --only_sent_eval to 1 and add --orcle_num 1000 in test.sh, and rerun the bash file".

@JingyuLi-code
Copy link

Yes, The upper bound is promising!!! Can you share the code of sub-graph sampling or sub-graph extraction now? Thanks a lot!

@Zora-zjj
Copy link

Zora-zjj commented Mar 9, 2021

hi, thanks for the sharing, I had some problems about generating caption with this code:

Traceback:
../ data Loaders/dataloader_test.py", line 198, in get_ batch
data['att_ masks'] = tmp_ att_ mask. view(-1,2, tmp_ this_ mini_batch, self .obj_ num)
RuntimeError: cannot reshape tensor of 0 elements into shape [-1, 2, 0, 37] because the unspecified dimension size -1 can be any value and is ambiguous

An error is generated every time when the caption for the NO.100432 image is generated.
Looking forward to your reply!

@SikandarBakht
Copy link

Hi, can we get the preprocessing code for sub-graph sampling? Also, I'd like some insights into getting custom images captioned with this code. I know there is an option for custom image captioning in the evaluation script but it does not have the accompanying code to produce it. Any help would be appreciated.

@zaryabmakram
Copy link

Hi @YiwuZhong ,
Really appreciate your work and making the code available. Can you kindly also make the preprocessing scripts available as well. It would be great if you could provide some insights on how we could re-train the model on a custom dataset.

@AleDella
Copy link

Hi, is available somewhere the code to produce scene graphs from a set of images? In case we want to use your model on other datasets that is not COCO or Flickr. If not yours, could you explain how you produced them?

Thank you very much in advance!

@YiwuZhong
Copy link
Owner

Hi, is available somewhere the code to produce scene graphs from a set of images? In case we want to use your model on other datasets that is not COCO or Flickr. If not yours, could you explain how you produced them?

Thank you very much in advance!

Hi @AleDella, thanks for your interests in our work.

As mentioned in the Implementation Details of paper, we first used Bottom-up object detector to detect objects from images and to extract region features. Using these region features as inputs, Motif-Net was trained to generate scene graphs from images. This is the model checkpoint of Motif-Net I trained and used to generate scene graphs. As a reference for saving scene graphs into local files, you might be able to use my script to replace the original file in Motif-Net codebase with some adaption as needed.

PS: There is another codebase for Bottom-up object detector to extract region features (bottom-up-attention.pytorch). This is my work for scene graph generation with image captions as only supervision (SGG_from_NLS).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants