Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement a simple multimodal RAG recipe with LLama Vision 11B model. #55

Open
silvererudite opened this issue Sep 30, 2024 · 1 comment

Comments

@silvererudite
Copy link

This issue is in response to the, call for contributions #43.

Hi @ariG23498, I would like to contribute a simple multimodal RAG recipe with the new LLama Vision models using pdf docs or any visual doc. If there's any feedback, suggestion or any conflict of interest pls let me know. Thanks!

@ariG23498
Copy link
Collaborator

Hey @silvererudite

I love the idea and it is very similar to #64. If you are free to collaborate with others, I would suggest you comment on the issue.

In the meantime I would keep this issue open till I hear back from you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants