Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new paper: #38

Open
wyzh0912 opened this issue Jan 19, 2025 · 0 comments
Open

Add new paper: #38

wyzh0912 opened this issue Jan 19, 2025 · 0 comments

Comments

@wyzh0912
Copy link
Contributor

Title

Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models

Published Date

2025-1-14

Source

arxiv

Head Name

Retrieval Head

Summary

  • Innovation: Proposed retrieval hit rate, a more direct metric to evaluate attention heads.
  • Tasks: In long-context retrieval tasks, identify the retrieval head and utilize it to improve the model's retrieval performance.
  • Significant Result: Through Joint Retrieval Head Training, a new retrieval head can be obtained to predict the Top-K context for generating final responses, enhancing the model's in-context retrieval and reasoning capabilities.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant