-
-
Notifications
You must be signed in to change notification settings - Fork 169
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reuse chunks from the same download if duplicated #683
base: master
Are you sure you want to change the base?
Conversation
Drastically decrease memory footprint
I've not done a full review of this but I think at the very least it should be an option like reordering (ideally also in the config so you don't have to specify it each time). |
It should be opt-in, not opt-out, like reordering. |
I've created an option to disable this behavior. This feature functions more like reusing chunks during patching (which is enabled by default) than reordering. It is simply an adaptation of the same algorithm - just scroll a few lines up in the analysis function to find it.
Reordering is likely disabled by default because it is very time-consuming with questionable benefits. However, this option is not as resource-intensive. I conducted measurements on an extremely large project (~860 GB), and it took about 1.5 seconds on my machine. For normal projects, it typically takes around 0.0-0.1 seconds. In contrast, reordering calculations take about 4 minutes.
|
35a261d
to
e1945f0
Compare
The entire system was designed around minismising disk I/O. I'm not going to merge this if it's on by default. |
7866399
to
652b78b
Compare
From my perspective, this results in minimal I/O and primarily affects highly duplicated projects. The bigger issue is the enormous memory consumption. With this option, chunks remain in shared memory if we reuse different parts of the same chunk. Additionally, I noticed that the actual memory consumption reported by the CLI during downloads is higher than the calculated values. This means that even if there seems to be enough memory based on calculations, the system could still hang. Therefore, keeping parts in memory for such edge cases seems like the wrong approach to me. As a compromise, I've added an auto-retry mechanism for memory errors. Does this work for you? |
It's only an issue for like 2 games where the devs did something stupid. It's not worth compromising for. According to the Steam hardware survey 16 and 32 GB are the most common configurations, so memory isn't really an issue.
The size calculated is only the minimum, not a limit. If the configured cache size is larger it'll still be used to the full extent if data isn't flushed to disk quickly enough.
No, but I do want this to be a config option. But I can do that whenever I get around to cleaning up the commits and merging the PR. |
652b78b
to
9887248
Compare
And I/O is, so badly? I guess people more eager to use such enormous amount of memory for something else. If we have really 2 games like this, fallback will almost never happen because it is enough shared memory. But it is not, because it became so annoying and even Heroic devs added fallback with re-running legendary cli with auto-increasing shared memory each time if we have memory error with default shared memory amount, which leads to manifests re-downloading and much slower.
Fallback will improve user experience because users mostly use legendary with frontends and even are not able to see such errors. But even CLI users could suffer. Are you sure we want users to figure out which workaround from 3 available (increasing max memory, re-using files, or reordering) they need to apply? If this one surely works fast, why it cannot do a fallback in such cases? Also, not having a fallback will put some work to frontend developers to implement workarounds on their side. Did I convince you to a fallback, or you still have arguments against it? If not, please provide requirements for error messages to help users with choosing workaround, for config options (per game/global/etc/naming), and for CLI options naming if you aren't good with mine. UPD: I've added global config option.
You mean squashing commits into the one? I can do this then we'll come to final decision, or GitHub can do this for your on merging. Just let me know. I want this to be finished and do not stuck in unmerged forever. |
…parameter and config option
9887248
to
455b6ff
Compare
@derrod Just a reminder. Cold you please review the last comment and let me know what is left to get this merged? BTW we got a report about another game with the same issue. Thanks! |
Just need to find some time to re-review it, last week's been busy. |
Drastically decrease memory usage
Usually we do not need even 1Gb of shared memory with this patch.
Closes #17