-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
See if git can be used to collaborative tracking Mastodon projects? #17
Comments
No they will just complain about conflicting binary files.
Yes, |
Git is known to perform poorly on large binary files. The biggest mastodon file, seen by me so far, has 50 MB. With roughly 400.000 spots. Storing only 1000 version of that file, without delta compression would produce 50 GB. Making a copy after 10 added spots would lead to a theoretical 1 TB. The recommended size for a git repo is less than 1 GB. Also the maximum file size in git is 100 MB so, we will soon reach this limit.
The Mastodon file format is not friendly with regard to delta compression. I did an experiment, opened a large dataset. Saved it to a.mastodon, added a spot, saved it to b.mastodon. I uncompressed the mastodon files. And compared the model.raw file between the two. There are more than a 100_000 bytes different between those two files. A would expect maybe 1000 bytes. My conclusion, and also verified using Conclusion: Mastodon storage file format could be hugely improved in terms of "delta compression friendlyness". Splitting the files into blocks would further reduce load on git. We wouldn't even need to use git LFS.
A specialized file format, would greatly reduce the need for bandwidth, storage requirements and git LFS. Probably improve performance and offline availability. SVN is another alternative, but has it's drawbacks of been centralized. An not offline capable. Slightly different approach to branching. |
Open question:
|
Is it possible to use git as a backend for mastodon-sc/mastodon-git#12
Clarify the following questions?
(To reduce the size of the git repository)
Refine ticket: Git Compatible Mastodon Storage Format mastodon-git#21
The text was updated successfully, but these errors were encountered: