Move away from zip and use tar+gzip instead #343

leostera · 2015-12-04T23:29:18Z

Just about that. Refactor the build scripts so they use tar instead.

It'd require refactoring the node-client to use node-tar instead of unzip2. But that should be fairly straightforward since they share the same .Extract interface.

The text was updated successfully, but these errors were encountered:

Permafacture · 2015-12-28T20:05:21Z

While someone is at it, xz (LZMA) compression is preferred to gzip.

igorshubovych · 2016-02-11T11:21:14Z

@Ostera do you think we need it?

IMO it does not solve any problem at the moment, but may create potential issues for the clients who rely on it.

leostera · 2016-02-11T12:06:08Z

Ultimately using tar vs zip provides you not only the ability to choose between gzip or bzip2 compression, but also to store UNIX metadata (uid, gid, permissions) – zip will only store MSDOS metadata (hidden, system, other crap).

Then there's another advantage, which is using gzip instead of zip (unless it's possible to use that compression algorithm with zip too).

gzip will compress one big blob (the tarball), which means that any repeated strings in all of the pages, will be compressed more efficiently. So a zip of a 100 identical files will be about 100 times bigger than the same gzip of the same files.

igorshubovych · 2016-02-11T13:05:29Z

@Ostera
You are right, it is 63 Kb vs 143 Kb at the moment.
Probably we should do both to let people migrate.

leostera · 2016-02-11T14:33:23Z

Precious kilobytes! I'd introduce gzip in the next release and then drop zip support on the one after that so people have time to migrate.

igorshubovych · 2016-02-11T14:45:37Z

They are precious.
Some people are using tldr on mobile clients. And now imagine they update package every week.

agnivade · 2016-10-06T06:23:01Z

Closing it as discussed on tldr-pages/tldr-node-client#9. TLDR: Clients should prefer git over manually downloading .zip archives.

pepa65 · 2017-03-12T00:29:37Z

Since we're talking about roughly 100kB for the whole archive, smaller than most current webpages, I think downloading the whole archive shouldn't be such a problem every once in a while.

waldyrious · 2017-04-25T21:59:15Z

@pepa65 while that approach may work now, IMO it's neither scalable nor elegant. But in any case, this issue was created back when clients (especially the node one) were tightly coupled to the tldr-pages repo, and we want to move away from that, by providing a spec that any client can follow.

So, as long as the clients follow the spec's recommendations, they could still use the full zip download approach, since the archives (tldr.zip and index.json) are currently still being generated upon every commit to this repo. We won't invest time changing the format of the archive as suggested on this issue, since we don't recommend that method of updating clients' local cache of pages, and we won't commit to support the archive generation indefinitely (e.g. if some part of the pipeline breaks), but we won't go out of our way to deliberately curtail that service, either -- not without previous discussion with the clients' authors, at least :)

igorshubovych added the page edit Changes to an existing page(s). label Dec 5, 2015

igorshubovych self-assigned this Dec 5, 2015

This was referenced Dec 5, 2015

Does it really need index.json? #336

Closed

Error: invalid distance too far back (Zlib) #355

Closed

leostera changed the title ~~Move away from zip and use tar instead~~ Move away from zip and use tar+gzip instead Feb 11, 2016

waldyrious added architecture Organization of the pages per language, platform, etc. and removed page edit Changes to an existing page(s). labels Aug 31, 2016

agnivade mentioned this issue Oct 5, 2016

Perform a automatic cache refresh when a page isn't found tldr-pages/tldr-node-client#9

Closed

agnivade closed this as completed Oct 6, 2016

waldyrious mentioned this issue Mar 11, 2017

Define spec for officially sanctioned clients #1065

Closed

waldyrious mentioned this issue Apr 25, 2017

Investigate using Git LFS for storing the pages tarball #344

Closed

waldyrious mentioned this issue Dec 24, 2019

New pages structure #190

Closed

waldyrious mentioned this issue May 3, 2021

idea: difference reporting when running tldr --update tldr-pages/tldr-node-client#332

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move away from zip and use tar+gzip instead #343

Move away from zip and use tar+gzip instead #343

leostera commented Dec 4, 2015

Permafacture commented Dec 28, 2015

igorshubovych commented Feb 11, 2016

leostera commented Feb 11, 2016

igorshubovych commented Feb 11, 2016

leostera commented Feb 11, 2016

igorshubovych commented Feb 11, 2016

agnivade commented Oct 6, 2016

pepa65 commented Mar 12, 2017

waldyrious commented Apr 25, 2017 •

edited

Loading

Move away from zip and use tar+gzip instead #343

Move away from zip and use tar+gzip instead #343

Comments

leostera commented Dec 4, 2015

Permafacture commented Dec 28, 2015

igorshubovych commented Feb 11, 2016

leostera commented Feb 11, 2016

igorshubovych commented Feb 11, 2016

leostera commented Feb 11, 2016

igorshubovych commented Feb 11, 2016

agnivade commented Oct 6, 2016

pepa65 commented Mar 12, 2017

waldyrious commented Apr 25, 2017 • edited Loading

waldyrious commented Apr 25, 2017 •

edited

Loading