Why NDJSON instead of JSON? #9068

alexandernst · 2024-05-31T12:18:10Z

yarn info (and others?) output NDJSON instead of JSON and I can't find a reason why you'd have picked NDJSON instead of JSON.

JSON is already very easily parseable with jq and the size increase in the output is negligible compared to NDJSON. Maybe switch to JSON and make it easier to parse yarn's output?

The text was updated successfully, but these errors were encountered:

Daniel15 · 2024-07-15T17:16:18Z

jq should handle newline-delimited JSON fine.

Are you talking about Yarn 1.x or 4.x? This repo is for 1.x, which is frozen and not getting updates.

alexandernst · 2024-07-15T17:21:12Z

Sure, jq can handle NDJSON, but why pick NDJSON instead of plain JSON in the first place? It just seems a weird decision given the fact that NDJSON is not that common and, albeit jq handling it properly, there most probably are ton of other tools that wont handle NDJSON.

I believe the output of this particular command is the same for 1.x and 4.x.

snydergd · 2025-01-31T18:32:25Z

When there is a large amount of data, I've found that it can be a challenge to efficiently handle the JSON file in code, because the choices I've seen are either:

Load the entire file into memory as an object - example json.parse(f) in python. I probably have enough memory to do it a few times at least, but still doesn't scale very nicely. I also can't start processing the data until the file has finished writing.
Do some sort of event-based JSON parsing similar to what SAX/STAX are to XML in Java, which from my experience tends to be a lot more cumbersome.

NDJSON (or JSON Lines, as I think is now the new name), solves this problem by allowing me to easily take in a single object at a time by reading lines in the file. Parsing/splitting lines has been trivial in any development environment/stack that I've worked with.

Am trying to think of a scenario where you would be coordinating the run of yarn info and feeding of the data into another tool, where you wouldn't have the ability to do the transformation into a JSON array like you are talking about. It would be dead-simple to do it with a node.js script, or you could use JQ, or you could even do it with something as primitive as bash scripting.

I would love to discuss the scenarios where NDJSON is proving problematic for you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why NDJSON instead of JSON? #9068

Why NDJSON instead of JSON? #9068

alexandernst commented May 31, 2024

Daniel15 commented Jul 15, 2024 •

edited

Loading

alexandernst commented Jul 15, 2024

snydergd commented Jan 31, 2025

Why NDJSON instead of JSON? #9068

Why NDJSON instead of JSON? #9068

Comments

alexandernst commented May 31, 2024

Daniel15 commented Jul 15, 2024 • edited Loading

alexandernst commented Jul 15, 2024

snydergd commented Jan 31, 2025

Daniel15 commented Jul 15, 2024 •

edited

Loading