this post was submitted on 02 Aug 2024
340 points (97.5% liked)

Science Memes

11068 readers
2633 users here now

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.



Rules

  1. Don't throw mud. Behave like an intellectual and remember the human.
  2. Keep it rooted (on topic).
  3. No spam.
  4. Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.



Research Committee

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] mvirts 1 points 3 months ago (1 children)

It's more likely you'll eat up storage when you read a 600mb parquet and try to write it as CSV.

[–] [email protected] 1 points 3 months ago (1 children)

I mean, yeah, that's the point of compression. I don't quite get what you mean by that comment.

[–] mvirts 1 points 3 months ago (1 children)

Ah I was trying to point out that CSV is the inefficient format. Reading a large amount of data from a more efficient format like parquet is more likely to cause trouble because the memory required can be more than the file size. CSV is the opposite where it will almost always use more disk space than is required to represent the data in memory.

[–] [email protected] 1 points 3 months ago