this post was submitted on 08 Jul 2023
-11 points (34.3% liked)

Python

5878 readers
41 users here now

Welcome to the Python community on the programming.dev Lemmy instance!

πŸ“… Events

October 2023

November 2023

PastJuly 2023

August 2023

September 2023

🐍 Python project:
πŸ’“ Python Community:
✨ Python Ecosystem:
🌌 Fediverse
Communities
Projects
Feeds

founded 1 year ago
MODERATORS
 

Are you doing data science? Statistics? No?

Then for god's sake don't use pandas, you just look dumb af when you pull several MB of a package just to load csv. If you find yourself doing that, just stop programming and look for another job

Thanks for attention

all 8 comments
sorted by: hot top controversial new old
[–] _danny 27 points 1 year ago (1 children)

If you find yourself doing that, just stop programming and look for another job

I don't think that is an appropriate response to someone misunderstanding a package. Just educate them in a kind and respectful manner and they'll learn.

[–] [email protected] 3 points 1 year ago

fair enough.

[–] [email protected] 6 points 1 year ago (1 children)

For those that don't know, the standard library includes a csv package so you can just import csv.

Documented at https://docs.python.org/3/library/csv.html

[–] [email protected] 5 points 1 year ago

I use DictReader all the time, along with a library to hande the type detection. This is the way to go, especially if you need to process line by line or filter columns and rows out first.

Regardless, I'll avoid pandas wherever I can. It's not something I want in production level code if I can help it.

[–] [email protected] 2 points 1 year ago (1 children)

Wait what, people are loading data from csv in their websites?

[–] [email protected] 2 points 1 year ago

Makes sense if you want to give the user the ability to import data from csv (see django import export package). Beyond importing data from user or another service i dont see other uses, but they do exist.

[–] [email protected] 1 points 1 year ago (1 children)

Pandas is more efficient than Python at operating on large datasets. Can you suggest alternarives?