Learn Programming

1625 readers

66 users here now

Posting Etiquette

Ask the main part of your question in the title. This should be concise but informative.
Provide everything up front. Don't make people fish for more details in the comments. Provide background information and examples.
Be present for follow up questions. Don't ask for help and run away. Stick around to answer questions and provide more details.
Ask about the problem you're trying to solve. Don't focus too much on debugging your exact solution, as you may be going down the wrong path. Include as much information as you can about what you ultimately are trying to achieve. See more on this here: https://xyproblem.info/

Icon base by Delapouite under CC BY 3.0 with modifications to add a gradient

founded 2 years ago

MODERATORS

[email protected]

What is NoSQL good for? (programming.dev)

submitted 1 day ago by [email protected] to c/[email protected]

21 comments fedilink hide all child comments

I’m versed enough in SQL and RDBMS that I can put things in the third normal form with relative ease. But the meta seems to be NoSQL. Backends often don’t even provide a SQL interface.

So, as far as I know, NoSQL is essentially a collection of files, usually JSON, paired with some querying capacity.

What problem is it trying to solve?
What advantages over traditional RDBMS?
Where are its weaknesses?
Can I make queries with complex WHERE clauses?

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 9 points 1 day ago* (last edited 1 day ago) (3 children)

Let me see if I got it. It would be like a denormalized table with a flexible number of columns? So instead of multiple rows for a single primary key, you have one row (the file), whose structure is variable, so you don’t need to traverse other tables or rows to gather/change/delete the data.

The downsides are the usual downsides of a denormalized DB.

Am I close?

[–] [email protected] 8 points 1 day ago* (last edited 1 day ago)

Pretty much. The advantage is not really the unstructeredness per se, but simply the speed at which you can get a single record and the throughput in how much you can write. It's essentially sacrificing some of the guarantees of ACID in return for parallelization/speed.

Like when you have a million devices who each send you their GPS position once a second. Possible with RDBS but the larger your table gets, the harder it'll be to get good insertion/retrieval speeds, you'd need to do a lot of tuning and would essentially end up at something like a NoSQL database effectively.

[–] bahbah23 4 points 1 day ago (1 children)

Rather than try to relate it to an rdbms, think of it as a distributed hash map/associative array.

[–] [email protected] 3 points 1 day ago (1 children)

What I’m hearing is that they’re very different beasts for very different applications. A typical web app would likely need both.

[–] [email protected] 3 points 1 day ago (1 children)

Yup. And this right here is where I dismiss people that generally say you only need one or the other. Each has a specific advantage and use case and you’ll have the best performance when you choose the “right tool for the job” and don’t just attempt to shoehorn everything into a single solution

[–] [email protected] 1 points 1 day ago (1 children)

Hold a sec. Rolling your own RDBMS out of a NoSQL database is insane. But is the opposite feasible? Wouldn't it be a simple table with two columns: a key and a JSON blob?

[–] [email protected] 2 points 1 day ago (1 children)

Could you do it? Yes, but it’s not something that it’s optimized to do. NoSQL engines are designed to deal with key value pairs much better than an RDBMS. Again, best tool for the job.

[–] [email protected] 1 points 23 hours ago

Got it, thanks.

[–] [email protected] 4 points 1 day ago

Yes. You can also have fields that weren't defined when you created the "table".

With something like Elasticsearch you also have tokenisation of text which obviously compresses it. If it's logs (or similar) then you also only have a limited number of unique tokens which is nice. And you can do very fast text search. And everything is set up for other things like tf-idf.