this post was submitted on 26 Dec 2024
69 points (71.4% liked)
Technology
60548 readers
6858 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
The external storage data and shannon are both called bits, exactly because they’re both base 2. That does not mean they’re the same. As the article explains it, a shannon is like a question from 20 questions.
Wrong. They are called the same because they are fundamentally the same. That's how you measure information.
In some contexts, one wants to make a difference between the theoretical information content and what is actually stored on a technical device. But that's a fairly subtle thing.
A bit in the data sense is just an element of the set of booleans. A bit in the entropy sense is the amount of information revealed by an observation with two equally probable outcomes. These are not the same thing because the amount of information contained in a bit is not always equal to one bit of entropy. For example, if a boolean is known to be 0, then the amount of information it contains is 0 bits. If it is known that the boolean is equally 0 or 1, then the information content is 1 bit. It depends on the prior probability distribution.
In some contexts, a bit can refer to a boolean variable, a flag. In other contexts, it may refer to the voltage at a certain point, or any number of other things. But when you are talking about bits/s then it's a measure of information.
Yes, but as you know, this implies that the information is already available. You can use that knowledge to create a compression algorithm, or to define a less redundant file format. That's very practical.
We can also be a bit philosophical and ask: How much information does a backup contain? The answer could be: By definition, 0 bits. That's not a useful answer, which implies a problem with the application of the definition.
A more interesting question might be: How much information does a file contain, that stores the first 1 million digits of the number π?
I don't see how that can be a subtle difference. How is a bit of external storage data only subtly different from information content that tells the probability of the event occurring is ½?
It's a bit like asking what is the difference between the letter "A" and ink on a page in the shape of the letter "A". Of course, first one would have to explain how they are usually not different at all.
BTW, I don't know what you mean by "external storage data". The expression doesn't make sense.