/sci/ - Science & Math » Thread #13937647

32KiB, 756x560, The-surprise-function-and-the-entropy-function-as-a-function-of-probability-over-a-binary.jpg

View Same Google iqdb SauceNAO

Anonymous Wed 08 Dec 04:03:54 2021 No.13937647 View Reply Original Report

Quoted By: >>13938972 >>13939999 >>13940047 >>13940386

Can someone help me understand Shannon Entropy?

So I get the basic concept, that entropy is a measure of all the possible outcomes of a message.

However, the books I've read next go into the surprise of a message. English doesn't have randomly distributed letters, so the actual surprise of an English language message isn't 26^n possible messages, since some letters tend to follow others.

The problem I have conceptualizing this, is that then the text went into how letters tend to appear in words, and words tend to appear together as well, so surprise is actually even lower.

This got me thinking, isn't suprise also totally reliant on what you know about who you're getting the message from?

>"Fuck, I just tapped to traps again"

Has little surprise value on /r9k/, but very high value when it is delivered by a pastor at church.

In a deterministic universe, the God's eye view of things would see zero surprise in any message, and the whole idea of physics as information starts to seem silly.

The reason I think I might be getting it wrong is because a passage basically said you could get your distribution of letters wrong, and have the wrong value for surprise. But to my mind, there is no "right" value of surprise below 0. If you know exactly who is talking to you and what they are about to say, it's a lot different from randomly generated text.

Second, how does information science deal with synonyms. I can't find this anywhere. There are tons of ways to say the exact same thing in English with different words. Entropy can be a total measure of possible messages, but it can't be a total measure of possible information when the same length string can represent the same exact idea multiple times in its possible configurations, right? I still get the concept is useful, it just doesn't seem like the hard limit it is described as.

Anonymous

Anonymous Wed 08 Dec 2021 15:40:12 No.13938972 Report

Quoted By: >>13939964

>>13937647
Does that mean entropy could be quantified as all possible arrangements of information one could get from a known messenger?

Anonymous

Anonymous Wed 08 Dec 2021 20:55:44 No.13939964 Report

Quoted By:

>>13938972
Yes, that is the definition.

Anonymous

View Same Google iqdb SauceNAO c27.png, 89KiB, 660x574

Anonymous Wed 08 Dec 2021 21:05:45 No.13939999 Report

Quoted By:

>>13937647
>The problem I have conceptualizing this, is that then the text went into how letters tend to appear in words, and words tend to appear together as well, so surprise is actually even lower.
correct, there is some distribution that characterizes any arbitrary alphabet
>This got me thinking, isn't suprise also totally reliant on what you know about who you're getting the message from?
yes, in the real world you have to understand the system that is generating the information.
>The reason I think I might be getting it wrong is because a passage basically said you could get your distribution of letters wrong, and have the wrong value for surprise. But to my mind, there is no "right" value of surprise below 0. If you know exactly who is talking to you and what they are about to say, it's a lot different from randomly generated text.
you are getting the physical world mixed up with idealized mathematical models. If you knew the structure perfectly, you are right, there would be no surprise. This is similar to how if you know every independent force working on a coin or some dice, its not actually a random result. We simply dont assume that in the model we work with.

>Second, how does information science deal with synonyms. I can't find this anywhere. There are tons of ways to say the exact same thing in English with different words. Entropy can be a total measure of possible messages, but it can't be a total measure of possible information when the same length string can represent the same exact idea multiple times in its possible configurations, right? I still get the concept is useful, it just doesn't seem like the hard limit it is described as.
very interesting, never thought about synonyms in natural language, id have to think about it for a bit

Anonymous

Anonymous Wed 08 Dec 2021 21:21:42 No.13940047 Report

Quoted By:

>>13937647
>If you know exactly who is talking to you and what they are about to say
You don't know what they are about to say though. Not sure where you are going with this.

>Second, how does information science deal with synonyms. I can't find this anywhere. There are tons of ways to say the exact same thing in English with different words. Entropy can be a total measure of possible messages, but it can't be a total measure of possible information when the same length string can represent the same exact idea multiple times in its possible configurations, right? I still get the concept is useful, it just doesn't seem like the hard limit it is described as.
Usually two words that are synonyms don't carry precisely the same information. they have slightly different connotations and things like that.
If you had an example where two words truly meant the same thing, and nothing more (the nothing more part is very important here), then you could effectively just drop one from your alphabet/vocabulary, or always substitute one for the other every time you receive a message.

I don't want to get too tangential here, but I think a good way to get a grasp on synonyms and words and shit is to think about word embeddings that you see used with neural networks. The basic idea being that you represent words as vectors, and the similarity between two vectors can be taken to represent the similarity in meaning between the two words. Usually these word embeddings are trained by optimizing a log-likelihood which is equivalent to a cross-entropy, so there actually is a connection to entropy here.

I think the best way to understand Shannon Entropy is to stay in the domain that Shannon was thinking about when coming up with all of it. That is, sending signals over a wire. You definitely can apply it beyond that, and people have, but for building that core intuition keep it simple.

Anonymous

Anonymous Wed 08 Dec 2021 23:13:33 No.13940386 Report

Quoted By:

>>13937647
>basic concept

communicate reliably in a noisy medium

for a linear increase in the resources used to represent the symbol, you get an exponential reduction in error so long as the noise is below a threshold.

https://www.youtube.com/watch?v=0yeKwZWmd-k

Anonymous

Anonymous Thu 09 Dec 2021 02:50:02 No.13941073 Report

Quoted By:

Yeah, there are synonyms and levels of synonymity. For example, you could send a book with all the Ts swapped for Qs, etc. With just letter swapping, no complex code, case swapping, numbers, or punctuation changes, nor additional information, you can send the same book 676 times within a message containing the same entropy.

We don't measure meaning in this way because it's incredibly hard to do mathematically. Semiotics tries, you have some formal logic systems try to do this or define synonymity.

Capcode	All Only User Posts Only Moderator Posts Only Admin Posts Only Developer Posts
Show Posts	All Only With Images Only Without Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

Your latest searches