/sci/ - Science & Math » Thread #14376579

477KiB, 2593x1118, virginSTATSvsCHADML.png

View Same Google iqdb SauceNAO

Central Limit Theorem usages, doubts and misconceptions

Anonymous Fri 08 Apr 18:56:01 2022 No.14376579 View Reply Original Report

Quoted By: >>14376853

I am twisting my career towards Data Science and, after some time refreshing my statistics base, I realized that statistics is one of the fields in which people tend to missunderstand more the basis, even teachers and specialists.

For that reason I am now a bit skeptical about some widely used techniques. For a project, I need to contrast two means from two different populations. Tthe standard deviation from both populations is unknown and sample distributions are not strictly normal (they have belled shape but their skewness and kurtosis differ from normal values).

Under this circumstances, Internet suggests me different approaches:

1) To asume the distirbutions are normal (despite they are not) and to apply a welch-test to determine if means differ (welch-test is like t-test but it is applied when it is not known that the two samples share the same sd value). People who suggest it argue that welch test is pretty robust to some degree of non-normality.

2) Applying Central Limit Theorem to both samples and then applying the Welch test to the sample mean distributions.

3) Applying a non parametric test to compare the two means.

4) Transforming both sample distributions into normal distributions and then applying a test. (For some irrational reason and considering my data is not far away from being normally distributed, I don't like this approach very much).

What do you think it is the best approach? I have been googlering about it the whole day, but I have not found a solid response. Maybe the question is a bit silly, but Internet is full of bad answers.

Thank you very much.

Anonymous

Anonymous Fri 08 Apr 2022 20:12:48 No.14376818 Report

Quoted By: >>14376897

1) Fair enough with large sample size

2) I don't know what you mean by "Applying the Central Limit Theorem"

3) Kolmogorov-Smirnov would do the job

4) I know normalizing everything feels very bullshit social sciences, but you'll come around. As long as you can eyeball that your distribution is sigma sub-gaussian, you're pretty much fine

A small note; using the term "Gaussian Distribution" will yield better results when searching online than "Normal Distribution".

Another small note; when looking at work done by others, I thrust bayesian approaches way more than some questionably applied statistical tests

Anonymous

Anonymous Fri 08 Apr 2022 20:21:21 No.14376853 Report

Quoted By: >>14376897

>>14376579
Not a statistician, but wouldn't it be possible to express the means as percentiles of the other distribution? I think that'd give it the most anal precision that wouldn't neglect skew and kurtosis, whilst still solving the problem.

Anonymous

Anonymous Fri 08 Apr 2022 20:32:30 No.14376897 Report

Quoted By:

>>14376818
Wow. Thank you very much, based anon
>>14376853
I will consider also this. Thank you too.

Capcode	All Only User Posts Only Moderator Posts Only Admin Posts Only Developer Posts
Show Posts	All Only With Images Only Without Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

Your latest searches