this post was submitted on 16 Jan 2024
59 points (100.0% liked)
askchapo
22849 readers
299 users here now
Ask Hexbear is the place to ask and answer ~~thought-provoking~~ questions.
Rules:
-
Posts must ask a question.
-
If the question asked is serious, answer seriously.
-
Questions where you want to learn more about socialism are allowed, but questions in bad faith are not.
-
Try !feedback@hexbear.net if you're having questions about regarding moderation, site policy, the site itself, development, volunteering or the mod team.
founded 4 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
For basic stats you can ignore a bunch of the underlying theory and just memorize a couple of formulas (or how to apply those formulas if you get a notes sheet on the test) and get good at finding numbers in tables.
The normal distribution represents the idea that most sets of data tend to cluster around the mean (if you grab someone at random, you are more likely to find someone who's 5'8" than 6'7").
Degrees of freedom is the number of observations in your sample minus 1 and is used to look up a value in a table.
Thank you! Good advice! I know X^2 tests of goodness of fit and independence are (Ei-Oi)^2/Ei, but I'm having trouble understanding what i means, I've been told it means category, but the context is hard for me to grasp (which category? When? What counts as a category?)
(I've grasped what degress of freedom and normal distribution are, I was just using them as an example lol but thank you, your explanations made it clearer for me)
No prob!
The i denotes a category that's determined by the researcher. You can think of Chi squared tests as testing the deviation from an assumption (typically that whatever you're looking at is randomly distributed). So it can be something as simple as heads or tails in a bunch of coin flips - you could calculate your _E_xpected rate of heads - _O_bserved rate of heads / the _E_xpected rate of heads
Here's a really detailed explanation. In evolutionary bio you can use it to see if traits are undergoing selection. Suppose in tribbles allele P causes black fur, p causes white fur, and the heterozygous condition is gray. You know from previous sampling that the rate of appearance for P is 0.80 and p is 0.20. You could then estimate that you would expect to see 64% black Tribbles, 32% gray Tribbles, and 4% white Tribbles. If you go out and your sample only has, say 20% gray tribbles, the chi square can tell you (O_gray - E_gray, etc) if it's reasonable to think that was random sampling error or there's something else going on (possibly sexual selection or predation).
That helps a lot, thank you! Exellent use of Tribbles
Glad you found it useful! Feel free to ping me if there's anything else I can help with.
doesn't this lead to a lot of extraneous variables that are actually linearly dependent on a smaller set? or worse -- overconstraining?
This isn't a universal definition of degrees of freedom, it's just "degrees of freedom as it applies in an undergraduate level stats course," which is typically for the t distribution. It's n-1 because you assume all your observations are independent of one another. In other contexts (ANOVA, e.g.), the calculation is different.
I get that, just thought it was a strange definition given what degrees of freedom actually refers to