#1
In a normal distribution, what percentage of data falls within one standard deviation of the mean?
68%
ExplanationApproximately 68% of data points in a normal distribution lie within one standard deviation of the mean.
#2
What does the term 'p-value' stand for in statistics?
Probability-value
ExplanationThe p-value represents the probability of obtaining the observed results, or more extreme results, under the assumption that the null hypothesis is true.
#3
Which measure of central tendency is most affected by outliers?
Mean
ExplanationMean is sensitive to extreme values, making it highly affected by outliers.
#4
What does the standard deviation measure in a set of data?
Variability
ExplanationStandard deviation quantifies the spread or dispersion of data points from the mean, representing variability.
#5
Which graphical representation is suitable for displaying the distribution of a continuous variable?
Histogram
ExplanationHistograms provide a visual depiction of the frequency distribution of continuous data, showing the shape, center, and spread of the data.
#6
What is the purpose of a confidence interval in statistics?
To estimate the population parameter with a range of values
ExplanationConfidence intervals provide a range of values within which the true population parameter is estimated to lie with a certain level of confidence.
#7
Which statistical test is appropriate for comparing means of more than two independent groups?
Analysis of Variance (ANOVA)
ExplanationANOVA assesses whether the means of multiple independent groups are significantly different from each other.
#8
In a normal distribution, what is the relationship between the mean, median, and mode?
Mean = Median = Mode
ExplanationIn a perfectly symmetrical normal distribution, the mean, median, and mode are equal.
#9
Which measure of variability is resistant to outliers?
Interquartile Range (IQR)
ExplanationIQR is less influenced by extreme values, making it a robust measure of variability in the presence of outliers.
#10
What is the primary purpose of a scatter plot in statistical analysis?
To show the relationship between two continuous variables
ExplanationScatter plots visually represent the relationship between two continuous variables, indicating the strength and direction of association.
#11
Which type of sampling method involves selecting every kth individual from the population?
Systematic sampling
ExplanationSystematic sampling involves selecting every kth element from the population after a random starting point, providing a simple and systematic way to obtain a representative sample.
#12
When is a type I error committed in hypothesis testing?
Rejecting a true null hypothesis
ExplanationA type I error occurs when a true null hypothesis is incorrectly rejected, indicating a false positive result.
#13
Which statistical test is appropriate for comparing means of two related groups?
Paired t-test
ExplanationThe paired t-test compares the means of two related groups to assess whether there is a statistically significant difference between them.
#14
In regression analysis, what does a negative coefficient indicate?
A negative correlation between variables
ExplanationA negative coefficient in regression analysis suggests an inverse relationship between the independent and dependent variables, indicating that as one variable increases, the other decreases.
#15
Which statistical test is used to determine if there is a significant association between two categorical variables?
Chi-square test
ExplanationChi-square test assesses the independence between categorical variables by comparing observed and expected frequencies.
#16
What is the purpose of regression analysis in statistics?
To predict one variable based on another
ExplanationRegression analysis aims to establish the relationship between variables, allowing prediction based on known data.
#17
What does p-value represent in hypothesis testing?
Probability of obtaining the observed results if the null hypothesis is true
ExplanationP-value indicates the likelihood of observing the given results if the null hypothesis were correct, guiding decision-making in hypothesis testing.
#18
What is the primary advantage of using a box plot in data visualization?
Highlights outliers in the data
ExplanationBox plots visually represent the distribution of data, emphasizing outliers and providing insight into the data's spread and central tendency.
#19
What is the purpose of the coefficient of determination (R-squared) in regression analysis?
To evaluate the goodness of fit of the regression model
ExplanationR-squared measures the proportion of variance in the dependent variable explained by the independent variables, indicating the goodness of fit of the regression model.
#20
What does a two-tailed hypothesis test assess?
If there is a significant difference in both directions
ExplanationA two-tailed hypothesis test determines if there is a significant difference in either direction from the hypothesized value.
#21
What is the formula for calculating the z-score of a data point in a normal distribution?
(X - μ) / σ
ExplanationThe z-score formula standardizes a data point by subtracting the mean and dividing by the standard deviation, providing its position relative to the mean in standard deviation units.
#22
When is the Mann-Whitney U test appropriate in statistical analysis?
To compare medians of two independent groups
ExplanationThe Mann-Whitney U test is a non-parametric test used to determine if there is a significant difference between the medians of two independent groups.
#23
What is the purpose of a Q-Q plot in statistics?
To check the normality of a dataset
ExplanationQ-Q plots compare the distribution of a dataset to a theoretical normal distribution, helping to assess whether the data are normally distributed.
#24
What is the primary purpose of the chi-square goodness-of-fit test?
To test if observed frequencies match expected frequencies in a distribution
ExplanationThe chi-square goodness-of-fit test evaluates whether observed frequencies of categorical data match the expected frequencies under a specified distribution.
#25
What does the term 'p-hacking' refer to in the context of statistical analysis?
Manipulating data to achieve statistical significance
ExplanationP-hacking involves manipulating data, analysis, or study conditions to obtain statistically significant results, which may not reflect the true relationship or effect.