Reading 3

Quantitative Methods · Statistical Measures of Asset Returns

MODULE 3.1: CENTRAL TENDENCY AND DISPERSION

LOS 3.a

Calculate, interpret, and evaluate measures of central tendency and location to address an investment problem.

Measures of Central Tendency

Measures of central tendency identify the center, or average, of a dataset. This central point can then be used to represent the typical, or expected, value in the dataset.

The arithmetic mean is the sum of the observation values divided by the number of observations. It is the most widely used measure of central tendency. The sample mean is the sum of all values in a sample, $\sum X$, divided by the number of observations, $n$. It is used to make inferences about the population mean:

$\displaystyle \bar{X} = \frac{\sum_{i=1}^{n} X_i}{n}$

The median is the midpoint of a dataset, where the data are arranged in ascending or descending order. Half of the observations lie above the median and half below. The median is important because the arithmetic mean can be affected by outliers (extremely large or small values). When this occurs, the median is a better measure of central tendency than the mean.

Example: Median (odd number of observations)

What is the median return for five portfolio managers with 10-year annualized total returns of 30%, 15%, 25%, 21%, and 23%?

Answer:

Arrange in descending order: 30%, 25%, 23%, 21%, 15%. The middle observation, 23%, is the median.

Example: Median (even number of observations)

Add a sixth manager with a return of 28%. What is the median return?

Answer:

Sorted: 30%, 28%, 25%, 23%, 21%, 15%. With an even count, the median is the arithmetic mean of the two middle values: $\frac{25\% + 23\%}{2} = \mathbf{24\%}$.

The mode is the value that occurs most frequently in a dataset. A dataset may have more than one mode, or no mode. A distribution with one most-frequent value is unimodal; with two or three, bimodal or trimodal.

Example: Mode

Dataset: {30%, 28%, 25%, 23%, 28%, 15%, 5%}

Answer:

The mode is 28% because it appears most frequently.

For continuous data such as investment returns, we typically do not identify a single outcome as the mode. Instead, we divide the relevant range of outcomes into intervals and identify the modal interval as the one into which the largest number of observations fall.

中文翻譯

集中趨勢量數用來描述資料的中心或平均水準，可代表「典型」或「期望」的值。

算術平均數＝所有觀測值總和÷觀測個數。樣本平均數 $\bar{X}$ 用於推論母體平均數：$\bar{X}=\frac{\sum X_i}{n}$。

中位數：資料排序後位於中間的值；偶數筆時取中間兩個的平均。中位數不受極端值（outlier）影響，當資料含異常值時較平均數更具代表性。

眾數：出現次數最多的值。可能無眾數、單峰、雙峰或三峰。連續型資料（如報酬率）找出觀測落入最多的眾數區間（modal interval）。

例題重點：5位經理（30、15、25、21、23）中位數＝23%；加入第6位（28）後中位數＝(25+23)/2＝24%；資料集{30,28,25,23,28,15,5}的眾數＝28%。

Methods for Dealing With Outliers

In some cases, a researcher may decide that outliers should be excluded from a measure of central tendency.

A trimmed mean excludes a stated percentage of the most extreme observations. A 1% trimmed mean discards the lowest 0.5% and the highest 0.5% of observations.
A winsorized mean substitutes a value for the most extreme observations rather than discarding them. To compute a 90% winsorized mean: determine the 5th and 95th percentiles, replace any value below the 5th percentile with the 5th percentile and any value above the 95th percentile with the 95th percentile, then average the revised dataset.

Measures of Location

Quantile is the general term for a value at or below which a stated proportion of the data in a distribution lies. Examples include:

Quartile — distribution divided into quarters (4)
Quintile — divided into fifths (5)
Decile — divided into tenths (10)
Percentile — divided into hundredths (100)

Any quantile can be expressed as a percentile. The third quartile is the 75th percentile. The difference between the third quartile and the first quartile (25th percentile) is known as the interquartile range (IQR).

To visualize a dataset based on quantiles, we can create a box and whisker plot. The box represents the central portion of the data (interquartile range), while the vertical line ("whisker") represents the entire range. If the largest observation is farther from the center than the smallest observation, the data may include one or more outliers on the high side.

中文翻譯

處理異常值（outliers）的方法：

截尾均數（trimmed mean）：剔除最極端的某百分比觀測值。1%截尾均數＝剔除最低0.5%與最高0.5%後再求平均。
溫式均數（winsorized mean）：以特定百分位數值「替換」極端值而非剔除。90%溫式均數＝低於第5百分位者以第5百分位代之，高於第95百分位者以第95百分位代之，再取平均。

位置量數（Quantiles）：四分位（quartile）、五分位（quintile）、十分位（decile）、百分位（percentile）。第三四分位＝第75百分位；Q3 − Q1 = 四分位距（IQR）。

盒鬚圖（Box and Whisker Plot）：盒子代表IQR，鬚線代表全距。若最大觀測距中心比最小觀測更遠，資料可能含高側異常值。

LOS 3.b

Calculate, interpret, and evaluate measures of dispersion to address an investment problem.

Range and Mean Absolute Deviation

Dispersion is the variability around the central tendency. The common theme in finance is the tradeoff between reward (central tendency) and variability (risk).

The range is the distance between the largest and smallest values in the dataset:

range = maximum value − minimum value

Example: Range

5-year annualized total returns: 30%, 12%, 25%, 20%, 23%.

Answer:

range = 30% − 12% = 18%

The mean absolute deviation (MAD) is the average of the absolute values of the deviations of individual observations from the arithmetic mean:

$\displaystyle MAD = \frac{\sum_{i=1}^{n} |X_i - \bar{X}|}{n}$

The MAD uses absolute values because the sum of actual deviations from the arithmetic mean is zero.

Example: MAD

Returns: 30%, 12%, 25%, 20%, 23%.

Answer:

$\bar{X} = \frac{30+12+25+20+23}{5} = 22\%$

$MAD = \frac{|30-22|+|12-22|+|25-22|+|20-22|+|23-22|}{5} = \frac{8+10+3+2+1}{5} = \mathbf{4.8\%}$

Interpretation: on average, an individual return deviates ±4.8% from the mean of 22%.

中文翻譯

離散度（Dispersion）＝資料相對於集中趨勢的變動程度。投資領域：報酬（中心）VS 風險（離散）。

全距（Range）＝最大值 − 最小值。例：30%−12%＝18%。

平均絕對離差（MAD）＝個別觀測值與平均數差的「絕對值」之平均：$MAD=\frac{\sum|X_i-\bar{X}|}{n}$。使用絕對值因為原始離差之和為零。

例題：報酬 30、12、25、20、23 → $\bar{X}=22\%$；$MAD=(8+10+3+2+1)/5=\mathbf{4.8\%}$，意即每筆報酬平均偏離均值±4.8%。

Sample Variance and Standard Deviation

The sample variance, $s^2$, is the measure of dispersion that applies when we are evaluating a sample of $n$ observations from a population:

$\displaystyle s^2 = \frac{\sum_{i=1}^{n}(X_i - \bar{X})^2}{n - 1}$

The denominator is $n - 1$ (one less than $n$). Using $n$ would systematically underestimate the population variance — a biased estimator. Using $n - 1$ improves the statistical properties of $s^2$ as an unbiased estimator of the population variance.

Example: Sample Variance

Sample of returns: 30%, 12%, 25%, 20%, 23%. $\bar{X} = 22\%$.

Answer:

$s^2 = \frac{(30-22)^2+(12-22)^2+(25-22)^2+(20-22)^2+(23-22)^2}{5-1} = \frac{64+100+9+4+1}{4} = \mathbf{44.5(\%^2)}$

Note: 44.5 "percent squared" = 0.00445 in decimal form.

A major problem with variance is interpreting its squared units (squared percentages, squared dollars). The sample standard deviation, $s$, is the positive square root of $s^2$ — and is in the original units of the data:

$\displaystyle s = \sqrt{\frac{\sum_{i=1}^{n}(X_i - \bar{X})^2}{n-1}}$

Example: Sample Standard Deviation

From the previous example, $s^2 = 44.5(\%^2)$.

Answer:

$s = \sqrt{44.5} = \mathbf{6.67\%}$ (or $\sqrt{0.00445} = 0.0667$).

On average, an individual return deviates ±6.67% from the mean return of 22%. $s$ is an unbiased estimator of the population standard deviation.

中文翻譯

樣本變異數 $s^2$：$s^2=\frac{\sum(X_i-\bar{X})^2}{n-1}$。分母用 $n-1$（而非 $n$）的目的：使估計式不偏（unbiased），避免低估母體變異數。

例題：5個報酬，$s^2=(64+100+9+4+1)/4=\mathbf{44.5\%^2}$（＝0.00445）。

樣本標準差 $s$＝$\sqrt{s^2}$，與原資料同單位。例題 $s=\sqrt{44.5}=\mathbf{6.67\%}$，意即每筆報酬平均偏離均值±6.67%。

Coefficient of Variation

A direct comparison between two or more measures of dispersion may be difficult when the means differ greatly. To make a meaningful comparison, a relative measure of dispersion must be used. Relative dispersion is commonly measured with the coefficient of variation (CV):

$\displaystyle CV = \frac{s}{\bar{X}} = \frac{\text{standard deviation of }x}{\text{average value of }x}$

CV measures the amount of dispersion in a distribution relative to its mean. In an investments setting, CV measures the risk (variability) per unit of expected return — lower CV is better.

Example: Coefficient of Variation

T-bills: mean monthly return 0.25%, standard deviation 0.36%. S&P 500: mean 1.09%, standard deviation 7.30%.

Answer:

$CV_{\text{T-bills}} = \frac{0.36}{0.25} = \mathbf{1.44}$

$CV_{\text{S\&P 500}} = \frac{7.30}{1.09} = \mathbf{6.70}$

There is less dispersion (risk) per unit of monthly return for T-bills than for the S&P 500.

教授提醒

To remember the formula for CV, remember that the CV is a measure of variation, so the standard deviation goes in the numerator. CV is variation per unit of return.

中文翻譯

變異係數（CV）＝$\frac{s}{\bar{X}}$，是相對離散度量數，可比較不同均值的兩組資料。投資上：CV愈小愈好（每單位報酬承擔的風險較低）。

例：T-bills CV＝0.36/0.25＝1.44；S&P 500 CV＝7.30/1.09＝6.70。T-bills每單位月報酬承擔的風險較低。

教授提醒：記憶口訣——CV是「變異」的量數，標準差在分子；CV＝每單位報酬的變異。

Target Downside Deviation

Variance and standard deviation calculate risk based on outcomes both above and below the mean. In some situations it is more appropriate to consider only outcomes less than the mean (or some other specific value). In this case, we are measuring downside risk.

One measure of downside risk is target downside deviation (also known as target semideviation). We choose a target value $B$ and only include deviations from the target in the numerator if the outcomes are below $B$:

$\displaystyle s_{target} = \sqrt{\frac{\displaystyle\sum_{\text{all }X_i < B}(X_i - B)^2}{n - 1}}$

Note that the denominator remains $n - 1$ (the full sample size minus one), even though we are not using all observations in the numerator.

Example: Target Downside Deviation

Returns: 30%, 12%, 25%, 20%, 23%. Calculate target downside deviation for $B = 22\%$ (the mean) and $B = 24\%$.

Answer:

Return	Deviation from Mean (22%)	Deviation from Target 24%
30%	+8%	+6%
12%	−10%	−12%
25%	+3%	+1%
20%	−2%	−4%
23%	+1%	−1%

For $B = 22\%$ (only $X_i < 22\%$: 12% and 20%):

$s_{22\%} = \sqrt{\frac{(-10)^2 + (-2)^2}{5-1}} = \sqrt{\frac{104}{4}} = \mathbf{5.10\%}$

For $B = 24\%$ (only $X_i < 24\%$: 12%, 20%, 23%):

$s_{24\%} = \sqrt{\frac{(-12)^2 + (-4)^2 + (-1)^2}{5-1}} = \sqrt{\frac{161}{4}} = \mathbf{6.34\%}$

中文翻譯

下行風險（Downside Risk）：僅考慮低於均值（或某目標值）的結果。標準差同時考量上下兩側偏離；某些情境下，只衡量「低於目標」的偏離更合適。

目標下行偏差（Target Downside Deviation, 又稱 target semideviation）：選定目標值 $B$，僅取低於 $B$ 的觀測值平方和：

$s_{target}=\sqrt{\frac{\sum_{X_i

注意：分母仍為 $n-1$（樣本總數），即使分子只用部分觀測值。

例題：報酬 30,12,25,20,23：以22%為目標 → $s_{22\%}=\sqrt{104/4}=5.10\%$；以24%為目標 → $s_{24\%}=\sqrt{161/4}=6.34\%$。目標愈高，下行偏差愈大。

📝 Module Quiz 3.1

1. A dataset has 100 observations. Which of the following measures of central tendency will be calculated using a denominator of 100?

A. The winsorized mean, but not the trimmed mean.
B. Both the trimmed mean and the winsorized mean.
C. Neither the trimmed mean nor the winsorized mean.

A — The winsorized mean substitutes a value for some of the largest and smallest observations (so all 100 obs are still used in averaging). The trimmed mean removes some of the largest and smallest observations (denominator < 100). (LOS 3.a)

2. XYZ Corp. annual stock returns: 22% (20X1), 5% (20X2), −7% (20X3), 11% (20X4), 2% (20X5), 11% (20X6). What is the sample standard deviation?

A. 9.8%.
B. 72.4%.
C. 96.3%.

A — Sample mean = $(22+5-7+11+2+11)/6 = 7.3\%$. Sample variance = $\frac{(22-7.3)^2+(5-7.3)^2+(-7-7.3)^2+(11-7.3)^2+(2-7.3)^2+(11-7.3)^2}{6-1} = 96.3$. Sample std = $\sqrt{96.3} \approx \mathbf{9.8\%}$. (LOS 3.b)

3. Using the same XYZ returns, assume an investor has a target return of 11%. What is the stock's target downside deviation?

A. 9.39%.
B. 12.10%.
C. 14.80%.

A — Deviations from target 11%: 22−11=+11; 5−11=−6; −7−11=−18; 11−11=0; 2−11=−9; 11−11=0. Only the negatives count: $\sqrt{\frac{(-6)^2+(-18)^2+(-9)^2}{6-1}} = \sqrt{\frac{36+324+81}{5}} = \sqrt{88.2} \approx \mathbf{9.39\%}$. (LOS 3.b)

MODULE 3.2: SKEWNESS, KURTOSIS, AND CORRELATION

LOS 3.c

Interpret and evaluate measures of skewness and kurtosis to address an investment problem.

Skewness

A distribution is symmetrical if it is shaped identically on both sides of its mean. Symmetry implies that intervals of losses and gains will exhibit the same frequency. The extent to which a returns distribution is symmetrical is important because it tells analysts if deviations from the mean are more likely to be positive or negative.

Skewness (or skew) refers to the extent to which a distribution is not symmetrical. Nonsymmetrical distributions result from outliers — observations extraordinarily far from the mean:

A positively skewed distribution has outliers greater than the mean (in the upper, or right tail) — said to be skewed right.
A negatively skewed distribution has a disproportionately large amount of outliers less than the mean (in the lower, or left tail) — said to be skewed left.

Effect of skewness on mean, median, mode:

Symmetrical: mean = median = mode.
Positively skewed (unimodal): mode < median < mean. Large positive outliers pull the mean upward. Example: a neighborhood of 99 homes selling for $100,000 plus one selling for $1,000,000 — median & mode = $100,000; mean = $109,000.
Negatively skewed (unimodal): mean < median < mode. Large negative outliers pull the mean downward.

教授提醒

Skew affects the mean more than the median and mode, and the mean is pulled in the direction of the skew. The median falls between the other two measures for both positively and negatively skewed distributions.

Sample skewness is approximated for large samples as:

$\displaystyle \text{sample skewness} \approx \left(\frac{1}{n}\right) \frac{\sum_{i=1}^{n}(X_i - \bar{X})^3}{s^3}$

The denominator $s^3$ is always positive; the numerator's sign depends on whether observations above or below the mean tend to be farther from the mean. Right-skewed → positive skewness; left-skewed → negative skewness. Values of sample skewness in excess of 0.5 in absolute value are considered significant.

教授提醒

The LOS requires us to "interpret and evaluate" measures of skewness and kurtosis, but not to calculate them.

中文翻譯

對稱分配（Symmetrical）：左右兩側形狀相同；損失與獲利區間出現頻率相同。偏態（Skewness）＝資料不對稱的程度，由異常值所致。

右偏（正偏）：尾巴在右側（高處有異常大值）。
左偏（負偏）：尾巴在左側（低處有異常小值）。

對中心趨勢量數的影響：對稱：mean = median = mode；右偏（單峰）：mode < median < mean；左偏（單峰）：mean < median < mode。記憶口訣：偏態「拉動」mean往尾巴方向移動，mean受影響最大；median永遠在中間。

例：99間100萬美元的房子＋1間1000萬美元的房子 → mode=median=100萬，mean=109萬（被拉高 → 正偏）。

樣本偏態（大樣本近似）：$\approx\frac{1}{n}\frac{\sum(X_i-\bar{X})^3}{s^3}$。絕對值>0.5者視為顯著。

教授提醒：LOS 要求是「解讀」與「評估」偏態與峰度，不需計算。

Kurtosis

Kurtosis measures the degree to which a distribution is more or less peaked than a normal distribution.

Leptokurtic — more peaked than normal (fat tails). Greater % of small deviations from the mean AND greater % of extremely large deviations from the mean.
Platykurtic — less peaked, flatter than normal (thin tails).
Mesokurtic — same kurtosis as a normal distribution.

A leptokurtic distribution implies a relatively greater probability of observing values either close to the mean or far from the mean. For investment returns, a greater likelihood of large deviations from the mean is often perceived as an increase in risk.

A distribution exhibits excess kurtosis if it has either more or less kurtosis than the normal distribution. Computed kurtosis for a normal distribution is 3. Statisticians sometimes report excess kurtosis = kurtosis − 3:

Normal distribution: excess kurtosis = 0.
Leptokurtic: excess kurtosis > 0 (positive).
Platykurtic: excess kurtosis < 0 (negative).

Kurtosis is critical in risk management. Most research shows actual securities returns exhibit both skewness and kurtosis. When returns are modeled assuming a normal distribution, the predictions will not account for the potential for extremely large, negative outcomes. Risk managers focus on the tails of the distribution — that is where the risk is. Greater excess kurtosis and more negative skew indicate increased risk.

Sample kurtosis (large sample approximation):

$\displaystyle \text{sample kurtosis} \approx \left(\frac{1}{n}\right) \frac{\sum_{i=1}^{n}(X_i - \bar{X})^4}{s^4}$

中文翻譯

峰度（Kurtosis）＝衡量分配「尖峭」或「平坦」程度。

Leptokurtic（高峰／尖峰）：比常態分配更尖、尾巴更厚（fat tails）；極端事件機率更大。
Platykurtic（低峰／平峰）：比常態分配更扁、尾巴較薄。
Mesokurtic（正常峰）：與常態分配峰度相同。

超額峰度（Excess Kurtosis）＝峰度 − 3。常態分配峰度＝3 → 超額峰度＝0。Lepto>0；Platy<0。

風險管理意涵：實際證券報酬常具偏態與峰度，若以常態假設建模會低估極端負面結果的機率。風險管理者特別關注尾部。超額峰度愈大、負偏愈深，風險愈大。

樣本峰度（大樣本近似）：$\approx\frac{1}{n}\frac{\sum(X_i-\bar{X})^4}{s^4}$。

LOS 3.d

Interpret correlation between two variables to address an investment problem.

Scatter Plots and Covariance

Scatter plots display the relationship between two variables. With one variable on each axis, paired observations are plotted as single points. Common patterns include:

(a) No relationship — points scattered randomly with no clear trend.
(b) Strong linear relationship — points cluster around a line (high correlation coefficient).
(c) Non-linear relationship — clear pattern but not linear; correlation coefficient close to zero. This is a key advantage of scatter plots: they reveal nonlinear relationships that are not described by the correlation coefficient.

Covariance is a measure of how two variables move together:

$\displaystyle s_{XY} = \frac{\sum_{i=1}^{n}\left[(X_i - \bar{X})(Y_i - \bar{Y})\right]}{n - 1}$

Covariance is difficult to interpret because:

Its value depends on the units of the variables (e.g., yen vs. dollars).
Its units are the square of the units used for the data.
We cannot interpret the relative strength of the relationship — a covariance of 0.8756 only tells us X and Y tend to move together (positive sign).

中文翻譯

散布圖（Scatter Plot）：兩變數一軸一個，每對觀測畫一點。型態：(a)無關係；(b)強線性關係（相關係數高）；(c)非線性關係（相關係數接近0但有可預測模式）。

散布圖最大價值：可揭示「非線性關係」，這是相關係數無法捕捉的。

共變異數（Covariance）＝$\frac{\sum(X_i-\bar{X})(Y_i-\bar{Y})}{n-1}$。難以解讀的原因：①單位影響大（以日圓計算 vs 美元計算結果不同）；②單位是原資料單位的平方；③無法判斷關係強度，只能看正負方向。

Correlation Coefficient

A standardized measure of the linear relationship between two variables is the correlation coefficient, or simply correlation:

$\displaystyle \rho_{XY} = \frac{s_{XY}}{s_X \, s_Y}$ which implies $s_{XY} = \rho_{XY}\, s_X\, s_Y$

Properties of correlation:

Measures the strength of the linear relationship between two random variables.
Has no units.
Ranges from −1 to +1: $-1 \le \rho_{XY} \le +1$.
$\rho_{XY} = +1$ → perfect positive correlation (proportional positive movement).
$\rho_{XY} = -1$ → perfect negative correlation (exact opposite proportional movement).
$\rho_{XY} = 0$ → no linear relationship; prediction of $Y$ from $X$ via linear methods is not possible.

Example: Correlation

Variance of returns: Stock A = 0.0028; Stock B = 0.0124. Covariance of returns = 0.0058.

Answer:

Convert variances to standard deviations:

$s_A = \sqrt{0.0028} = 0.0529$

$s_B = \sqrt{0.0124} = 0.1114$

$\displaystyle \rho_{AB} = \frac{0.0058}{(0.0529)(0.1114)} = \mathbf{0.9842}$

Close to +1 → linear relationship is positive AND very strong.

中文翻譯

相關係數（Correlation Coefficient）$\rho_{XY}$＝$\frac{s_{XY}}{s_X s_Y}$，是共變異數的「標準化」版本。

性質：①衡量兩變數的線性關係強度；②無單位；③值域 $[-1,+1]$；④$\rho=+1$完全正相關；$\rho=-1$完全負相關；$\rho=0$無線性關係（不代表無關係，可能是非線性）。

例題：股票A變異數0.0028（$s_A=0.0529$）、股票B變異數0.0124（$s_B=0.1114$）、共變異數0.0058 → $\rho_{AB}=\frac{0.0058}{0.0529\times0.1114}=\mathbf{0.9842}$，接近+1，呈強正向線性關係。

Care should be taken when drawing conclusions based on correlation. Causation is not implied just from significant correlation. Even if causation were present, correlation does not reveal which variable is causing change in the other. It is more prudent to say two variables exhibit positive (or negative) association — the nature of any causal relationship must be separately investigated or grounded in theory subject to additional tests.

Outliers and correlation: If removing outliers significantly reduces the calculated correlation, further inquiry is necessary into whether the outliers provide information or are caused by noise (randomness) in the data.

Spurious correlation refers to correlation that is either:

The result of chance; or
Present due to changes in both variables over time caused by their association with a third variable.

Examples (Tyler Vigen, Spurious Correlations):

The correlation between the age of each year's Miss America and the number of films Nicolas Cage appeared in that year is 87% — random.
U.S. spending on science, space, and technology vs. suicides by hanging, strangulation, and suffocation (1999–2009): 99.87% — both variables increased approximately linearly over the period.

中文翻譯

重點警語：相關 ≠ 因果。顯著相關並不代表存在因果關係，也無法判斷誰因誰果。較嚴謹的說法是兩變數呈「正向（或負向）連結」，因果關係須另行調查或以理論為基礎進行檢定。

異常值（Outliers）：若刪除outliers後相關係數明顯下降，需進一步釐清outliers是有意義訊息還是雜訊。

偽相關（Spurious Correlation）有兩類：①純屬巧合；②兩變數同時隨「第三變數」（如時間趨勢、通膨等）變動而呈現高相關，但彼此並無因果關係。例：美國科研支出與某類自殺案件1999–2009年相關性99.87%，純屬同向時間趨勢。

📝 Module Quiz 3.2

1. Which of the following is most accurate regarding a distribution of returns that has a mean greater than its median?

A. It is positively skewed.
B. It is a symmetric distribution.
C. It has positive excess kurtosis.

A — A distribution with mean > median is positively skewed (skewed to the right). The skew "pulls" the mean. Kurtosis deals with the overall shape (peak/tails), not skewness. (LOS 3.c)

2. A distribution of returns that has a greater percentage of small deviations from the mean AND a greater percentage of extremely large deviations from the mean (compared with a normal distribution):

A. is positively skewed.
B. has positive excess kurtosis.
C. has negative excess kurtosis.

B — Such a distribution is leptokurtic and exhibits positive excess kurtosis. It is more peaked AND has fatter tails than a normal distribution. (LOS 3.c)

3. The correlation between two variables is +0.25. The most appropriate way to interpret this value is to say:

A. a scatter plot of the two variables is likely to show a strong linear relationship.
B. when one variable is above its mean, the other variable tends to be above its mean as well.
C. a change in one of the variables usually causes the other variable to change in the same direction.

B — A correlation of +0.25 indicates a positive linear relationship — one variable tends to be above its mean when the other is above its mean. The value 0.25 indicates the linear relationship is not particularly strong. Correlation does not imply causation. (LOS 3.d)

Key Concepts — Reading 3

LOS 3.a

The arithmetic mean is the average of observations: $\bar{X} = \frac{\sum X_i}{n}$. The median is the midpoint when data are arranged in order. The mode is the most frequent value (modal interval for continuous data). A trimmed mean omits outliers; a winsorized mean replaces outliers with given values, reducing the effect of outliers in both cases.

Quantiles are values at or below which a stated proportion of data lies — quartile (4), quintile (5), decile (10), percentile (100).

LOS 3.b

Range = max − min. MAD = $\frac{\sum|X_i-\bar{X}|}{n}$. Sample variance $s^2 = \frac{\sum(X_i-\bar{X})^2}{n-1}$ (use $n-1$ for unbiased estimator). Standard deviation $s = \sqrt{s^2}$ — frequently used as a quantitative measure of risk.

Coefficient of variation $CV = \frac{s}{\bar{X}}$ — risk per unit of return; lower is better.

Target downside deviation (semideviation) $s_{target} = \sqrt{\frac{\sum_{X_i

LOS 3.c

Skewness describes the degree to which a distribution is not symmetric. Right-skewed → positive skewness; left-skewed → negative skewness.

Positively skewed unimodal: mean > median > mode.
Negatively skewed unimodal: mean < median < mode.

Kurtosis measures peakedness and tail thickness. Excess kurtosis = kurtosis − 3 (normal = 3, so normal's excess = 0). Positive excess → leptokurtic (fat tails, more peaked, greater tail risk). Negative excess → platykurtic (thin tails, flatter).

LOS 3.d

Correlation is a standardized measure of association between two random variables: $\rho_{AB} = \frac{Cov_{AB}}{s_A \, s_B}$, range $[-1, +1]$.

Scatter plots are useful for revealing nonlinear relationships not captured by the correlation coefficient.

Correlation does not imply causation. Spurious correlation may result by chance, or from the relationships of two variables to a third variable (e.g., common time trend).

中文翻譯 — 重點整理

【LOS 3.a】算術平均、樣本平均 $\bar{X}=\sum X_i / n$；中位數（排序後中間值）；眾數（最常出現值，連續資料用眾數區間）。截尾均數＝剔除極端值；溫式均數＝替換極端值。Quantile：四分位、五分位、十分位、百分位。

【LOS 3.b】全距＝max−min；MAD＝$\frac{\sum|X_i-\bar{X}|}{n}$；樣本變異數 $s^2=\frac{\sum(X_i-\bar{X})^2}{n-1}$（用 $n-1$ 確保不偏）；樣本標準差 $s=\sqrt{s^2}$。CV＝$s/\bar{X}$，相對風險指標，愈小愈好。目標下行偏差$s_{target}=\sqrt{\frac{\sum_{X_i

【LOS 3.c】偏態：右偏 mean > median > mode；左偏 mean < median < mode。峰度衡量尖峭與尾部厚度；超額峰度＝峰度−3（常態=3，超額=0）。正超額峰度＝lepto（肥尾、尾部風險高）；負超額峰度＝platy（薄尾、低峰）。

【LOS 3.d】相關係數 $\rho_{AB}=\frac{Cov_{AB}}{s_A s_B}\in[-1,+1]$，標準化、無單位。散布圖可揭示「非線性關係」，這是相關係數的盲點。相關 ≠ 因果；偽相關可能源於巧合或第三變數（如共同時間趨勢）。