Scaling and memory in volatility return intervals in financial markets
- *Center for Polymer Studies and Department of Physics, Boston University, Boston, MA 02215; †Department of Environmental Sciences, Tokyo University of Information Sciences, Chiba 265-8501, Japan; ‡Minerva Center and Department of Physics, Bar-Ilan University, Ramat-Gan 52900, Israel; and ¶Institut für Theoretische Physik III, Justus-Liebig-Universität, D-35392 Giessen, Germany
-
Contributed by H. Eugene Stanley, April 1, 2005
Abstract
For both stock and currency markets, we study the return intervals τ between the daily volatilities of the price changes that
are above a certain threshold q. We find that the distribution function Pq(τ) scales with the mean return interval
as
. The scaling function f(x) is similar in form for all seven stocks and for all seven currency databases analyzed, and f(x) is consistent with a power-law form, f(x) ∼ x
-γ with γ ≈ 2. We also quantify how the conditional distribution Pq(τ|τ0) depends on the previous return interval τ0 and find that small (or large) return intervals are more likely to be followed by small (or large) return intervals. This
“clustering” of the volatility return intervals is a previously unrecognized phenomenon that we relate to the long-term correlations
known to be present in the volatility.
The statistical properties of stock and currency market fluctuations are of importance for modeling and understanding complex market dynamics. They are also relevant for practical applications such as risk estimation and portfolio optimization (1). In particular, understanding the volatility fluctuations of financial records is of particular importance, because they are the key input of option pricing models, including the classic Black and Scholes model and the Cox, Ross, and Rubinstein binomial models that are based on estimates of the asset's volatility during the residual time of the option (2–4). Although the changes from day i - 1 to day i, Δpi ≡ pi - pi -1, of both stock prices and currency rates are uncorrelated, their absolute values (one measure of volatility) are long-term power-law correlated (5–17). Moreover, the probability density function (pdf) of Δpi scales as a power law (18) Φ(Δp) ∼ (Δp)-(ζ + 1) with ζ ≈ 3 (5, 19–21). Also, within t days after a crash, nq(t), the number of times |Δpi| exceeds a threshold q, follows a power-law relation nq(t) ∼ t -p with p ≈ 1 (22), a behavior similar to the Omori earthquake law.
Here, we are interested in the statistical properties of large volatilities. A quantity that characterizes the occurrence of large volatilities is the return interval τ between two consecutive volatilities above some large threshold q (Fig. 1). We study return intervals because they are related to the rate of occurrence of volatilities that exceed a threshold q (22). Because extreme volatilities are rare, we consider also the return intervals between volatilities above intermediate thresholds. By doing this, we hope to gain insight also into the return intervals between very large volatilities that are too rare to obtain with reasonable statistics.
Schematic illustration of volatility return intervals. Shown are the return intervals τ3 and τ4 for two threshold values q = 3 and 4 (indicated by arrows) for the normalized volatility |Δpi|/(〈|Δpi|2〉 - 〈|Δpi|〉2)1/2 of USD/JPY currency exchange rates in the 3-year period 2000–2002.
We analyze the statistical properties of the daily return intervals of seven representative stocks and currencies obtained, respectively, from http://finance.yahoo.com and www.federalreserve.gov/releases/H10/hist. We choose to study daily data records because there are intraday trends in the volatility. We report two results:
(i) The pdf of the return intervals Pq(τ) is not a function of the two independent variables τ and q but depends only on the scaled parameter
, where the q dependence is contained in the mean return interval
. This scaling is important because it allows us to extrapolate large q values, corresponding to rare events, from the behavior at small q values and thereby collect good statistics. We find that the scaled pdf can be well approximated by the same power law for
all seven stocks and for all seven currencies studied. We also show that this power-law behavior results from the known long-term
correlations in the volatility records.
(ii) A long-term memory exists in the return intervals, such that short return intervals tend to be followed by short ones, and long return intervals tend to be followed by long ones. This “clustering” of the volatility return intervals is a previously unrecognized phenomenon that we relate to the long-term correlations known to be present in the volatility.
Methods
We normalize the volatility records by dividing each value of |Δpi| by the standard deviation (〈|Δpi|2〉 - 〈|Δpi|〉2)1/2. In this way, the thresholds q are in units of the standard deviation of the volatility. Because we analyze daily records, we restrict ourselves to those
q values for which
days to avoid spurious discreteness effects.
We begin by studying the behavior of the pdf pi(τ) and how it depends on the threshold parameter q (Fig. 2 a and b). For different q, the pdfs are different and cannot be described by a Poisson distribution as for uncorrelated data. Moreover, for larger q, the decay of pi(τ) becomes slower.
Distribution of volatility return intervals. (a and b) Distribution function Pq(τ) of the return intervals τ of the volatility records of a typical stock, General Electric (GE) (a), and a typical currency exchange database, USD vs. SEK (b), for five threshold values q. (c and d) Scaled plots where
is plotted vs.
. (e and f) Scaled plots for seven stocks (e) and currencies (f). Stocks are S&P 500, ▪; IBM, •; DuPont, ▴; AT&T, ▾ ; Kodak, ♦; GE, ◂; and Coca-Cola, ▸. Currencies are USD vs. JPY, ▪; British
pound vs. Swiss franc, •; USD vs. SEK, ▴; Danish krone vs. Australian dollar, ▾ ; Danish krone vs. Norwegian krone, ♦; USD
vs. Canadian dollar, ◂; and USD vs. South African dollar, ▸. The straight lines are for demonstration and have a slope of
-2. The lower plots (open symbols) represent the scaled
vs.
after shuffling the volatility records, thereby removing the long-term correlations. The lower plots were divided by a factor
of 10 [i.e.,
]. The open symbols represent the same stocks and the same currencies as the corresponding filled symbols in e and f.(g and h) To test whether after shuffling the volatility record Pq(τ) becomes a Poisson distribution, we plot on a semilogarithmic scale the shuffled stock (g) and currencies (h) data. The linear plot shows the exponential form of a Poisson distribution.
To understand the q dependence, we show, in Fig. 2 c and d, the scaled pdfs
as functions of the scaled return intervals
; the evident data collapse to a single curve is consistent with the scaling relation,
We see that the scaling function f(x) does not depend explicitly on q but only through
. Hence, if Pq(τ) is known for one value of q, Eq. 1 can make predictions for other values of q: in particular, for very large q (rare events), which are difficult to study because of the lack of data. The functional form of the scaled pdf appears to
be quite similar for all seven stock records studied (Fig. 2e) and all seven currency data sets studied (Fig. 2f). As suggested by the straight lines with slope -2 (Fig. 2 e and f), f(x) for x ≥ 1 is consistent with a power law f(x) ∼ x
-γ, where γ ≈ 2 for both stock and currency data. For very large x values, the data are also consistent with the possibility that f(x) is a stretched exponential (see also ref. 23). This result raises the possibility that there exists a “universal” scaling function for the return intervals of both stock
and currency volatility data.
For uncorrelated records, we expect that the return intervals follow a Poisson distribution, yielding log f(x) ∼ -x. To test this expectation, we remove the long-term memory by shuffling the volatility records, and we obtain a simple exponential (Fig. 2 g and h). The distinct difference between the distributions of the return intervals in the real data and in the uncorrelated shuffled data suggests that the power-law behavior of the scaling function f(x) must arise from the correlations in the volatility. Fig. 2 e and f also show that very small and very large return intervals are more frequent in the correlated records than in the shuffled ones.
Next, we address the question of whether the distribution Pq(τ) fully characterizes the sequence of the return intervals. The result depends on whether the return intervals are organized in a correlated fashion. If they are uncorrelated, subsequent return intervals are independent of each other and chosen randomly from Pq(τ). Accordingly, Pq(τ) would fully characterize the data. Fig. 3a shows a typical sequence of return intervals for IBM; Fig. 3b shows a typical shuffled sequence for IBM. One sees that Fig. 3 a and b look very different. Although in Fig. 3a there are “patches” of return intervals below and above their mean value, there are no such patches for the shuffled records. The patches are an indication of memory; i.e., short return intervals (below average) tend to follow short intervals, and long ones (above average) tend to follow long ones.
Visual demonstration of return interval clustering. (a) Sequence of 500 return intervals for IBM for years 1984–2004 (≈5,000 days), where q = 1.5, chosen so that the average return interval is 10 days (horizontal line). (b) Same as a, except that the original volatility returns are shuffled.
To quantify the effect of memory, we study the conditional pdf Pq(τ|τ0) of return intervals τ that immediately follow a return interval τ0. In records without memory, Pq(τ|τ0) does not depend on τ0 and is identical to Pq(τ). In records with long-term memory, we expect a pronounced dependence (23). To search for these kinds of memory effects in the (relatively) short data sets analyzed here, we study Pq(τ|τ0) not for a specific value of τ0 but for a range of τ0 values. For this purpose, we sort the full data set of N return intervals in increasing order and divide it into eight subsets, Q
1, Q
2,..., Q
8, such that each subset contains one-eighth of the total number of return intervals. By this definition, the N/8 lowest return intervals are in Q
1, whereas the largest N/8 intervals are in Q
8. Fig. 4 a and b show, for the Standard and Poor's 500 Index (S&P 500) and for IBM stocks, Pq(τ|τ0) for τ0 in Q
1 and Q
8. The same quantities are studied in Fig. 4 c and d for the Japanese yen (JPY)/U.S. dollar (USD) exchange rate and the Swedish krona (SEK)/USD exchange rate. The results show
that for τ0 in Q
1, the probability of finding τ below (or above)
is enhanced (or decreased) when compared with Pq(τ), whereas the opposite occurs for τ0 in Q
8. We note that, for the range of q studied, the data collapse onto a single scaling function for both Q
1 and Q
8.
Scaling and memory in distributions of volatility return intervals. Shown is the conditional distribution function Pq(τ|τ0) of the return intervals τ of the volatility records of the daily S&P 500 (a), IBM stock (b), USD/JPY exchange rate (c), and USD/SEK exchange rate (d) for τ0 in Q
1 (filled symbols) and Q
8 (open symbols) vs.
. The lines provide a guide for the eye.
Results and Discussion
This memory effect in the conditional distribution function Pq(τ|τ0) leads to a pronounced memory effect in the mean conditional return interval
, which is the mean of those return intervals that immediately follow τ0. By definition,
is the first moment of Pq(τ|τ0). Fig. 5 presents
as a function of
and shows clearly the effect of memory. Because small (or large) return intervals are more likely to be followed by small
(or large) intervals,
is well below (or above) unity for
well below (or above) unity.
Scaling and memory in mean return intervals. Shown is the mean conditional return interval
divided by
vs.
of the volatility records for the daily S&P 500 (a), IBM stock (b), USD/JPY exchange rate (c), and USD/SEK exchange rate (d). For records without memory, we expect
1, as supported by the open symbols obtained for volatility shuffled records.
Thus, the statistics of the return intervals τ strongly depend on the preceding return interval τ0. We ask whether this memory is limited only to nearest-neighbor return intervals or whether there exists long-term memory
in the return interval time series. To answer this question, we use detrended fluctuation analysis (DFA) to test for the presence
of long-term correlations in τ (24). DFA calculates the rms fluctuation
of a time series within a window of
days and determines the exponent α from the scaling relation
. When α > 1/2, the time series is long-term correlated; when α = 1/2, it is uncorrelated. Fig. 6a shows the values of α for the volatility data, in agreement with refs. 5–17; Fig. 6b shows the corresponding analysis for the return interval data. The exponents α obtained for the return intervals are usually
smaller than those for the corresponding volatilities. However, they are within the error bars of each other. Thus, we cannot
rule out that the α values are the same for both volatility and return intervals (see also ref. 23). We note that α > 1/2 for all 14 databases studied, indicating the existence of long-term correlations in the return interval
data. Further, we see that when the volatility records of each database are shuffled, the long-term correlation is removed,
and α = 0.5.
Long-term memory in volatility records. Shown is the detrended fluctuation analysis fluctuation exponent α for the seven stocks (databases 1–7) and seven currencies (databases 8–14) (in the same order as they appear in the legend of Fig. 3) analyzed for volatility (a) and return interval (b) records. The filled symbols represent α values for the actual data; the open symbols represent the α values for the corresponding shuffled volatility data.
These findings quantify a previously unrecognized characterization, return interval clustering, of the known effect of volatility clustering (7, 25–27). Lillo and Mantegna (22) recently found that after a crash, the return intervals tend to cluster and decay as a power law, similar to the Omori law for earthquakes. Our results suggest that there is an analogous clustering not limited to data recorded after a crash and that the return intervals exhibit a scaling form that appears to be universal.
In summary, we have studied scaling and memory effects in return intervals for stock and currency data. We find that the distribution
functions of return intervals can be well approximated by a single scaling function that depends only on the ratio
and differs from the Poisson distribution for uncorrelated events. We also find strong memory effects such that a small return
interval is more likely to be followed by a small interval, and a large interval is more likely to be followed by a large
interval, which is reflected by the fact that the mean conditional return interval
increases monotonically withτ0. We showed that the origin of these phenomena is the long-term memory of the volatility.
Footnotes
-
↵ § To whom correspondence should be addressed. E-mail: havlin{at}ophir.ph.biv.ac.il.
-
Author contributions: S.H. and H.E.S. designed research; K.Y., L.M., S.H., and H.E.S. performed research; A.B. contributed new reagents/analytic tools; A.B. analyzed data; and S.H. wrote the paper.
-
Abbreviations: pdf, probability density function; S&P 500, Standard and Poor's 500 Index; USD, U.S. dollar; JPY, Japanese yen; SEK, Swedish krona.
- Copyright © 2005, The National Academy of Sciences











