Presentation is loading. Please wait.

Presentation is loading. Please wait.

Statistical Arbitrage Ying Chen, Leonardo Bachega Yandong Guo, Xing Liu February, 2010.

Similar presentations


Presentation on theme: "Statistical Arbitrage Ying Chen, Leonardo Bachega Yandong Guo, Xing Liu February, 2010."— Presentation transcript:

1 Statistical Arbitrage Ying Chen, Leonardo Bachega Yandong Guo, Xing Liu February, 2010

2 Outline Overview of the project Improvements in the last week Speedup the data access Improve the PCA algorithm Use adjusted price in PNL calculation Taking trading volume into account Future work

3 Framework Raw Historical Data From WRDS PCA Eigenportfolios PCA Eigenportfolios Residuals as increments of AR process Compute S-scores ETFs for industry sectors ETFs for industry sectors Signal trade orders Market model 60-day returns Residual process model Current stock prices Market model 252-day returns Adjusted Stock price Series + indices Data pre-processing (python scripts) Back-testing simulations (matlab scripts)

4 Code Speedup Data access Tradeoff Always read from disk: very slow Everything in memory: not robust, can be slow Cache parts of dataset in memory Fast code Same Total Speedup > 16 times BeforeAfter

5 PCA amelioration (1/4) Suppose X is a nxp matrix including n samples and p features; Original algorithm: Calculate the Eigen-decomposition of the correlation matrix: The matrix Q consists of the Eigen-vectors of the correlation matrix

6 PCA amelioration (2/4) Suppose X is a nxp matrix including n samples and p features; Substituted algorithm: We use singular value decomposition (SVD) to get the eigenvectors. Then V consists of Eigen-vectors of the correlation matrix. This will reduce the computational complexity by around 80%

7 PCA amelioration (3/4) Proof: Since U and V are orthogonal, V consists of the eigen-vectors of the correlation matrix And equals to diagonal matrix D

8 PCA amelioration (4/4) Notice: if p is one eigenvector of X, then –p is also its eigenvector Since if Then The effect of “negative” can be removed by the estimation.

9 Experiment result (Fig. 1) Top 50 eigenvalues of the correlation matrix of market returns computed on May 1 2007 estimated using a 1-year window and a universe of 1590 stocks

10 Experiment result 2 Value of the first eigenvector

11 Experiment result 2 Value of the second eigenvector

12 Experiment result 2 Value of the third eigenvector

13 Preliminary PNL Experiment Dec-13-1994 Feb-12-1998

14 After correction

15 Taking trading volume into account Problem mean-reversion strategies are sensitive to trading volume immediately before the signal was triggered. Modified returns is the average daily trading volume over a given trading window. Experiments PCA/ETF actual price vs. using trading volume

16 Top 50 eigenvalues of the correlation matrix—trading time Top 50 eigenvalues of the correlation matrix of market returns computed on May 1 2007 estimated using a 1-year window and a universe of 1590 stocks

17 Top 50 eigenvalues of the correlation matrix——calendar time Top 50 eigenvalues of the correlation matrix of market returns computed on May 1 2007 estimated using a 1-year window and a universe of 1590 stocks

18 Value of the first eigenvector

19 Future work Experiment on ETF Associate each stock with one ETF Compare ETF with PCA Take into account Transaction fee, interest, dividend Calculate PCA using trading-time modified return

20 THANK YOU


Download ppt "Statistical Arbitrage Ying Chen, Leonardo Bachega Yandong Guo, Xing Liu February, 2010."

Similar presentations


Ads by Google