For the hardest CMO tranche, generalized Faure achieves accuracy 10 ;2 with 170 points, while modied Sobol uses 600 points. On the other hand, the Mon

Similar documents
VARIANCE REDUCTION TECHNIQUES IN MONTE CARLO SIMULATIONS K. Ming Leung

Extra-High Speed Matrix Multiplication on the Cray-2. David H. Bailey. September 2, 1987

You ve already read basics of simulation now I will be taking up method of simulation, that is Random Number Generation

Quasi-Monte Carlo Methods Combating Complexity in Cost Risk Analysis

In a two-way contingency table, the null hypothesis of quasi-independence. (QI) usually arises for two main reasons: 1) some cells involve structural

Constructing low star discrepancy point sets with genetic algorithms

Probabilistic (Randomized) algorithms

Computer Experiments: Space Filling Design and Gaussian Process Modeling

Lecture 2 - Graph Theory Fundamentals - Reachability and Exploration 1

Chapter 1. Introduction

Using Local Trajectory Optimizers To Speed Up Global. Christopher G. Atkeson. Department of Brain and Cognitive Sciences and

Probability and Statistics for Final Year Engineering Students

Computer Experiments. Designs

Testing unrolling optimization technique for quasi random numbers

Computational Methods in Statistics with Applications A Numerical Point of View. Large Data Sets. L. Eldén. March 2016

Neuro-Remodeling via Backpropagation of Utility. ABSTRACT Backpropagation of utility is one of the many methods for neuro-control.

Computational Methods. Randomness and Monte Carlo Methods

Cluster quality 15. Running time 0.7. Distance between estimated and true means Running time [s]

Formal Model. Figure 1: The target concept T is a subset of the concept S = [0, 1]. The search agent needs to search S for a point in T.

CSE 586 Final Programming Project Spring 2011 Due date: Tuesday, May 3

Review of the Robust K-means Algorithm and Comparison with Other Clustering Methods

BMVC 1996 doi: /c.10.41

COPULA MODELS FOR BIG DATA USING DATA SHUFFLING

A Random Number Based Method for Monte Carlo Integration

Andrew Davenport and Edward Tsang. fdaveat,edwardgessex.ac.uk. mostly soluble problems and regions of overconstrained, mostly insoluble problems as

The only known methods for solving this problem optimally are enumerative in nature, with branch-and-bound being the most ecient. However, such algori

Multi-Way Number Partitioning

Pre-control and Some Simple Alternatives

Stochastic Function Norm Regularization of DNNs

Combinatorial Search; Monte Carlo Methods

Chapter 4: Implicit Error Detection

Expectation and Maximization Algorithm for Estimating Parameters of a Simple Partial Erasure Model

Halftoning and quasi-monte Carlo

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

Bayesian Estimation for Skew Normal Distributions Using Data Augmentation

Implementations of Dijkstra's Algorithm. Based on Multi-Level Buckets. November Abstract

From Finance to Flip Flops:

ADAPTIVE AND DYNAMIC LOAD BALANCING METHODOLOGIES FOR DISTRIBUTED ENVIRONMENT

A Comparative Study of Frequency-domain Finite Element Updating Approaches Using Different Optimization Procedures

Kalev Kask and Rina Dechter. Department of Information and Computer Science. University of California, Irvine, CA

Michael Mitzenmacher. Abstract. The eld of random graphs contains many surprising and interesting results. Here we

A Fast Estimation of SRAM Failure Rate Using Probability Collectives

GUIDELINES FOR MASTER OF SCIENCE INTERNSHIP THESIS

ACCURACY AND EFFICIENCY OF MONTE CARLO METHOD. Julius Goodman. Bechtel Power Corporation E. Imperial Hwy. Norwalk, CA 90650, U.S.A.

Generating random samples from user-defined distributions

HOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery

MCMC Methods for data modeling

UCLA UCLA Previously Published Works

Optimal Matrix Transposition and Bit Reversal on. Hypercubes: All{to{All Personalized Communication. Alan Edelman. University of California

Optimizing Pharmaceutical Production Processes Using Quality by Design Methods

Statistical analysis of nancial networks

The Item Log-Likelihood Surface for Two- and Three-Parameter Item Characteristic Curve Models

SIMULATION AND MONTE CARLO

Probability Models.S4 Simulating Random Variables

Recent Developments in the Design and Optimization of Constant Force Electrical Contacts

Metaheuristic Development Methodology. Fall 2009 Instructor: Dr. Masoud Yaghini

CHAPTER 1 INTRODUCTION

Short-Cut MCMC: An Alternative to Adaptation

Telecommunication and Informatics University of North Carolina, Technical University of Gdansk Charlotte, NC 28223, USA

Curve Correction in Atomic Absorption

IQC monitoring in laboratory networks

Modeling with Uncertainty Interval Computations Using Fuzzy Sets

Packet Routing in Networks with Long Wires

This chapter explains two techniques which are frequently used throughout

First Order Logic in Practice 1 First Order Logic in Practice John Harrison University of Cambridge Background: int

UvA-DARE (Digital Academic Repository) Memory-type control charts in statistical process control Abbas, N. Link to publication

ME 555: Distributed Optimization

CHAPTER 2 WIRELESS SENSOR NETWORKS AND NEED OF TOPOLOGY CONTROL

A noninformative Bayesian approach to small area estimation

Two-dimensional Totalistic Code 52

of m clauses, each containing the disjunction of boolean variables from a nite set V = fv 1 ; : : : ; vng of size n [8]. Each variable occurrence with

Condence Intervals about a Single Parameter:

Carnegie Learning Math Series Course 2, A Florida Standards Program

Quasi-random maximum simulated likelihood estimation of the mixed multinomial logit model

PROJECTION MODELING SIMPLIFICATION MARKER EXTRACTION DECISION. Image #k Partition #k

Center-Based Sampling for Population-Based Algorithms

APPLICATION OF MULTIPLE RANDOM CENTROID (MRC) BASED K-MEANS CLUSTERING ALGORITHM IN INSURANCE A REVIEW ARTICLE

Lecturer 4: File Handling

Discovery of the Source of Contaminant Release

A taxonomy of race. D. P. Helmbold, C. E. McDowell. September 28, University of California, Santa Cruz. Santa Cruz, CA

How to Conduct a Heuristic Evaluation

Cluster Analysis for Microarray Data

On Using Machine Learning for Logic BIST

Physics 736. Experimental Methods in Nuclear-, Particle-, and Astrophysics. - Statistical Methods -

Integration. Volume Estimation

Spring 2017 FRE-GY 6883 Financial Computing Song Tang, Ph.D

UNSTRUCTURED GRIDS ON NURBS SURFACES. The surface grid can be generated either in a parameter. surfaces. Generating grids in a parameter space is

Rowena Cole and Luigi Barone. Department of Computer Science, The University of Western Australia, Western Australia, 6907

where is a constant, 0 < <. In other words, the ratio between the shortest and longest paths from a node to a leaf is at least. An BB-tree allows ecie

An experiment to show undergraduate students the impact of transient analysis decisions on parameter estimation for non-terminating simulated systems

Fast Generation of Nested Space-filling Latin Hypercube Sample Designs. Keith Dalbey, PhD

Scientific Computing: An Introductory Survey

The problem of minimizing the elimination tree height for general graphs is N P-hard. However, there exist classes of graphs for which the problem can

E±cient Detection Of Compromised Nodes In A Wireless Sensor Network

m Environment Output Activation 0.8 Output Activation Input Value

Simulation Calibration with Correlated Knowledge-Gradients

Adaptive Estimation of Distributions using Exponential Sub-Families Alan Gous Stanford University December 1996 Abstract: An algorithm is presented wh

coordinates ([6]); it turns out that this language is also useful for ecient scanning of the relevant part of the backtracking tree. The idea of learn

A Non-Iterative Approach to Frequency Estimation of a Complex Exponential in Noise by Interpolation of Fourier Coefficients

Simulation. Chapter Copyright 2010 Pearson Education, Inc. Publishing as Prentice Hall

Transcription:

New Results on Deterministic Pricing of Financial Derivatives A. Papageorgiou and J.F. Traub y Department of Computer Science Columbia University CUCS-028-96 Monte Carlo simulation is widely used to price complex nancial instruments. Recent theoretical results and extensive computer testing indicate that deterministic methods may be far superior in speed and condence. Simulations using the Sobol or Faure points are examples of deterministic methods. For the sake of brevity, we refer to a deterministic method using the name of the sequence of points which the method uses, e.g., Sobol method. In this paper we test the generalized Faure sequence due to Tezuka [T95]. We also test a modied Sobol method this includes further improvements from those in [PT95]. We compare these two low discrepancy deterministic methods with basic Monte Carlo. We summarize our conclusions regarding the valuation of a Collateralized Mortgage Obligation which we divide into three groups. instruments such as asian options. I. Deterministic and Monte Carlo Methods. Similar results hold for other nancial Deterministic methods beat Monte Carlo by a wide margin. In particular, Both the generalized Faure and modied Sobol methods converge signicantly faster than Monte Carlo. The generalized Faure method always converges at least as fast as the modied Sobol method and frequently faster. The Monte Carlo method is sensitive to the initial seed. II. Small Number of Sample Points. Deterministic methods outperform Monte Carlo for a small number of sample points. In particular, Deterministic methods attain small error with a small number of points. Presented at \Mathematical Problems in Finance,"Institute for Advanced Study, Princeton, NJ, April 15, 1996 y This work was supported in part by NSF and AFOSR.

For the hardest CMO tranche, generalized Faure achieves accuracy 10 ;2 with 170 points, while modied Sobol uses 600 points. On the other hand, the Monte Carlo method requires 2700 points for the same accuracy. Monte Carlo tends to waste points due to clustering, which severely compromises its performance when the sample size is small. III. Speedup. The advantage of deterministic methods over Monte Carlo is further amplied as the sample size and the accuracy demands grow. In particular, Deterministic methods are 20 to 50 times faster than Monte Carlo even with moderate sample sizes (2000 deterministic points or more). When high accuracy is desired, deterministic methods can be 1000 times faster than Monte Carlo. Valuing nancial derivatives may be formulated via paths or as a high dimensional integral. For simplicity we will restrict ourselves to the integral formulation and, without loss of generality (see Paskov [P96]), as an integral over the unit cube in d dimensions. For most nance problems the integral cannot be analytically computed we have to settle for a numerical approximation. The basic Monte Carlo method obtains this approximation by computing the arithmetic mean of the integrand evaluated at randomly chosen points. More precisely, only pseudo-random points can be generated on a digital computer and these are used in lieu of random points. There are sophisticated variations of this basic method whenever we refer to Monte Carlo in this paper we will always mean the basic version. If pseudo-random points from a at distribution are plotted in two dimensions (see Figure 1) there are regions where no sample points occur and regions where the points are more concentrated. This is clearly undesirable. Random point samples are wasted due to clustering. Indeed, Monte Carlo simulations with very small sample sizes cannot be trusted. It would be desirable to place our sample points as uniformly as possible, which is the idea behind low discrepancy sequences. Discrepancy is a measure of deviation from uniformity hence low discrepancy points are desirable. Figure 2 shows a plot of certain low discrepancy points in two dimensions. A low discrepancy method approximates the integral by computing the arithmetic mean of the integrand evaluated at low discrepancy points. Low discrepancy sequences have been extensively studied see Paskov [P96] for the formal denition of discrepancy and an extensive bibliography. In contrast to the Monte Carlo method which uses random (or 2

pseudo-random) points, low discrepancy methods use deterministic points. These methods are sometimes said to be quasi-random. In 1992 the conventional wisdom was that although theory suggested that low discrepancy methods were sometimes superior to Monte Carlo, this theoretical advantage was not seen for high dimensional problems. Traub and a Ph.D. student, Spassimir Paskov, decided to compare the ecacy of low discrepancy and Monte Carlo methods on the valuation of nancial derivatives. They used a Collateralized Mortgage Obligation (Fannie Mae REMIC Trust 1989-23) provided by Goldman Sachs with ten tranches requiring the evaluation of ten 360-dimensional integrals. The values of the tranches depend on the interest rate and prepayment models. Details can be found in Paskov [P96]. Paskov and Traub used a particular low discrepancy sequence due to Sobol. They made major improvements in the Sobol points and showed that the improved Sobol method consistently outperformed Monte Carlo see Paskov and Traub [PT95] andpaskov [P96] for details. Software Construction and testing of low discrepancy deterministic methods for pricing nancial derivatives was begun at Columbia University in the Fall of 1992. Preliminary results were shared with a number of New York City nancial houses in the Fall of 1993 and the Spring of 1994. The rst published announcement wasajanuary 1994 article in Scientic American Traub and Wozniakowski [TW94]. See Paskov andtraub [PT95] for a more detailed history. In September 1995 IBM announced a product called the Deterministic Simulation Blaster (see also [IBM95]) which usesalow discrepancy deterministic method. The company claimed a very large improvement over Monte Carlo. However, the method for chosing the sample points and the methodology for calculating the speedup have not been revealed by the time of our writing this article (March 1996). IBM has repeatedly acknowledged that the use of low discrepancy methods to price nancial derivatives was pioneered at Columbia University. We have built a software system called FINDER for computing high dimensional integrals. FINDER has modules for generating generalized Faure and modied Sobol points. We emphasize that major improvements in generalized Faure points and Sobol points, which cannot be found in the published literature, have been put in FINDER. Indeed, a number of nancial institutions have informed us that they could not replicate our results using, for example, the Sobol point generator found in Press et al. [Pr92]. As further improvements in low discrepancy methods are discovered they will be added to our software. FINDER may be obtained from Columbia University. 3

We used FINDER to price the CMO and to compare low discrepancy methods with Monte Carlo. Deterministic methods and Monte Carlo compute the arithmetic mean of the integrand evaluated at a number of points. Thus the dierence in performance depends on the number of points that each method uses for the same accuracy. Weobserve the least number of points that a method requires in order to achieve and maintain a relative error below a certain level, e.g., 10 ;2. The speedup of one method relative to another is the ratio of the least number of points required by the rst method divided by the least number of points required by the second method so that both methods maintain the same level of accuracy. This denition of speedup is new. We study the convergence and the error of a method throughout a simulation. We feel that this has advantages over speedup calculations based only on error values at the end of a simulation. Note that our denition of speedup is a more rigorous requirement than only computing the condence level of Monte Carlo. For xed accuracy, extensive testing has shown that dierent tranches require a dierent number of points. We emphasize that deterministic methods beat Monte Carlo for every tranche. We report results using the residual tranche of the CMO as reference point since it is the most dicult to price. The residual tranche depends on all the 360 monthly interest rates. If this tranche can be priced using a certain number of samples with a given accuracy, the same number of samples will yield at least the same accuracy for the rest of the tranches. Since pricing models for complicated derivatives are subject to uncertainty, nancial houses are often content with relative errors of one part in a hundred. Furthermore, if they wish to price a book of instruments it is critical to use a small number of samples. Deterministic methods achieve a relative error of one part in a hundred using a small number of points. In gures 3 and 4 we see that 170 generalized Faure points, 600 Sobol points, and 2700 Monte Carlo points are sucient for a relative error equal to 10 ;2.Thus, avery small number of generalized Faure points yields an accurate price 16 times faster than Monte Carlo. A further reduction of the error by a factor of 20 (equal to 10 ;3 =2) requires about 16000 generalized Faure points, while it may require up to 800000 random points and the speedup is up to 50. In general, samples using as few as 2000 generalized Faure points can price the CMO 20 to 50 times faster than Monte Carlo. We discuss the convergence rates. For n generalized Faure points, n 10 4, the error is proportional to n ;0:82. This error estimate is very conservative since frequentlyamuch higher convergence rate is attained. This is very superior to the n ;0:5 expected Monte 4

Carlo error. Monte Carlo exhibits a great sensitivity on the seed of the pseudo-random number generator. Unless we are dealing with the result of a fairly long simulation we cannot have much condence. Very long simulations, needed for high accuracy in the Monte Carlo estimate, yield a deterministic method speedup on the order of 1000. Generalized Faure points are better than Sobol points in the sense that they usually achieve the same accuracy 2:5 to6:5 times faster. Another important advantage of the generalized Faure points is that they can be easily produced for very high dimensional problems. It is much more complicated to obtain the improved Sobol points that we have been using, in very high dimensions. We summarize our conclusions. Among the deterministic methods we have tested, the one based on generalized Faure points is the method of choice. Generalized Faure points can be produced eciently, at a cost similar to that required for random points, and a small number of points suces to price the CMO. In contrast to some other deterministic sequences generalized Faure points can be easily produced in very high dimensions. FINDER contains features that further improve the quality of the approximation obtained by the generalized Faure method without any additional computational overhead. Finally, preliminary, but very encouraging results, indicate that generalized Faure points can eciently price nancial derivatives modeled in more than 1500 dimensions. In closing, we indicate our plans for future work. (1) Make further improvements in FINDER. (2) Compare the performance of low discrepancy and Monte Carlo methods on other nancial derivatives. (3) Investigate the of low discrepancy methods for risk management. (4) Design new low discrepancy methods which are especially good for nancial computations. References [IBM95] Truel P., From I.B.M., Help in Intricate Trading, The New York Times September 25 (1995). [PT95] Paskov, S.H. and Traub, J.F., Faster Valuation of Financial Derivatives, The Journal of Portfolio Management Fall (1995), 113{120. [P96] Paskov, S.H., New Methodologies for Valuing Derivatives, in \Mathematics of Derivative Securities," edited by S. Pliska and M. Dempster, Isaac Newton Institute, Cambridge University Press, Cambridge, UK, 1996 (to appear). [Pr92] Press, W., Teukolsky, S., Wetterling, W. and Flannery, B., \Numerical Recipes in C," Second Edition, Cambridge University Press,Cambridge, 1992. [T95] Tezuka, S., \Uniform Random Numbers: Theory and Practice," Kluwer Academic Publishers, Boston, 1995. 5

[TW94] Traub, J.F. and Wozniakowski, H., Breaking Intractability, Scientic American January (1994), 102{107. A. Papageorgiou is a researcher and J.F. Traub is the Edwin Howard Armstrong Professor of Computer Science at Columbia University. 6

1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Figure 1: 512 pseudo-random points

1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Figure 2: 512 low discrepancy points

0.03 GFaure Monte Carlo 0.02 0.01 0.005 0.0025 0 128 256 512 1024 2048 4096 Figure 3: Generalized Faure and Monte Carlo errors

0.03 GFaure Sobol 0.02 0.01 0.005 0.0025 0 128 256 512 1024 2048 4096 Figure 4: Generalized Faure and Sobol errors