Package ParetoPosStable

Similar documents
Package NormalLaplace

Package GWRM. R topics documented: July 31, Type Package

Package OLScurve. August 29, 2016

Package glogis. R topics documented: March 16, Version Date

Package libstabler. June 1, 2017

Package intccr. September 12, 2017

Package datasets.load

Package GLDreg. February 28, 2017

Package sure. September 19, 2017

Package extweibquant

Package fitur. March 11, 2018

Package CompGLM. April 29, 2018

Package bayesdp. July 10, 2018

Package ihs. February 25, 2015

Package glmnetutils. August 1, 2017

Package weco. May 4, 2018

Package strat. November 23, 2016

Package mlegp. April 15, 2018

Package VarianceGamma

Package RWiener. February 22, 2017

Package parmsurvfit. October 8, 2018

Package kdevine. May 19, 2017

Package SSLASSO. August 28, 2018

Package mixphm. July 23, 2015

Package stochprofml. February 20, 2015

Package logspline. February 3, 2016

Package StVAR. February 11, 2017

Package survivalmpl. December 11, 2017

Package ClusterBootstrap

Package simsurv. May 18, 2018

Package rereg. May 30, 2018

Package ADMM. May 29, 2018

Package rpst. June 6, 2017

Package gpdtest. February 19, 2015

Package attrcusum. December 28, 2016

Package OutlierDC. R topics documented: February 19, 2015

Package vinereg. August 10, 2018

Package gbts. February 27, 2017

Package TANDEM. R topics documented: June 15, Type Package

Package fbroc. August 29, 2016

Package ssmn. R topics documented: August 9, Type Package. Title Skew Scale Mixtures of Normal Distributions. Version 1.1.

Package MIICD. May 27, 2017

Package gains. September 12, 2017

Package FWDselect. December 19, 2015

Package PTE. October 10, 2017

Package merror. November 3, 2015

Package subplex. April 5, 2018

Package interplot. R topics documented: June 30, 2018

Package TVsMiss. April 5, 2018

Package sfadv. June 19, 2017

Package CMPControl. February 19, 2015

Package enpls. May 14, 2018

Package OrthoPanels. November 11, 2016

Package flexcwm. May 20, 2018

Package PedCNV. February 19, 2015

Package HMMCont. February 19, 2015

Package ClustGeo. R topics documented: July 14, Type Package

Package dyncorr. R topics documented: December 10, Title Dynamic Correlation Package Version Date

Package palmtree. January 16, 2018

Package pcapa. September 16, 2016

Package quickreg. R topics documented:

Package bife. May 7, 2017

Package gridgraphics

Package preprosim. July 26, 2016

Package sspse. August 26, 2018

Package hsiccca. February 20, 2015

Package catenary. May 4, 2018

The powerlaw package: a general overview

Package basictrendline

Package lvm4net. R topics documented: August 29, Title Latent Variable Models for Networks

Package coga. May 8, 2018

Package qvcalc. R topics documented: September 19, 2017

Package mtsdi. January 23, 2018

Package robets. March 6, Type Package

Package svmpath. R topics documented: August 30, Title The SVM Path Algorithm Date Version Author Trevor Hastie

Package clustvarsel. April 9, 2018

Package nplr. August 1, 2015

Package Combine. R topics documented: September 4, Type Package Title Game-Theoretic Probability Combination Version 1.

Package cwm. R topics documented: February 19, 2015

Package GUILDS. September 26, 2016

Package powerlaw. August 29, 2017

Package bootlr. July 13, 2015

Package cdfquantreg. August 28, 2017

Package queuecomputer

Package reval. May 26, 2015

Package truncgof. R topics documented: February 20, 2015

Lecture 11: Distributions as Models October 2014

Package mnlogit. November 8, 2016

Package RCA. R topics documented: February 29, 2016

Package ASICS. January 23, 2018

Package PrivateLR. March 20, 2018

Package mrm. December 27, 2016

Package gppm. July 5, 2018

Package treedater. R topics documented: May 4, Type Package

Package corclass. R topics documented: January 20, 2016

Package CINID. February 19, 2015

Package JOUSBoost. July 12, 2017

Package ordinalnet. December 5, 2017

Package extremevalues

Package sfa. R topics documented: February 20, 2015

Transcription:

Type Package Package ParetoPosStable September 2, 2015 Title Computing, Fitting and Validating the PPS Distribution Version 1.1 Date 2015-09-02 Maintainer Antonio Jose Saez-Castillo <ajsaez@ujaen.es> Depends R (>= 3.0.0) Imports graphics, grdevices, stats, ADGofTest, lmom, foreach, doparallel, parallel Statistical functions to describe a Pareto Positive Stable (PPS) distribution and fit it to real data. Graphical and statistical tools to validate the fits are included. License GPL-2 GPL-3 LazyData true NeedsCompilation no Author Antonio Jose Saez-Castillo [aut, cre], Faustino Prieto [aut], Jose Maria Sarabia [aut] Repository CRAN Date/Publication 2015-09-02 15:52:04 R topics documented: coef.ppsfit......................................... 2 forbes400.......................................... 3 GoF............................................. 3 loglik.ppsfit........................................ 4 pareto.fit........................................... 5 ParetoPosStable....................................... 6 plot.ppsfit.......................................... 6 PPS............................................. 7 PPS.fit............................................ 9 1

2 coef.ppsfit print.ppsfit......................................... 11 se.............................................. 11 turkey............................................ 12 Index 14 coef.ppsfit Parameter estimates in a PPSfit Object coef returns the parameter estimates in a PPSfit Object ## S3 method for class PPSfit coef(object,...) object A PPSfit object, typically from PPS.fit().... Other arguments. Value A list with the parameter estimates. PPS.fit. x <- rpps(50, 1.2, 100, 2.3) fit <- PPS.fit(x) coef(fit)

forbes400 3 forbes400 Forbes 400 list. The Richest People in America in 2012 The Forbes 400 or 400 Richest Americans is a list published by Forbes magazine of the wealthiest 400 Americans, ranked by net worth. The dataset presents results about 2012. Format A data frame with 400 observations on the following 2 variables. Namemembers of the list names. NetWorthnet worth in 2012 in $ billion. GoF Goodness of fit tests for the Pareto Positive Stable (PPS) distribution Kolmogorov-Smirnov, Anderson-Darling and PPS goodness of fit tests to validate a PPS fit (typically from PPS.fit()). GoF(PPSfit, k = 2000, parallel = TRUE, ncores = 2,...) PPSfit k Details parallel ncores A PPSfit Object.... Other arguments. The number of iterations in the bootstrap procedure to approximate the p-values. A logical argument specifying if parallelization is allowed in the bootstrap iteration procedure. is the number of cores that we use if parallel is TRUE. It returns the Kolmogorov-Smirnov, the Anderson-Darling tests and a specific test for PPS distributions. p-values are approximated by a bootstrap procedure. The specific goodness of fit test for PPS distributions is based on the linearity of the survival function vs. the scaled observations in a double log-log scale (see Sarabia and Prieto, 2009).

4 loglik.ppsfit Value A list with the values of the tests statistics and the approximated p-values. PPS.fit, plot.ppsfit x <- rpps(50, 1.2, 100, 2.3) fit <- PPS.fit(x) GoF(fit, k = 50) loglik.ppsfit Log-likelihood value of a PPSfit Object It returns the log-likelihood value of a PPSfit Object ## S3 method for class PPSfit loglik(object,...) Value object A PPSfit Object.... Other arguments. The log-likelihood. PPS.fit

pareto.fit 5 x <- rpps(50, 1.2, 100, 2.3) fit <- PPS.fit(x) loglik(fit) pareto.fit Fitting a Pareto distribution It is an auxiliar function for fitting a Pareto distribution as a particular case of a Pareto Positive Stable distribution, allowing the scale parameter to be held fixed if desired. pareto.fit(x, estim.method = "MLE", sigma = NULL, start,...) x estim.method sigma start The vector of observations. The estimation method, "MLE" or "OLS" The value of the scale parameter, if it is known; if the value is NULL, the parameter is estimated. Unused argument from PPS.fit.... Other arguments. Details This function is called by PPS.fit() when Pareto argument is TRUE. Value A PPSfit Object. PPS.fit, coef.ppsfit, print.ppsfit, plot.ppsfit, GoF

6 plot.ppsfit ParetoPosStable Computing, Fitting and Validating the PPS Distribution Statistical functions to describe a Pareto Positive Stable (PPS) distribution and fit it to real data. Graphical and statistical tools to validate the fits are included. Details The main function you re likely to need from ParetoPosStable is PPS.fit, in order to obtain a PPS fit from data. Validation can be obtain with GoF and plot. plot.ppsfit Plots to validate a Pareto Positive Stable (PPS) fit Plots to validate a PPS fit (typically from PPS.fit()) with different comparisons between empirical and theoretical functions. ## S3 method for class PPSfit plot(x, which = 1:4, ask = prod(par("mfcol")) < length(which) && dev.interactive(), ylim, breaks,...) x which ask ylim breaks a PPSfit Object.... other arguments. values from 1 to 4 indicating the type of plot. an argument to control the plot window. optional argument to control the y limits of the histogram. It is included to prevent non-desired scales on the y-axis. optional argument to control the breakpoints of the histogram. See hist help for the details. It is included to prevent non-desired scales on the y-axis.

PPS 7 Details The plots return: 1. The histogram of the observations and the fitted PPS density (which = 1). Optional ylim and breaks arguments are provided to prevent frequent imbalances between density and histogram scales in real data: they work as the analogue arguments of the default hist function. 2. The empirical distribution function of data and the cumulative distribution function of the fitted model (which = 2). 3. A rank-size plot in log-log scale to check the Pareto or power-law behaviour of data (which = 3). In the X-axis the log of the observations appears; in the Y-axis, the log of the empirical survival function. If the scatter-plot is around a straight line, then the observations exhibit a power law behaviour. The plot also includes the curve with the theoretical survival function of the model specified in the first argument class PPSfit: only when nu is 1, that curve is going to be a straight line. 4. A plot in a double log-log scale to check the adequacy of data to the PPS model (which = 4). On one hand, the X-axis shows the double log of the observations divided by the scale parameter, while the Y-axis shows the log of minus the log of the empirical survival function. On the other hand, the straight line determined by the linear relation between the survival function and the scaled data in a double log-log scale, in relation to the argument class PPSfit is added. The proximity of the points in the scatter-plot to that straight line is an evidence in favour of a PPS behaviour of data. PPS.fit data(forbes400) fit <- PPS.fit(forbes400$NetWorth) par(mfrow=c(2,2)) plot(fit) dev.off() plot(fit, which = 1, breaks = seq(0, 60, length.out = 60)) PPS The Pareto Positive Stable (PPS) distribution Density, distribution function, hazard function, quantile function and random generation for the Pareto Positive Stable (PPS) distribution with parameters lam, sc and v.

8 PPS dpps(x, lam, sc, v, log = FALSE) hpps(x, lam, sc, v) ppps(x, lam, sc, v, lower.tail = TRUE, log.p = FALSE) qpps(p, lam, sc, v, lower.tail = TRUE, log.p = FALSE) rpps(n, lam, sc, v) x lam sc v log lower.tail log.p p n vector of quantiles. vector of (non-negative) first shape parameters. vector of (non-negative) scale parameters. vector of (non-negative) second shape parameters. logical; if TRUE, probabilities/densities p are returned as log(p). logical; if TRUE (default), probabilities are P [X x], otherwise, P [X > x]. logical; if TRUE, probabilities/densities p are returned as log(p). vector of probabilities. number of random values to return. Details Value The PPS distribution has density cumulative distribution function quantile function and hazard function f(x) = λν[log(x/σ)] ( ν 1)exp( λ[log(x/σ)] ν )/x, F (x) = 1 exp( λ[log(x/σ) ν ]), Q(p) = σexp([ (1/λ)log(1 p)] ( 1/ν)) λν(log(x/σ)) ( ν 1)x ( 1). See Sarabia and Prieto (2009) for the details about the numbers random generation. dpps gives the (log) density, ppps gives the (log) distribution function, qpps gives the quantile function, and rpois generates random samples. Invalid parameters will result in return value NaN, with a warning. The length of the result is determined by n for rpps, and is the common length of the numerical arguments for the other functions.

PPS.fit 9 print(x <- sort(rpps(10, 1.2, 100, 2.3))) dpps(x, 1.2, 100, 2.3) ppps(x, 1.2, 100, 2.3) qpps(ppps(x, 1.2, 100, 2.3), 1.2, 100, 2.3) hpps(x, 1.2, 100, 2.3) PPS.fit Fitting the Pareto Positive Stable (PPS) distribution PPS.fit() returns the fit of a PPS distribution to real data, allowing the scale parameter to be held fixed if desired. PPS.fit(x, estim.method = "MLE", sigma = NULL, start, Pareto = FALSE,...) x Details estim.method sigma start Pareto a vector of observations... other arguments. the method of parameter estimation. It may be "MLE", "imle", "OLS", or "LMOM". the value of the scale parameter, if it is known; if the value is NULL, the parameter is estimated. a list with the initial values of the parameters for some of the estimation methods. a logical argument to constrain the PPS fit to a Pareto fit when the value is TRUE. The maximum likelihood method implemented by the direct optimization of the log-likelihood is given by estim.method = "MLE". The numerical algorithm to search the optimum is the Nelder- Mead method implemented in the optim function, considering as initial values those given in the start argument or, if it is missing, those provided by the OLS method. A different approximation of the maximum likelihood estimates is given by estim method = "imle"; it is an iterative methodology where optimize() function provides the optimum scale parameter value, while the uniroot() function solve normal equations for that given scale parameter.

10 PPS.fit The regression estimates ("OLS") searchs an optimum scale value (in a OLS criterion) by the optimize() function. Then the rest of the parameters are estimated also by OLS, as appears in Sarabia and Prieto (2009). In the L-moments method ("LMOM") estimates are obtained searching parameters that equal the first three sample and theoretical L-moments by means of the Nelder-Mead algorithm implemented in optim(); the initial values are given in the start argument or, if it is missing, provided by the "imle". Value A PPSfit Object, a list with estimateparameter estimates. loglikthe log-likelihood value. nthe number of observations. obsthe observations. obsnamethe name of the variable with the observations. estim.methodthe method of parameter estimation. When this last value is "LMOM" the function also returns details about the convergence of the numerical method involved (convergence value). Hosking, J. R. M. (1990). L-moments: analysis and estimation of distributions using linear combinations of order statistics. Journal of the Royal Statistical Society, Series B, 52, 105-124. coef.ppsfit, print.ppsfit, plot.ppsfit, GoF data(turkey) fit <- PPS.fit(turkey$Pop2000) print(fit) coef(fit) se(fit, k = 100, parallel = FALSE) loglik(fit) par(mfrow=c(2,2)) plot(fit) GoF(fit, k = 100, parallel = FALSE)

print.ppsfit 11 print.ppsfit Printing a PPSfit Object It prints its argument (typically from PPS.fit()), returning some of the most important aspects. ## S3 method for class PPSfit print(x, digits = max(3, getoption("digits") - 3),...) x digits a PPSfit object.... other arguments. the number of digits to be printed. PPS.fit x <- rpps(50, 1.2, 100, 2.3) fit <- PPS.fit(x) print(fit) se Approximated standard errors of Pareto Positive Stable (PPS) parameter estimates It approximates the stantard errors of PPS parameter estimates by bootstrapping. se(ppsfit, k = 2000, parallel = TRUE, ncores = 2,...)

12 turkey PPSfit k Details parallel ncores... other arguments. a PPSfit Object, typically from PPS.fit(). the number of steps in the bootstrapping procedure. A logical argument specifying if parallelization is allowed in the bootstrapping procedure. is the number of cores that we use if parallel is TRUE The function simulates k samples from the model given in the PPSfit argument, fits them with the same method of estimation and uses the parameter estimates to approximate the standard errors. Value A list with the standard errors. PPS.fit x <- rpps(50, 1.2, 100, 2.3) fit <- PPS.fit(x) coef(fit) se(fit, k = 50) turkey Population of Turkish cities and towns Census population of Turkish cities and towns of more than 20,000 inhabitants in 1990 and 2000. Format A data frame with 280 observations on the following 4 variables. Namecities and towns names. Adm.abbreviated province name of cities and towns. Pop1990the population in 1990. Pop2000the population in 2000. #

turkey 13 Source http://www.citypopulation.de/turkey-c20.html

Index coef.ppsfit, 2, 5, 10 dpps (PPS), 7 forbes400, 3 GoF, 3, 5, 6, 10 hpps (PPS), 7 loglik.ppsfit, 4 pareto.fit, 5 ParetoPosStable, 6 ParetoPosStable-package (ParetoPosStable), 6 plot, 6 plot.ppsfit, 4, 5, 6, 10 ppps (PPS), 7 PPS, 7 PPS.fit, 2, 4 7, 9, 11, 12 print.ppsfit, 5, 10, 11 print.se (se), 11 qpps (PPS), 7 rpps (PPS), 7 se, 11 turkey, 12 14