Time Series Data Analysis on Agriculture Food Production

Size: px
Start display at page:

Download "Time Series Data Analysis on Agriculture Food Production"

Transcription

1 , pp Time Series Data Analysis on Agriculture Food Production A.V.S. Pavan Kumar 1 and R. Bhramaramba 2 1 Research Scholar, Department of Computer Science and Engineering, GIT, GITAM University, Visakhapatnam avspavankumarmca@gmail.com 2 Associate Professor, Department of Information Technology, GIT, GITAM University, Visakhapatnam bhramarambaravi@gmail.com Abstract. Agriculture food production is the backbone of every economy. In a country like India, which has ever increasing demand of food due to rising population, advances in agriculture sector are required to meet the needs. Time series data is a special case of time stamped data. It is similar to a number line. The events are uniformly separated in time variety of domains like engineering, research, medicine and finance. This paper presents an identifying the underlying structure of the time series and fitting an appropriate Auto Regressive Integrated Moving Average model with a case study on agriculture food production time series data with R software. Keywords: Time Series Data, Agriculture food production data, Prediction, ARIMA Model, Augmented Dickey-Fuller Test, Kwiatkowski-Phillips- Schmidt-Shin test. 1 Introduction Agriculture food production is a large portion of the economic output. Together with the breeding industry, researchers try to identify the food production growth in a yearly manner. The application of machine learning to agriculture is an incredibly important advance in agriculture technologies because it allows these systems to leverage and combine historic information with real-time information such as agriculture crop yield prediction, food production, and weather information, that the farmer maintains. Machine learning techniques augment this heterogeneous data by combining it with the farmer s knowledge to provide coherence to the vast amounts of data being generated in today s farm environment. It allows these systems to learn about characteristics of each field and adapt systemic recommendations that get better over time. Machine learning allows farmers and agribusinesses alike to make better decisions, in real-time, even in the absence of complete information. Time Series data is a special case of time stamped data. It is similar to a number line. The events are uniformly separated in time in variety of domains like engineering, research, medicine and finance. Time series analysis attempts to model the underlying structure of observations taken over time. A time series, denoted Y=a +bx, is an ordered sequence ISSN: ASTL Copyright 2017 SERSC

2 of equally spaced values over time. Time series analysis has several applications in finance, economics, engineering and manufacturing. The important steps involved in time series analysis are a) Identify and model the structure of the time series b) Estimating the Parameters c) Diagnostic Checking with Tests and d) Forecast the future values in the time series. 2 Implementation The ARIMA model is also called as Box-Jenkins methodology (Box and Jenkins 1976). The Box-Jenkins procedure is concerned with fitting a mixed ARIMA model to a given set of data. The main objective in fitting ARIMA model is to identify the stochastic process of the time series and predict the future values accurately. This paper discusses the implementation of forecasting model on food production data with R Tool. The main requirements to implement this are packages named Forecast, fma, expsmooth, lmtest, zoo, tseries need to be loaded into the R studio. The yearly production of food grains in CSV form is the input file. The important steps involved are. 2.1 Examining and converting the continuous data into time series data The data can be understood by using descriptive statistical functions like mean, median, min, max, standard deviation, and summary data. Data in the dataset is in the normal numerical form (Continuous form). So, now data has to be converted into time series data. ts() function in R can be used for the conversion of data from continuous to time series and stored as an object. Once you have read the time series data into R, then store the time series object in R as foodgraints. > foodgraints=ts(foodgrains$food.grain.production,start=1951) To understand the time series data effectively we can use a graph. To draw a graph we use a command called plot function in R studio > plot(foodgraints). Copyright 2017 SERSC 521

3 Fig. 1. Continuous Data Sets to Time Lines 2.2 Model fitting for the analysis of the data ARIMA model needs the series data must be stationary. Stationary data consists of its mean, variance, and auto covariance which are time invariant. The ADF stands for Augmented Dickey-Fuller. This test is a formal stationary test for the data set. > adf.test(foodgraints) KPSS another unit root test called Kwiatkowski-Phillips-Schmidt-Shin test. > kpss.test(foodgraints) # Second Test for Stationary We are creating an ARIMA model. ARIMA stands for Auto-regressive Integrated Moving Average. To create a First ARIMA model, We have a special function called ARIMA function in R studio. This function will fit the data set into arima and creates a model. >fit=arima(foodgraints,order=c(0,1,1)) This created model called fit is added as an object in the Global Environment of the R studio. ACF stands for Auto Correlation Function. This is used to find whether the data is stationary or not. To draw a acf plot use a function called acf() in R studio. > acf(diff(foodgraints)) plot(forecast(fit)) # PACF plot of residuals 522 Copyright 2017 SERSC

4 Fig. 2. Forecast of food grains using ARIMA > Box.test(residuals(fit),type="Ljung") # Ljung Box Test Box-Ljung test data: residuals(fit) X-squared = , df = 1, p-value = Creating new models using ARIMA > fit1=auto.arima(diff(foodgraints)) # Auto Arima for finding the Arima(p,d,q), p,d,qprameters It creates a new model called fit1 and adds to the Global Environment window of the R studio. > plot(forecast(fit1)) # Forecast plot Copyright 2017 SERSC 523

5 Fig. 3. Forecast from ARIMA with non-zero mean > fit2=nnetar(diff(foodgraints),maxit=1000) # Implementing Neural Network on Timeseries data It creates a new object fit2 and adds to the Global Environment window of a R studio. Forecast plot to plot the historical data with forecasts and prediction of intervals. Drawing a Forecast plot on neural network model fit3 using plot() function. > plot(forecast(fit2)) Fig. 4. Forecast Plot on Neural Network 524 Copyright 2017 SERSC

6 To display the summary of the model 2 (Neural network model) > fit2 Series: diff(foodgraints) Model: NNAR(4,2) Call: nnetar(y = diff(foodgraints), maxit = 1000) Average of 20 networks, each of which is a network with 13 weights options were - linear output units. sigma^2 estimated as Conclusion In the study of Time series analysis on agriculture food production data, ARIMA model is used and predicted the values for the next four years. The validity of the predicted values can be checked when the data for the lead periods become available. The model can be used by researchers for forecasting food production in India. However the data need to be updated from time to time with incorporation of current values. References 1. Box, George EP, Gwilym M. Jenkins, Gregory C. Reinsel, and Greta M. Ljung.: Time series analysis: forecasting and control. John Wiley & Sons, Makridakis, Spyros, Michele Hibon, and Claus Moser.: Accuracy of forecasting: An empirical investigation. Journal of the Royal Statistical Society. Series A (General) (1979) 3. Prabakaran, K. and Sivapragasam, C.: Forecasting areas and production of rice in India using ARIMA model. International Journal of Farm Sciences. 4(1), pp (2014) 4. Box, George EP, Gwilym M. Jenkins, Gregory C. Reinsel, and Greta M. Ljung.: Time series analysis: forecasting and control. John Wiley & Sons, Pankratz, Alan.: Forecasting with univariate Box-Jenkins models. Concepts and cases. Vol John Wiley & Sons, 2009 Copyright 2017 SERSC 525

Network Bandwidth Utilization Prediction Based on Observed SNMP Data

Network Bandwidth Utilization Prediction Based on Observed SNMP Data 160 TUTA/IOE/PCU Journal of the Institute of Engineering, 2017, 13(1): 160-168 TUTA/IOE/PCU Printed in Nepal Network Bandwidth Utilization Prediction Based on Observed SNMP Data Nandalal Rana 1, Krishna

More information

Package mtsdi. January 23, 2018

Package mtsdi. January 23, 2018 Version 0.3.5 Date 2018-01-02 Package mtsdi January 23, 2018 Author Washington Junger and Antonio Ponce de Leon Maintainer Washington Junger

More information

Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling

Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Abstract Forecasting future events and statistics is problematic because the data set is a stochastic, rather

More information

The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC

The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC INTRODUCTION The Time Series Forecasting System (TSFS) is a component of SAS/ETS that provides a menu-based

More information

The funitroots Package

The funitroots Package The funitroots Package October 8, 2007 Version 260.72 Date 1997-2007 Title Rmetrics - Trends and Unit Roots Author Diethelm Wuertz and many others, see the SOURCE file Depends R (>= 2.4.0), urca, fbasics

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

Title. Description. time series Introduction to time-series commands

Title. Description. time series Introduction to time-series commands Title time series Introduction to time-series commands Description The Time-Series Reference Manual organizes the commands alphabetically, making it easy to find individual command entries if you know

More information

PSY 9556B (Feb 5) Latent Growth Modeling

PSY 9556B (Feb 5) Latent Growth Modeling PSY 9556B (Feb 5) Latent Growth Modeling Fixed and random word confusion Simplest LGM knowing how to calculate dfs How many time points needed? Power, sample size Nonlinear growth quadratic Nonlinear growth

More information

OpenBudgets.eu: Fighting Corruption with Fiscal Transparency. WP 2, Data Collection and Mining T2.4 Data Mining and Statistical Analytics Task

OpenBudgets.eu: Fighting Corruption with Fiscal Transparency. WP 2, Data Collection and Mining T2.4 Data Mining and Statistical Analytics Task OpenBudgets.eu: Fighting Corruption with Fiscal Transparency Project Number: 645833 Start Date of Project: 01.05.2015 Duration: 30 months Deliverable D2.4 Data Mining and Statistical Analytics Techniques

More information

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

Data Mining Technology Based on Bayesian Network Structure Applied in Learning , pp.67-71 http://dx.doi.org/10.14257/astl.2016.137.12 Data Mining Technology Based on Bayesian Network Structure Applied in Learning Chunhua Wang, Dong Han College of Information Engineering, Huanghuai

More information

National College of Ireland. Project Submission Sheet 2015/2016. School of Computing

National College of Ireland. Project Submission Sheet 2015/2016. School of Computing National College of Ireland Project Submission Sheet 2015/2016 School of Computing Student Name: Anicia Lafayette-Madden Student ID: 15006590 Programme: M.Sc Data Analytics Year: 2015-2016 Module: Configuration

More information

Fuzzy time series forecasting of wheat production

Fuzzy time series forecasting of wheat production Fuzzy time series forecasting of wheat production Narendra kumar Sr. lecturer: Computer Science, Galgotia college of engineering & Technology Sachin Ahuja Lecturer : IT Dept. Krishna Institute of Engineering

More information

Intro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019

Intro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019 Intro to ARMA models FISH 507 Applied Time Series Analysis Mark Scheuerell 15 Jan 2019 Topics for today Review White noise Random walks Autoregressive (AR) models Moving average (MA) models Autoregressive

More information

Forecasting Video Analytics Sami Abu-El-Haija, Ooyala Inc

Forecasting Video Analytics Sami Abu-El-Haija, Ooyala Inc Forecasting Video Analytics Sami Abu-El-Haija, Ooyala Inc (haija@stanford.edu; sami@ooyala.com) 1. Introduction Ooyala Inc provides video Publishers an endto-end functionality for transcoding, storing,

More information

Time Series Analysis by State Space Methods

Time Series Analysis by State Space Methods Time Series Analysis by State Space Methods Second Edition J. Durbin London School of Economics and Political Science and University College London S. J. Koopman Vrije Universiteit Amsterdam OXFORD UNIVERSITY

More information

Package funitroots. November 16, 2017

Package funitroots. November 16, 2017 Title Rmetrics - Modelling Trends and Unit Roots Date 2017-11-12 Version 3042.79 Author Diethelm Wuertz [aut], Tobias Setz [cre], Yohan Chalabi [ctb] Package funitroots November 16, 2017 Maintainer Tobias

More information

[Mahajan*, 4.(7): July, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785

[Mahajan*, 4.(7): July, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785 [Mahajan*, 4.(7): July, 05] ISSN: 77-9655 (IOR), Publication Impact Factor:.785 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY OPTIMIZATION OF SURFACE GRINDING PROCESS PARAMETERS

More information

SYS 6021 Linear Statistical Models

SYS 6021 Linear Statistical Models SYS 6021 Linear Statistical Models Project 2 Spam Filters Jinghe Zhang Summary The spambase data and time indexed counts of spams and hams are studied to develop accurate spam filters. Static models are

More information

Conditional Volatility Estimation by. Conditional Quantile Autoregression

Conditional Volatility Estimation by. Conditional Quantile Autoregression International Journal of Mathematical Analysis Vol. 8, 2014, no. 41, 2033-2046 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ijma.2014.47210 Conditional Volatility Estimation by Conditional Quantile

More information

Volume 3, Issue 10, October 2015 International Journal of Advance Research in Computer Science and Management Studies

Volume 3, Issue 10, October 2015 International Journal of Advance Research in Computer Science and Management Studies Volume 3, Issue 10, October 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 13: The bootstrap (v3) Ramesh Johari ramesh.johari@stanford.edu 1 / 30 Resampling 2 / 30 Sampling distribution of a statistic For this lecture: There is a population model

More information

Annexes : Sorties SAS pour l'exercice 3. Code SAS. libname process 'G:\Enseignements\M2ISN-Series temp\sas\';

Annexes : Sorties SAS pour l'exercice 3. Code SAS. libname process 'G:\Enseignements\M2ISN-Series temp\sas\'; Annexes : Sorties SAS pour l'exercice 3 Code SAS libname process 'G:\Enseignements\M2ISN-Series temp\sas\'; /* Etape 1 - Création des données*/ proc iml; phi={1-1.583 0.667-0.083}; theta={1}; y=armasim(phi,

More information

Optimization of process parameter for maximizing Material removal rate in turning of EN8 (45C8) material on CNC Lathe machine using Taguchi method

Optimization of process parameter for maximizing Material removal rate in turning of EN8 (45C8) material on CNC Lathe machine using Taguchi method Optimization of process parameter for maximizing Material removal rate in turning of EN8 (45C8) material on CNC Lathe machine using Taguchi method Sachin goyal 1, Pavan Agrawal 2, Anurag Singh jadon 3,

More information

Model Diagnostic tests

Model Diagnostic tests Model Diagnostic tests 1. Multicollinearity a) Pairwise correlation test Quick/Group stats/ correlations b) VIF Step 1. Open the EViews workfile named Fish8.wk1. (FROM DATA FILES- TSIME) Step 2. Select

More information

Long Run Relationship between Global Electronic Cycle, Yen/Dollar Exchange Rate and Malaysia Export. - 國貿四乙 劉易宣

Long Run Relationship between Global Electronic Cycle, Yen/Dollar Exchange Rate and Malaysia Export. - 國貿四乙 劉易宣 Long Run Relationship between Global Electronic Cycle, Yen/Dollar Exchange Rate and Malaysia Export. - 國貿四乙 10342243 劉易宣 Abstract 1. Depreciations of the Japanese Yen vis-a-vis the US Dollar have been

More information

International Journal of Advance Engineering and Research Development. A Survey on Data Mining Methods and its Applications

International Journal of Advance Engineering and Research Development. A Survey on Data Mining Methods and its Applications Scientific Journal of Impact Factor (SJIF): 4.72 International Journal of Advance Engineering and Research Development Volume 5, Issue 01, January -2018 e-issn (O): 2348-4470 p-issn (P): 2348-6406 A Survey

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

A. Incorrect! This would be the negative of the range. B. Correct! The range is the maximum data value minus the minimum data value.

A. Incorrect! This would be the negative of the range. B. Correct! The range is the maximum data value minus the minimum data value. AP Statistics - Problem Drill 05: Measures of Variation No. 1 of 10 1. The range is calculated as. (A) The minimum data value minus the maximum data value. (B) The maximum data value minus the minimum

More information

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)

BUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office) SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

Measures of Dispersion

Measures of Dispersion Lesson 7.6 Objectives Find the variance of a set of data. Calculate standard deviation for a set of data. Read data from a normal curve. Estimate the area under a curve. Variance Measures of Dispersion

More information

Fall 2016 CS130 - Regression Analysis 1 7. REGRESSION. Fall 2016

Fall 2016 CS130 - Regression Analysis 1 7. REGRESSION. Fall 2016 Fall 2016 CS130 - Regression Analysis 1 7. REGRESSION Fall 2016 Fall 2016 CS130 - Regression Analysis 2 Regression Analysis Regression analysis: usually falls under statistics and mathematical modeling

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

Prediction of Crop Yield using Machine Learning

Prediction of Crop Yield using Machine Learning International Research Journal of Engineering and Technology (IRJET) e-issn: 2395-0056 Volume: 05 Issue: 02 Feb-2018 www.irjet.net p-issn: 2395-0072 Prediction of Crop Yield using Machine Learning Rushika

More information

Institute for Statics und Dynamics of Structures Fuzzy Time Series

Institute for Statics und Dynamics of Structures Fuzzy Time Series Institute for Statics und Dynamics of Structures Fuzzy Time Series Bernd Möller 1 Description of fuzzy time series 2 3 4 5 Conclusions Folie 2 von slide422 1 Description of fuzzy time series 2 3 4 5 Conclusions

More information

Yield Estimation using faster R-CNN

Yield Estimation using faster R-CNN Yield Estimation using faster R-CNN 1 Vidhya Sagar, 2 Sailesh J.Jain and 2 Arjun P. 1 Assistant Professor, 2 UG Scholar, Department of Computer Engineering and Science SRM Institute of Science and Technology,Chennai,

More information

Leverage the power of SQL Analytical functions in Business Intelligence and Analytics. Viana Rumao, Asher Dmello

Leverage the power of SQL Analytical functions in Business Intelligence and Analytics. Viana Rumao, Asher Dmello International Journal of Scientific & Engineering Research Volume 9, Issue 7, July-2018 461 Leverage the power of SQL Analytical functions in Business Intelligence and Analytics Viana Rumao, Asher Dmello

More information

Section 4.1: Time Series I. Jared S. Murray The University of Texas at Austin McCombs School of Business

Section 4.1: Time Series I. Jared S. Murray The University of Texas at Austin McCombs School of Business Section 4.1: Time Series I Jared S. Murray The University of Texas at Austin McCombs School of Business 1 Time Series Data and Dependence Time-series data are simply a collection of observations gathered

More information

Regression Analysis and Linear Regression Models

Regression Analysis and Linear Regression Models Regression Analysis and Linear Regression Models University of Trento - FBK 2 March, 2015 (UNITN-FBK) Regression Analysis and Linear Regression Models 2 March, 2015 1 / 33 Relationship between numerical

More information

Implementing Operational Analytics Using Big Data Technologies to Detect and Predict Sensor Anomalies

Implementing Operational Analytics Using Big Data Technologies to Detect and Predict Sensor Anomalies Implementing Operational Analytics Using Big Data Technologies to Detect and Predict Sensor Anomalies Joseph Coughlin, Rohit Mital, Shashi Nittur, Benjamin SanNicolas, Christian Wolf, Rinor Jusufi Stinger

More information

Real Coded Genetic Algorithm Particle Filter for Improved Performance

Real Coded Genetic Algorithm Particle Filter for Improved Performance Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore Real Coded Genetic Algorithm Particle Filter for Improved Performance

More information

Statistical Quality Control Approach in Typical Garments Manufacturing Industry in Bangladesh: A Case Study

Statistical Quality Control Approach in Typical Garments Manufacturing Industry in Bangladesh: A Case Study Statistical Quality Control Approach in Typical Garments Manufacturing Industry in Bangladesh: A Case Study * Md. Mohibul Islam and ** Md. Mosharraf Hossain Garments industry is the most important economic

More information

PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation. Simple Linear Regression Software: Stata v 10.1

PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation. Simple Linear Regression Software: Stata v 10.1 PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation Simple Linear Regression Software: Stata v 10.1 Emergency Calls to the New York Auto Club Source: Chatterjee, S; Handcock MS and

More information

Applying Residual Control Charts to Identify the False Alarms in a TFT-LCD Manufacturing Process

Applying Residual Control Charts to Identify the False Alarms in a TFT-LCD Manufacturing Process Appl. Math. Inf. Sci. 7, No. 4, 1459-1464 (2013) 1459 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/10.12785/amis/070426 Applying Residual Control Charts to Identify

More information

Time Series Forecasting Using a Two-Level Multi-Objective Genetic Algorithm: A Case Study of Maintenance Cost Data for Tunnel Fans

Time Series Forecasting Using a Two-Level Multi-Objective Genetic Algorithm: A Case Study of Maintenance Cost Data for Tunnel Fans algorithms Article Time Series Forecasting Using a Two-Level Multi-Objective Genetic Algorithm: A Case Study of Maintenance Cost Data Tunnel Fans Yamur K. Al-Douri 1, *, Hussan Hamodi 1,2 ID Jan Lundberg

More information

A Data-Mining Approach for Wind Turbine Power Generation Performance Monitoring Based on Power Curve

A Data-Mining Approach for Wind Turbine Power Generation Performance Monitoring Based on Power Curve , pp.456-46 http://dx.doi.org/1.1457/astl.16. A Data-Mining Approach for Wind Turbine Power Generation Performance Monitoring Based on Power Curve Jianlou Lou 1,1, Heng Lu 1, Jia Xu and Zhaoyang Qu 1,

More information

Introduction to R for Time Series Analysis Lecture Notes 2

Introduction to R for Time Series Analysis Lecture Notes 2 Introduction to R for Time Series Analysis Lecture Notes 2 1.0 OVERVIEW OF R R is a widely used environment for statistical analysis. The striking difference between R and most other statistical software

More information

MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data

MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data MINI-PAPER by Rong Pan, Ph.D., Assistant Professor of Industrial Engineering, Arizona State University We, applied statisticians and manufacturing engineers, often need to deal with sequential data, which

More information

Analysing Search Trends

Analysing Search Trends Data Mining in Business Intelligence 7 March 2013, Ben-Gurion University Analysing Search Trends Yair Shimshoni, Google R&D center, Tel-Aviv. shimsh@google.com Outline What are search trends? The Google

More information

Bootstrap and multiple imputation under missing data in AR(1) models

Bootstrap and multiple imputation under missing data in AR(1) models EUROPEAN ACADEMIC RESEARCH Vol. VI, Issue 7/ October 2018 ISSN 2286-4822 www.euacademic.org Impact Factor: 3.4546 (UIF) DRJI Value: 5.9 (B+) Bootstrap and multiple imputation under missing ELJONA MILO

More information

A Robust Optimum Response Surface Methodology based On MM-estimator

A Robust Optimum Response Surface Methodology based On MM-estimator A Robust Optimum Response Surface Methodology based On MM-estimator, 2 HABSHAH MIDI,, 2 MOHD SHAFIE MUSTAFA,, 2 ANWAR FITRIANTO Department of Mathematics, Faculty Science, University Putra Malaysia, 434,

More information

Readers will be provided a link to download the software and Excel files that are used in the book after payment. Please visit

Readers will be provided a link to download the software and Excel files that are used in the book after payment. Please visit Readers will be provided a link to download the software and Excel files that are used in the book after payment. Please visit http://www.xlpert.com for more information on the book. The Excel files are

More information

Fixed-point Simulink Designs for Automatic HDL Generation of Binary Dilation & Erosion

Fixed-point Simulink Designs for Automatic HDL Generation of Binary Dilation & Erosion Fixed-point Simulink Designs for Automatic HDL Generation of Binary Dilation & Erosion Gurpreet Kaur, Nancy Gupta, and Mandeep Singh Abstract Embedded Imaging is a technique used to develop image processing

More information

Modelling and simulation of seismic reflectivity

Modelling and simulation of seismic reflectivity Modelling reflectivity Modelling and simulation of seismic reflectivity Rita Aggarwala, Michael P. Lamoureux, and Gary F. Margrave ABSTRACT We decompose the reflectivity series obtained from a seismic

More information

PS 6: Regularization. PART A: (Source: HTF page 95) The Ridge regression problem is:

PS 6: Regularization. PART A: (Source: HTF page 95) The Ridge regression problem is: Economics 1660: Big Data PS 6: Regularization Prof. Daniel Björkegren PART A: (Source: HTF page 95) The Ridge regression problem is: : β "#$%& = argmin (y # β 2 x #4 β 4 ) 6 6 + λ β 4 #89 Consider the

More information

Multi-Criterion Optimal Design of Building Simulation Model using Chaos Particle Swarm Optimization

Multi-Criterion Optimal Design of Building Simulation Model using Chaos Particle Swarm Optimization , pp.21-25 http://dx.doi.org/10.14257/astl.2014.69.06 Multi-Criterion Optimal Design of Building Simulation Model using Chaos Particle Swarm Optimization Young-Jin, Kim 1,*, Hyun-Su Kim 1 1 Division of

More information

Data Mining on Agriculture Data using Neural Networks

Data Mining on Agriculture Data using Neural Networks Data Mining on Agriculture Data using Neural Networks June 26th, 28 Outline Data Details Data Overview precision farming cheap data collection GPS-based technology divide field into small-scale parts treat

More information

Offline Accessible System for Agricultural E-Commerce Using Unstructured Supplementary Services Data Application

Offline Accessible System for Agricultural E-Commerce Using Unstructured Supplementary Services Data Application International Journal of Computer Science and Telecommunications [Volume 9, Issue 6, November 2018] 5 ISSN 2047-3338 Offline Accessible System for Agricultural E-Commerce Using Unstructured Supplementary

More information

Analysis&Optimization of Design Parameters of Mechanisms Using Ga

Analysis&Optimization of Design Parameters of Mechanisms Using Ga International Journal of Computational Engineering Research Vol, 3 Issue, 7 Analysis&Optimization of Design Parameters of Mechanisms Using Ga B.Venu 1, Dr.M.nagaphani sastry 2 1 Student, M.Tech (CAD/CAM),

More information

AGRICULTURE BASED ANDROID APPLICATION

AGRICULTURE BASED ANDROID APPLICATION AGRICULTURE BASED ANDROID APPLICATION Prof.Aradhana D 1, Shiva Prasad K S 2, Shrivaishnavi J K 3, P. Sowmya 4, Tina Agarwal 5 1 Department of Computer Science & Engineering Ballari Institute of Technology

More information

Package analytics. June 14, 2017

Package analytics. June 14, 2017 Type Package Package analytics June 14, 2017 Title Regression Outlier Detection, Stationary Bootstrap, Testing Weak Stationarity, and Other Tools for Data Analysis Version 2.0 Date 2017-06-14 Author Maintainer

More information

Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models

Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models Statistics Preprints Statistics 3-29-1999 Graphical Tools for Exploring and Analyzing Data From ARIMA Time Series Models William Q. Meeker Iowa State University, wqmeeker@iastate.edu Follow this and additional

More information

α - CUT FUZZY CONTROL CHARTS FOR BOTTLE BURSTING STRENGTH DATA

α - CUT FUZZY CONTROL CHARTS FOR BOTTLE BURSTING STRENGTH DATA International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD ISSN 2249-684X Vol. 2 Issue 4 Dec 2012 17-30 TJPRC Pvt. Ltd., α - CUT FUZZY CONTROL

More information

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening Variables Entered/Removed b Variables Entered GPA in other high school, test, Math test, GPA, High school math GPA a Variables Removed

More information

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 23 CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 3.1 DESIGN OF EXPERIMENTS Design of experiments is a systematic approach for investigation of a system or process. A series

More information

Vignette of the JoSAE package

Vignette of the JoSAE package Vignette of the JoSAE package Johannes Breidenbach 6 October 2011: JoSAE 0.2 1 Introduction The aim in the analysis of sample surveys is frequently to derive estimates of subpopulation characteristics.

More information

Descriptive Statistics, Standard Deviation and Standard Error

Descriptive Statistics, Standard Deviation and Standard Error AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.

More information

Parallel learning of content recommendations using map- reduce

Parallel learning of content recommendations using map- reduce Parallel learning of content recommendations using map- reduce Michael Percy Stanford University Abstract In this paper, machine learning within the map- reduce paradigm for ranking

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use

More information

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,

More information

CS130 Regression. Winter Winter 2014 CS130 - Regression Analysis 1

CS130 Regression. Winter Winter 2014 CS130 - Regression Analysis 1 CS130 Regression Winter 2014 Winter 2014 CS130 - Regression Analysis 1 Regression Analysis Regression analysis: usually falls under statistics and mathematical modeling is a form of statistical analysis

More information

Position Error Reduction of Kinematic Mechanisms Using Tolerance Analysis and Cost Function

Position Error Reduction of Kinematic Mechanisms Using Tolerance Analysis and Cost Function Position Error Reduction of Kinematic Mechanisms Using Tolerance Analysis and Cost Function B.Moetakef-Imani, M.Pour Department of Mechanical Engineering, Faculty of Engineering, Ferdowsi University of

More information

A Monotonic Sequence and Subsequence Approach in Missing Data Statistical Analysis

A Monotonic Sequence and Subsequence Approach in Missing Data Statistical Analysis Global Journal of Pure and Applied Mathematics. ISSN 0973-1768 Volume 12, Number 1 (2016), pp. 1131-1140 Research India Publications http://www.ripublication.com A Monotonic Sequence and Subsequence Approach

More information

July Issue. Volume 01 - Issue 01

July Issue. Volume 01 - Issue 01 IJICT International Journal of Intelligent Computing and Technology July Issue Volume 1 - Issue 1 Comparison of Statistical Approaches for Sentiment Analysis - Malayalam Film Review Authors : Deepu S.Nair,

More information

Effects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex

Effects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex Effects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex Keiko I. Powers, Ph.D., J. D. Power and Associates, Westlake Village, CA ABSTRACT Discrete time series

More information

SAS Econometrics and Time Series Analysis 1.1 for JMP

SAS Econometrics and Time Series Analysis 1.1 for JMP SAS Econometrics and Time Series Analysis 1.1 for JMP SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2011. SAS Econometrics and Time Series Analysis

More information

An ICA based Approach for Complex Color Scene Text Binarization

An ICA based Approach for Complex Color Scene Text Binarization An ICA based Approach for Complex Color Scene Text Binarization Siddharth Kherada IIIT-Hyderabad, India siddharth.kherada@research.iiit.ac.in Anoop M. Namboodiri IIIT-Hyderabad, India anoop@iiit.ac.in

More information

International Journal of Advance Engineering and Research Development. Flow Control Loop Analysis for System Modeling & Identification

International Journal of Advance Engineering and Research Development. Flow Control Loop Analysis for System Modeling & Identification Scientific Journal of Impact Factor(SJIF): 3.134 e-issn(o): 2348-4470 p-issn(p): 2348-6406 International Journal of Advance Engineering and Research Development Volume 2,Issue 5, May -2015 Flow Control

More information

Rotation and Scaling Image Using PCA

Rotation and Scaling Image Using PCA wwwccsenetorg/cis Computer and Information Science Vol 5, No 1; January 12 Rotation and Scaling Image Using PCA Hawrra Hassan Abass Electrical & Electronics Dept, Engineering College Kerbela University,

More information

MULTIDIMENSIONAL INDEXING TREE STRUCTURE FOR SPATIAL DATABASE MANAGEMENT

MULTIDIMENSIONAL INDEXING TREE STRUCTURE FOR SPATIAL DATABASE MANAGEMENT MULTIDIMENSIONAL INDEXING TREE STRUCTURE FOR SPATIAL DATABASE MANAGEMENT Dr. G APPARAO 1*, Mr. A SRINIVAS 2* 1. Professor, Chairman-Board of Studies & Convener-IIIC, Department of Computer Science Engineering,

More information

Global Journal of Engineering Science and Research Management

Global Journal of Engineering Science and Research Management A NOVEL HYBRID APPROACH FOR PREDICTION OF MISSING VALUES IN NUMERIC DATASET V.B.Kamble* 1, S.N.Deshmukh 2 * 1 Department of Computer Science and Engineering, P.E.S. College of Engineering, Aurangabad.

More information

Bernt Arne Ødegaard. 15 November 2018

Bernt Arne Ødegaard. 15 November 2018 R Bernt Arne Ødegaard 15 November 2018 To R is Human 1 R R is a computing environment specially made for doing statistics/econometrics. It is becoming the standard for advanced dealing with empirical data,

More information

References R's single biggest strenght is it online community. There are tons of free tutorials on R.

References R's single biggest strenght is it online community. There are tons of free tutorials on R. Introduction to R Syllabus Instructor Grant Cavanaugh Department of Agricultural Economics University of Kentucky E-mail: gcavanugh@uky.edu Course description Introduction to R is a short course intended

More information

= 75. See Hamilton p. 111 on solving PACF given ACF.

= 75. See Hamilton p. 111 on solving PACF given ACF. Washington University Fall 2009 Department of Economics James Morley Economics 518B Homework #2 GDP and ARMA Models Due: Tuesday 9/15 As with the first homework assignment, try to conserve paper in presenting

More information

A Modified Weibull Distribution

A Modified Weibull Distribution IEEE TRANSACTIONS ON RELIABILITY, VOL. 52, NO. 1, MARCH 2003 33 A Modified Weibull Distribution C. D. Lai, Min Xie, Senior Member, IEEE, D. N. P. Murthy, Member, IEEE Abstract A new lifetime distribution

More information

Optimizing Number of Hidden Nodes for Artificial Neural Network using Competitive Learning Approach

Optimizing Number of Hidden Nodes for Artificial Neural Network using Competitive Learning Approach Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 5, May 2015, pg.358

More information

Differential Evolution Algorithm for Likelihood Estimation

Differential Evolution Algorithm for Likelihood Estimation International Conference on Control, Robotics Mechanical Engineering (ICCRME'2014 Jan. 15-16, 2014 Kuala Lumpur (Malaysia Differential Evolution Algorithm for Likelihood Estimation Mohd Sapiyan bin Baba

More information

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions)

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions) THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533 MIDTERM EXAMINATION: October 14, 2005 Instructor: Val LeMay Time: 50 minutes 40 Marks FRST 430 50 Marks FRST 533 (extra questions) This examination

More information

STA 570 Spring Lecture 5 Tuesday, Feb 1

STA 570 Spring Lecture 5 Tuesday, Feb 1 STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row

More information

Inclusion of Aleatory and Epistemic Uncertainty in Design Optimization

Inclusion of Aleatory and Epistemic Uncertainty in Design Optimization 10 th World Congress on Structural and Multidisciplinary Optimization May 19-24, 2013, Orlando, Florida, USA Inclusion of Aleatory and Epistemic Uncertainty in Design Optimization Sirisha Rangavajhala

More information

Study of microedm parameters of Stainless Steel 316L: Material Removal Rate Optimization using Genetic Algorithm

Study of microedm parameters of Stainless Steel 316L: Material Removal Rate Optimization using Genetic Algorithm Study of microedm parameters of Stainless Steel 316L: Material Removal Rate Optimization using Genetic Algorithm Suresh P #1, Venkatesan R #, Sekar T *3, Sathiyamoorthy V **4 # Professor, Department of

More information

Rural/Urban Divides in Mobile Coverage Expansion

Rural/Urban Divides in Mobile Coverage Expansion Rural/Urban Divides in Mobile Coverage Expansion Pierre Biscaye & C. Leigh Anderson Evans School Policy Analysis & Research Group (EPAR) Evans School of Public Policy & Governance, University of Washington,

More information

Package iclick. R topics documented: February 24, Type Package

Package iclick. R topics documented: February 24, Type Package Type Package Package iclick February 24, 2018 Title A Button-Based GUI for Financial and Economic Data Analysis Version 1.4 Date 2018-02-24 Author Ho Tsung-wu Maintainer A GUI designed to support the analysis

More information

An Efficient Clustering for Crime Analysis

An Efficient Clustering for Crime Analysis An Efficient Clustering for Crime Analysis Malarvizhi S 1, Siddique Ibrahim 2 1 UG Scholar, Department of Computer Science and Engineering, Kumaraguru College Of Technology, Coimbatore, Tamilnadu, India

More information

GEOP 505/MATH 587 Fall 02 Homework 6

GEOP 505/MATH 587 Fall 02 Homework 6 GEOP 55/MATH 587 Fall 2 Homework 6 In grading these homeworks, I found that the major problem that students had was a misunderstanding of the difference between the original time series z, the differenced

More information

2014 Stat-Ease, Inc. All Rights Reserved.

2014 Stat-Ease, Inc. All Rights Reserved. What s New in Design-Expert version 9 Factorial split plots (Two-Level, Multilevel, Optimal) Definitive Screening and Single Factor designs Journal Feature Design layout Graph Columns Design Evaluation

More information

Analysis and Optimization of Parameters Affecting Surface Roughness in Boring Process

Analysis and Optimization of Parameters Affecting Surface Roughness in Boring Process International Journal of Advanced Mechanical Engineering. ISSN 2250-3234 Volume 4, Number 6 (2014), pp. 647-655 Research India Publications http://www.ripublication.com Analysis and Optimization of Parameters

More information

ESTIMATING THE COST OF ENERGY USAGE IN SPORT CENTRES: A COMPARATIVE MODELLING APPROACH

ESTIMATING THE COST OF ENERGY USAGE IN SPORT CENTRES: A COMPARATIVE MODELLING APPROACH ESTIMATING THE COST OF ENERGY USAGE IN SPORT CENTRES: A COMPARATIVE MODELLING APPROACH A.H. Boussabaine, R.J. Kirkham and R.G. Grew Construction Cost Engineering Research Group, School of Architecture

More information

VALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES

VALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES Libraries Conference on Applied Statistics in Agriculture 1996-8th Annual Conference Proceedings VALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES Stephen N. Sly Jeffrey S.

More information

Generalized least squares (GLS) estimates of the level-2 coefficients,

Generalized least squares (GLS) estimates of the level-2 coefficients, Contents 1 Conceptual and Statistical Background for Two-Level Models...7 1.1 The general two-level model... 7 1.1.1 Level-1 model... 8 1.1.2 Level-2 model... 8 1.2 Parameter estimation... 9 1.3 Empirical

More information