DriveFaster: Optimizing a Traffic Light Grid System

Size: px
Start display at page:

Download "DriveFaster: Optimizing a Traffic Light Grid System"

Transcription

1 DriveFaster: Optimizing a Traffic Light Grid System Abstract CS221 Fall 2016: Final Report Team Members: Xiaofan Li, Ahmed Jaffery Traffic lights are the central point of control of traffic for cities and can be an effective tool for keeping traffic moving. The cheapest way to speed up traffic flow for a city is to optimize their traffic light system. This project aims to optimize traffic flow through a grid of street intersections by controlling only the traffic light grid. The goal is to minimize time from a random start point to a random end point for every car. Using an MDP model with Q-Learning will allow the system to adapt to a variety of situations and traffic patterns. This paper will give details into the MDP definition and the effect of various features upon the Q-Learning simulation. 1. Introduction This paper will investigate the use of a Markhov Decision Process (MDP) model to learn traffic patterns and apply Q-Learning to improve traffic flow. The model will control only the state of each traffic light (green/red) and base its decision making on traffic information. It will take into account the number of cars in the system and wait time at each traffic light. To simplify the system and reduce the number of variables, the cars will all have a constant speed of 1 and the intersection length will be 1. By keeping these values constant we can focus more on the features that focus on managing an increasing number of cars. Firstly we summarize a paper that gave us background information and define our problem in detail. We will then end by discussing our model and approach as well as analyzing the results of Q-Learning as more features are added. 2. Related Works Traffic has been increasing significantly over the past few decades as many people continue to migrate to a few key cities. Minimizing traffic delays and improving traffic flows is a problem that has been worked on by many municipal governments with varying results. A paper from the Deakin University in Australia conducted a study on traffic simulation comparing the use of Q-Learning and Neural Networks. Their goal was to reduce average delay time at an intersection. In the MDP model they chose actions that maximize short term reward rather than distant future rewards. They had a minimum time that a light 1

2 would be in the green state and have a maximum of 2 extensions that had a fixed time of 10 seconds each. The Neural Network they designed used a Genetic Algorithm and Simulated Annealing which improves the initial solution by selecting another neighbor solution and doing small changes by comparing them to the fitness function. There is no minimum fixed green time or fixed extension time. The advantage of this method is having fewer constraints than Q-Learning and being able to determine exact times for green/red light states. The downside they had was having a much larger state space due to more variability. 3. Content & Simulation a. Problem Definition This project aims to optimize traffic flow through a grid of intersections. The goal is minimize time from a random start point to a random end point for every car. For the scope of this project the model will have 2 major constraints in order to reduce the number of variables: Firstly, the road map will have an M*N grid pattern with fixed distances between each intersection. Secondly, all cars will travel at the same speed and take a time score of 1 to cross a road (from one intersection to the next). The cars will be randomly spawned at an entrance position and make their way over to a randomly chosen exit position. All edges are considered as spawn points and there will only be a green light and red light state for every traffic light (go/stop). The primary variable that will be adjusted is the duration of a green or red light state per traffic light. Initially the reward was calculated by summing the total wait time each car experienced. As the project progressed, the reward was changed to the total time it took for the system to solve (time it took for all cars to reach their destination). b. Car and Traffic Light Generation The MDP is initialized with a set number of cars, a set number of additional cars that will be periodically added, and a grid size of M*N. The number of traffic lights is determined by multiplying M and N. Each traffic light is 1 unit away from each other. The number of cars initialized was arbitrarily set to equal half the number of traffic lights. After each action cycle additional cars are generated at random start positions. Adding additional cars simulates varying traffic trends throughout the day and helps create a more realistic traffic model. These two functions compromise the setup of our simulation. 2

3 c. Approaches i. Oracle The oracle was designed to have a pre-determined traffic pattern and based on this it ran a search algorithm for the optimal action at a given state. A state will take into account traffic at all intersections and determine how to process based on the successorstate function. We used a 2x2 grid with 4 cars. The result was an reward of 0 here because the system knew the positions of each car and enabled the green light before the car arrived at the intersection. ii. Baseline The baseline algorithm takes a greedy approach and simply activates the greenlight for the direction in which there are maximum number of cars at that one intersection. We simulated a 2x2 grid with 4 cars. This is the approach that is used in most intelligent intersections currently. There is also an upper bound defined in terms of maximum time limit that a traffic light can stay Green in one direction per cycle. The result of the baseline was very low as expected. The reward here was 6 due to each car having a wait time of 1.5. iii. MDP a. State The MDP state is defined using a tuple of ([Car States], [Light States]) which describes the state space. The list [Car States] contains a list of car states for each car, each of which has information about the car position, direction and place in line (if waiting for a light). The list [Light States] contains a list of light states, each of which records the current light color for the up/down direction, and the number of cars waiting at each direction. b. Actions The available actions from a state will be defined as an M*N vector of traffic light colors in the next iteration. This will control the next state of each traffic light which will move the cars throughout the streets. c. Randomness In order to introduce randomness of car actions and simulate different driving patterns, cars can choose to make a turn with probabilities as follows: 3

4 P (turning) = dist(currentp os, endp os). 5(1[is currentdirection blocked by light]) d. Random Drivers Decisions P (go straight) = 1 P (turning) The decisions made by each car also adds to the randomness of the environment. The drivers follow the rules below: 1. Each driver always make progress towards the goal by always turning towards the goal positions. 2. If the goal is on the left side and in front of the current position: there s a 50% chance that the car will turn left and 50% that the car will go straight. In this case, the driver s decision is independent of the current signal color. 3. If the goal is on the right side and in front of the current position: 3.1. if the current light is red, then the car has a 80% chance of turning right and 20% chance of waiting for the light to turn green If the current light is green, then the car has 20% chance of turning right and 80% chance of going straight. 4. It is impossible for the goal to be behind the current position based on the above rules and the fact that we always initialize the starting positions on the borders of the grid. The intuition behind the rules is to simulate realistic decision making for the drivers. e. State Transition Complexity At each time step, we update State based on the current Action. First the light state is updated because it is most directly impacted by the current action vector; then the car state is updated based on direction, position as well as the updated light state. Light states and car states can each be updated in parallel because any light or car is independent of one another. The only dependency is that the car states have to be updated after light states. If there are L lights and N cars then at each time step: S equential update complexity : O(L + N ) 4

5 P arallel update complexity : O(1) iv. Learning We used a standard Q-learning algorithm in order to minimize global wait time, which is the sum of all wait times for all cars spawned. Q ˆ opt(s, a) (1 η)[qˆ opt(s, a) + η(r + γv ˆ opt(s 0 ))] V ˆ opt(s 0 ) = max [a Action(s )]Qˆ opt(s, a ) Beside from standard Q-learning algorithm, we also explored various feature extractors: 1. Traffic Light Information Features total_action Keep track of all possible actions at a given state. total_red Total number of lights that are red. total_green - Total number of lights that are green. total_cars Total number of cars on the grid currently. cars_per_light_per_direction Total number of cars at each traffic light per direction. Will be stored in a hash. This will help the computer learn that it is not good to have too many cars in the que. wait_time_per_light_per_direction How much total wait time is in the traffic light queue currently. This will help keep track of cars that have been on the grid for a long time and help prioritize those cars. 2. Spacial Features 2D_Neighborhood - This will extract features based on the 2D neighborhood of a given light. When we know that all the lights surrounding a particular intersection are green/red, we can assign more/less weight accordingly. The spacial feature will extract the geometric features in the intersection grid and use it as a heavily weighted feature. This will help optimize sub sections of the grid map rather than just looking at the whole grid map. 4. Results a. Evaluation Metric 5

6 To evaluate the effectiveness of our MDP model we ran Q-Learning for various grid sizes with 30 cars and 1000 iterations. We will also investigate the effect of each feature individually and plot how features affect the overall reward of the system. The graph will display a comparison of the number of features versus total reward. This will display the effectivity of our feature extractor and give a comparison of the model vs the optimal result as features are added. b. Analysis In order to better analyze the results, we introduce the concept of Accuracy, where it is defined as: Accuracy = -1 * 100 / score. With this, we can directly compare the results of the learning algorithm with different features. We also compare standard deviation for every 100 iterations to show how stable the predictions are. In the diagram, the three sets of features compared are the following: 1. Feature Set 1: All basic features such as number of cars at each light, number of current cars and current turning directions etc. 2. Feature Set 2: All temporal features such as cumulative wait time for each car, ticketing number etc. 3. Feature Set 3: All spatial features such as neighborhood features and lighting states in certain positions of the grid etc. In total, we have 7 features across all three feature groups. The results are as follows: 6

7 In the above diagram, we compare the prediction accuracy with the features we used to do the prediction across each grid size. As we can see, the spatial features really increased the accuracy of the prediction. Especially for small grids, it shows significant improvement with spatial features. This result makes sense because in smaller grids, spatial features such as the neighborhood feature essentially computes the whole state. It is also worth noting that with only temporal feature (with feature 1+2), we are seeing degradation in larger grids (16 and 25 lights). This is probably because the temporal features are most focused on localized states such as car wait time. Therefore, with larger grids, it actually hurts accuracy prediction. However, it does show that the spatial extractor performs with higher deviations with larger grids. We think it is because with larger grids, there are more information being incorporated in the neighborhood feature, thus creating more randomness, which makes the results less stable. Despite the stability issue, the results with the spatial feature extractor still consistently outperform the other features. Overall, we cannot see a trend in the standard deviations across grid sizes. In general, the deviation increases as the grid sizes increase but it is not clear if this observation will continue had we tried with larger sizes. 7

8 c. Challenges The primary challenges of this approach was keeping track of car wait time, taking into account neighboring local traffic lights, and managing the state size. Initially the car wait time was included in the state, however this was affecting the Q-learning as each additional second the car waited that was being considered as a new state. In reality the system state is the same regardless of the wait time of each individual car. This issue was solved by replacing the car wait time with overall simulation time. This approach also allowed the model to optimize for a global solution rather than only individual cars. For example, if all cars except 1 reach their goal in 5 time units and the last car reaches its goal in 100 time units, the average time for each car is low; however the full system time is 100. Optimizing for the global solution minimizes the full system time and ensures that all cars will reach their destination faster. This also reduced the state size considerably. One of the primary features that was decided upon was taking into account the local grid for each traffic light and the total number of cars in each direction at neighboring lights, rather than looking at each traffic light separately. Dividing the grid however prioritized moving cars through certain sectors in certain directions. If there were more cars traveling in the North/South direction then the cars going West/East would end up waiting until the North/South direction was cleared of cars. This was solved by adding in a maximum time that a traffic light could be green in one direction. After this maximum time was hit, the traffic light would switch directions regardless of the number of cars. 5. Conclusion This paper studied the effectivity of using the MDP model for improving traffic flow through traffic light control. The three main features that are investigated consist of basic traffic traffic light information, cumulative traffic wait time, and local neighborhood features. As seen in the graphs above, the local neighborhood feature was determined to be the most crucial as it alone improved the average score by about 24%. The primary challenge that was unable to be overcome was the large state size and scaling the simulation up. In the future, work will have to be done to decrease the state complexity and use a sampling method to look at every Nth traffic light rather 8

9 than every single one. In conclusion, applying Q-Learning with a few critical features on an MDP model can be a viable way to solve the traffic light control problem. 6. References

Welfare Navigation Using Genetic Algorithm

Welfare Navigation Using Genetic Algorithm Welfare Navigation Using Genetic Algorithm David Erukhimovich and Yoel Zeldes Hebrew University of Jerusalem AI course final project Abstract Using standard navigation algorithms and applications (such

More information

CSE151 Assignment 2 Markov Decision Processes in the Grid World

CSE151 Assignment 2 Markov Decision Processes in the Grid World CSE5 Assignment Markov Decision Processes in the Grid World Grace Lin A484 gclin@ucsd.edu Tom Maddock A55645 tmaddock@ucsd.edu Abstract Markov decision processes exemplify sequential problems, which are

More information

Homework 2: Search and Optimization

Homework 2: Search and Optimization Scott Chow ROB 537: Learning Based Control October 16, 2017 Homework 2: Search and Optimization 1 Introduction The Traveling Salesman Problem is a well-explored problem that has been shown to be NP-Complete.

More information

Solving Traveling Salesman Problem Using Parallel Genetic. Algorithm and Simulated Annealing

Solving Traveling Salesman Problem Using Parallel Genetic. Algorithm and Simulated Annealing Solving Traveling Salesman Problem Using Parallel Genetic Algorithm and Simulated Annealing Fan Yang May 18, 2010 Abstract The traveling salesman problem (TSP) is to find a tour of a given number of cities

More information

Controlling user groups in traffic

Controlling user groups in traffic Controlling user groups in traffic Jaap Vreeswijk 1, Luc Wismans 2, Bas Tutert 3 1. Imtech Traffic & Infra, Basicweg 16, 3821 BR Amersfoort, The Netherlands, Phone: +31 33 454 1724, Mail: jaap.vreeswijk@imtech.com

More information

Reinforcement Learning for Adaptive Routing of Autonomous Vehicles in Congested Networks

Reinforcement Learning for Adaptive Routing of Autonomous Vehicles in Congested Networks Reinforcement Learning for Adaptive Routing of Autonomous Vehicles in Congested Networks Jonathan Cox Aeronautics & Astronautics Brandon Jennings Mechanical Engineering Steven Krukowski Aeronautics & Astronautics

More information

GenStat for Schools. Disappearing Rock Wren in Fiordland

GenStat for Schools. Disappearing Rock Wren in Fiordland GenStat for Schools Disappearing Rock Wren in Fiordland A possible decrease in number of Rock Wren in the Fiordland area of New Zealand between 1985 and 2005 was investigated. Numbers were recorded in

More information

Lightweight Simulation of Air Traffic Control Using Simple Temporal Networks

Lightweight Simulation of Air Traffic Control Using Simple Temporal Networks Lightweight Simulation of Air Traffic Control Using Simple Temporal Networks Russell Knight Jet Propulsion Laboratory, California Institute of Technology 4800 Oak Grove Drive MS 126-347 Pasadena, CA 91109

More information

Traffic Light Control by Multiagent Reinforcement Learning Systems

Traffic Light Control by Multiagent Reinforcement Learning Systems Traffic Light Control by Multiagent Reinforcement Learning Systems Bram Bakker and Shimon Whiteson and Leon J.H.M. Kester and Frans C.A. Groen Abstract Traffic light control is one of the main means of

More information

Training Intelligent Stoplights

Training Intelligent Stoplights Training Intelligent Stoplights Thomas Davids, Michael Celentano, and Luke Knepper December 14, 2012 1 Introduction Traffic is a huge problem for the American economy. In 2010, the average American commuter

More information

Formations in flow fields

Formations in flow fields Formations in flow fields Rick van Meer June 11, 2015 1 Introduction Pathfinding and formations (i.e. an arrangement of agents) are both a part of artificial intelligence and can be used in games. However,

More information

Improving the Data Scheduling Efficiency of the IEEE (d) Mesh Network

Improving the Data Scheduling Efficiency of the IEEE (d) Mesh Network Improving the Data Scheduling Efficiency of the IEEE 802.16(d) Mesh Network Shie-Yuan Wang Email: shieyuan@csie.nctu.edu.tw Chih-Che Lin Email: jclin@csie.nctu.edu.tw Ku-Han Fang Email: khfang@csie.nctu.edu.tw

More information

SPEED SURVEY ANALYSIS SYSTEM

SPEED SURVEY ANALYSIS SYSTEM SPEED SURVEY ANALYSIS SYSTEM user s manual CROSSROADS SOFTWARE 210 W. Birch Street, Suite 207 Brea, CA 92821 (714) 990-6433 Fax: (714) 990-5628 www.crossroadssoftware.com Chapter One: Introducing the Speed

More information

Estimating Parking Spot Occupancy

Estimating Parking Spot Occupancy 1 Estimating Parking Spot Occupancy David M.W. Landry and Matthew R. Morin Abstract Whether or not a car occupies a given parking spot at a given time can be modeled as a random variable. By looking at

More information

Control Charts. An Introduction to Statistical Process Control

Control Charts. An Introduction to Statistical Process Control An Introduction to Statistical Process Control Course Content Prerequisites Course Objectives What is SPC? Control Chart Basics Out of Control Conditions SPC vs. SQC Individuals and Moving Range Chart

More information

DIAL: A Distributed Adaptive-Learning Routing Method in VDTNs

DIAL: A Distributed Adaptive-Learning Routing Method in VDTNs : A Distributed Adaptive-Learning Routing Method in VDTNs Bo Wu, Haiying Shen and Kang Chen Department of Electrical and Computer Engineering Clemson University, Clemson, South Carolina 29634 {bwu2, shenh,

More information

CS 229: Machine Learning Final Report Identifying Driving Behavior from Data

CS 229: Machine Learning Final Report Identifying Driving Behavior from Data CS 9: Machine Learning Final Report Identifying Driving Behavior from Data Robert F. Karol Project Suggester: Danny Goodman from MetroMile December 3th 3 Problem Description For my project, I am looking

More information

Review of the Robust K-means Algorithm and Comparison with Other Clustering Methods

Review of the Robust K-means Algorithm and Comparison with Other Clustering Methods Review of the Robust K-means Algorithm and Comparison with Other Clustering Methods Ben Karsin University of Hawaii at Manoa Information and Computer Science ICS 63 Machine Learning Fall 8 Introduction

More information

CS 229 Final Project - Using machine learning to enhance a collaborative filtering recommendation system for Yelp

CS 229 Final Project - Using machine learning to enhance a collaborative filtering recommendation system for Yelp CS 229 Final Project - Using machine learning to enhance a collaborative filtering recommendation system for Yelp Chris Guthrie Abstract In this paper I present my investigation of machine learning as

More information

Universiteit Leiden Computer Science

Universiteit Leiden Computer Science Universiteit Leiden Computer Science Optimizing octree updates for visibility determination on dynamic scenes Name: Hans Wortel Student-no: 0607940 Date: 28/07/2011 1st supervisor: Dr. Michael Lew 2nd

More information

A Joint Replication-Migration-based Routing in Delay Tolerant Networks

A Joint Replication-Migration-based Routing in Delay Tolerant Networks A Joint -Migration-based Routing in Delay Tolerant Networks Yunsheng Wang and Jie Wu Dept. of Computer and Info. Sciences Temple University Philadelphia, PA 19122 Zhen Jiang Dept. of Computer Science West

More information

Non-Homogeneous Swarms vs. MDP s A Comparison of Path Finding Under Uncertainty

Non-Homogeneous Swarms vs. MDP s A Comparison of Path Finding Under Uncertainty Non-Homogeneous Swarms vs. MDP s A Comparison of Path Finding Under Uncertainty Michael Comstock December 6, 2012 1 Introduction This paper presents a comparison of two different machine learning systems

More information

Stability of Marriage and Vehicular Parking

Stability of Marriage and Vehicular Parking Stability of Marriage and Vehicular Parking Daniel Ayala, Ouri Wolfson, Bo Xu, Bhaskar DasGupta, and Jie Lin University of Illinois at Chicago Abstract. The proliferation of mobile devices, location-based

More information

Rise-Time Enhancement Techniques for Resistive Array Infrared Scene Projectors

Rise-Time Enhancement Techniques for Resistive Array Infrared Scene Projectors Rise-Time Enhancement Techniques for Resistive Array Infrared Scene Projectors Greg Franks a, Joe LaVeigne a, Kevin Sparkman a, Jim Oleson a a Santa Barbara Infrared, Inc., 30 S. Calle Cesar Chavez, #D,

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

ˆ The exam is closed book, closed calculator, and closed notes except your one-page crib sheet.

ˆ The exam is closed book, closed calculator, and closed notes except your one-page crib sheet. CS Summer Introduction to Artificial Intelligence Midterm ˆ You have approximately minutes. ˆ The exam is closed book, closed calculator, and closed notes except your one-page crib sheet. ˆ Mark your answers

More information

Predicting Diabetes using Neural Networks and Randomized Optimization

Predicting Diabetes using Neural Networks and Randomized Optimization Predicting Diabetes using Neural Networks and Randomized Optimization Kunal Sharma GTID: ksharma74 CS 4641 Machine Learning Abstract This paper analysis the following randomized optimization techniques

More information

Archna Rani [1], Dr. Manu Pratap Singh [2] Research Scholar [1], Dr. B.R. Ambedkar University, Agra [2] India

Archna Rani [1], Dr. Manu Pratap Singh [2] Research Scholar [1], Dr. B.R. Ambedkar University, Agra [2] India Volume 4, Issue 3, March 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Performance Evaluation

More information

Optimal Detector Locations for OD Matrix Estimation

Optimal Detector Locations for OD Matrix Estimation Optimal Detector Locations for OD Matrix Estimation Ying Liu 1, Xiaorong Lai, Gang-len Chang 3 Abstract This paper has investigated critical issues associated with Optimal Detector Locations for OD matrix

More information

REINFORCEMENT LEARNING: MDP APPLIED TO AUTONOMOUS NAVIGATION

REINFORCEMENT LEARNING: MDP APPLIED TO AUTONOMOUS NAVIGATION REINFORCEMENT LEARNING: MDP APPLIED TO AUTONOMOUS NAVIGATION ABSTRACT Mark A. Mueller Georgia Institute of Technology, Computer Science, Atlanta, GA USA The problem of autonomous vehicle navigation between

More information

Using Traffic Models in Switch Scheduling

Using Traffic Models in Switch Scheduling I. Background Using Traffic Models in Switch Scheduling Hammad M. Saleem, Imran Q. Sayed {hsaleem, iqsayed}@stanford.edu Conventional scheduling algorithms use only the current virtual output queue (VOQ)

More information

Character Recognition

Character Recognition Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches

More information

arxiv: v1 [cs.cv] 2 Sep 2018

arxiv: v1 [cs.cv] 2 Sep 2018 Natural Language Person Search Using Deep Reinforcement Learning Ankit Shah Language Technologies Institute Carnegie Mellon University aps1@andrew.cmu.edu Tyler Vuong Electrical and Computer Engineering

More information

A Routing Protocol for Utilizing Multiple Channels in Multi-Hop Wireless Networks with a Single Transceiver

A Routing Protocol for Utilizing Multiple Channels in Multi-Hop Wireless Networks with a Single Transceiver 1 A Routing Protocol for Utilizing Multiple Channels in Multi-Hop Wireless Networks with a Single Transceiver Jungmin So Dept. of Computer Science, and Coordinated Science Laboratory University of Illinois

More information

BUSNet: Model and Usage of Regular Traffic Patterns in Mobile Ad Hoc Networks for Inter-Vehicular Communications

BUSNet: Model and Usage of Regular Traffic Patterns in Mobile Ad Hoc Networks for Inter-Vehicular Communications BUSNet: Model and Usage of Regular Traffic Patterns in Mobile Ad Hoc Networks for Inter-Vehicular Communications Kai-Juan Wong, Bu-Sung Lee, Boon-Chong Seet, Genping Liu, Lijuan Zhu School of Computer

More information

Separating Speech From Noise Challenge

Separating Speech From Noise Challenge Separating Speech From Noise Challenge We have used the data from the PASCAL CHiME challenge with the goal of training a Support Vector Machine (SVM) to estimate a noise mask that labels time-frames/frequency-bins

More information

311 Predictions on Kaggle Austin Lee. Project Description

311 Predictions on Kaggle Austin Lee. Project Description 311 Predictions on Kaggle Austin Lee Project Description This project is an entry into the SeeClickFix contest on Kaggle. SeeClickFix is a system for reporting local civic issues on Open311. Each issue

More information

Improving the Hopfield Network through Beam Search

Improving the Hopfield Network through Beam Search Brigham Young University BYU ScholarsArchive All Faculty Publications 2001-07-19 Improving the Hopfield Network through Beam Search Tony R. Martinez martinez@cs.byu.edu Xinchuan Zeng Follow this and additional

More information

ADAPTIVE TILE CODING METHODS FOR THE GENERALIZATION OF VALUE FUNCTIONS IN THE RL STATE SPACE A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL

ADAPTIVE TILE CODING METHODS FOR THE GENERALIZATION OF VALUE FUNCTIONS IN THE RL STATE SPACE A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL ADAPTIVE TILE CODING METHODS FOR THE GENERALIZATION OF VALUE FUNCTIONS IN THE RL STATE SPACE A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY BHARAT SIGINAM IN

More information

Object vs Image-based Testing Producing Automated GUI Tests to Withstand Change

Object vs Image-based Testing Producing Automated GUI Tests to Withstand Change Object vs Image-based Testing Producing Automated GUI Tests to Withstand Change Handling Application Change Script maintenance, and handling application change, is one of the highest impact factors when

More information

Study on Indoor and Outdoor environment for Mobile Ad Hoc Network: Random Way point Mobility Model and Manhattan Mobility Model

Study on Indoor and Outdoor environment for Mobile Ad Hoc Network: Random Way point Mobility Model and Manhattan Mobility Model Study on and Outdoor for Mobile Ad Hoc Network: Random Way point Mobility Model and Manhattan Mobility Model Ibrahim khider,prof.wangfurong.prof.yinweihua,sacko Ibrahim khider, Communication Software and

More information

Background subtraction in people detection framework for RGB-D cameras

Background subtraction in people detection framework for RGB-D cameras Background subtraction in people detection framework for RGB-D cameras Anh-Tuan Nghiem, Francois Bremond INRIA-Sophia Antipolis 2004 Route des Lucioles, 06902 Valbonne, France nghiemtuan@gmail.com, Francois.Bremond@inria.fr

More information

Streaming videos. Problem statement for Online Qualification Round, Hash Code 2017

Streaming videos. Problem statement for Online Qualification Round, Hash Code 2017 Streaming videos Problem statement for Online Qualification Round, Hash Code 2017 Introduction Have you ever wondered what happens behind the scenes when you watch a YouTube video? As more and more people

More information

Graph Structure Over Time

Graph Structure Over Time Graph Structure Over Time Observing how time alters the structure of the IEEE data set Priti Kumar Computer Science Rensselaer Polytechnic Institute Troy, NY Kumarp3@rpi.edu Abstract This paper examines

More information

Histogram-Based Density Discovery in Establishing Road Connectivity

Histogram-Based Density Discovery in Establishing Road Connectivity Histogram-Based Density Discovery in Establishing Road Connectivity Kevin C. Lee, Jiajie Zhu, Jih-Chung Fan, Mario Gerla Department of Computer Science University of California, Los Angeles Los Angeles,

More information

Naïve Bayes for text classification

Naïve Bayes for text classification Road Map Basic concepts Decision tree induction Evaluation of classifiers Rule induction Classification using association rules Naïve Bayesian classification Naïve Bayes for text classification Support

More information

Optimal Crane Scheduling

Optimal Crane Scheduling Optimal Crane Scheduling IonuŃ Aron Iiro Harjunkoski John Hooker Latife Genç Kaya March 2007 1 Problem Schedule 2 cranes to transfer material between locations in a manufacturing plant. For example, copper

More information

outline Sensor Network Navigation without Locations 11/5/2009 Introduction

outline Sensor Network Navigation without Locations 11/5/2009 Introduction Sensor Network Navigation without Locations Mo Li, Yunhao Liu, Jiliang Wang, and Zheng Yang Department of Computer Science and Engineering Hong Kong University of Science and Technology, outline Introduction

More information

Traffic Simulator. Revised Software Requirements Specification. Khalid AlHokail Luke Bay James Grady Michael Murphy

Traffic Simulator. Revised Software Requirements Specification. Khalid AlHokail Luke Bay James Grady Michael Murphy Traffic Simulator Revised Software Requirements Specification Khalid AlHokail Luke Bay James Grady Michael Murphy Version 2.0 03/07/2007 Table of Contents 1. Map Model...2 1.1. Graph...2 1.2. Rules for

More information

Selecting DaoOpt solver configuration for MPE problems

Selecting DaoOpt solver configuration for MPE problems Selecting DaoOpt solver configuration for MPE problems Summer project report Submitted by: Abhisaar Sharma Submitted to: Dr. Rina Dechter Abstract We address the problem of selecting the best configuration

More information

Cluster Analysis. Prof. Thomas B. Fomby Department of Economics Southern Methodist University Dallas, TX April 2008 April 2010

Cluster Analysis. Prof. Thomas B. Fomby Department of Economics Southern Methodist University Dallas, TX April 2008 April 2010 Cluster Analysis Prof. Thomas B. Fomby Department of Economics Southern Methodist University Dallas, TX 7575 April 008 April 010 Cluster Analysis, sometimes called data segmentation or customer segmentation,

More information

Reinforcement Learning of Traffic Light Controllers under Partial Observability

Reinforcement Learning of Traffic Light Controllers under Partial Observability Reinforcement Learning of Traffic Light Controllers under Partial Observability MSc Thesis of R. Schouten(0010774), M. Steingröver(0043826) Students of Artificial Intelligence on Faculty of Science University

More information

Predicting Bus Arrivals Using One Bus Away Real-Time Data

Predicting Bus Arrivals Using One Bus Away Real-Time Data Predicting Bus Arrivals Using One Bus Away Real-Time Data 1 2 3 4 5 Catherine M. Baker Alexander C. Nied Department of Computer Science Department of Computer Science University of Washington University

More information

DS595/CS525: Urban Network Analysis --Urban Mobility Prof. Yanhua Li

DS595/CS525: Urban Network Analysis --Urban Mobility Prof. Yanhua Li Welcome to DS595/CS525: Urban Network Analysis --Urban Mobility Prof. Yanhua Li Time: 6:00pm 8:50pm Wednesday Location: Fuller 320 Spring 2017 2 Team assignment Finalized. (Great!) Guest Speaker 2/22 A

More information

ADAPTIVE K MEANS CLUSTERING FOR HUMAN MOBILITY MODELING AND PREDICTION Anu Sharma( ) Advisor: Prof. Peizhao Hu

ADAPTIVE K MEANS CLUSTERING FOR HUMAN MOBILITY MODELING AND PREDICTION Anu Sharma( ) Advisor: Prof. Peizhao Hu ADAPTIVE K MEANS CLUSTERING FOR HUMAN MOBILITY MODELING AND PREDICTION Anu Sharma( axs3617@rit.edu ) Advisor: Prof. Peizhao Hu ABSTRACT Human movement follows repetitive trajectories. There has been extensive

More information

Quality of Service Mechanism for MANET using Linux Semra Gulder, Mathieu Déziel

Quality of Service Mechanism for MANET using Linux Semra Gulder, Mathieu Déziel Quality of Service Mechanism for MANET using Linux Semra Gulder, Mathieu Déziel Semra.gulder@crc.ca, mathieu.deziel@crc.ca Abstract: This paper describes a QoS mechanism suitable for Mobile Ad Hoc Networks

More information

Last topic: Summary; Heuristics and Approximation Algorithms Topics we studied so far:

Last topic: Summary; Heuristics and Approximation Algorithms Topics we studied so far: Last topic: Summary; Heuristics and Approximation Algorithms Topics we studied so far: I Strength of formulations; improving formulations by adding valid inequalities I Relaxations and dual problems; obtaining

More information

Traffic-Signal Control in Real-World Environments

Traffic-Signal Control in Real-World Environments Traffic-Signal Control in Real-World Environments Alex Mallery, Ben Cohen-Wang, Edward Lee Abstract Countless man-hours are spent waiting in traffic, due in part to suboptimal traffic light behavior. Extensive

More information

TEST EXAM PART 2 INTERMEDIATE LAND NAVIGATION

TEST EXAM PART 2 INTERMEDIATE LAND NAVIGATION NAME DATE TEST EXAM PART 2 INTERMEDIATE LAND NAVIGATION 1. Knowing these four basic skills, it is impossible to be totally lost; what are they? a. Track Present Location / Determine Distance / Sense of

More information

RSM Split-Plot Designs & Diagnostics Solve Real-World Problems

RSM Split-Plot Designs & Diagnostics Solve Real-World Problems RSM Split-Plot Designs & Diagnostics Solve Real-World Problems Shari Kraber Pat Whitcomb Martin Bezener Stat-Ease, Inc. Stat-Ease, Inc. Stat-Ease, Inc. 221 E. Hennepin Ave. 221 E. Hennepin Ave. 221 E.

More information

Wireless Internet Routing. Learning from Deployments Link Metrics

Wireless Internet Routing. Learning from Deployments Link Metrics Wireless Internet Routing Learning from Deployments Link Metrics 1 Learning From Deployments Early worked focused traditional routing issues o Control plane: topology management, neighbor discovery o Data

More information

Replica Distribution Scheme for Location-Dependent Data in Vehicular Ad Hoc Networks using a Small Number of Fixed Nodes

Replica Distribution Scheme for Location-Dependent Data in Vehicular Ad Hoc Networks using a Small Number of Fixed Nodes Replica Distribution Scheme for Location-Dependent Data in Vehicular d Hoc Networks using a Small Number of Fixed Nodes Junichiro Okamoto and Susumu Ishihara Graduate School of Engineering, Shizuoka University,

More information

Announcements Wednesday, August 22

Announcements Wednesday, August 22 Announcements Wednesday, August 22 Everything you ll need to know is on the master website: http://people.math.gatech.edu/~cjankowski3/18f/m1553/webpage/ or on the website for this section: http://people.math.gatech.edu/~jrabinoff/1819f-1553/

More information

Chapter 6. Semi-Lagrangian Methods

Chapter 6. Semi-Lagrangian Methods Chapter 6. Semi-Lagrangian Methods References: Durran Chapter 6. Review article by Staniford and Cote (1991) MWR, 119, 2206-2223. 6.1. Introduction Semi-Lagrangian (S-L for short) methods, also called

More information

When Network Embedding meets Reinforcement Learning?

When Network Embedding meets Reinforcement Learning? When Network Embedding meets Reinforcement Learning? ---Learning Combinatorial Optimization Problems over Graphs Changjun Fan 1 1. An Introduction to (Deep) Reinforcement Learning 2. How to combine NE

More information

CS 331: Artificial Intelligence Local Search 1. Tough real-world problems

CS 331: Artificial Intelligence Local Search 1. Tough real-world problems CS 331: Artificial Intelligence Local Search 1 1 Tough real-world problems Suppose you had to solve VLSI layout problems (minimize distance between components, unused space, etc.) Or schedule airlines

More information

Information Brokerage

Information Brokerage Information Brokerage Sensing Networking Leonidas Guibas Stanford University Computation CS321 Information Brokerage Services in Dynamic Environments Information Brokerage Information providers (sources,

More information

Deep Q-Learning to play Snake

Deep Q-Learning to play Snake Deep Q-Learning to play Snake Daniele Grattarola August 1, 2016 Abstract This article describes the application of deep learning and Q-learning to play the famous 90s videogame Snake. I applied deep convolutional

More information

A STUDY OF THE PERFORMANCE TRADEOFFS OF A TRADE ARCHIVE

A STUDY OF THE PERFORMANCE TRADEOFFS OF A TRADE ARCHIVE A STUDY OF THE PERFORMANCE TRADEOFFS OF A TRADE ARCHIVE CS737 PROJECT REPORT Anurag Gupta David Goldman Han-Yin Chen {anurag, goldman, han-yin}@cs.wisc.edu Computer Sciences Department University of Wisconsin,

More information

Last Class: Processes

Last Class: Processes Last Class: Processes A process is the unit of execution. Processes are represented as Process Control Blocks in the OS PCBs contain process state, scheduling and memory management information, etc A process

More information

Local Search Methods. CS 188: Artificial Intelligence Fall Announcements. Hill Climbing. Hill Climbing Diagram. Today

Local Search Methods. CS 188: Artificial Intelligence Fall Announcements. Hill Climbing. Hill Climbing Diagram. Today CS 188: Artificial Intelligence Fall 2006 Lecture 5: Robot Motion Planning 9/14/2006 Local Search Methods Queue-based algorithms keep fallback options (backtracking) Local search: improve what you have

More information

ANALYZING AND COMPARING TRAFFIC NETWORK CONDITIONS WITH A QUALITY TOOL BASED ON FLOATING CAR AND STATIONARY DATA

ANALYZING AND COMPARING TRAFFIC NETWORK CONDITIONS WITH A QUALITY TOOL BASED ON FLOATING CAR AND STATIONARY DATA 15th World Congress on Intelligent Transport Systems ITS Connections: Saving Time, Saving Lives New York, November 16-20, 2008 ANALYZING AND COMPARING TRAFFIC NETWORK CONDITIONS WITH A QUALITY TOOL BASED

More information

Lesson 14: Graph of a Linear Equation Horizontal and Vertical Lines

Lesson 14: Graph of a Linear Equation Horizontal and Vertical Lines Lesson 14: Graph of a Linear Equation Horizontal and Vertical Lines Student Outcomes Students graph linear equations in standard form, 0), that produce a horizontal or a vertical line. Lesson Notes The

More information

Influence Maximization in Location-Based Social Networks Ivan Suarez, Sudarshan Seshadri, Patrick Cho CS224W Final Project Report

Influence Maximization in Location-Based Social Networks Ivan Suarez, Sudarshan Seshadri, Patrick Cho CS224W Final Project Report Influence Maximization in Location-Based Social Networks Ivan Suarez, Sudarshan Seshadri, Patrick Cho CS224W Final Project Report Abstract The goal of influence maximization has led to research into different

More information

CS 229 Final Report: Location Based Adaptive Routing Protocol(LBAR) using Reinforcement Learning

CS 229 Final Report: Location Based Adaptive Routing Protocol(LBAR) using Reinforcement Learning CS 229 Final Report: Location Based Adaptive Routing Protocol(LBAR) using Reinforcement Learning By: Eunjoon Cho and Kevin Wong Abstract In this paper we present an algorithm for a location based adaptive

More information

IMPROVING THE DATA COLLECTION RATE IN WIRELESS SENSOR NETWORKS BY USING THE MOBILE RELAYS

IMPROVING THE DATA COLLECTION RATE IN WIRELESS SENSOR NETWORKS BY USING THE MOBILE RELAYS IMPROVING THE DATA COLLECTION RATE IN WIRELESS SENSOR NETWORKS BY USING THE MOBILE RELAYS 1 K MADHURI, 2 J.KRISHNA, 3 C.SIVABALAJI II M.Tech CSE, AITS, Asst Professor CSE, AITS, Asst Professor CSE, NIST

More information

Random Search Report An objective look at random search performance for 4 problem sets

Random Search Report An objective look at random search performance for 4 problem sets Random Search Report An objective look at random search performance for 4 problem sets Dudon Wai Georgia Institute of Technology CS 7641: Machine Learning Atlanta, GA dwai3@gatech.edu Abstract: This report

More information

Programming Project. Remember the Titans

Programming Project. Remember the Titans Programming Project Remember the Titans Due: Data and reports due 12/10 & 12/11 (code due 12/7) In the paper Measured Capacity of an Ethernet: Myths and Reality, David Boggs, Jeff Mogul and Chris Kent

More information

Evaluating Robot Systems

Evaluating Robot Systems Evaluating Robot Systems November 6, 2008 There are two ways of constructing a software design. One way is to make it so simple that there are obviously no deficiencies. And the other way is to make it

More information

Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator

Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator Stanley Bak Abstract Network algorithms are deployed on large networks, and proper algorithm evaluation is necessary to avoid

More information

Introduction to Fall 2008 Artificial Intelligence Midterm Exam

Introduction to Fall 2008 Artificial Intelligence Midterm Exam CS 188 Introduction to Fall 2008 Artificial Intelligence Midterm Exam INSTRUCTIONS You have 80 minutes. 70 points total. Don t panic! The exam is closed book, closed notes except a one-page crib sheet,

More information

Final Exam. Introduction to Artificial Intelligence. CS 188 Spring 2010 INSTRUCTIONS. You have 3 hours.

Final Exam. Introduction to Artificial Intelligence. CS 188 Spring 2010 INSTRUCTIONS. You have 3 hours. CS 188 Spring 2010 Introduction to Artificial Intelligence Final Exam INSTRUCTIONS You have 3 hours. The exam is closed book, closed notes except a two-page crib sheet. Please use non-programmable calculators

More information

DYNAMIC SITE LAYOUT PLANNING USING MTPE PRINCIPLE FROM PHYSICS

DYNAMIC SITE LAYOUT PLANNING USING MTPE PRINCIPLE FROM PHYSICS DYNAMIC SITE LAYOUT PLANNING USING MTPE PRINCIPLE FROM PHYSICS Mohsen Andayesh* and Farnaz Sadeghpour Department of Civil Engineering, University of Calgary, Calgary, Canada * Corresponding author (m.andayesh@ucalgary.ca)

More information

SOLVING AN ACCESSIBILITY-MAXIMIZATION ROAD NETWORK DESIGN MODEL: A COMPARISON OF HEURISTICS

SOLVING AN ACCESSIBILITY-MAXIMIZATION ROAD NETWORK DESIGN MODEL: A COMPARISON OF HEURISTICS Advanced OR and AI Methods in Transportation SOLVING AN ACCESSIBILITY-MAXIMIZATION ROAD NETWORK DESIGN MODEL: A COMPARISON OF HEURISTICS Bruno SANTOS 1, António ANTUNES 2, Eric MILLER 3 Abstract. This

More information

Scalable Trigram Backoff Language Models

Scalable Trigram Backoff Language Models Scalable Trigram Backoff Language Models Kristie Seymore Ronald Rosenfeld May 1996 CMU-CS-96-139 School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 This material is based upon work

More information

Location Traceability of Users in Location-based Services

Location Traceability of Users in Location-based Services Location Traceability of Users in Location-based Services Yutaka Yanagisawa Hidetoshi Kido Tetsuji Satoh, NTT Communication Science Laboratories, NTT Corporation Graduate School of Information Science

More information

Travelling Salesman Problem: Tabu Search

Travelling Salesman Problem: Tabu Search Travelling Salesman Problem: Tabu Search (Anonymized) April 2017 Abstract The Tabu Search algorithm is a heuristic method to find optimal solutions to the Travelling Salesman Problem (TSP). It is a local

More information

Cable s Role in the 5G Evolution

Cable s Role in the 5G Evolution Cable s Role in the 5G Evolution A Technical Paper prepared for SCTE ISBE by Erik Gronvall VP Strategy and Market Development CommScope 501 Shenandoah Dr Shakopee, MN 55379 952.403.8691 erik.gronvall@commscope.com

More information

SILAB A Task Oriented Driving Simulation

SILAB A Task Oriented Driving Simulation SILAB A Task Oriented Driving Simulation Hans-Peter Krueger, Martin Grein, Armin Kaussner, Christian Mark Center for Traffic Sciences, University of Wuerzburg Roentgenring 11 D-97070 Wuerzburg, Germany

More information

Chapter 5 Hashing. Introduction. Hashing. Hashing Functions. hashing performs basic operations, such as insertion,

Chapter 5 Hashing. Introduction. Hashing. Hashing Functions. hashing performs basic operations, such as insertion, Introduction Chapter 5 Hashing hashing performs basic operations, such as insertion, deletion, and finds in average time 2 Hashing a hash table is merely an of some fixed size hashing converts into locations

More information

EXECUTION PLAN OPTIMIZATION TECHNIQUES

EXECUTION PLAN OPTIMIZATION TECHNIQUES EXECUTION PLAN OPTIMIZATION TECHNIQUES Július Štroffek Database Sustaining Engineer Sun Microsystems, Inc. Tomáš Kovařík Faculty of Mathematics and Physics, Charles University, Prague PostgreSQL Conference,

More information

4 INFORMED SEARCH AND EXPLORATION. 4.1 Heuristic Search Strategies

4 INFORMED SEARCH AND EXPLORATION. 4.1 Heuristic Search Strategies 55 4 INFORMED SEARCH AND EXPLORATION We now consider informed search that uses problem-specific knowledge beyond the definition of the problem itself This information helps to find solutions more efficiently

More information

Homework 1 Solutions:

Homework 1 Solutions: Homework 1 Solutions: If we expand the square in the statistic, we get three terms that have to be summed for each i: (ExpectedFrequency[i]), (2ObservedFrequency[i]) and (ObservedFrequency[i])2 / Expected

More information

3. Evaluation of Selected Tree and Mesh based Routing Protocols

3. Evaluation of Selected Tree and Mesh based Routing Protocols 33 3. Evaluation of Selected Tree and Mesh based Routing Protocols 3.1 Introduction Construction of best possible multicast trees and maintaining the group connections in sequence is challenging even in

More information

Fairness Example: high priority for nearby stations Optimality Efficiency overhead

Fairness Example: high priority for nearby stations Optimality Efficiency overhead Routing Requirements: Correctness Simplicity Robustness Under localized failures and overloads Stability React too slow or too fast Fairness Example: high priority for nearby stations Optimality Efficiency

More information

An Evaluation of Shared Multicast Trees with Multiple Active Cores

An Evaluation of Shared Multicast Trees with Multiple Active Cores Brigham Young University BYU ScholarsArchive All Faculty Publications 2001-07-01 An Evaluation of Shared Multicast Trees with Multiple Active Cores Daniel Zappala daniel_zappala@byu.edu Aaron Fabbri Follow

More information

Performance Comparison of Scalable Location Services for Geographic Ad Hoc Routing

Performance Comparison of Scalable Location Services for Geographic Ad Hoc Routing Performance Comparison of Scalable Location Services for Geographic Ad Hoc Routing Saumitra M. Das, Himabindu Pucha and Y. Charlie Hu School of Electrical and Computer Engineering Purdue University West

More information

Robotics. Lecture 5: Monte Carlo Localisation. See course website for up to date information.

Robotics. Lecture 5: Monte Carlo Localisation. See course website  for up to date information. Robotics Lecture 5: Monte Carlo Localisation See course website http://www.doc.ic.ac.uk/~ajd/robotics/ for up to date information. Andrew Davison Department of Computing Imperial College London Review:

More information

Confidence Based Dual Reinforcement Q-Routing: An adaptive online network routing algorithm

Confidence Based Dual Reinforcement Q-Routing: An adaptive online network routing algorithm Confidence Based Dual Reinforcement Q-Routing: An adaptive online network routing algorithm Shailesh Kumar Dept. of Elec. and Computer Engineering The University of Texas at Austin Austin, TX 78712 USA

More information

OPERATING SYSTEMS. Systems with Multi-programming. CS 3502 Spring Chapter 4

OPERATING SYSTEMS. Systems with Multi-programming. CS 3502 Spring Chapter 4 OPERATING SYSTEMS CS 3502 Spring 2018 Systems with Multi-programming Chapter 4 Multiprogramming - Review An operating system can support several processes in memory. While one process receives service

More information