BGP ANOMALY DETECTION USING DATA MINING TECHNIQUES. Iñigo Ortiz de Urbina

Similar documents
Classification of BGP Anomalies Using Decision Trees and Fuzzy Rough Sets

4-Byte AS Numbers. The view from the Old BGP world. Geoff Huston February 2007 APNIC

Global Routing Instabilities

Complex Networks: Traffic Data, Network Topologies, and Routing Anomalies

Machine Learning Models for Classification of BGP Anomalies

Internet Routing Protocols Lecture 01 & 02

Measuring BGP. Geoff Huston. CAIA SEMINAR 31 May

Happy Packets: Some Initial Results

the real-time Internet routing observatory

Other Developments: CIDR

Investigating occurrence of duplicate updates in BGP announcements

Examination. IP routning på Internet och andra sammansatta nät, DD2491 IP routing in the Internet and other complex networks, DD2491

More Specific Announcements in BGP. Geoff Huston APNIC

Inter-Domain Routing Trends

More Specific Announcements in BGP. Geoff Huston APNIC

Feature Selection for Classification of BGP Anomalies using Bayesian Models

Examination. ANSWERS IP routning på Internet och andra sammansatta nät, DD2491 IP routing in the Internet and other complex networks, DD2491

Inter-Domain Routing: BGP

INTERNET INEFFICIENCIES AND INSTABILITY

BGP Scaling Techniques

Resource Certification. Alex Band, Product Manager DENIC Technical Meeting

The BGP-Inspect Project. Manish Karir, Jon Oberheide (Merit) Dionysus Blazakis, John Baras (UMd)

BGP Issues. Geoff Huston

Tracking the Internet s BGP Table

Internet Routing Protocols Lecture 03 Inter-domain Routing

Securing BGP. Geoff Huston November 2007

CS 640: Introduction to Computer Networks. Intra-domain routing. Inter-domain Routing: Hierarchy. Aditya Akella

BGP. Autonomous system (AS) BGP version 4

The real-time Internet routing observatory. Luca Sani RIPE Meeting 77 Amsterdam, NL October 15 th, 2018

BGP. Autonomous system (AS) BGP version 4

Routing the Internet in Geoff Huston APNIC March 2007

Finding a Needle in a Haystack: Pinpointing Significant BGP Routing Changes in an IP Network

BGP. Autonomous system (AS) BGP version 4

L11 : Inter-domain Routing with BGP Lecture14 Michaelmas, 2016

BGP Scaling Techniques

Inter-domain Routing(BGP) Security [IP Prefix Hijacking] Akmal Khan

Detecting Behavior Propagation in BGP Trace Data Brian J. Premore Michael Liljenstam David Nicol

RIPE Network Coordination Centre RIPE Labs Nathalie Trenaman UKNOF 17 - Sept

the real-time Internet routing observatory Alessandro Improta

CHAPTER 6 EXPERIMENTS

PTT Fórum 6. John Kemp

BGP. Autonomous system (AS) BGP version 4. Definition (AS Autonomous System)

Taming BGP. An incremental approach to improving the dynamic properties of BGP. Geoff Huston. CAIA Seminar 18 August

RIPE NCC Measurements And Tools

Traffic in Network /8. Background. Initial Experience. Geoff Huston George Michaelson APNIC R&D. April 2010

CS BGP v4. Fall 2014

RISng - Technical Overview

TELE 301 Network Management

BGP Route Stability. Alexander Asimov Highload Lab

the real-time Internet routing observatory Luca Sani

How BGP Routers Use the Multi Exit Discriminator for Best Path Selection

Towards a Logic for Wide-Area Internet Routing

Seminars of Software and Services for the Information Society

Quantifying Path Exploration in the Internet

Routing in Geoff Huston Chief Scientist, APNIC

BGP Configuration. BGP Overview. Introduction to BGP. Formats of BGP Messages. Header

RIPE75 - Network monitoring at scale. Louis Poinsignon

International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.7, No.3, May Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani

Internet Engineering Task Force (IETF) Request for Comments: 8050 Category: Standards Track ISSN: May 2017

the real-time Internet routing observatory

Quantifying Path Exploration in the Internet

Quantifying Path Exploration in the Internet

Soundness of Chilean Internet Routes

CSCD 433/533 Network Programming Fall Lecture 14 Global Address Space Autonomous Systems, BGP Protocol Routing

Comparing IPv4 and IPv6 from the perspec7ve of BGP dynamic ac7vity. Geoff Huston APNIC February 2012

IBM Zürich Research Laboratory. Billy Goat Overview. James Riordan Diego Zamboni Yann Duponchel IBM Research Zurich Switzerland IBM Corporation

BGP. Autonomous system (AS) BGP version 4. Definition (AS Autonomous System)

Dynamics of Hot-Potato Routing in IP Networks

BGP Path Exploration Damping (PED)

CSCI-1680 Network Layer: Inter-domain Routing Rodrigo Fonseca

COMPARISON OF MACHINE LEARNING MODELS FOR CLASSIFICATION OF BGP ANOMALIES

Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques

32-bit ASNs. Greg Hankins Chris Malayter APRICOT 2009 APRICOT /02/25

BGP in the Internet Best Current Practices

Recommended IOS Releases. BGP in the Internet. Which IOS? Which IOS? 12.2 IOS release images IOS release images is the old mainline train

BGP-Inspect - Extracting Information from Raw BGP Data

BGP. Border Gateway Protocol A short introduction. Karst Koymans. Informatics Institute University of Amsterdam. (version 18.3, 2018/12/03 13:53:22)

A Day in the Life of an Address. Bill Fenner AT&T Labs - Research IETF Routing Area Director

BGP Routing and BGP Policy. BGP Routing. Agenda. BGP Routing Information Base. L47 - BGP Routing. L47 - BGP Routing

internet technologies and standards

TDC 375 Network Protocols TDC 563 P&T for Data Networks

RIPE Labs. Operators Tools, Ideas and Analysis. Mirjam Kühne, RIPE NCC. RIPE Network Coordination Centre. IETF 78 - Juy 2010.

Towards Localizing Root Causes of BGP Dynamics

Internet Protocols Fall Lectures Inter-domain routing, mobility support, multicast routing Andreas Terzis

A framework for BGP data analysis

Interactive Collector Engine. Luca Sani

from BGPlay to ibgplay

Measurement: Techniques, Strategies, and Pitfalls. David Andersen CMU

History. Configuration and Policy. Assigning IP address and AS numbers (Ideally) Addressing Schemes. Problem #1: Lifetime of Address Space

Internet Routing Dynamics

RIB Size Estimation for BGPSEC

BGP Additional Paths. Finding Feature Information. Information About BGP Additional Paths. Problem That Additional Paths Can Solve

Hebei University of Technology A Text-Mining-based Patent Analysis in Product Innovative Process

CSCI-1680 Network Layer: Inter-domain Routing Rodrigo Fonseca


Rapid Detection of BGP Anomalies

BGP Security. Kevin s Attic for Security Research

BSC Smart Cities Initiative

The Implementation of BGP Monitoring, Alarming, and Protecting System by a BGP-UPDATE-Based Method using ECOMMUNITY in Real Time

Routing part 2. Electrical and Information Technology

Transcription:

BGP ANOMALY DETECTION USING DATA MINING TECHNIQUES Iñigo Ortiz de Urbina

Introduction Goal Apply machine learning algorithms to mine network data and produce a preliminary offline anomaly detection system Keywords BGP, network security, data mining, anomaly detection, Internet Routing Forensics, perl

Abnormal BGP Events Events can spread either Globally or locally Sustained or short period Serious economical and social impact Common ABEs Sea cable cuts Prefix hijacks Power blackouts Worms Routing table leaks

What is Data Mining Data mining is the process of extracting patterns from data Becoming an increasingly important tool to transform data into knowledge Implies Data preprocessing Data mining Result validation

Anomaly Detection Assumption Trends sufficiently different from normal behavior are potentially dangerous Anomaly detection systems model normal behavior System spots anomalies matching events against the model It is a classification problem

The dataset Update messages from independent Route Information Servers A collection of Remote Route Collectors distributed globally MRT format Protocol Time Type PeerIP PeerAS Prefix <update dependant information> ASPATH Origin NextHop Local_Pref MED Community Sample BGP 884831401 A 144.228.107.1 1239 205.113.0.0/16 1239 64535769 IGP 192.41.177.241 0 91 BGP 884831402 W 204.70.7.53 3561 198.163.111.0/24 Millions of these updates

Attributes and descriptions NumAnnounce: number of announcements seen in a given time window NumWithd: number of withdrawals seen NumUpdate: Linear combination of the number of announcements and withdrawals. It represents the main volume of updates exchanged

SQL/Slammer January 2003

Attributes and descriptions AnnouncedPrefixes: The total number of announced prefixes in a given period. This is an implicitly important feature in BGP WithdrawnPrefixes: The total number of withdrawn prefixes in the bin MaxAnnouncementsPerPrefixes: Maximum announcements per prefix

Attributes and descriptions MaxASPL: Maximum AS Path length. Instability periods tend to show longer AS Paths maxuaspn: Maximum unique AS numbers in the AS Path. Instability periods introduce new AS numbers announcetolongerpath: Total number of updates announcing a longer path for any given prefix. Measures how far we are from convergence

Attributes and descriptions FirstOrderRatio: Division ratio between the first most active announced prefix and the total number of announcements concentrationratio: Division ratio between the three most active announced prefixes and the total number of announcements

Attributes and descriptions Updates are not uniformly distributed along prefixes! Source: Geoff Huston

Attributes and descriptions Extracted attributes also include average values, per significant attribute computed Certain attributes ommited and discarded after feature selection ThirdOrderRatio: not much info left after FirstOrderRatio and ConcentrationRatio Minimum values: do not provide any meaningful information. Typically, always 0, 1 or similar

Researched events Update: RIPE/Duke University experimental optional attribute

Experiment results

Experiment results 11 out of 14 events are classified with more than 90% accuracy Poor detection for certain events Data cleaning and normalization Information gain based feature selection Further test and tuning needed Add more depth and semantics to our feature vector Root-cause AS, inter-as correlation... Problem handling multivariate data in Weka Use R to mine data instead

Conclusions Encouraging first results Proof that we are in the right road Modern worms are easier to detect in network and transport layers using netflow Both offline and online frameworks can be achieved Alarm reporting, automatic classification Data normalization and appropriate feature selection is a must

Interesting Resources on Internet Intelligence BGPMon RIPE RIS Raw Data Renesys blog Team Cymru Caida Rotueviews HE BGP Toolkit Internet Measurement Data Catalog The Data Repository RIPE Labs Datasets BGP Potaroo Reports

Thanks for your attention @ioc32 inigo@infornografia.net