On the Parikh-de-Bruijn grid

Similar documents
Data Compression Fundamentals

CHAPTER TWO LANGUAGES. Dr Zalmiyah Zakaria

Prefix normal words, binary jumbled pattern matching, and bubble languages

Section 1.7 Sequences, Summations Cardinality of Infinite Sets

Paths, Circuits, and Connected Graphs

Section 2.4 Sequences and Summations

Compiler Construction

Mathematics and Statistics, Part A: Graph Theory Problem Sheet 1, lectures 1-4

MAT 145: PROBLEM SET 4

Chapter 2: Graph Theory

Chapter 4: Regular Expressions

Lecture 5: Graphs. Rajat Mittal. IIT Kanpur

QUESTION BANK. Unit 1. Introduction to Finite Automata

Regular Expressions. Lecture 10 Sections Robb T. Koether. Hampden-Sydney College. Wed, Sep 14, 2016

Basic Combinatorics. Math 40210, Section 01 Fall Homework 4 Solutions

Automata Theory TEST 1 Answers Max points: 156 Grade basis: 150 Median grade: 81%

Glynda, the good witch of the North

Chapter 18: Decidability

Lec-5-HW-1, TM basics

Discrete Mathematics for CS Spring 2008 David Wagner Note 13. An Introduction to Graphs

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao,David Tse Note 8

Algorithms on Words Introduction CHAPTER 1

On Combinatorial Generation of Prefix Normal Words

XVIII Open Cup named after E.V. Pankratiev Stage 10, Grand Prix of Korea, Sunday, February 4, 2018

Counting Faces of Polytopes

Models and Algorithms for Complex Networks

Bayesian Networks and Decision Graphs

SCHOOL OF ENGINEERING & BUILT ENVIRONMENT. Mathematics. An Introduction to Graph Theory

Graph Theory Questions from Past Papers

Proof Techniques Alphabets, Strings, and Languages. Foundations of Computer Science Theory

V :non-empty vertex ornode set E V V :edge set G (V, E) :directed graph on V, or digraph on V

Graphs (MTAT , 6 EAP) Lectures: Mon 14-16, hall 404 Exercises: Wed 14-16, hall 402

Assignment No.4 solution. Pumping Lemma Version I and II. Where m = n! (n-factorial) and n = 1, 2, 3

JNTUWORLD. Code No: R

CS402 Theory of Automata Solved Subjective From Midterm Papers. MIDTERM SPRING 2012 CS402 Theory of Automata

Discrete Mathematics and Probability Theory Fall 2013 Vazirani Note 7

PACKING DIGRAPHS WITH DIRECTED CLOSED TRAILS

Notes for Comp 454 Week 2

Combinatorics Summary Sheet for Exam 1 Material 2019

Majority and Friendship Paradoxes

DEFINITIONS. Perpendicular Two lines are called perpendicular if they form a right angle.

Characterization of Graphs with Eulerian Circuits

Math 776 Graph Theory Lecture Note 1 Basic concepts

Chapter 18 out of 37 from Discrete Mathematics for Neophytes: Number Theory, Probability, Algorithms, and Other Stuff by J. M. Cargal.

ITEC2620 Introduction to Data Structures

Adjacent: Two distinct vertices u, v are adjacent if there is an edge with ends u, v. In this case we let uv denote such an edge.

If three points A (h, 0), P (a, b) and B (0, k) lie on a line, show that: a b 1.

CONSTRUCTIONS Introduction Division of a Line Segment

More NP-complete Problems. CS255 Chris Pollett May 3, 2006.

Skyup's Media. PART-B 2) Construct a Mealy machine which is equivalent to the Moore machine given in table.

Phil 320 Chapter 1: Sets, Functions and Enumerability I. Sets Informally: a set is a collection of objects. The objects are called members or

Exercise set 2 Solutions

Interleaving Schemes on Circulant Graphs with Two Offsets

Eulerian Tours and Fleury s Algorithm

Assignment 4 Solutions of graph problems

Polygon Triangulation

On vertex-coloring edge-weighting of graphs

Topic 10 Part 2 [474 marks]

Planar Graphs. 1 Graphs and maps. 1.1 Planarity and duality

Some Upper Bounds for Signed Star Domination Number of Graphs. S. Akbari, A. Norouzi-Fard, A. Rezaei, R. Rotabi, S. Sabour.

1. Which of the following regular expressions over {0, 1} denotes the set of all strings not containing 100 as a sub-string?

THE LABELLED PEER CODE FOR KNOT AND LINK DIAGRAMS 26th February, 2015

Computer Science 280 Fall 2002 Homework 10 Solutions

Characterizing Graphs (3) Characterizing Graphs (1) Characterizing Graphs (2) Characterizing Graphs (4)

Structured System Theory

Math 443/543 Graph Theory Notes 2: Transportation problems

Theorem 2.9: nearest addition algorithm

Graph Theory. Connectivity, Coloring, Matching. Arjun Suresh 1. 1 GATE Overflow

Power Set of a set and Relations

Graph Theory CS/Math231 Discrete Mathematics Spring2015

Automata Theory CS S-FR Final Review

Institute of Operating Systems and Computer Networks Algorithms Group. Network Algorithms. Tutorial 4: Matching and other stuff

An Introduction to Graph Theory

Chapter 1. Essentials of Geometry

6.2 Initial Problem. Section 6.2 Network Problems. 6.2 Initial Problem, cont d. Weighted Graphs. Weighted Graphs, cont d. Weighted Graphs, cont d

Math.3336: Discrete Mathematics. Chapter 10 Graph Theory

How can we lay cable at minimum cost to make every telephone reachable from every other? What is the fastest route between two given cities?

Conway s Tiling Groups

Graph Theory. Chapter 4.

Fundamental Properties of Graphs

Lecture 8: The Traveling Salesman Problem

Matching Algorithms. Proof. If a bipartite graph has a perfect matching, then it is easy to see that the right hand side is a necessary condition.

Eulerian tours. Russell Impagliazzo and Miles Jones Thanks to Janine Tiefenbruck. April 20, 2016

Chapter 3: Paths and Cycles

Zhibin Huang 07. Juni Zufällige Graphen

Geometry Review for Test 3 January 13, 2016

The Geodesic Integral on Medial Graphs

A Theorem of Ramsey- Ramsey s Number

Computing minimum distortion embeddings into a path for bipartite permutation graphs and threshold graphs

RAMSEY NUMBERS IN SIERPINSKI TRIANGLE. Vels University, Pallavaram Chennai , Tamil Nadu, INDIA

14 More Graphs: Euler Tours and Hamilton Cycles

UNIT I PART A PART B

Approximation Algorithms

2 Solution of Homework

Graph Algorithms. Tours in Graphs. Graph Algorithms

The University of Sydney MATH2969/2069. Graph Theory Tutorial 2 (Week 9) 2008

Math 55 - Spring Lecture notes # 14 - March 9 (Tuesday)

Genus Ranges of 4-Regular Rigid Vertex Graphs

TWO-PLAYER ENVY-FREE MULTI-CAKE DIVISION

Partha Sarathi Mandal

Transcription:

On the Parikh-de-Bruijn grid Péter Burcsi Zsuzsanna Lipták W. F. Smyth ELTE Budapest (Hungary), U of Verona (Italy), McMaster U (Canada) & Murdoch U (Australia) Words & Complexity 2018 Lyon, 19-23 Feb. 2018

Alphabet Σ: finite, ordered, constant size σ Given string s, the Parikh vector of s pv(s) is vector of multiplicities of characters Given a Pv p, its order is the sum of its entries = length of a string with Pv p Ex. s = aabaccba over Σ = {a, b, c}, then pv(s) = (4, 2, 2), order 8. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 2 / 32

Alphabet Σ: finite, ordered, constant size σ Given string s, the Parikh vector of s pv(s) is vector of multiplicities of characters Given a Pv p, its order is the sum of its entries = length of a string with Pv p Ex. s = aabaccba over Σ = {a, b, c}, then pv(s) = (4, 2, 2), order 8. Two strings over the same alphabet are Parikh equivalent (a.k.a. abelian equivalent) if they have the same Parikh vector (i.e. if they are permutations of one another) Ex. aaaabbcc and aabcaabc are both Parikh equivalent to s. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 2 / 32

Alphabet Σ: finite, ordered, constant size σ Given string s, the Parikh vector of s pv(s) is vector of multiplicities of characters Given a Pv p, its order is the sum of its entries = length of a string with Pv p Ex. s = aabaccba over Σ = {a, b, c}, then pv(s) = (4, 2, 2), order 8. Two strings over the same alphabet are Parikh equivalent (a.k.a. abelian equivalent) if they have the same Parikh vector (i.e. if they are permutations of one another) Ex. aaaabbcc and aabcaabc are both Parikh equivalent to s. In Abelian stringology, equality is replaced by Parikh equivalence. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 2 / 32

Abelian stringology In Abelian stringology, equality is replaced by Parikh equivalence. Jumbled Pattern Matching abelian borders abelian periods abelian squares, repetitions, runs abelian pattern avoidance abelian reconstruction abelian problems on run-length encoded strings abelian complexity... Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 3 / 32

Abelian stringology In this talk, we introduce a new tool for attacking abelian problems. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 4 / 32

Abelian stringology In this talk, we introduce a new tool for attacking abelian problems. But first: What s so different about abelian problems? Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 4 / 32

An example problem Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 5 / 32

Parikh-de-Bruijn strings Recall: A de Bruijn sequence of order k over alphabet Σ is a string over Σ which contains every u Σ k exactly once as a substring. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 6 / 32

Parikh-de-Bruijn strings Recall: A de Bruijn sequence of order k over alphabet Σ is a string over Σ which contains every u Σ k exactly once as a substring. Ex. Σ = {a, b}, order 2: aabba, order 3: aaababbbaa Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 6 / 32

Parikh-de-Bruijn strings Recall: A de Bruijn sequence of order k over alphabet Σ is a string over Σ which contains every u Σ k exactly once as a substring. Ex. Σ = {a, b}, order 2: aabba, order 3: aaababbbaa We define the abelian analogue: Def. A Parikh-de-Bruijn string of order k (a (k, σ)-pdb-string) is a string s over an alphabet of size σ s.t. p Parikh vector of order k!(i, j) s.t. pv(s i s j ) = p Ex. aabbcca is a ( k 2, σ 3)-PdB-string Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 6 / 32

Parikh-de-Bruijn strings Classical case: De Bruijn sequences exist for every Σ and k. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 7 / 32

Parikh-de-Bruijn strings Classical case: De Bruijn sequences exist for every Σ and k. correspond to Hamiltonian paths in the de Bruijn graph of order k Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 7 / 32

Parikh-de-Bruijn strings Classical case: De Bruijn sequences exist for every Σ and k. correspond to Hamiltonian paths in the de Bruijn graph of order k and to Euler-paths in the de Bruijn graph of order k 1 Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 7 / 32

bab Parikh-de-Bruijn strings Classical case: De Bruijn sequences exist for every Σ and k. correspond to Hamiltonian paths in the de Bruijn graph of order k and to Euler-paths in the de Bruijn graph of order k 1 aab ab abb aaa aa aba bb bbb baa ba bba Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 7 / 32

bab Parikh-de-Bruijn strings Classical case: De Bruijn sequences exist for every Σ and k. correspond to Hamiltonian paths in the de Bruijn graph of order k and to Euler-paths in the de Bruijn graph of order k 1 aab ab abb aaa aa aba bb bbb baa ba bba order 2: aabba Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 7 / 32

bab Parikh-de-Bruijn strings Classical case: De Bruijn sequences exist for every Σ and k. correspond to Hamiltonian paths in the de Bruijn graph of order k and to Euler-paths in the de Bruijn graph of order k 1 aab ab abb aaa aa aba bb bbb baa ba bba order 2: aabba order 3: aaababbbaa Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 7 / 32

Parikh-de-Bruijn strings Abelian case: aabbcca is a ( k 2, σ 3)-PdB-string Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 8 / 32

Parikh-de-Bruijn strings Abelian case: aabbcca is a ( k 2, σ 3)-PdB-string abbbcccaaabc is a (3, 3)-PdB-string Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 8 / 32

Parikh-de-Bruijn strings Abelian case: aabbcca is a ( 2, k 3)-PdB-string σ abbbcccaaabc is a (3, 3)-PdB-string but no (4, 3)-PdB-string exists Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 8 / 32

Parikh-de-Bruijn strings Abelian case: aabbcca is a ( 2, k 3)-PdB-string σ abbbcccaaabc is a (3, 3)-PdB-string but no (4, 3)-PdB-string exists and no (2, 4)-PdB-string exists Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 8 / 32

The Parikh-de-Bruijn grid Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 9 / 32

bab Recall: de Bruijn graph db k = (V, E), where V = Σ k and (xu, uy) E for all x, y Σ and u Σ k 1 N.B. edges ˆ= (k + 1)-length strings aab ab abb aaa aa aba bb bbb edges ˆ= one-character shifts baa ba bba Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 10 / 32

bab Recall: de Bruijn graph db k = (V, E), where V = Σ k and (xu, uy) E for all x, y Σ and u Σ k 1 N.B. edges ˆ= (k + 1)-length strings aab ab abb aaa aa aba bb bbb edges ˆ= one-character shifts baa ba bba A straightforward generalization to Pv s gives: 21 12 21 12 30 20 11 02 03 21 12 Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 10 / 32

bab Recall: de Bruijn graph db k = (V, E), where V = Σ k and (xu, uy) E for all x, y Σ and u Σ k 1 N.B. edges ˆ= (k + 1)-length strings aab ab abb aaa aa aba bb bbb edges ˆ= one-character shifts baa ba bba A straightforward generalization to Pv s gives: 21 12 21 12 30 20 11 02 03 21 12 edges ˆ= one-character shifts, Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 10 / 32

bab Recall: de Bruijn graph db k = (V, E), where V = Σ k and (xu, uy) E for all x, y Σ and u Σ k 1 N.B. edges ˆ= (k + 1)-length strings aab ab abb aaa aa aba bb bbb edges ˆ= one-character shifts baa ba bba A straightforward generalization to Pv s gives: 21 12 21 12 30 20 11 02 03 edges ˆ= one-character shifts, undirected edges Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 10 / 32

bab Recall: de Bruijn graph db k = (V, E), where V = Σ k and (xu, uy) E for all x, y Σ and u Σ k 1 N.B. edges ˆ= (k + 1)-length strings aab ab abb aaa aa aba bb bbb edges ˆ= one-character shifts baa ba bba A straightforward generalization to Pv s gives: NO: edges ˆ= (k + 1)-order Pv s! 21 12 21 12 30 20 11 02 03 edges ˆ= one-character shifts, undirected edges Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 10 / 32

Let s look at another example: Here, σ = 3, k = 2. 201 102 ca 300 201 101 102 003 ab bc 200 111 111 002 aa bb cc 210 012 ba cb 120 110 111 011 021 ac 210 120 020 021 012 030 In the abelian version, several edges have the same label (i.e. here: 3-order Pv). Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 11 / 32

Turns out the right way to generalize de Bruijn graphs is the Parikh-de-Bruijn grid: 201 102 300 201 101 102 003 200 111 111 002 210 012 120 110 111 011 021 210 120 020 021 012 030 Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 12 / 32

For σ = 4, k = 5, the Parikh-de-Bruijn grid looks like this: Zs. Lipta k, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 13 / 32

The Parikh-de-Bruijn grid PdB-grid: V = k-order Pv s (4, 3)-grid; loops not included in figure Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 14 / 32

The Parikh-de-Bruijn grid PdB-grid: V = k-order Pv s pq E iff exist x, y Σ s.t. p = q x + y (4, 3)-grid; loops not included in figure Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 14 / 32

The Parikh-de-Bruijn grid PdB-grid: V = k-order Pv s pq E iff exist x, y Σ s.t. p = q x + y undirected edges (or: bidirectional edges) (4, 3)-grid; loops not included in figure Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 14 / 32

The Parikh-de-Bruijn grid PdB-grid: V = k-order Pv s pq E iff exist x, y Σ s.t. p = q x + y undirected edges (or: bidirectional edges) loops at every node, different one for each non-zero character (4, 3)-grid; loops not included in figure Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 14 / 32

The Parikh-de-Bruijn grid PdB-grid: V = k-order Pv s pq E iff exist x, y Σ s.t. p = q x + y undirected edges (or: bidirectional edges) loops at every node, different one for each non-zero character (k + 1)- and (k 1)-order Pv s ˆ= (σ 1)-simplices (triangles for σ = 3, tetrahedra for σ = 4 etc.) (4, 3)-grid; loops not included in figure Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 14 / 32

The Parikh-de-Bruijn grid PdB-grid: V = k-order Pv s pq E iff exist x, y Σ s.t. p = q x + y undirected edges (or: bidirectional edges) loops at every node, different one for each non-zero character (k + 1)- and (k 1)-order Pv s ˆ= (σ 1)-simplices (triangles for σ = 3, tetrahedra for σ = 4 etc.) (4, 3)-grid; loops not included in figure Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 14 / 32

The Parikh-de-Bruijn grid The (4, 3)-PdB-grid The (4, 4)-PdB-grid vertices: k-order Pv s (vertices), downward triangles/tetrahedra: (k + 1)-order Pv s, (upward triangles/tetrahedra: (k 1)-order Pv s. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 15 / 32

point(l3, color = 'orange', size=9) The Parikh-de-Bruijn grid The diagonal section of the integer grid with the hyperplanes H k (green), H k+1 (blue), and H k 1 (yellow), for k = 6 and σ = 3. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 16 / 32

Back to the example problem Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 17 / 32

More differences Classical case: (db k ) One-to-one correspondence: walks and strings. Abelian case: Every string corresponds to a walk in the PdB-grid, but not every walk corresponds to a string. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 18 / 32

More differences Classical case: (db k ) One-to-one correspondence: walks and strings. Abelian case: Every string corresponds to a walk in the PdB-grid, but not every walk corresponds to a string. Classical case: De Bruijn sequences exist for every k and σ. Abelian case: PdB-strings do not exist for every k and σ. (N.B. Not all PdB-strings come from circular strings = universal cycles!) Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 18 / 32

Back to Parikh-de-Bruijn strings Theorem 1 No (k, 3)-PdB strings exist for k 4. Theorem 2 A (2, σ)-pdb string exists if and only if σ is odd. Theorem 3 A (3, σ)-pdb string exists if and only if σ = 3 or σ not a multiple of 3. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 19 / 32

Theorem 1 No (k, 3)-PdB strings exist for k 4. Lemma 1 If the walk induced by string w does not use any loops, then for all i: w i w i+k. Proof Otherwise we would have two consecutive occurrences of the same Pv p, thus using a loop at p. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 20 / 32

Theorem 1: No (k, 3)-PdB strings exists for k 4. (Proof uses Lemma 1.) p x = (k, 0, 0) p y k = (, 0, ) p y+k = (, 0, ) p y = (0, k, 0) p z = (0, 0, k) p x+k = (0,, ) w = aaa }{{ a} bbb }{{ b} ccc }{{ c} k k k position x position y position z Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 21 / 32

Example 2: Covering strings Classical case: If s is a (classical) de Bruijn sequence of order k, then it also contains all (k 1)-length strings as substrings. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 22 / 32

Example 2: Covering strings Classical case: If s is a (classical) de Bruijn sequence of order k, then it also contains all (k 1)-length strings as substrings. Abelian case: not always true, e.g. aaaaabbbbbcaaaadbbbcccccdddddaaaccdbcbaccaccddbddbadacddbbbb is a (5, 4)-PdB-string but is not (4, 4)-covering: no substring with Pv (1, 1, 1, 1). Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 22 / 32

Example 2: Covering strings Classical case: If s is a (classical) de Bruijn sequence of order k, then it also contains all (k 1)-length strings as substrings. Abelian case: not always true, e.g. aaaaabbbbbcaaaadbbbcccccdddddaaaccdbcbaccaccddbddbadacddbbbb is a (5, 4)-PdB-string but is not (4, 4)-covering: no substring with Pv (1, 1, 1, 1). Theorem 4 For every σ 3 and k 4, there exist (k, σ)-covering strings which are not (k 1, σ)-covering. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 22 / 32

Experimental results k σ string length (excess) 2 3 aabbcca 7 (0) 3 3 abbbcccaaabc 12 (0) 4 3 aaaabbbbccccaacabcb 19 (1) 5 3 aaaaabbbacccccbbbbbaacaaccb 27 (2) 6 3 aaaabccccccaaaaaabbbbbbcccbbcabbaca 35 (2) 7 3 aabbbccbbcccabacaaabcbbbbbbbaaaaaaacccccccba 44 (2) 2 4 aabbcadbccdd 12 (1) 3 4 aaabbbcaadbdbccadddccc 22 (0) 4 4 aabbbbcaacadbddbccacddddaaaabdbbccccdd 38 (0) 5 4 aaaaabbbbbcaaaadbbbcccccdddddaaaccdbcbaccaccddbddbadacddbbbb 60 (0) 2 5 aabbcadbeccddeea 16 (0) 3 5 aaabbbcaadbbeaccbdddcccebededadceeeaa 37 (0) 4 5 aaaabbbbcaaadbbbeaaccbbddaaeaebcccadbeeeadddcccceeeedddd... 73 (0) Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 23 / 32

Conclusion and open problems new tool for modeling and solving abelian problems find good characterization for walks which correspond to strings many open problems on PdB- and covering strings, e.g. For which σ and k do PdB-strings exist? (We answered this question only for some special cases.) What is the length of a shortest covering string when no PdB-string exists, e.g. k = 3, σ = 6? What is the minimum proportion of (k 1)-order Pv s covered by a k-covering string? apply PdB-grid to other abelian problems paper on Arxiv Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 24 / 32

Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 25 / 32

Appendix Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 26 / 32

example (marked in blue, resp. brown). If the edge connects a vertex p with itself, then the induced (k + 1)- and (k 1)-order Pv s are given by triangles at the opposite direction with respect to p. This is the case in our example in the second (red) and third The (green) Parikh-de-Bruijn steps. grid k = 4, σ = 3 220 310 301 311 210 a c b 201 211 202 b c 221 a 212 111 121 112 (k +1) a 3 3 2 2 b 1 1 2 2 c 1 1 1 1 aabacabb k a 32221 b 11112 c 01111 (k 1) a 2 1 2 1 b 1 1 0 1 c 0 1 1 1 Walk corresponding to aabacabb. (k + 1)- and (k 1)-order Pv s: triangles Fig. 3. Left: The part of the (4, 3)-PdB-grid being visited by the string aabacabb. We marked incidenteach to the stepedges of thetraversed walk in aby di erent the walk. colour The (blue-red-green-brown). (k + 1) and (k 1)-order Right: Pv s We list for loops vertically (same thek-order corresponding Pv twice) (k + lie1)-order in opposite Pv s (top), direction, k-order hence Pv s the (center), name bow. and (k 1)-order Pv s, each at its beginning position. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 27 / 32

Parikh-de-Bruijn and covering strings Theorem 2 A (2, σ)-pdb string exists if and only if σ is odd. Proof Pv s of order 2 either form (0...0, i 2, 0..0) or (0...0, i 1, 0...0, j 1, 0..0). So we need exactly one substring aa for all a Σ, and either ab or ba for all a, b Σ. Consider the undirected complete graph G = (V, E) with loops where V = Σ (N.B.: not the PdB-grid!): an Euler path exists iff σ is odd. 20000 11000 a 10001 02000 b e 00002 01100 00011 c 00110 00200 00020 Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 28 / 32 d

Next best thing: covering strings. Def. Ex. Example 2: Covering strings We call a string s (k, σ)-covering if p Parikh vector of order k (i, j) s.t. pv(s i s j ) = p (There is at least one substring in s which has Pv p.) The excess of s is: s ( ) σ+k 1 k + k 1. }{{} length of a PdB-string aaaabbbbccccaacabcb is a shortest (4, 3)-covering string, with excess 1. aabbcadbccdd is a shortest (2, 4)-covering string, with excess 1. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 29 / 32

Example 2: Covering strings Classical case: If s is a (classical) de Bruijn sequence of order k, then it also contains all (k 1)-length strings as substrings. For PdB-strings, this is not always true, e.g. aaaaabbbbbcaaaadbbbcccccdddddaaaccdbcbaccaccddbddbadacddbbbb is a (5, 4)-PdB-string but is not (4, 4)-covering: no substring with Pv (1, 1, 1, 1). Theorem 4 For every σ 3 and k 4, there exist (k, σ)-covering strings which are not (k 1, σ)-covering. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 30 / 32

The Parikh-de-Bruijn grid Lemma 2 A set of k-order Parikh vectors is realizable if and only if the induced subgraph in the k-pdb-grid is connected. realizable = exists string with exactly these k-order sub-pv s. Proof sketch : clear. : Use loops until undesired character x exits, replace by new character y. Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 31 / 32

Parikh-de-Bruijn and covering strings Theorem 4 For every σ 3 and k 4, there exist (k, σ)-covering strings which are not (k 1, σ)-covering. Proof w = aaaaabbbbbcabbaaacacbbcbccacaccccbccccc General construction: remove (k 1)-order Pv p = (k 3, 1, 1, 0,..., 0) with incident edges and vertices the rest is connected, hence a string exists (Lemma 2) add vertices of p without traversing edges incident to p can be done by detours from corners of PdB-grid Zs. Lipták, P. Burcsi, W.F. Smyth On the Parikh-de-Bruijn grid Words & Complexity 2018 32 / 32