MuBu for Max/MSP. IMTR IRCAM Centre Pompidou. Norbert Schnell Riccardo Borghesi 20/10/2010

Similar documents
Multimedia Database Systems. Retrieval by Content

Automatic Classification of Audio Data

SmsPerformer: A Real-Time Synthesis Interface for SMS

USING THE SDIF SOUND DESCRIPTION INTERCHANGE FORMAT FOR AUDIO FEATURES

Introduction. What is Max?

Chronotron - Change log

The Visual SDIF interface in PWGL

A voice interface for sound generators: adaptive and automatic mapping of gestures to sound

Kristian Nymoen Musical Gestures Group, Department of Musicology, University of Oslo

Implementing a Speech Recognition System on a GPU using CUDA. Presented by Omid Talakoub Astrid Yi

Figure 1. Overview of Musink's user interface

Hello, I am from the State University of Library Studies and Information Technologies, Bulgaria

Graduate School for Integrative Sciences & Engineering 2 Arts and Creativity Laboratory, Interactive and Digital Media Institute

MP3 Speech and Speaker Recognition with Nearest Neighbor. ECE417 Multimedia Signal Processing Fall 2017

Modeling the Spectral Envelope of Musical Instruments

Qtractor. An Audio/MIDI multi-track sequencer. Rui Nuno Capela rncbc.org.

REAL-TIME CORPUS-BASED CONCATENATIVE SYNTHESIS WITH CATART

Getting Started with Crazy Talk 6

SOUND BROWSER SYSTEM PROTOTYPES SOUND FILE WINDOW. Click to update the list when files or directories have been added or removed by other programs.

Editing & Color Grading 101 in DaVinci Resolve 15

USING GARAGE BAND. Creating a new project

What s New in Finale 2008a for Windows

LabROSA Research Overview

PiPo, A Plugin Interface for Afferent Data Stream Processing Modules

CHROMA AND MFCC BASED PATTERN RECOGNITION IN AUDIO FILES UTILIZING HIDDEN MARKOV MODELS AND DYNAMIC PROGRAMMING. Alexander Wankhammer Peter Sciri

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++

OpenMusic - Linux-port

PT210M: Contents. Preface... xi Lesson 1. Preparing The Session... 1

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

A TIMBRE ANALYSIS AND CLASSIFICATION TOOLKIT FOR PURE DATA

Meta-Gest. A flexible and adaptable performance environment created in Max/MSP for real-time audio processing.

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval

Quick Guide to Getting Started with:

Elixir Ad-hoc Report. Release Elixir Technology Pte Ltd

Experiment 3. Getting Start with Simulink

Speech Synthesis. Simon King University of Edinburgh

Principles of Audio Coding

Installing energyxt2.5. Mac. Double-click energyxt2.5 disk image file. Drag the energyxt2.5 folder to your "Applica- tions" folder / your desktop.

FROM 4D WRITE TO 4D WRITE PRO INTRODUCTION. Presented by: Achim W. Peschke

Release notes for version 3.1

twisted wave twisted wave [an introduction]

Sequencing Traditional Lights in ShowTime 5 Originally presented at Christmas Expo 2017

over Multi Label Images

MOSIEVIUS: FEATURE DRIVEN INTERACTIVE AUDIO MOSAICING. Ari Lazier, Perry Cook

WaveLab Pro 9.5 WaveLab Elements 9.5

Sakai Forums.

Spectral modeling of musical sounds

utworks MidSequer Version , 04 July 2016 Documentation updated 2 October 2016 MidSequer by utworks is a 16-track MIDI sequencer for Android.

Music Technology Configuring MIDI and /audio

9/8/2016. Characteristics of multimedia Various media types

Voice Threads. How to make a Voice Thread. Click Create from the top left corner. Click Upload. Click From My Computer

User Manual Version

EasySites Quickstart Guide. Table Of Contents

WaveLab Pro 9.5 WaveLab Elements 9.5

Editing and Effects in DaVinci Resolve 12.5

Portable Music Studio

Implementing Symmetric Multiprocessing in LispWorks

Sequence Clip Reporter

CHAPTER 8 Multimedia Information Retrieval

A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D.

ACID PRO 4.0. An overview of new features By Gary Rebholz and Michael Bryant

Sending image(s) to report Click Send Snapshot on any screen in Dolphin3D and choose the Send to Report option

4D Write. User Reference Mac OS and Windows Versions. 4D Write D SA/4D, Inc. All Rights reserved.

Julius rev LEE Akinobu, and Julius Development Team 2007/12/19. 1 Introduction 2

ELEC Dr Reji Mathew Electrical Engineering UNSW

GPU-based Distributed Behavior Models with CUDA

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

International Music Information Retrieval Systems Evaluation Laboratory (IMIRSEL): Introducing D2K and M2K

Final Cut Pro X (FCP X) Chapter Notes

Maestro Frankenstein 0.4 beta

Please note that the following issues have been fixed in the 7.1 update.

Elixir Ad-hoc Report. Release Elixir Technology Pte Ltd

ROW.mp3. Colin Raffel, Jieun Oh, Isaac Wang Music 422 Final Project 3/12/2010

Slow Motion Video. MediaTek Technology White Paper

Eventrons: A Safe Programming Construct for High-Frequency Hard Real-Time Applications

Engineering Data Management (EDM) Software

dyn dynamic object management

New Features in Primavera P6 EPPM 17

Norbert Schnell, Victor Saiz, Karim Barkati, Samuel Goldszmidt. HAL Id: hal

Cubase 9. Version History. February 2018 Steinberg Media Technologies GmbH

OpenEx Supplement for Synapse Users

Adobe Audition CC Detailed Video Training Outline Chapter 1 - Overview

2017 HSC Information and Digital Technology Web and Software Applications Marking Guidelines

ASSESSMENT & FEEDBACK

Administrator Quick Guide

Breeze - Segmentation guide

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards

ELAN Linguistic Annotator

Disclaimer: this pdf is just the content of the sliders with no format at all. I'm using the moin-wiki slides plugin for the real slides.

How You Use the Timeline

User Manual. for ArcTrax UWP version

Rhythmic constant pitch time stretching for digital audio

C++ Memory Model Tutorial

WaveLab 8 WaveLab Elements 8

MPEG-4 AUTHORING TOOL FOR THE COMPOSITION OF 3D AUDIOVISUAL SCENES

QuickStart Guide MindManager 7 MAC

Video Summarization Using MPEG-7 Motion Activity and Audio Descriptors

by AssistiveWare Quick Start

Key Features. Let s Get Started!

Digital Audio Basics

Transcription:

MuBu for Max/MSP IMTR IRCAM Centre Pompidou Norbert Schnell Riccardo Borghesi 20/10/2010

Motivation Create a solid and open framework for the experimentation with recorded data streams of multiple representations in Max/MSP - audio samples - audio descriptors - gesture and motion capture data - spectral audio representations - symbolic representations - segmentation and annotations... what else?

Converging Applications Analysis/re-synthesis (sound, music and speech) Corpus based granular synthesis (textures) Following and recognition (gestures/sounds and scores) Computer aided improvisation...

Context Créations IRCAM & Forum Recherche musicale Prototyping of interactive real-time applications

Relaxed Real-Time Analysis/Synthesis Real-time analysis, recognition and classification - structured sound and gesture data - emerging symbolics Content based audio processing and rendering - synthesis based on offline or real-time analysis - post-sampling and re-performance

MuBu Container Functionalities Store heterogeneous synchronous data streams File import/export Visualization and editing Optimized congruent access to shared data Handling of dynamic configurations (data and references)

A real-time/offline analysis file import graphical editing F MuBu Container G Data Models S real-time synthesis

MuBu Container Design Simple and easy to maintain Minimizing dependencies on other libraries Thread-safe and lock-free congruent data access

Container Data Structure Array of buffers being arrays of tracks track 1 track 2 track 3 buffer 1 buffer 2 buffer 3 buffer B... track T...

Track Data Structure Array of 2-dim matrix data (optional time-tags and matrix rows) Array of (non-matrix) extra data (currently labels only) Track meta-data parameters time-tags (64-bit float)... matrix column names N x M matrix data (32-bit float)... info dictionary variable number of matrix rows (int) extra data (symbols or other)......

Modularity and Thread-Safeness Distinction of configuration and real-time access - thread-locked access to container (using pthread r/w locks) - lock-free access to track data (using compare-and-swap) Track implemented as immutable data structure fixed capacity re-configuration by copying data into a new track Container manages dynamic track configuration - locked callbacks to modules that refer to a container - simple garbage collector (by reference count)

File Format Support SDIF import and export* Plain text file import and export MIDI standard file import MusicXML import Save/load data with Max patcher (for small amounts of data) *... what else? * features to be implemented

Visualization and Editing Based on the Juce framework Abstract data interface (also implemented for FTM data structures) Set of editor/visualization components - waveform (single or multi-channel) - break-point function (single or also multi-channel) - sonogram (of sampled or time-tagged data) - markers (with duration and offset) - textual tables/matrices - piano roll (or simplified staffs) - traces (waveform or bpf with color and thickness) Control components (scroll bar, rulers, toolbar, tabs, buffer chooser, etc.)

Editor Control Components (1/2)

Editor Control Components (2/2)

Generic Data Access Generic data container mubu... basic multi-buffer container imubu... container with graphical user interface Reference to a specific track mubu.track... optimized access to track data * * Generic record and play externals JavaScript interface

Analysis FTM & Co prototype patch - loundness - power spectrum and spectral moments - MEL bands & MFCC coefs - YIN - loudness based segmentation * * PSOLA markers Partials and harmonics * SuperVP

Synthesis ZsaZsa overlap-add engine - concatenative synthesis - granular synthesis * * PSOLA FFT-1 based additive synthesis * SuperVP spectral/hybrid concatenative synthesis

Data Models KD-tree based k-nearest neighbors * N-Grams? * Factor oracle? *... what else?

Current Release Beta version of first set of externals - mubu... container - imubu... container with graphical interface - mubu.track... optimized access to track data - mubu.knn... k-nearest neighbors - mubu.concat~... concatenative synthesis - mubu.granular~... granular synthesis

Future Developments Finalize SDIF export Integrate analysis and segmentation (real-time/offline) Integrate spectral analysis/synthesis (real-time/offline) - PSOLA (ZsaZsa synthesis) - additive (FFT-1 synthesis) - phase vocoder (SuperVP and simple) Implement further data models

Links MuBu on the IMTR web pages http://imtr.ircam.fr/imtr/mubu MuBu beta-test mailing list (subscription required) http://listes.ircam.fr/wws/info/mubu-beta