DANCEREPRODUCER: AN AUTOMATIC MASHUP MUSIC VIDEO GENERATION SYSTEM BY REUSING DANCE VIDEO CLIPS ON THE WEB

Size: px
Start display at page:

Download "DANCEREPRODUCER: AN AUTOMATIC MASHUP MUSIC VIDEO GENERATION SYSTEM BY REUSING DANCE VIDEO CLIPS ON THE WEB"

Transcription

1 DANCEREPRODUCER: AN AUTOMATIC MASHUP MUSIC VIDEO GENERATION SYSTEM BY REUSING DANCE VIDEO CLIPS ON THE WEB Tomoyasu Nakano 1 Sora Murofushi 3 Masataka Goto 2 Shigeo Morishima 3 1 t.nakano[at]aist.go.jp 2 m.goto[at]aist.go.jp 3 shigeo[at]waseda.jp ABSTRACT DanceReProducer 1. INTRODUCTION MAD movies mashup videos 1st generation (primary or original) content Copyright: 2011 Tomoyasu Nakano et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Original content (1st generation) Mashup music videos (User-generated video clips) 2nd generation new new 3rd generation new Nth generation... Figure 1 2nd generation (secondary or derivative) content DanceReProducer

2 e.g. N 2. RELATED WORK 3. SYSTEM DESIGN 3.1 Criteria of natural/skillful relationships between an image sequence and music Local relationships Rhythm e.g. Automatic mashup music video generation system Web Segment N Output Input Estimated music structure Stretch and Concatenate Chorus A A B B B A A B B B C C C Figure 2 DanceReProducer Impression Context relationships Music structure e.g. Temporal continuity 3.2 Image sequence generation Automatic image sequence generation

3 Figure 3 visual unit Rhythmic synchronization Impression synchronization Music structure Temporal continuity Interface Figure 4 Interactive re-selection of a generated image sequence e.g. Jumping to the beginning of sections 4. INTERNAL MECHANISM OF DANCEREPRODUCER

4 Database construction Web A Gather videos Resampling (30 fps, 44.1kHz) B Extract frame feature 30 fps 44.1kHz 1 bar Beat tracking 1 frame C Extract bar-level feature 1 bar Resampling (16 points) D DCT (3rd order with DC) 1 feature vector ( = + ) E Construct Database Video generation Input music F Extract bar-level feature DCT input tempo G Reconstruct H Train mapping model I Select visual unit Output Database Viterbi search under the Euclidean distance < 20% > 20% PCA weighting calculation clustering mapping linear regression A A A A B B Music structure Stretch and concatenate of the unit Figure Database construction frame-time frame features bar-level features Beat tracking e.g Frame feature extraction (Music) Frame feature extraction (Image sequence)

5 Bar-level feature extraction bar-level feature e.g. 4.2 Video generation 20% N N 95% Linear regression models for multiple clusters k Image sequence selection under the criteria for natural/skillful relationships d(n, k m ) n(1 n N) m k d(n, k m ) ch(n) =1 c l (n, k m ) = ch(k m )=1, p c d(n, k m ) c a (n, k m ) =min τ,μ c l (n, k m ) (μ = m κ = k 1) +c a (n 1,κ μ ) st(n) st(n 1). p t c l (n, k m ) +c a (n 1,κ μ ) ch(n) n st(n) p c p t N

6 d min d min =argmin c a (N,k m ). k,m 4.3 Model training weighted according to view counts ω V c w = max (α log 10 (V c ) β,0). α β , 000 ω =1 100, 000 ω =3 ω ω =2 5. IMPLEMENTATION OF DANCEREPRODUCER 5.1 Dataset MikuMikuDance (MMD) NicoNicoDouga 5.2 Trial usage and introspective comments 6. CONCLUSION DanceReProducer n

7 Acknowledgments IPSJ SIG Technical Reports 2007-MUS-069 IEEE Trans. on Speech and Audio Processing 7. REFERENCES Journal of Information Processing Society of Japan Proc. of the 2008 Computers in Music Modeling and Retrieval Conference Journal of New Music Research IPSJ Transactions on Computer Vision and Image Media Proc. of the 12th annual ACM international conference on Multimedia Proc. of the 32nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2007) Proc. of the tenth ACM international conference on Multimedia Proc. of the 12th annual ACM international conference on Multimedia IEEE Trans. on Audio, Speech, and Language Processing IEEE Trans. on Circuits and Systems for Video Technology

Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming

Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Takuichi Nishimura Real World Computing Partnership / National Institute of Advanced

More information

VRMixer: Mixing Video and Real World via Video Segmentation

VRMixer: Mixing Video and Real World via Video Segmentation 情報処理学会インタラクション 2015 IPSJ Interaction 2015 C73 2015/3/7 VRMixer 1,a) 2,3 3 1,3 VRMixer 3 2.5 3 2.5 Video Reality VRMixer: Mixing Video and Real World via Video Segmentation Hirai Tatsunori 1,a) Nakamura

More information

Rhythmic constant pitch time stretching for digital audio

Rhythmic constant pitch time stretching for digital audio Rhythmic constant pitch time stretching for digital audio Brendan TREVORROW ; University of Southern Queensland, Australia ABSTRACT Constant pitch time stretching is not uncommon in audio editing software,

More information

Scalable Video Coding

Scalable Video Coding Introduction to Multimedia Computing Scalable Video Coding 1 Topics Video On Demand Requirements Video Transcoding Scalable Video Coding Spatial Scalability Temporal Scalability Signal to Noise Scalability

More information

Frame Wise Video Editing based on Audio-Visual Continuity

Frame Wise Video Editing based on Audio-Visual Continuity Frame Wise Video Editing based on Audio-Visual Continuity Tatsunori Hirai Abstract In this paper, we describe a method for freely changing the length of a video clip, leaving its content almost unchanged,

More information

MULTIPLE HYPOTHESES AT MULTIPLE SCALES FOR AUDIO NOVELTY COMPUTATION WITHIN MUSIC. Florian Kaiser and Geoffroy Peeters

MULTIPLE HYPOTHESES AT MULTIPLE SCALES FOR AUDIO NOVELTY COMPUTATION WITHIN MUSIC. Florian Kaiser and Geoffroy Peeters MULTIPLE HYPOTHESES AT MULTIPLE SCALES FOR AUDIO NOVELTY COMPUTATION WITHIN MUSIC Florian Kaiser and Geoffroy Peeters STMS IRCAM-CNRS-UPMC 1 Place Igor Stravinsky 75004 Paris florian.kaiser@ircam.fr ABSTRACT

More information

Improved Integral Histogram Algorithm. for Big Sized Images in CUDA Environment

Improved Integral Histogram Algorithm. for Big Sized Images in CUDA Environment Contemporary Engineering Sciences, Vol. 7, 2014, no. 24, 1415-1423 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2014.49174 Improved Integral Histogram Algorithm for Big Sized Images in CUDA

More information

Video shot segmentation using late fusion technique

Video shot segmentation using late fusion technique Video shot segmentation using late fusion technique by C. Krishna Mohan, N. Dhananjaya, B.Yegnanarayana in Proc. Seventh International Conference on Machine Learning and Applications, 2008, San Diego,

More information

PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content

PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content Masataka Goto Jun Ogata Kazuyoshi Yoshii Hiromasa Fujihara Matthias Mauch Tomoyasu Nakano National

More information

An Introduction to Pattern Recognition

An Introduction to Pattern Recognition An Introduction to Pattern Recognition Speaker : Wei lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering 1 Abstract Not a new research field Wide range included

More information

(JBE Vol. 23, No. 6, November 2018) Detection of Frame Deletion Using Convolutional Neural Network. Abstract

(JBE Vol. 23, No. 6, November 2018) Detection of Frame Deletion Using Convolutional Neural Network. Abstract (JBE Vol. 23, No. 6, November 2018) (Regular Paper) 23 6, 2018 11 (JBE Vol. 23, No. 6, November 2018) https://doi.org/10.5909/jbe.2018.23.6.886 ISSN 2287-9137 (Online) ISSN 1226-7953 (Print) CNN a), a),

More information

Fast Spectral Reflectance Recovery using DLP Projector

Fast Spectral Reflectance Recovery using DLP Projector Fast Spectral Reflectance Recovery using DLP Projector Shuai Han 1, Imari Sato 2, Takahiro Okabe 1, and Yoichi Sato 1 1 Institute of Industrial Science, The University of Tokyo, Japan 2 National Institute

More information

1 Introduction. 3 Data Preprocessing. 2 Literature Review

1 Introduction. 3 Data Preprocessing. 2 Literature Review Rock or not? This sure does. [Category] Audio & Music CS 229 Project Report Anand Venkatesan(anand95), Arjun Parthipan(arjun777), Lakshmi Manoharan(mlakshmi) 1 Introduction Music Genre Classification continues

More information

Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues

Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues 3rd International Symposium on Ambisonics & Spherical Acoustics@Lexington, Kentucky, USA, 2nd June 2011 Evaluation of a new Ambisonic decoder for irregular loudspeaker arrays using interaural cues J. Treviño

More information

Second author Retain these fake authors in submission to preserve the formatting

Second author Retain these fake authors in submission to preserve the formatting l 1 -GRAPH BASED MUSIC STRUCTURE ANALYSIS First author Affiliation1 author1@ismir.edu Second author Retain these fake authors in submission to preserve the formatting Third author Affiliation3 author3@ismir.edu

More information

A Study on Clustering Method by Self-Organizing Map and Information Criteria

A Study on Clustering Method by Self-Organizing Map and Information Criteria A Study on Clustering Method by Self-Organizing Map and Information Criteria Satoru Kato, Tadashi Horiuchi,andYoshioItoh Matsue College of Technology, 4-4 Nishi-ikuma, Matsue, Shimane 90-88, JAPAN, kato@matsue-ct.ac.jp

More information

Baseball Game Highlight & Event Detection

Baseball Game Highlight & Event Detection Baseball Game Highlight & Event Detection Student: Harry Chao Course Adviser: Winston Hu 1 Outline 1. Goal 2. Previous methods 3. My flowchart 4. My methods 5. Experimental result 6. Conclusion & Future

More information

9/8/2016. Characteristics of multimedia Various media types

9/8/2016. Characteristics of multimedia Various media types Chapter 1 Introduction to Multimedia Networking CLO1: Define fundamentals of multimedia networking Upon completion of this chapter students should be able to define: 1- Multimedia 2- Multimedia types and

More information

A Singing Robot Realized by a Collaboration of VOCALOID and Cybernetic Human HRP-4C

A Singing Robot Realized by a Collaboration of VOCALOID and Cybernetic Human HRP-4C ISCA Archive http://www.isca-speech.org/archive InterSinging 2010 - First Interdisciplinary Workshop on Singing Voice Tokyo, Japan October1-2,2010 A Singing Robot Realized by a Collaboration of VOCALOID

More information

TWO-STEP SEMI-SUPERVISED APPROACH FOR MUSIC STRUCTURAL CLASSIFICATION. Prateek Verma, Yang-Kai Lin, Li-Fan Yu. Stanford University

TWO-STEP SEMI-SUPERVISED APPROACH FOR MUSIC STRUCTURAL CLASSIFICATION. Prateek Verma, Yang-Kai Lin, Li-Fan Yu. Stanford University TWO-STEP SEMI-SUPERVISED APPROACH FOR MUSIC STRUCTURAL CLASSIFICATION Prateek Verma, Yang-Kai Lin, Li-Fan Yu Stanford University ABSTRACT Structural segmentation involves finding hoogeneous sections appearing

More information

Motion Interpretation and Synthesis by ICA

Motion Interpretation and Synthesis by ICA Motion Interpretation and Synthesis by ICA Renqiang Min Department of Computer Science, University of Toronto, 1 King s College Road, Toronto, ON M5S3G4, Canada Abstract. It is known that high-dimensional

More information

Speech Synthesis. Simon King University of Edinburgh

Speech Synthesis. Simon King University of Edinburgh Speech Synthesis Simon King University of Edinburgh Hybrid speech synthesis Partial synthesis Case study: Trajectory Tiling Orientation SPSS (with HMMs or DNNs) flexible, robust to labelling errors but

More information

MUSIC STRUCTURE ANALYSIS BY RIDGE REGRESSION OF BEAT-SYNCHRONOUS AUDIO FEATURES

MUSIC STRUCTURE ANALYSIS BY RIDGE REGRESSION OF BEAT-SYNCHRONOUS AUDIO FEATURES MUSIC STRUCTURE ANALYSIS BY RIDGE REGRESSION OF BEAT-SYNCHRONOUS AUDIO FEATURES Yannis Panagakis and Constantine Kotropoulos Department of Informatics Aristotle University of Thessaloniki Box 451 Thessaloniki,

More information

User Interfaces for Live Programming

User Interfaces for Live Programming User Interfaces for Live Programming Jun Kato https://junkato.jp Researcher, LIVE 2017 Keynote, 10/24/2017 Jun Kato junkato https://junkato.jp Research Topic Computer Science (Human-Computer Interaction,

More information

MUSIC STRUCTURE BOUNDARY DETECTION AND LABELLING BY A DECONVOLUTION OF PATH-ENHANCED SELF-SIMILARITY MATRIX

MUSIC STRUCTURE BOUNDARY DETECTION AND LABELLING BY A DECONVOLUTION OF PATH-ENHANCED SELF-SIMILARITY MATRIX MUSIC STRUCTURE BOUNDARY DETECTION AND LABELLING BY A DECONVOLUTION OF PATH-ENHANCED SELF-SIMILARITY MATRIX Tian Cheng, Jordan B. L. Smith, Masataka Goto National Institute of Advanced Industrial Science

More information

Random Swap algorithm

Random Swap algorithm Random Swap algorithm Pasi Fränti 24.4.2018 Definitions and data Set of N data points: X={x 1, x 2,, x N } Partition of the data: P={p 1, p 2,,p k }, Set of k cluster prototypes (centroids): C={c 1, c

More information

A Novel Approach for Reduction of Huffman Cost Table in Image Compression

A Novel Approach for Reduction of Huffman Cost Table in Image Compression Global Journal of Computer Science and Technology Volume 11 Issue 9 Version 1.0 May 2011 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc. (USA) ISSN: 0975-4172

More information

Convergence Point Adjustment Methods for Minimizing Visual Discomfort Due to a Stereoscopic Camera

Convergence Point Adjustment Methods for Minimizing Visual Discomfort Due to a Stereoscopic Camera J. lnf. Commun. Converg. Eng. 1(4): 46-51, Dec. 014 Regular paper Convergence Point Adjustment Methods for Minimizing Visual Discomfort Due to a Stereoscopic Camera Jong-Soo Ha 1, Dae-Woong Kim, and Dong

More information

AUDIO ANALOGIES: CREATING NEW MUSIC FROM AN EXISTING PERFORMANCE BY CONCATENATIVE SYNTHESIS

AUDIO ANALOGIES: CREATING NEW MUSIC FROM AN EXISTING PERFORMANCE BY CONCATENATIVE SYNTHESIS AUDIO ANALOGIES: CREATING NEW MUSIC FROM AN EXISTING PERFORMANCE BY CONCATENATIVE SYNTHESIS Ian Simon University of Washington Sumit Basu Microsoft Research David Salesin Microsoft Research Maneesh Agrawala

More information

Open Access Self-Growing RBF Neural Network Approach for Semantic Image Retrieval

Open Access Self-Growing RBF Neural Network Approach for Semantic Image Retrieval Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 1505-1509 1505 Open Access Self-Growing RBF Neural Networ Approach for Semantic Image Retrieval

More information

mira: musically interactive rendered animation Ning Hu Philo Chua Kevin AuYoung

mira: musically interactive rendered animation Ning Hu Philo Chua Kevin AuYoung mira: musically interactive rendered animation Ning Hu (ninghu@cs.cmu.edu) Philo Chua (pchua@andrew.cmu.edu) Kevin AuYoung (auyoung@cmu.edu) Goal of the project The goal is to build an animated figure

More information

MULTIMEDIA APPLICATIONS BS (1 st or 2 nd Semester)

MULTIMEDIA APPLICATIONS BS (1 st or 2 nd Semester) MULTIMEDIA APPLICATIONS BS000013 (1 st or 2 nd Semester) Grades 9, 10, 11, 12 Prerequisite: None ½ Unit If you are interested in computers and are a creative person, this is where you want to start. Students

More information

EUROPEAN COMPUTER DRIVING LICENCE. Multimedia Video Editing. Syllabus

EUROPEAN COMPUTER DRIVING LICENCE. Multimedia Video Editing. Syllabus EUROPEAN COMPUTER DRIVING LICENCE Multimedia Video Editing Syllabus Purpose This document details the syllabus for ECDL Multimedia Module 2 Video Editing. The syllabus describes, through learning outcomes,

More information

Performance Evaluation Metrics and Statistics for Positional Tracker Evaluation

Performance Evaluation Metrics and Statistics for Positional Tracker Evaluation Performance Evaluation Metrics and Statistics for Positional Tracker Evaluation Chris J. Needham and Roger D. Boyle School of Computing, The University of Leeds, Leeds, LS2 9JT, UK {chrisn,roger}@comp.leeds.ac.uk

More information

A Robust Audio Fingerprinting Algorithm in MP3 Compressed Domain

A Robust Audio Fingerprinting Algorithm in MP3 Compressed Domain A Robust Audio Fingerprinting Algorithm in MP3 Compressed Domain Ruili Zhou, Yuesheng Zhu Abstract In this paper, a new robust audio fingerprinting algorithm in MP3 compressed domain is proposed with high

More information

Design of Multichannel AP-DCD Algorithm using Matlab

Design of Multichannel AP-DCD Algorithm using Matlab , October 2-22, 21, San Francisco, USA Design of Multichannel AP-DCD using Matlab Sasmita Deo Abstract This paper presented design of a low complexity multichannel affine projection (AP) algorithm using

More information

Face Recognition from Images with High Pose Variations by Transform Vector Quantization

Face Recognition from Images with High Pose Variations by Transform Vector Quantization Face Recognition from Images with High Pose Variations by Transform Vector Quantization Amitava Das, Manoj Balwani 1, Rahul Thota 1, and Prasanta Ghosh 2 Microsoft Research India. Bangalore, India amitavd@microsoft.com

More information

BMC Evolutionary Biology 2008, 8:89

BMC Evolutionary Biology 2008, 8:89 BMC Evolutionary Biology This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. Selection against

More information

The Automatic Musicologist

The Automatic Musicologist The Automatic Musicologist Douglas Turnbull Department of Computer Science and Engineering University of California, San Diego UCSD AI Seminar April 12, 2004 Based on the paper: Fast Recognition of Musical

More information

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines The Scientific World Journal Volume 2013, Article ID 596724, 6 pages http://dx.doi.org/10.1155/2013/596724 Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines Weizhe

More information

Data Hiding in Video

Data Hiding in Video Data Hiding in Video J. J. Chae and B. S. Manjunath Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 9316-956 Email: chaejj, manj@iplab.ece.ucsb.edu Abstract

More information

Understanding Rule Behavior through Apriori Algorithm over Social Network Data

Understanding Rule Behavior through Apriori Algorithm over Social Network Data Global Journal of Computer Science and Technology Volume 12 Issue 10 Version 1.0 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc. (USA) Online ISSN: 0975-4172

More information

PHASE AND LEVEL DIFFERENCE FUSION FOR ROBUST MULTICHANNEL SOURCE SEPARATION

PHASE AND LEVEL DIFFERENCE FUSION FOR ROBUST MULTICHANNEL SOURCE SEPARATION PHASE AND LEVEL DIFFERENCE FUSION FOR ROBUST MULTICHANNEL SOURCE SEPARATION Johannes Traa 1 Minje Kim 2 Paris Smaragdis 1,2,3 1 University of Illinois at Urbana-Champaign, Department of Electrical and

More information

Authoring System for Choreography Using Dance Motion Retrieval and Synthesis

Authoring System for Choreography Using Dance Motion Retrieval and Synthesis Authoring System for Choreography Using Dance Motion Retrieval and Synthesis Ryo Kakitsuka 1 Kosetsu Tsukuda 2 Satoru Fukayama 2 Naoya Iwamoto 1 Masataka Goto 2 Shigeo Morishima 1 1 Waseda University 2

More information

Audio & Music Research at LabROSA

Audio & Music Research at LabROSA Audio & Music Research at LabROSA Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA dpwe@ee.columbia.edu http://labrosa.ee.columbia.edu/

More information

Latent Topic Model Based on Gaussian-LDA for Audio Retrieval

Latent Topic Model Based on Gaussian-LDA for Audio Retrieval Latent Topic Model Based on Gaussian-LDA for Audio Retrieval Pengfei Hu, Wenju Liu, Wei Jiang, and Zhanlei Yang National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy

More information

ALIGNED HIERARCHIES: A MULTI-SCALE STRUCTURE-BASED REPRESENTATION FOR MUSIC-BASED DATA STREAMS

ALIGNED HIERARCHIES: A MULTI-SCALE STRUCTURE-BASED REPRESENTATION FOR MUSIC-BASED DATA STREAMS ALIGNED HIERARCHIES: A MULTI-SCALE STRUCTURE-BASED REPRESENTATION FOR MUSIC-BASED DATA STREAMS Katherine M. Kinnaird Department of Mathematics, Statistics, and Computer Science Macalester College, Saint

More information

Audacity tutorial. 1. Look for the Audacity icon on your computer desktop. 2. Open the program. You get the basic screen.

Audacity tutorial. 1. Look for the Audacity icon on your computer desktop. 2. Open the program. You get the basic screen. Audacity tutorial What does Audacity do? It helps you record and edit audio files. You can record a speech through a microphone into your computer, into the Audacity program, then fix up the bits that

More information

CHAPTER 8 Multimedia Information Retrieval

CHAPTER 8 Multimedia Information Retrieval CHAPTER 8 Multimedia Information Retrieval Introduction Text has been the predominant medium for the communication of information. With the availability of better computing capabilities such as availability

More information

08 Sound. Multimedia Systems. Nature of Sound, Store Audio, Sound Editing, MIDI

08 Sound. Multimedia Systems. Nature of Sound, Store Audio, Sound Editing, MIDI Multimedia Systems 08 Sound Nature of Sound, Store Audio, Sound Editing, MIDI Imran Ihsan Assistant Professor, Department of Computer Science Air University, Islamabad, Pakistan www.imranihsan.com Lectures

More information

ScienceDirect. Reducing Semantic Gap in Video Retrieval with Fusion: A survey

ScienceDirect. Reducing Semantic Gap in Video Retrieval with Fusion: A survey Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 50 (2015 ) 496 502 Reducing Semantic Gap in Video Retrieval with Fusion: A survey D.Sudha a, J.Priyadarshini b * a School

More information

Abstract. 1 Introduction. Information and Communication Engineering July 2nd Fast Spectral Reflectance Recovery using DLP Projector

Abstract. 1 Introduction. Information and Communication Engineering July 2nd Fast Spectral Reflectance Recovery using DLP Projector Information and Communication Engineering July 2nd 2010 Fast Spectral Reflectance Recovery using DLP Projector Sato Laboratory D2, 48-097403, Shuai Han Abstract We present a practical approach to fast

More information

Automatic Classification of Audio Data

Automatic Classification of Audio Data Automatic Classification of Audio Data Carlos H. C. Lopes, Jaime D. Valle Jr. & Alessandro L. Koerich IEEE International Conference on Systems, Man and Cybernetics The Hague, The Netherlands October 2004

More information

EAR RECOGNITION AND OCCLUSION

EAR RECOGNITION AND OCCLUSION EAR RECOGNITION AND OCCLUSION B. S. El-Desouky 1, M. El-Kady 2, M. Z. Rashad 3, Mahmoud M. Eid 4 1 Mathematics Department, Faculty of Science, Mansoura University, Egypt b_desouky@yahoo.com 2 Mathematics

More information

Research Article Data Visualization Using Rational Trigonometric Spline

Research Article Data Visualization Using Rational Trigonometric Spline Applied Mathematics Volume Article ID 97 pages http://dx.doi.org/.//97 Research Article Data Visualization Using Rational Trigonometric Spline Uzma Bashir and Jamaludin Md. Ali School of Mathematical Sciences

More information

D 5.2 Time stretching modules with synchronized multimedia prototype

D 5.2 Time stretching modules with synchronized multimedia prototype The time-stretching factor is sent to the audio processing engine in order to change the analysis hop size, and the audio output frame timestamp is calculated accordingly. However, this timestamp is not

More information

MPEG-4 ALS International Standard for Lossless Audio Coding

MPEG-4 ALS International Standard for Lossless Audio Coding MPEG-4 ALS International Standard for Lossless Audio Coding Takehiro Moriya, Noboru Harada, Yutaka Kamamoto, and Hiroshi Sekigawa Abstract This article explains the technologies and applications of lossless

More information

Multimedia Event Detection for Large Scale Video. Benjamin Elizalde

Multimedia Event Detection for Large Scale Video. Benjamin Elizalde Multimedia Event Detection for Large Scale Video Benjamin Elizalde Outline Motivation TrecVID task Related work Our approach (System, TF/IDF) Results & Processing time Conclusion & Future work Agenda 2

More information

Efficient Acquisition of Human Existence Priors from Motion Trajectories

Efficient Acquisition of Human Existence Priors from Motion Trajectories Efficient Acquisition of Human Existence Priors from Motion Trajectories Hitoshi Habe Hidehito Nakagawa Masatsugu Kidode Graduate School of Information Science, Nara Institute of Science and Technology

More information

USABILITY EVALUATION OF VISUALIZATION INTERFACES FOR CONTENT-BASED MUSIC RETRIEVAL SYSTEMS

USABILITY EVALUATION OF VISUALIZATION INTERFACES FOR CONTENT-BASED MUSIC RETRIEVAL SYSTEMS 1th International Society for Music Information Retrieval Conference (ISMIR 29) USABILITY EVALUATION OF VISUALIZATION INTERFACES FOR CONTENT-BASED MUSIC RETRIEVAL SYSTEMS Keiichiro Hoashi Shuhei Hamawaki

More information

An efficient access control method for composite multimedia content

An efficient access control method for composite multimedia content IEICE Electronics Express, Vol.7, o.0, 534 538 An efficient access control method for composite multimedia content Shoko Imaizumi,a), Masaaki Fujiyoshi,andHitoshiKiya Industrial Research Institute of iigata

More information

Sales Manual Part II

Sales Manual Part II Sales Manual Part II In this sales manual, you ll be able to show how to make a song and create a WAV file of the song. Table of Contents Page 1. Main Features of the Sequencer 2 2. How to Demo the Sequencer

More information

A REGULARITY-CONSTRAINED VITERBI ALGORITHM AND ITS APPLICATION TO THE STRUCTURAL SEGMENTATION OF SONGS

A REGULARITY-CONSTRAINED VITERBI ALGORITHM AND ITS APPLICATION TO THE STRUCTURAL SEGMENTATION OF SONGS Author manuscript, published in "International Society for Music Information Retrieval Conference (ISMIR) (2011)" A REGULARITY-CONSTRAINED VITERBI ALGORITHM AND ITS APPLICATION TO THE STRUCTURAL SEGMENTATION

More information

FSRM Feedback Algorithm based on Learning Theory

FSRM Feedback Algorithm based on Learning Theory Send Orders for Reprints to reprints@benthamscience.ae The Open Cybernetics & Systemics Journal, 2015, 9, 699-703 699 FSRM Feedback Algorithm based on Learning Theory Open Access Zhang Shui-Li *, Dong

More information

Multimedia Integration for Cooking Video Indexing

Multimedia Integration for Cooking Video Indexing Multimedia Integration for Cooking Video Indexing Reiko Hamada 1, Koichi Miura 1, Ichiro Ide 2, Shin ichi Satoh 3, Shuichi Sakai 1, and Hidehiko Tanaka 4 1 The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku,

More information

Locating 1-D Bar Codes in DCT-Domain

Locating 1-D Bar Codes in DCT-Domain Edith Cowan University Research Online ECU Publications Pre. 2011 2006 Locating 1-D Bar Codes in DCT-Domain Alexander Tropf Edith Cowan University Douglas Chai Edith Cowan University 10.1109/ICASSP.2006.1660449

More information

Developing a Promo Video in Premiere Elements: a Day in Dublin

Developing a Promo Video in Premiere Elements: a Day in Dublin Dublin Institute of Technology ARROW@DIT Instructional Guides School of Multidisciplinary Technologies 2012 Developing a Promo Video in Premiere Elements: a Day in Dublin Jerome Casey jerome.casey@dit.ie

More information

Erratum to: Exploring functional data analysis and wavelet principal component analysis on ecstasy (MDMA) wastewater data

Erratum to: Exploring functional data analysis and wavelet principal component analysis on ecstasy (MDMA) wastewater data Salvatore et al. BMC Medical Research Methodology (2017) 17:34 DOI 10.1186/s12874-017-0311-y ERRATUM Erratum to: Exploring functional data analysis and wavelet principal component analysis on ecstasy (MDMA)

More information

Evaluation of traffic dispersion methods for synchronous distributed multimedia data transmission on multiple links for group of mobile hosts

Evaluation of traffic dispersion methods for synchronous distributed multimedia data transmission on multiple links for group of mobile hosts Int. J. Applied Systemic Studies, Vol. 3, No. 1, 2010 89 Evaluation of traffic dispersion methods for synchronous distributed multimedia data transmission on multiple links for group of mobile hosts Yoshia

More information

Signature Amortization Using Multiple Connected Chains

Signature Amortization Using Multiple Connected Chains Signature Amortization Using Multiple Connected Chains Qusai Abuein 1 and Susumu Shibusawa 2 1 Graduate School of Science and Engineering 2 Department of Computer and Information Sciences Ibaraki University,

More information

User manual of LED Dance Floor

User manual of LED Dance Floor User manual of LED Dance Floor Product name: Magic LED Dance Floor Item#: DF500-4 1. Features: Unit Dimension (mm): 500*500*100 Light source: super bright LEDs LEDs quantity: 192pcs (64R/64G/64B)

More information

Convention Paper Presented at the 120th Convention 2006 May Paris, France

Convention Paper Presented at the 120th Convention 2006 May Paris, France Audio Engineering Society Convention Paper Presented at the 12th Convention 26 May 2 23 Paris, France This convention paper has been reproduced from the author s advance manuscript, without editing, corrections,

More information

Sampling & Sequencing. Combining MIDI and audio

Sampling & Sequencing. Combining MIDI and audio Sampling & Sequencing Combining MIDI and audio 1 2 Introduction To create audio assets we often want to combine MIDI data with audio How to combine multiple audio assets to create a single artefact? How

More information

Epitomic Analysis of Human Motion

Epitomic Analysis of Human Motion Epitomic Analysis of Human Motion Wooyoung Kim James M. Rehg Department of Computer Science Georgia Institute of Technology Atlanta, GA 30332 {wooyoung, rehg}@cc.gatech.edu Abstract Epitomic analysis is

More information

K-Means Clustering. Sargur Srihari

K-Means Clustering. Sargur Srihari K-Means Clustering Sargur srihari@cedar.buffalo.edu 1 Topics in Mixture Models and EM Mixture models K-means Clustering Mixtures of Gaussians Maximum Likelihood EM for Gaussian mistures EM Algorithm Gaussian

More information

Using IKAROS as a data transfer and management utility within the KM3NeT computing model

Using IKAROS as a data transfer and management utility within the KM3NeT computing model EPJ Web of Conferences 116, 07001 (2016) DOI: 10.1051/epjconf/201611607001 C Owned by the authors, published by EDP Sciences, 2016 Using IKAROS as a data transfer and management utility within the KM3NeT

More information

Detection of Acoustic Events in Meeting-Room Environment

Detection of Acoustic Events in Meeting-Room Environment 11/Dec/2008 Detection of Acoustic Events in Meeting-Room Environment Presented by Andriy Temko Department of Electrical and Electronic Engineering Page 2 of 34 Content Introduction State of the Art Acoustic

More information

SOUND EVENT DETECTION AND CONTEXT RECOGNITION 1 INTRODUCTION. Toni Heittola 1, Annamaria Mesaros 1, Tuomas Virtanen 1, Antti Eronen 2

SOUND EVENT DETECTION AND CONTEXT RECOGNITION 1 INTRODUCTION. Toni Heittola 1, Annamaria Mesaros 1, Tuomas Virtanen 1, Antti Eronen 2 Toni Heittola 1, Annamaria Mesaros 1, Tuomas Virtanen 1, Antti Eronen 2 1 Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 33720, Tampere, Finland toni.heittola@tut.fi,

More information

Available online at ScienceDirect. Procedia Technology 17 (2014 )

Available online at  ScienceDirect. Procedia Technology 17 (2014 ) Available online at www.sciencedirect.com ScienceDirect Procedia Technology 17 (2014 ) 528 533 Conference on Electronics, Telecommunications and Computers CETC 2013 Social Network and Device Aware Personalized

More information

Research Article QOS Based Web Service Ranking Using Fuzzy C-means Clusters

Research Article QOS Based Web Service Ranking Using Fuzzy C-means Clusters Research Journal of Applied Sciences, Engineering and Technology 10(9): 1045-1050, 2015 DOI: 10.19026/rjaset.10.1873 ISSN: 2040-7459; e-issn: 2040-7467 2015 Maxwell Scientific Publication Corp. Submitted:

More information

Towards an Integrated Approach to Music Retrieval

Towards an Integrated Approach to Music Retrieval Towards an Integrated Approach to Music Retrieval Emanuele Di Buccio 1, Ivano Masiero 1, Yosi Mass 2, Massimo Melucci 1, Riccardo Miotto 1, Nicola Orio 1, and Benjamin Sznajder 2 1 Department of Information

More information

Mixtures of Gaussians and Advanced Feature Encoding

Mixtures of Gaussians and Advanced Feature Encoding Mixtures of Gaussians and Advanced Feature Encoding Computer Vision Ali Borji UWM Many slides from James Hayes, Derek Hoiem, Florent Perronnin, and Hervé Why do good recognition systems go bad? E.g. Why

More information

FUSION OF MULTITEMPORAL AND MULTIRESOLUTION REMOTE SENSING DATA AND APPLICATION TO NATURAL DISASTERS

FUSION OF MULTITEMPORAL AND MULTIRESOLUTION REMOTE SENSING DATA AND APPLICATION TO NATURAL DISASTERS FUSION OF MULTITEMPORAL AND MULTIRESOLUTION REMOTE SENSING DATA AND APPLICATION TO NATURAL DISASTERS Ihsen HEDHLI, Josiane ZERUBIA INRIA Sophia Antipolis Méditerranée (France), Ayin team, in collaboration

More information

Modeling Dynamic Structure of Human Verbal and Nonverbal Communication

Modeling Dynamic Structure of Human Verbal and Nonverbal Communication Modeling Dynamic Structure of Human Verbal and Nonverbal Communication Takashi Matsuyama and Hiroaki Kawashima Graduate School of Informatics, Kyoto University Yoshida-Honmachi, Sakyo, Kyoto 66-851, Japan

More information

YST-M40/M45D POWERED SPEAKER SERVICE MANUAL CONTENTS SPECIFICATIONS YST-M40/M45D

YST-M40/M45D POWERED SPEAKER SERVICE MANUAL CONTENTS SPECIFICATIONS YST-M40/M45D POWERED YST-M40/M45D SERVICE MANUAL CONTENTS FRONT PANELS... 1 REAR PANELS... 1 INSTALLING AND VERIFYING THE YST-M45D DEVICE DRIVERS... 2 EXPLODED VIEW (YST-M40)... 4 EXPLODED VIEW (YST-M45D)... 5 SPECIFICATIONS

More information

Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu

Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu 1 Introduction Yelp Dataset Challenge provides a large number of user, business and review data which can be used for

More information

Reverberation design based on acoustic parameters for reflective audio-spot system with parametric and dynamic loudspeaker

Reverberation design based on acoustic parameters for reflective audio-spot system with parametric and dynamic loudspeaker PROCEEDINGS of the 22 nd International Congress on Acoustics Signal Processing Acoustics: Paper ICA 2016-310 Reverberation design based on acoustic parameters for reflective audio-spot system with parametric

More information

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting

More information

Remarks. Marks. DVIDA Scale: High Honors: Honors: Pass: Fail:

Remarks. Marks. DVIDA Scale: High Honors: Honors: Pass: Fail: Professional Examination verification form for NDCA Judging application. Given by Examiner: Exam Start Time: Exam Finish TIme: Date: Name: Address: Remarks Part A (Dancing Demonstration) 35% Partnered

More information

Autonomous Robot Dancing Driven by Beats and Emotions of Music

Autonomous Robot Dancing Driven by Beats and Emotions of Music Autonomous Robot Dancing Driven by Beats and Emotions of Music ABSTRACT Guangyu Xia Computer Science Department Carnegie Mellon University Pittsburgh, PA 523, USA gxia@andrew.cmu.edu Roger Dannenberg Computer

More information

COMPRESSING MUSIC RECORDINGS INTO AUDIO SUMMARIES

COMPRESSING MUSIC RECORDINGS INTO AUDIO SUMMARIES COMPRESSING MUSIC RECORDINGS INTO AUDIO SUMMARIES Oriol Nieto New York University oriol@nyu.edu Eric J. Humphrey New York University ejhumphrey@nyu.edu Juan Pablo Bello New York University jpbello@nyu.edu

More information

Audio Watermarking using Empirical Mode Decomposition

Audio Watermarking using Empirical Mode Decomposition Audio Watermarking using Empirical Mode Decomposition Charulata P. Talele 1, Dr A. M. Patil 2 1ME Student, Electronics and Telecommunication Department J. T. Mahajan College of Engineering, Faizpur, Maharashtra,

More information

Gonzalo R. Arce Dept. of Electrical & Computer Engineering University of Delaware Newark, DE , U.S.A.

Gonzalo R. Arce Dept. of Electrical & Computer Engineering University of Delaware Newark, DE , U.S.A. 12th International Society for Music Information Retrieval Conference (ISMIR 2011) l 1 -GRAPH BASED MUSIC STRUCTURE ANALYSIS Yannis Panagakis Constantine Kotropoulos Dept. of Informatics Aristotle University

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017 RESEARCH ARTICLE Gesture Based Musical Metronome Implementation Strategy Mr. Mahesha Padyana [1], Dr. Bindu A Thomas [2] Department of Electronics and Communication Vidya Vikas Institute of Eng and Technology

More information

Tendency Mining in Dynamic Association Rules Based on SVM Classifier

Tendency Mining in Dynamic Association Rules Based on SVM Classifier Send Orders for Reprints to reprints@benthamscienceae The Open Mechanical Engineering Journal, 2014, 8, 303-307 303 Open Access Tendency Mining in Dynamic Association Rules Based on SVM Classifier Zhonglin

More information

Copyright Notice. Springer papers: Springer. Pre-prints are provided only for personal use. The final publication is available at link.springer.

Copyright Notice. Springer papers: Springer. Pre-prints are provided only for personal use. The final publication is available at link.springer. Copyright Notice The document is provided by the contributing author(s) as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. This is the author s version

More information

Role of big data in classification and novel class detection in data streams

Role of big data in classification and novel class detection in data streams DOI 10.1186/s40537-016-0040-9 METHODOLOGY Open Access Role of big data in classification and novel class detection in data streams M. B. Chandak * *Correspondence: hodcs@rknec.edu; chandakmb@gmail.com

More information

k-means Clustering David S. Rosenberg April 24, 2018 New York University

k-means Clustering David S. Rosenberg April 24, 2018 New York University k-means Clustering David S. Rosenberg New York University April 24, 2018 David S. Rosenberg (New York University) DS-GA 1003 / CSCI-GA 2567 April 24, 2018 1 / 19 Contents 1 k-means Clustering 2 k-means:

More information

Drawing Bipartite Graphs as Anchored Maps

Drawing Bipartite Graphs as Anchored Maps Drawing Bipartite Graphs as Anchored Maps Kazuo Misue Graduate School of Systems and Information Engineering University of Tsukuba 1-1-1 Tennoudai, Tsukuba, 305-8573 Japan misue@cs.tsukuba.ac.jp Abstract

More information

Machine Learning for Music Discovery

Machine Learning for Music Discovery Talk Machine Learning for Music Discovery Joan Serrà Artificial Intelligence Research Institute (IIIA-CSIC) Spanish National Research Council jserra@iiia.csic.es http://joanserra.weebly.com Joan Serrà

More information