D 5.2 Time stretching modules with synchronized multimedia prototype

Size: px
Start display at page:

Download "D 5.2 Time stretching modules with synchronized multimedia prototype"

Transcription

1 The time-stretching factor is sent to the audio processing engine in order to change the analysis hop size, and the audio output frame timestamp is calculated accordingly. However, this timestamp is not sufficient for proper A/V synchronisation, since it represents the time when the audio frame is sent to audio hardware buffer. For example, if an audio frame is 1024 samples and the sample rate is Hz, the time resolution will be 23.2 ms. For the normal playback speed, this may be sufficient, but in the case of doubling the playback speed the time span between two audio sample points on the media timeline becomes 46.4 ms. Hence, some measure of fullness of the audio hardware buffer needs to be introduced for precise timing of outputted audio samples. The fullness of the hardware audio buffer is hardware dependent and measuring it is often a complex task, so we propose to find approximate timing of the audio sample by measuring the time difference (Δt) between the moment the audio frame is sent to the hardware buffer and the current time. This value is then added to the timestamp of the audio frame that was sent to the audio buffer (T audio ), and is then compared with the video frame timestamp (T video ). The display is refreshed with this frame when the video frame time code is smaller than or equal to the calculated audio time: T! T + " t video audio Another issue is timer precision for measuring Δt. In Windows OS, the maximal precision that can be achieved with the standard timer is 15ms, which is hardly enough for a synchronisation application. Hence, Δt is measured by measuring CPU counts from the moment the frame is sent to the hardware buffer and then dividing by the CPU count frequency. Since Δt gives a value related to the real playback time-line, it is transposed to the media time line by dividing it by the time-stretching factor α: " CNT " t = #! f 1 cpu cnt (95) (10) 32

2 5. A/V Synchronisation Evaluation To measure the quality of the A/V synchronisation algorithm, we compared it with our integration of time-stretching in ffplay on the Linux platform and also with the MPlayer implementation in LinuxOS. MPlayer is a robust, open source video player in Linux based on ffmpeg libraries. One of the many features of MPlayer is the possibility to change playback speed, but without independent pitch-shifting. Nevertheless, this feature, robust implementation and the possibility to extract A/V synchronisation information make MPlayer useful for evaluation and comparison with our algorithm. We compared video players on the Casino Royale trailer sequence coded in MPEG1 format with video frame dimension 640x352 at frames per second and an audio sample rate of Hz. The video frame lag with respect to audio is presented for 100 video frames from the middle of the sequence in the case of playing the video at half of the original speed (Figure 15) and with double the original speed (Figure 16). It can be seen that our adaptive video refresh rate algorithm (marked Easaier on the figures after the name of the project it was implemented for) clearly outperforms the other two, because of the precise matching of the video timestamp to the audio clock. The video lag of the Easaier time-stretching algorithm is also well below the ITU lip sync error recommendation with maximal video lag being 14 ms and maximal video advance being 13 ms in the case of doubled playback speed. Moreover, the standard deviation of video lag is ms, showing stability of this solution. Figure 15. Comparison of video lag for three video player implementations when playback speed is half of original. 33

3 Figure 16. Video lag when playback speed is doubled. 34

4 6. Conclusions A framework for real-time video/audio synchronised time scaling and pitch shifting was developed for EASAIER. Careful consideration was given to the problems which arise in a real-time context and novel solutions to these issues have been provided. It was shown how time-scale changes can be achieved in real-time with almost imperceptible latency and no transitional artefacts. The approach is based on a modified phase vocoder with optional phase locking and an integrated transient detector which enables high quality transient preservation in real-time. The framework presented is the basis for the developments of applications which allow for a seamless real-time transition between continually varying, independent video/audio time-scale and pitch-scale parameters. A novel solution for audio/visual synchronisation called adaptive video refresh rate has also been developed. Due to the fact that synchronisation errors in the foreseen applications will be easier to detect, special focus was given to minimizing video lags and advances, resulting in algorithm that significantly outperforms existing algorithms. This work has also been presented for review in the IEEE Transactions on Multimedia [23] The framework and described algorithms have been integrated into the EASAIER client application successfully as shown in Figure 17. Figure 17. The EASAIER client application, showing the time scale modification tool along with synchronised video playback. Also shown, the freehand EQ with synchronised spectral display. All dynamic screen objects are synchronised to the time scaled time-base 35

5 36

6 7. References [1] LaBarbera P, and MacLachlan J, Time-Compressed Speech in Radio Advertising, Journal of Marketing, v. 43, n. 1, January 1979, pp [2] Landone C, Harrop J, Reiss J, Enabling Access to Sound Archives through Integration, Enrichment and Retrieval: the EASAIER Project, 8th ISMIR Conference, Vienna, 2007 [3] Barrett S, Duffy C, and Marshalsay K, HOTBED (Handing On Tradition By Electronic Dissemination), Royal Scottish Academy of Music and Drama, Glasgow, Report March [4] Harrigan K, The SPECIAL system: Self-paced education with compressed interactive audio learning, Journal of Research on Computing in Education,vol. 27, no. 3, 1995, pp [5] Harrigan K., The SPECIAL system: Searching time-compressed digital video lectures, Journal of Research on Computing in Education, vol. 33, no. 1, 2000, pp [6] King P. E, and Behnke R. R, The Effect of Time-Compressed Speech on Comprehension, Interpretive and Short-Term Listening, Human Communication Research, vol. 15, no. 3, [7] Olson J. S, A Study of the relative effectiveness of verbal and visual augmentation of rate-modified speech in the presentation of technical material, Annual Conference of the Association or Educational Communications and Technology (AECT), Anaheim, Ca, [8] Orr D. B, Friedman H. L, and Williams J. C, Trainability of listening comprehension of speeded discourse, Journal of Educational Psychology, vol. 56, 1965, pp [9] Short S, A Comparison of Variable Time-Compressed Speech and Normal Rate Speech Based on Time Spent and Performance in a Course Taught with Self- Instructional Methods, British Journal of Educational Technology,vol. 8, no. 2, 1977, pp [10] Li F. C, Gupta A, Sanocki E, He L, and Rui Y, Browsing digital video,. ACM CHI 2000, Hague, Netherlands, April 2000, pp [11] Flanagan J.L.,and Golden R.M, Phase Vocoder, Bell System Technical Journal vol. 45:, pp [12] Dolson M, The phase vocoder: A tutorial, Computer Music Journal, vol. 10, 1986, pp [13] Portnoff M, Implementation of the digital phase vocoder using the fast Fourier transform in IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 24, no. 3O,Jun 1976, pp [14] Laroche J; and Dolson M, Improved phase vocoder, In Proc. IEEE Trans. Speech and Audio Processing, v. 7, n. 3, May 1999, p

7 [15] Bonada J, Automatic technique in frequency domain for near-lossless time-scale modification of audio, 'Proceedings of International Computer Music Conference, Berlin, Germany 2000 [16] McAulay, R. J. and Quatieri, T. F. Speech Transformations Based on a Sinusoidal Representation. IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. ASSP-34:6, pp , August 1986 [17] Laroche J, Autocorrelation method for high quality time/pitch scaling, IEEE WASPAA, Mohonk, NY, [18] Tony S. Verma and Teresa H. Y. Meng, "An analysis /synthesis tool for transient signals," in Proc. 16th International Congress on Acoustics/135th Meeting of the Acoustical Society of America, June 1998, vol. 1, pp [19] Duxbury, C., M. Davies, and M. Sandler. Improved time-scaling of musical audio using phase locking at transientsm, 112th AES Convention. Convention Paper5530, 2002 [20] Barry D; FitzGerald D; and Coyle E, Drum Source Separation using Percussive Feature Detection and Spectral Modulation, IEE Irish Signals and Systems Conference, Dublin, Ireland., 2005 [21] International Telecommunication Union Document 11A/47-E, 13 October 1993 [22] International Telecommunication Union, Relative Timing of Sound and Vision for Broadcasting. Recommendation, ITU-R BT , [23] Damnjanovic I, Barry D, Dorran D, Reiss J, Real-time Synchronised Audio/Video Time and Pitch Scale Modification, Submitted to IEEE Transactions on Multimedia, September

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques

More information

Spectral modeling of musical sounds

Spectral modeling of musical sounds Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective

More information

Rhythmic constant pitch time stretching for digital audio

Rhythmic constant pitch time stretching for digital audio Rhythmic constant pitch time stretching for digital audio Brendan TREVORROW ; University of Southern Queensland, Australia ABSTRACT Constant pitch time stretching is not uncommon in audio editing software,

More information

USER-GUIDED VARIABLE-RATE TIME-STRETCHING VIA STIFFNESS CONTROL

USER-GUIDED VARIABLE-RATE TIME-STRETCHING VIA STIFFNESS CONTROL Proc. of the 5 th Int. Conference on Digital Audio Effects (DAFx-), York, UK, September 7-, USER-GUIDED VARIABLE-RATE TIME-STRETCHING VIA STIFFNESS CONTROL Nicholas J. Bryan, Jorge Herrera, and Ge Wang

More information

Audio Watermarking Based on PCM Technique

Audio Watermarking Based on PCM Technique Audio Watermarking Based on PCM Technique Ranjeeta Yadav Department of ECE SGIT, Ghaziabad, INDIA Sachin Yadav Department of CSE SGIT, Ghaziabad, INDIA Jyotsna Singh Department of ECE NSIT, New Delhi,

More information

A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D.

A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D. Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Limerick, Ireland, December 6-8, A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS Verfaille V., Arfib D. CNRS - LMA 3, chemin Joseph Aiguier

More information

D5.1 Prototype of Looping and Marking Modules

D5.1 Prototype of Looping and Marking Modules D5.1 Prototype of Looping and Marking Modules Abstract The EASAIER system provides the end user with the ability to query large multimedia archives and access the retrieved content directly. Upon retrieval

More information

MPEG-4 ALS International Standard for Lossless Audio Coding

MPEG-4 ALS International Standard for Lossless Audio Coding MPEG-4 ALS International Standard for Lossless Audio Coding Takehiro Moriya, Noboru Harada, Yutaka Kamamoto, and Hiroshi Sekigawa Abstract This article explains the technologies and applications of lossless

More information

PERIODIC ACTIVITY REPORT

PERIODIC ACTIVITY REPORT Page 1 Project Number: 033902 Project Acronym: EASAIER Project Title Enabling Access to Sound Archives through Integration, Enrichment and Retrieval SPECIFIC TARGETED RESEACH OR INNOVATION PROJECT ACCESS

More information

MCompressor. Presets button. Left arrow button. Right arrow button. Randomize button. Save button. Panic button. Settings button

MCompressor. Presets button. Left arrow button. Right arrow button. Randomize button. Save button. Panic button. Settings button MCompressor Presets button Presets button shows a window with all available presets. A preset can be loaded from the preset window by double-clicking on it, using the arrow buttons or by using a combination

More information

Wavetable Matching of Pitched Inharmonic Instrument Tones

Wavetable Matching of Pitched Inharmonic Instrument Tones Wavetable Matching of Pitched Inharmonic Instrument Tones Clifford So and Andrew Horner Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay, Kowloon, Hong Kong

More information

Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications

Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications Peter Počta {pocta@fel.uniza.sk} Department of Telecommunications

More information

Networking Applications

Networking Applications Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services

More information

Open Binding Of IDs To Media

Open Binding Of IDs To Media REQUEST FOR PROPOSALS Open Binding Of IDs To Media SMPTE RFP An online/teleconference meeting will take place on April 20, 2015, to answer questions. Non-SMPTE members, please RSVP to the Drafting Group

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes: Page 1 of 8 1. SCOPE This Operational Practice sets out guidelines for minimising the various artefacts that may distort audio signals when low bit-rate coding schemes are employed to convey contribution

More information

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Nobuhiko Kitawaki University of Tsukuba 1-1-1, Tennoudai, Tsukuba-shi, 305-8573 Japan. E-mail: kitawaki@cs.tsukuba.ac.jp

More information

WATERMARKING FOR LIGHT FIELD RENDERING 1

WATERMARKING FOR LIGHT FIELD RENDERING 1 ATERMARKING FOR LIGHT FIELD RENDERING 1 Alper Koz, Cevahir Çığla and A. Aydın Alatan Department of Electrical and Electronics Engineering, METU Balgat, 06531, Ankara, TURKEY. e-mail: koz@metu.edu.tr, cevahir@eee.metu.edu.tr,

More information

FINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM

FINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM FINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM Heiko Hansen, Stefan Strahl Carl von Ossietzky University Oldenburg Department of Physics D-6111 Oldenburg, Germany

More information

C H A P T E R Introduction

C H A P T E R Introduction C H A P T E R 1 Introduction M ultimedia is probably one of the most overused terms of the 90s (for example, see [Sch97]). The field is at the crossroads of several major industries: computing, telecommunications,

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek

More information

Audio Streams Merging Over ALMI

Audio Streams Merging Over ALMI Audio Streams Merging Over ALMI Christopher J. Dunkle, Zhen Zhang, Sherlia Y. Shi, Zongming Fei Department of Computer Science University of Kentucky 301 Rose Street, 2 nd floor Lexington, KY 40506-0495,

More information

GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS

GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS Digitization Best Practices for Audio This document sets forth guidelines for digitizing audio materials for CARLI Digital Collections. The issues described

More information

Automatic Enhancement of Correspondence Detection in an Object Tracking System

Automatic Enhancement of Correspondence Detection in an Object Tracking System Automatic Enhancement of Correspondence Detection in an Object Tracking System Denis Schulze 1, Sven Wachsmuth 1 and Katharina J. Rohlfing 2 1- University of Bielefeld - Applied Informatics Universitätsstr.

More information

Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms

Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms 26 IEEE 24th Convention of Electrical and Electronics Engineers in Israel Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms Hadas Ofir and David Malah Department of Electrical

More information

MPEG-4 Structured Audio Systems

MPEG-4 Structured Audio Systems MPEG-4 Structured Audio Systems Mihir Anandpara The University of Texas at Austin anandpar@ece.utexas.edu 1 Abstract The MPEG-4 standard has been proposed to provide high quality audio and video content

More information

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems Rec. ITU-R BS.1693 1 RECOMMENDATION ITU-R BS.1693 Procedure for the performance test of automated query-by-humming systems (Question ITU-R 8/6) (2004) The ITU Radiocommunication Assembly, considering a)

More information

Podcasting: How to Create Your Own in 30-Minutes

Podcasting: How to Create Your Own in 30-Minutes Podcasting: How to Create Your Own in 30-Minutes Podcasts Included in this Tutorial: o What is a Podcast? o What are the Learning Benefits of Podcasts? o Creating a Podcast with Audacity o Creating a Podcast

More information

An Adaptive Scene Compositor Model in MPEG-4 Player for Mobile Device

An Adaptive Scene Compositor Model in MPEG-4 Player for Mobile Device An Adaptive Scene Compositor Model in MPEG-4 Player for Mobile Device Hyunju Lee and Sangwook Kim Computer Science Department, Kyungpook National University 1370 Sankyuk-dong Buk-gu, Daegu, 702-701, Korea

More information

Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy

Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Patrick Brown EE382C Embedded Software Systems May 10, 2000 $EVWUDFW MPEG Audio Layer-3 is a standard for the compression of high-quality digital audio.

More information

Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming

Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Takuichi Nishimura Real World Computing Partnership / National Institute of Advanced

More information

A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio

A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio Haakon Lund 1, Mette Skov 2, Birger Larsen 2 and Marianne Lykke 2 1 Royal School of Library and

More information

MPEG-7. Multimedia Content Description Standard

MPEG-7. Multimedia Content Description Standard MPEG-7 Multimedia Content Description Standard Abstract The purpose of this presentation is to provide a better understanding of the objectives & components of the MPEG-7, "Multimedia Content Description

More information

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1 Introduction: Parametric

More information

Robustness of Multiplexing Protocols for Audio-Visual Services over Wireless Networks

Robustness of Multiplexing Protocols for Audio-Visual Services over Wireless Networks Robustness of Multiplexing Protocols for Audio-Visual Services over Wireless Networks W. S. Lee, M. R. Frater, M. R. Pickering and J. F. Arnold School of Electrical Engineering University College UNSW

More information

Content Based Classification of Audio Using MPEG-7 Features

Content Based Classification of Audio Using MPEG-7 Features Content Based Classification of Audio Using MPEG-7 Features ManasiChoche, Dr.SatishkumarVarma Abstract The segmentation plays important role in audio classification. The audio data can be divided into

More information

Parametric Coding of Spatial Audio

Parametric Coding of Spatial Audio Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial

More information

Version (build 46h) released October 31, 2016: Minor changes to common codebase. Withdrawn November 1 because of bug in batch processing.

Version (build 46h) released October 31, 2016: Minor changes to common codebase. Withdrawn November 1 because of bug in batch processing. ClickRepair version history Version 3.9.9 (build 46j) released June 17, 2017: Improved repair of 192kHz files. Version 3.9.8 (build 46i) released November 14, 2016: Fixed bugs in batch processing. Version

More information

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7) EE799 -- Multimedia Signal Processing Multimedia Signal Compression VI (MPEG-4, 7) References: 1. http://www.mpeg.org 2. http://drogo.cselt.stet.it/mpeg/ 3. T. Berahimi and M.Kunt, Visual data compression

More information

Rich Recording Technology Technical overall description

Rich Recording Technology Technical overall description Rich Recording Technology Technical overall description Ari Koski Nokia with Windows Phones Product Engineering/Technology Multimedia/Audio/Audio technology management 1 Nokia s Rich Recording technology

More information

GETTING STARTED WITH DJCONTROL INSTINCT AND DJUCED UK US

GETTING STARTED WITH DJCONTROL INSTINCT AND DJUCED UK US GETTING STARTED WITH DJCONTROL INSTINCT AND DJUCED INSTALLATION Insert the CD-ROM. Run the installer program. Follow the instructions. 6 1 2 7 3 4 5 1- Channels 1-2 (mix output) balance 2- Volume on channels

More information

PUBLICATIONS. Journal Papers

PUBLICATIONS. Journal Papers PUBLICATIONS Journal Papers [J1] X. Wu and L.-L. Xie, Asymptotic equipartition property of output when rate is above capacity, submitted to IEEE Transactions on Information Theory, August 2009. [J2] A.

More information

ITU-T. FG AVA TR Version 1.0 (10/2013) Part 16: Interworking and digital audiovisual media accessibility

ITU-T. FG AVA TR Version 1.0 (10/2013) Part 16: Interworking and digital audiovisual media accessibility International Telecommunication Union ITU-T TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU FG AVA TR Version 1.0 (10/2013) Focus Group on Audiovisual Media Accessibility Technical Report Part 16: Interworking

More information

Sounding Better Than Ever: High Quality Audio. Simon Forrest Connected Home Marketing

Sounding Better Than Ever: High Quality Audio. Simon Forrest Connected Home Marketing Sounding Better Than Ever: High Quality Audio Simon Forrest Connected Home Marketing www.imgtec.com A brief look at the numbers Market trends Worldwide audio market 2014 67.9m units shipped 16% increase

More information

Experimental Evaluation of Jitter Buffer Algorithms on Voice over IP Networks

Experimental Evaluation of Jitter Buffer Algorithms on Voice over IP Networks Experimental Evaluation of Jitter Buffer Algorithms on Voice over IP Networks Abstract J.P.Ouedraogo, L.Sun and I.H.Mkwawa Signal Processing and Multimedia Communications, University of Plymouth, Plymouth,

More information

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future

More information

Unit Title: Capture Pictures and Sound for Non-Linear Editing

Unit Title: Capture Pictures and Sound for Non-Linear Editing Unit Credit Value: 8 Unit Level: Three Unit Guided Learning Hours: 50 Ofqual Unit Reference Number: D/600/8457 Unit Review Date: 31/12/2016 Unit Sector: 9.3 Media and Communications Unit Summary In this

More information

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer

More information

Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS

Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS Outline Overview of Dirac Overview of AVS-china Overview

More information

ITEC310 Computer Networks II

ITEC310 Computer Networks II ITEC310 Computer Networks II Chapter 29 Multimedia Department of Information Technology Eastern Mediterranean University 2/75 Objectives After completing this chapter you should be able to do the following:

More information

DESCRIPTION FEATURES MULTISTREAM PCI SOUND CARDS 26 DECEMBER 2007 ASI6514, ASI6518

DESCRIPTION FEATURES MULTISTREAM PCI SOUND CARDS 26 DECEMBER 2007 ASI6514, ASI6518 26 DECEMBER 2007 ASI6514, ASI6518 MULTISTREAM PCI SOUND CARDS DESCRIPTION The ASI6514 and ASI6518 are professional PCI sound cards designed for use in radio broadcast automation. Providing up to 16 play

More information

Using Noise Substitution for Backwards-Compatible Audio Codec Improvement

Using Noise Substitution for Backwards-Compatible Audio Codec Improvement Using Noise Substitution for Backwards-Compatible Audio Codec Improvement Colin Raffel AES 129th Convention San Francisco, CA February 16, 2011 Outline Introduction and Motivation Coding Error Analysis

More information

Parametric Coding of High-Quality Audio

Parametric Coding of High-Quality Audio Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits

More information

Chapter 28. Multimedia

Chapter 28. Multimedia Chapter 28. Multimedia 28-1 Internet Audio/Video Streaming stored audio/video refers to on-demand requests for compressed audio/video files Streaming live audio/video refers to the broadcasting of radio

More information

The ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents 1

The ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents 1 The ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents 1 N. Adami, A. Bugatti, A. Corghi, R. Leonardi, P. Migliorati, Lorenzo A. Rossi, C. Saraceno 2 Department of Electronics

More information

Audio and video compression

Audio and video compression Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and

More information

Audio-Visual Content Indexing, Filtering, and Adaptation

Audio-Visual Content Indexing, Filtering, and Adaptation Audio-Visual Content Indexing, Filtering, and Adaptation Shih-Fu Chang Digital Video and Multimedia Group ADVENT University-Industry Consortium Columbia University 10/12/2001 http://www.ee.columbia.edu/dvmm

More information

Audio-Visual Content Indexing, Filtering, and Adaptation

Audio-Visual Content Indexing, Filtering, and Adaptation Audio-Visual Content Indexing, Filtering, and Adaptation Shih-Fu Chang Digital Video and Multimedia Group ADVENT University-Industry Consortium Columbia University 10/12/2001 http://www.ee.columbia.edu/dvmm

More information

MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.

MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan. MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.E 4 Assistant Professor, Dept. of ECE, Dr.NGP Institute of Technology,

More information

Frame Wise Video Editing based on Audio-Visual Continuity

Frame Wise Video Editing based on Audio-Visual Continuity Frame Wise Video Editing based on Audio-Visual Continuity Tatsunori Hirai Abstract In this paper, we describe a method for freely changing the length of a video clip, leaving its content almost unchanged,

More information

Gated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver

Gated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver Gated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver E.Kanniga 1, N. Imocha Singh 2,K.Selva Rama Rathnam 3 Professor Department of Electronics and Telecommunication, Bharath

More information

GETTING STARTED WITH DJCONTROL COMPACT AND DJUCED 18

GETTING STARTED WITH DJCONTROL COMPACT AND DJUCED 18 GETTING STARTED WITH DJCONTROL COMPACT AND DJUCED 18 INSTALLATION Connect the DJControl Compact to your computer Install the DJUCED 18 software Launch the DJUCED 18 software More information (forums, tutorials,

More information

Brian F. Cooper. Distributed systems, digital libraries, and database systems

Brian F. Cooper. Distributed systems, digital libraries, and database systems Brian F. Cooper Home Office Internet 2240 Homestead Ct. #206 Stanford University cooperb@stanford.edu Los Altos, CA 94024 Gates 424 http://www.stanford.edu/~cooperb/app/ (408) 730-5543 Stanford, CA 94305

More information

Research on Construction of Road Network Database Based on Video Retrieval Technology

Research on Construction of Road Network Database Based on Video Retrieval Technology Research on Construction of Road Network Database Based on Video Retrieval Technology Fengling Wang 1 1 Hezhou University, School of Mathematics and Computer Hezhou Guangxi 542899, China Abstract. Based

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

DRA AUDIO CODING STANDARD

DRA AUDIO CODING STANDARD Applied Mechanics and Materials Online: 2013-06-27 ISSN: 1662-7482, Vol. 330, pp 981-984 doi:10.4028/www.scientific.net/amm.330.981 2013 Trans Tech Publications, Switzerland DRA AUDIO CODING STANDARD Wenhua

More information

Completing the Multimedia Architecture

Completing the Multimedia Architecture Copyright Khronos Group, 2011 - Page 1 Completing the Multimedia Architecture Erik Noreke Chair of OpenSL ES Working Group Chair of OpenMAX AL Working Group Copyright Khronos Group, 2011 - Page 2 Today

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Interactive Progressive Encoding System For Transmission of Complex Images

Interactive Progressive Encoding System For Transmission of Complex Images Interactive Progressive Encoding System For Transmission of Complex Images Borko Furht 1, Yingli Wang 1, and Joe Celli 2 1 NSF Multimedia Laboratory Florida Atlantic University, Boca Raton, Florida 33431

More information

Audio, IEC, and the AES. Audio Engineering Society Standards Bruce C. Olson, AESSC SC Dr. Richard Cabot, AESSC SM

Audio, IEC, and the AES. Audio Engineering Society Standards Bruce C. Olson, AESSC SC Dr. Richard Cabot, AESSC SM Audio Engineering Society Standards Bruce C. Olson, AESSC SC Dr. Richard Cabot, AESSC SM AESSC A bit of terminology Audio Engineering Society Standards Committee AESSC SC AESSC Standards Chair AESSC SM

More information

Because of the good performance of vocoder and the potential

Because of the good performance of vocoder and the potential FINAL REVIEW ABOUT APPLIED FFT APPROACH IN PHASE VOCODER TO ACHIEVE TIME/PITCH SCALING Digital Audio Systems, DESC9115, 2018 Graduate Program in Audio and Acoustics Sydney School of Architecture, Design

More information

Multimedia Data and Its Encoding

Multimedia Data and Its Encoding Lecture 13 Multimedia Data and Its Encoding M. Adnan Quaium Assistant Professor Department of Electrical and Electronic Engineering Ahsanullah University of Science and Technology Room 4A07 Email adnan.eee@aust.edu

More information

Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction

Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction Jonathan Le Roux, obutaka Ono and Shigeki Sagayama Graduate School of Information Science and Technology,

More information

Car Information Systems for ITS

Car Information Systems for ITS Car Information Systems for ITS 102 Car Information Systems for ITS Kozo Nakamura Ichiro Hondo Nobuo Hataoka, Ph.D. Shiro Horii OVERVIEW: For ITS (intelligent transport systems) car information systems,

More information

Baseball Game Highlight & Event Detection

Baseball Game Highlight & Event Detection Baseball Game Highlight & Event Detection Student: Harry Chao Course Adviser: Winston Hu 1 Outline 1. Goal 2. Previous methods 3. My flowchart 4. My methods 5. Experimental result 6. Conclusion & Future

More information

Speech Synthesis. Simon King University of Edinburgh

Speech Synthesis. Simon King University of Edinburgh Speech Synthesis Simon King University of Edinburgh Hybrid speech synthesis Partial synthesis Case study: Trajectory Tiling Orientation SPSS (with HMMs or DNNs) flexible, robust to labelling errors but

More information

Experiments in computer-assisted annotation of audio

Experiments in computer-assisted annotation of audio Experiments in computer-assisted annotation of audio George Tzanetakis Computer Science Dept. Princeton University en St. Princeton, NJ 844 USA +1 69 8 491 gtzan@cs.princeton.edu Perry R. Cook Computer

More information

Streaming Media. Advanced Audio. Erik Noreke Standardization Consultant Chair, OpenSL ES. Copyright Khronos Group, Page 1

Streaming Media. Advanced Audio. Erik Noreke Standardization Consultant Chair, OpenSL ES. Copyright Khronos Group, Page 1 Streaming Media Advanced Audio Erik Noreke Standardization Consultant Chair, OpenSL ES Copyright Khronos Group, 2010 - Page 1 Today s Consumer Requirements Rich media applications and UI - Consumer decisions

More information

Lock vs. Lock-free Memory Project proposal

Lock vs. Lock-free Memory Project proposal Lock vs. Lock-free Memory Project proposal Fahad Alduraibi Aws Ahmad Eman Elrifaei Electrical and Computer Engineering Southern Illinois University 1. Introduction The CPU performance development history

More information

ENHANCED GENERIC FOURIER DESCRIPTORS FOR OBJECT-BASED IMAGE RETRIEVAL

ENHANCED GENERIC FOURIER DESCRIPTORS FOR OBJECT-BASED IMAGE RETRIEVAL ENHANCED GENERIC FOURIER DESCRIPTORS FOR OBJECT-BASED IMAGE RETRIEVAL Dengsheng Zhang and Guojun Lu Gippsland School of Computing and Info Tech Monash University Churchill, Victoria 3842 dengsheng.zhang,

More information

Workshops. 1. SIGMM Workshop on Social Media. 2. ACM Workshop on Multimedia and Security

Workshops. 1. SIGMM Workshop on Social Media. 2. ACM Workshop on Multimedia and Security 1. SIGMM Workshop on Social Media SIGMM Workshop on Social Media is a workshop in conjunction with ACM Multimedia 2009. With the growing of user-centric multimedia applications in the recent years, this

More information

V. Zetterberg Amlab Elektronik AB Nettovaegen 11, Jaerfaella, Sweden Phone:

V. Zetterberg Amlab Elektronik AB Nettovaegen 11, Jaerfaella, Sweden Phone: Comparison between whitened generalized cross correlation and adaptive filter for time delay estimation with scattered arrays for passive positioning of moving targets in Baltic Sea shallow waters V. Zetterberg

More information

DESIGN AND IMPLEMENTATION OF A REAL TIME HIGH QUALITY DV DIGITAL VIDEO SOFTWARE ENCODER

DESIGN AND IMPLEMENTATION OF A REAL TIME HIGH QUALITY DV DIGITAL VIDEO SOFTWARE ENCODER EC-VIP-MC 2003.4th EURASIP Conference focused on Video I Image Processing and Multimedia Communications. 2-5 July 2003, Zagreb, Croatia DESIGN AND IMPLEMENTATION OF A REAL TIME HIGH QUALITY DV DIGITAL

More information

On Performance Evaluation of Reliable Topology Control Algorithms in Mobile Ad Hoc Networks (Invited Paper)

On Performance Evaluation of Reliable Topology Control Algorithms in Mobile Ad Hoc Networks (Invited Paper) On Performance Evaluation of Reliable Topology Control Algorithms in Mobile Ad Hoc Networks (Invited Paper) Ngo Duc Thuan 1,, Hiroki Nishiyama 1, Nirwan Ansari 2,andNeiKato 1 1 Graduate School of Information

More information

PCIe/104 or PCI/104-Express 4-Channel Audio/Video Codec Model 953 User's Manual Rev.C September 2017

PCIe/104 or PCI/104-Express 4-Channel Audio/Video Codec Model 953 User's Manual Rev.C September 2017 PCIe/104 or PCI/104-Express 4-Channel Audio/Video Codec Model 953 User's Manual Rev.C September 2017 Table of Contents LIMITED WARRANTY...3 SPECIAL HANDLING INSTRUCTIONS...4 INTRODUCTION...5 SYSTEM REQUIREMENTS...5

More information

IMAGE COMPRESSION USING HYBRID TRANSFORM TECHNIQUE

IMAGE COMPRESSION USING HYBRID TRANSFORM TECHNIQUE Volume 4, No. 1, January 2013 Journal of Global Research in Computer Science RESEARCH PAPER Available Online at www.jgrcs.info IMAGE COMPRESSION USING HYBRID TRANSFORM TECHNIQUE Nikita Bansal *1, Sanjay

More information

A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO

A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO International journal of computer science & information Technology (IJCSIT) Vol., No.5, October A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO Pranab Kumar Dhar *, Mohammad

More information

IMPLEMENTATION OF A FAST MPEG-2 COMPLIANT HUFFMAN DECODER

IMPLEMENTATION OF A FAST MPEG-2 COMPLIANT HUFFMAN DECODER IMPLEMENTATION OF A FAST MPEG-2 COMPLIANT HUFFMAN ECOER Mikael Karlsson Rudberg (mikaelr@isy.liu.se) and Lars Wanhammar (larsw@isy.liu.se) epartment of Electrical Engineering, Linköping University, S-581

More information

MPEG-1 Bitstreams Processing for Audio Content Analysis

MPEG-1 Bitstreams Processing for Audio Content Analysis ISSC, Cork. June 5- MPEG- Bitstreams Processing for Audio Content Analysis Roman Jarina, Orla Duffner, Seán Marlow, Noel O Connor, and Noel Murphy Visual Media Processing Group Dublin City University Glasnevin,

More information

Mobile Operating Systems Lesson 01 Operating System

Mobile Operating Systems Lesson 01 Operating System Mobile Operating Systems Lesson 01 Operating System Oxford University Press 2007. All rights reserved. 1 Operating system (OS) The master control program Manages all software and hardware resources Controls,

More information

An Improved Image Resizing Approach with Protection of Main Objects

An Improved Image Resizing Approach with Protection of Main Objects An Improved Image Resizing Approach with Protection of Main Objects Chin-Chen Chang National United University, Miaoli 360, Taiwan. *Corresponding Author: Chun-Ju Chen National United University, Miaoli

More information

Physical Modeling Synthesis of Sound. Adapted from Perry R. Cook Princeton Computer Science (also Music)

Physical Modeling Synthesis of Sound. Adapted from Perry R. Cook Princeton Computer Science (also Music) Physical Modeling Synthesis of Sound Adapted from Perry R. Cook Princeton Computer Science (also Music) prc@cs.princeton.edu www.cs.princeton.edu/~prc 1 One View of Sound Sound is a waveform, it, store

More information

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,

More information

DIGITIZING ANALOG AUDIO SOURCES USING AUDACITY

DIGITIZING ANALOG AUDIO SOURCES USING AUDACITY DIGITIZING ANALOG AUDIO SOURCES USING AUDACITY INTRODUCTION There are many ways to digitize and edit audio, all of which are dependant on the hardware and software used. This workflow provides instructions

More information

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP TRADEOFF BETWEEN COMPLEXITY AND MEMORY SIZE IN THE 3GPP ENHANCED PLUS DECODER: SPEED-CONSCIOUS AND MEMORY- CONSCIOUS DECODERS ON A 16-BIT FIXED-POINT DSP Osamu Shimada, Toshiyuki Nomura, Akihiko Sugiyama

More information

Adaptive Methods for Distributed Video Presentation. Oregon Graduate Institute of Science and Technology. fcrispin, scen, walpole,

Adaptive Methods for Distributed Video Presentation. Oregon Graduate Institute of Science and Technology. fcrispin, scen, walpole, Adaptive Methods for Distributed Video Presentation Crispin Cowan, Shanwei Cen, Jonathan Walpole, and Calton Pu Department of Computer Science and Engineering Oregon Graduate Institute of Science and Technology

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Chapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution

Chapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution Chapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution Sangita Roy, Dola B. Gupta, Sheli Sinha Chaudhuri and P. K. Banerjee Abstract In the last

More information

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC) STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC) EE 5359-Multimedia Processing Spring 2012 Dr. K.R Rao By: Sumedha Phatak(1000731131) OBJECTIVE A study, implementation and comparison

More information