D 5.2 Time stretching modules with synchronized multimedia prototype
|
|
- Karin Wilkins
- 6 years ago
- Views:
Transcription
1 The time-stretching factor is sent to the audio processing engine in order to change the analysis hop size, and the audio output frame timestamp is calculated accordingly. However, this timestamp is not sufficient for proper A/V synchronisation, since it represents the time when the audio frame is sent to audio hardware buffer. For example, if an audio frame is 1024 samples and the sample rate is Hz, the time resolution will be 23.2 ms. For the normal playback speed, this may be sufficient, but in the case of doubling the playback speed the time span between two audio sample points on the media timeline becomes 46.4 ms. Hence, some measure of fullness of the audio hardware buffer needs to be introduced for precise timing of outputted audio samples. The fullness of the hardware audio buffer is hardware dependent and measuring it is often a complex task, so we propose to find approximate timing of the audio sample by measuring the time difference (Δt) between the moment the audio frame is sent to the hardware buffer and the current time. This value is then added to the timestamp of the audio frame that was sent to the audio buffer (T audio ), and is then compared with the video frame timestamp (T video ). The display is refreshed with this frame when the video frame time code is smaller than or equal to the calculated audio time: T! T + " t video audio Another issue is timer precision for measuring Δt. In Windows OS, the maximal precision that can be achieved with the standard timer is 15ms, which is hardly enough for a synchronisation application. Hence, Δt is measured by measuring CPU counts from the moment the frame is sent to the hardware buffer and then dividing by the CPU count frequency. Since Δt gives a value related to the real playback time-line, it is transposed to the media time line by dividing it by the time-stretching factor α: " CNT " t = #! f 1 cpu cnt (95) (10) 32
2 5. A/V Synchronisation Evaluation To measure the quality of the A/V synchronisation algorithm, we compared it with our integration of time-stretching in ffplay on the Linux platform and also with the MPlayer implementation in LinuxOS. MPlayer is a robust, open source video player in Linux based on ffmpeg libraries. One of the many features of MPlayer is the possibility to change playback speed, but without independent pitch-shifting. Nevertheless, this feature, robust implementation and the possibility to extract A/V synchronisation information make MPlayer useful for evaluation and comparison with our algorithm. We compared video players on the Casino Royale trailer sequence coded in MPEG1 format with video frame dimension 640x352 at frames per second and an audio sample rate of Hz. The video frame lag with respect to audio is presented for 100 video frames from the middle of the sequence in the case of playing the video at half of the original speed (Figure 15) and with double the original speed (Figure 16). It can be seen that our adaptive video refresh rate algorithm (marked Easaier on the figures after the name of the project it was implemented for) clearly outperforms the other two, because of the precise matching of the video timestamp to the audio clock. The video lag of the Easaier time-stretching algorithm is also well below the ITU lip sync error recommendation with maximal video lag being 14 ms and maximal video advance being 13 ms in the case of doubled playback speed. Moreover, the standard deviation of video lag is ms, showing stability of this solution. Figure 15. Comparison of video lag for three video player implementations when playback speed is half of original. 33
3 Figure 16. Video lag when playback speed is doubled. 34
4 6. Conclusions A framework for real-time video/audio synchronised time scaling and pitch shifting was developed for EASAIER. Careful consideration was given to the problems which arise in a real-time context and novel solutions to these issues have been provided. It was shown how time-scale changes can be achieved in real-time with almost imperceptible latency and no transitional artefacts. The approach is based on a modified phase vocoder with optional phase locking and an integrated transient detector which enables high quality transient preservation in real-time. The framework presented is the basis for the developments of applications which allow for a seamless real-time transition between continually varying, independent video/audio time-scale and pitch-scale parameters. A novel solution for audio/visual synchronisation called adaptive video refresh rate has also been developed. Due to the fact that synchronisation errors in the foreseen applications will be easier to detect, special focus was given to minimizing video lags and advances, resulting in algorithm that significantly outperforms existing algorithms. This work has also been presented for review in the IEEE Transactions on Multimedia [23] The framework and described algorithms have been integrated into the EASAIER client application successfully as shown in Figure 17. Figure 17. The EASAIER client application, showing the time scale modification tool along with synchronised video playback. Also shown, the freehand EQ with synchronised spectral display. All dynamic screen objects are synchronised to the time scaled time-base 35
5 36
6 7. References [1] LaBarbera P, and MacLachlan J, Time-Compressed Speech in Radio Advertising, Journal of Marketing, v. 43, n. 1, January 1979, pp [2] Landone C, Harrop J, Reiss J, Enabling Access to Sound Archives through Integration, Enrichment and Retrieval: the EASAIER Project, 8th ISMIR Conference, Vienna, 2007 [3] Barrett S, Duffy C, and Marshalsay K, HOTBED (Handing On Tradition By Electronic Dissemination), Royal Scottish Academy of Music and Drama, Glasgow, Report March [4] Harrigan K, The SPECIAL system: Self-paced education with compressed interactive audio learning, Journal of Research on Computing in Education,vol. 27, no. 3, 1995, pp [5] Harrigan K., The SPECIAL system: Searching time-compressed digital video lectures, Journal of Research on Computing in Education, vol. 33, no. 1, 2000, pp [6] King P. E, and Behnke R. R, The Effect of Time-Compressed Speech on Comprehension, Interpretive and Short-Term Listening, Human Communication Research, vol. 15, no. 3, [7] Olson J. S, A Study of the relative effectiveness of verbal and visual augmentation of rate-modified speech in the presentation of technical material, Annual Conference of the Association or Educational Communications and Technology (AECT), Anaheim, Ca, [8] Orr D. B, Friedman H. L, and Williams J. C, Trainability of listening comprehension of speeded discourse, Journal of Educational Psychology, vol. 56, 1965, pp [9] Short S, A Comparison of Variable Time-Compressed Speech and Normal Rate Speech Based on Time Spent and Performance in a Course Taught with Self- Instructional Methods, British Journal of Educational Technology,vol. 8, no. 2, 1977, pp [10] Li F. C, Gupta A, Sanocki E, He L, and Rui Y, Browsing digital video,. ACM CHI 2000, Hague, Netherlands, April 2000, pp [11] Flanagan J.L.,and Golden R.M, Phase Vocoder, Bell System Technical Journal vol. 45:, pp [12] Dolson M, The phase vocoder: A tutorial, Computer Music Journal, vol. 10, 1986, pp [13] Portnoff M, Implementation of the digital phase vocoder using the fast Fourier transform in IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 24, no. 3O,Jun 1976, pp [14] Laroche J; and Dolson M, Improved phase vocoder, In Proc. IEEE Trans. Speech and Audio Processing, v. 7, n. 3, May 1999, p
7 [15] Bonada J, Automatic technique in frequency domain for near-lossless time-scale modification of audio, 'Proceedings of International Computer Music Conference, Berlin, Germany 2000 [16] McAulay, R. J. and Quatieri, T. F. Speech Transformations Based on a Sinusoidal Representation. IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. ASSP-34:6, pp , August 1986 [17] Laroche J, Autocorrelation method for high quality time/pitch scaling, IEEE WASPAA, Mohonk, NY, [18] Tony S. Verma and Teresa H. Y. Meng, "An analysis /synthesis tool for transient signals," in Proc. 16th International Congress on Acoustics/135th Meeting of the Acoustical Society of America, June 1998, vol. 1, pp [19] Duxbury, C., M. Davies, and M. Sandler. Improved time-scaling of musical audio using phase locking at transientsm, 112th AES Convention. Convention Paper5530, 2002 [20] Barry D; FitzGerald D; and Coyle E, Drum Source Separation using Percussive Feature Detection and Spectral Modulation, IEE Irish Signals and Systems Conference, Dublin, Ireland., 2005 [21] International Telecommunication Union Document 11A/47-E, 13 October 1993 [22] International Telecommunication Union, Relative Timing of Sound and Vision for Broadcasting. Recommendation, ITU-R BT , [23] Damnjanovic I, Barry D, Dorran D, Reiss J, Real-time Synchronised Audio/Video Time and Pitch Scale Modification, Submitted to IEEE Transactions on Multimedia, September
1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends
Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques
More informationSpectral modeling of musical sounds
Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective
More informationRhythmic constant pitch time stretching for digital audio
Rhythmic constant pitch time stretching for digital audio Brendan TREVORROW ; University of Southern Queensland, Australia ABSTRACT Constant pitch time stretching is not uncommon in audio editing software,
More informationUSER-GUIDED VARIABLE-RATE TIME-STRETCHING VIA STIFFNESS CONTROL
Proc. of the 5 th Int. Conference on Digital Audio Effects (DAFx-), York, UK, September 7-, USER-GUIDED VARIABLE-RATE TIME-STRETCHING VIA STIFFNESS CONTROL Nicholas J. Bryan, Jorge Herrera, and Ge Wang
More informationAudio Watermarking Based on PCM Technique
Audio Watermarking Based on PCM Technique Ranjeeta Yadav Department of ECE SGIT, Ghaziabad, INDIA Sachin Yadav Department of CSE SGIT, Ghaziabad, INDIA Jyotsna Singh Department of ECE NSIT, New Delhi,
More informationA-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS. Verfaille V., Arfib D.
Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Limerick, Ireland, December 6-8, A-DAFX: ADAPTIVE DIGITAL AUDIO EFFECTS Verfaille V., Arfib D. CNRS - LMA 3, chemin Joseph Aiguier
More informationD5.1 Prototype of Looping and Marking Modules
D5.1 Prototype of Looping and Marking Modules Abstract The EASAIER system provides the end user with the ability to query large multimedia archives and access the retrieved content directly. Upon retrieval
More informationMPEG-4 ALS International Standard for Lossless Audio Coding
MPEG-4 ALS International Standard for Lossless Audio Coding Takehiro Moriya, Noboru Harada, Yutaka Kamamoto, and Hiroshi Sekigawa Abstract This article explains the technologies and applications of lossless
More informationPERIODIC ACTIVITY REPORT
Page 1 Project Number: 033902 Project Acronym: EASAIER Project Title Enabling Access to Sound Archives through Integration, Enrichment and Retrieval SPECIFIC TARGETED RESEACH OR INNOVATION PROJECT ACCESS
More informationMCompressor. Presets button. Left arrow button. Right arrow button. Randomize button. Save button. Panic button. Settings button
MCompressor Presets button Presets button shows a window with all available presets. A preset can be loaded from the preset window by double-clicking on it, using the arrow buttons or by using a combination
More informationWavetable Matching of Pitched Inharmonic Instrument Tones
Wavetable Matching of Pitched Inharmonic Instrument Tones Clifford So and Andrew Horner Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay, Kowloon, Hong Kong
More informationSubjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications
Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications Peter Počta {pocta@fel.uniza.sk} Department of Telecommunications
More informationNetworking Applications
Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services
More informationOpen Binding Of IDs To Media
REQUEST FOR PROPOSALS Open Binding Of IDs To Media SMPTE RFP An online/teleconference meeting will take place on April 20, 2015, to answer questions. Non-SMPTE members, please RSVP to the Drafting Group
More informationNew Results in Low Bit Rate Speech Coding and Bandwidth Extension
Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without
More informationThe following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:
Page 1 of 8 1. SCOPE This Operational Practice sets out guidelines for minimising the various artefacts that may distort audio signals when low bit-rate coding schemes are employed to convey contribution
More informationPerspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony
Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Nobuhiko Kitawaki University of Tsukuba 1-1-1, Tennoudai, Tsukuba-shi, 305-8573 Japan. E-mail: kitawaki@cs.tsukuba.ac.jp
More informationWATERMARKING FOR LIGHT FIELD RENDERING 1
ATERMARKING FOR LIGHT FIELD RENDERING 1 Alper Koz, Cevahir Çığla and A. Aydın Alatan Department of Electrical and Electronics Engineering, METU Balgat, 06531, Ankara, TURKEY. e-mail: koz@metu.edu.tr, cevahir@eee.metu.edu.tr,
More informationFINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM
FINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM Heiko Hansen, Stefan Strahl Carl von Ossietzky University Oldenburg Department of Physics D-6111 Oldenburg, Germany
More informationC H A P T E R Introduction
C H A P T E R 1 Introduction M ultimedia is probably one of the most overused terms of the 90s (for example, see [Sch97]). The field is at the crossroads of several major industries: computing, telecommunications,
More information19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007
19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek
More informationAudio Streams Merging Over ALMI
Audio Streams Merging Over ALMI Christopher J. Dunkle, Zhen Zhang, Sherlia Y. Shi, Zongming Fei Department of Computer Science University of Kentucky 301 Rose Street, 2 nd floor Lexington, KY 40506-0495,
More informationGUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS
GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS Digitization Best Practices for Audio This document sets forth guidelines for digitizing audio materials for CARLI Digital Collections. The issues described
More informationAutomatic Enhancement of Correspondence Detection in an Object Tracking System
Automatic Enhancement of Correspondence Detection in an Object Tracking System Denis Schulze 1, Sven Wachsmuth 1 and Katharina J. Rohlfing 2 1- University of Bielefeld - Applied Informatics Universitätsstr.
More informationPacket Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms
26 IEEE 24th Convention of Electrical and Electronics Engineers in Israel Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms Hadas Ofir and David Malah Department of Electrical
More informationMPEG-4 Structured Audio Systems
MPEG-4 Structured Audio Systems Mihir Anandpara The University of Texas at Austin anandpar@ece.utexas.edu 1 Abstract The MPEG-4 standard has been proposed to provide high quality audio and video content
More informationRECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems
Rec. ITU-R BS.1693 1 RECOMMENDATION ITU-R BS.1693 Procedure for the performance test of automated query-by-humming systems (Question ITU-R 8/6) (2004) The ITU Radiocommunication Assembly, considering a)
More informationPodcasting: How to Create Your Own in 30-Minutes
Podcasting: How to Create Your Own in 30-Minutes Podcasts Included in this Tutorial: o What is a Podcast? o What are the Learning Benefits of Podcasts? o Creating a Podcast with Audacity o Creating a Podcast
More informationAn Adaptive Scene Compositor Model in MPEG-4 Player for Mobile Device
An Adaptive Scene Compositor Model in MPEG-4 Player for Mobile Device Hyunju Lee and Sangwook Kim Computer Science Department, Kyungpook National University 1370 Sankyuk-dong Buk-gu, Daegu, 702-701, Korea
More informationModeling of an MPEG Audio Layer-3 Encoder in Ptolemy
Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy Patrick Brown EE382C Embedded Software Systems May 10, 2000 $EVWUDFW MPEG Audio Layer-3 is a standard for the compression of high-quality digital audio.
More informationMusic Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming
Music Signal Spotting Retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming Takuichi Nishimura Real World Computing Partnership / National Institute of Advanced
More informationA Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio
A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio Haakon Lund 1, Mette Skov 2, Birger Larsen 2 and Marianne Lykke 2 1 Royal School of Library and
More informationMPEG-7. Multimedia Content Description Standard
MPEG-7 Multimedia Content Description Standard Abstract The purpose of this presentation is to provide a better understanding of the objectives & components of the MPEG-7, "Multimedia Content Description
More informationPerceptual Audio Coders What to listen for: Artifacts of Parametric Coding
Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler University of AES 109th Convention, Los Angeles, September 22-25, 2000 1 Introduction: Parametric
More informationRobustness of Multiplexing Protocols for Audio-Visual Services over Wireless Networks
Robustness of Multiplexing Protocols for Audio-Visual Services over Wireless Networks W. S. Lee, M. R. Frater, M. R. Pickering and J. F. Arnold School of Electrical Engineering University College UNSW
More informationContent Based Classification of Audio Using MPEG-7 Features
Content Based Classification of Audio Using MPEG-7 Features ManasiChoche, Dr.SatishkumarVarma Abstract The segmentation plays important role in audio classification. The audio data can be divided into
More informationParametric Coding of Spatial Audio
Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial
More informationVersion (build 46h) released October 31, 2016: Minor changes to common codebase. Withdrawn November 1 because of bug in batch processing.
ClickRepair version history Version 3.9.9 (build 46j) released June 17, 2017: Improved repair of 192kHz files. Version 3.9.8 (build 46i) released November 14, 2016: Fixed bugs in batch processing. Version
More informationEE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)
EE799 -- Multimedia Signal Processing Multimedia Signal Compression VI (MPEG-4, 7) References: 1. http://www.mpeg.org 2. http://drogo.cselt.stet.it/mpeg/ 3. T. Berahimi and M.Kunt, Visual data compression
More informationRich Recording Technology Technical overall description
Rich Recording Technology Technical overall description Ari Koski Nokia with Windows Phones Product Engineering/Technology Multimedia/Audio/Audio technology management 1 Nokia s Rich Recording technology
More informationGETTING STARTED WITH DJCONTROL INSTINCT AND DJUCED UK US
GETTING STARTED WITH DJCONTROL INSTINCT AND DJUCED INSTALLATION Insert the CD-ROM. Run the installer program. Follow the instructions. 6 1 2 7 3 4 5 1- Channels 1-2 (mix output) balance 2- Volume on channels
More informationPUBLICATIONS. Journal Papers
PUBLICATIONS Journal Papers [J1] X. Wu and L.-L. Xie, Asymptotic equipartition property of output when rate is above capacity, submitted to IEEE Transactions on Information Theory, August 2009. [J2] A.
More informationITU-T. FG AVA TR Version 1.0 (10/2013) Part 16: Interworking and digital audiovisual media accessibility
International Telecommunication Union ITU-T TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU FG AVA TR Version 1.0 (10/2013) Focus Group on Audiovisual Media Accessibility Technical Report Part 16: Interworking
More informationSounding Better Than Ever: High Quality Audio. Simon Forrest Connected Home Marketing
Sounding Better Than Ever: High Quality Audio Simon Forrest Connected Home Marketing www.imgtec.com A brief look at the numbers Market trends Worldwide audio market 2014 67.9m units shipped 16% increase
More informationExperimental Evaluation of Jitter Buffer Algorithms on Voice over IP Networks
Experimental Evaluation of Jitter Buffer Algorithms on Voice over IP Networks Abstract J.P.Ouedraogo, L.Sun and I.H.Mkwawa Signal Processing and Multimedia Communications, University of Plymouth, Plymouth,
More informationSqueeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft
Squeeze Play: The State of Ady0 Cmprshn Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft Agenda Why compress? The tools at present Measuring success A glimpse of the future
More informationUnit Title: Capture Pictures and Sound for Non-Linear Editing
Unit Credit Value: 8 Unit Level: Three Unit Guided Learning Hours: 50 Ofqual Unit Reference Number: D/600/8457 Unit Review Date: 31/12/2016 Unit Sector: 9.3 Media and Communications Unit Summary In this
More informationScalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC
Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer
More informationPerformance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS
Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS Outline Overview of Dirac Overview of AVS-china Overview
More informationITEC310 Computer Networks II
ITEC310 Computer Networks II Chapter 29 Multimedia Department of Information Technology Eastern Mediterranean University 2/75 Objectives After completing this chapter you should be able to do the following:
More informationDESCRIPTION FEATURES MULTISTREAM PCI SOUND CARDS 26 DECEMBER 2007 ASI6514, ASI6518
26 DECEMBER 2007 ASI6514, ASI6518 MULTISTREAM PCI SOUND CARDS DESCRIPTION The ASI6514 and ASI6518 are professional PCI sound cards designed for use in radio broadcast automation. Providing up to 16 play
More informationUsing Noise Substitution for Backwards-Compatible Audio Codec Improvement
Using Noise Substitution for Backwards-Compatible Audio Codec Improvement Colin Raffel AES 129th Convention San Francisco, CA February 16, 2011 Outline Introduction and Motivation Coding Error Analysis
More informationParametric Coding of High-Quality Audio
Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits
More informationChapter 28. Multimedia
Chapter 28. Multimedia 28-1 Internet Audio/Video Streaming stored audio/video refers to on-demand requests for compressed audio/video files Streaming live audio/video refers to the broadcasting of radio
More informationThe ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents 1
The ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents 1 N. Adami, A. Bugatti, A. Corghi, R. Leonardi, P. Migliorati, Lorenzo A. Rossi, C. Saraceno 2 Department of Electronics
More informationAudio and video compression
Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and
More informationAudio-Visual Content Indexing, Filtering, and Adaptation
Audio-Visual Content Indexing, Filtering, and Adaptation Shih-Fu Chang Digital Video and Multimedia Group ADVENT University-Industry Consortium Columbia University 10/12/2001 http://www.ee.columbia.edu/dvmm
More informationAudio-Visual Content Indexing, Filtering, and Adaptation
Audio-Visual Content Indexing, Filtering, and Adaptation Shih-Fu Chang Digital Video and Multimedia Group ADVENT University-Industry Consortium Columbia University 10/12/2001 http://www.ee.columbia.edu/dvmm
More informationMODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.
MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.E 4 Assistant Professor, Dept. of ECE, Dr.NGP Institute of Technology,
More informationFrame Wise Video Editing based on Audio-Visual Continuity
Frame Wise Video Editing based on Audio-Visual Continuity Tatsunori Hirai Abstract In this paper, we describe a method for freely changing the length of a video clip, leaving its content almost unchanged,
More informationGated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver
Gated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver E.Kanniga 1, N. Imocha Singh 2,K.Selva Rama Rathnam 3 Professor Department of Electronics and Telecommunication, Bharath
More informationGETTING STARTED WITH DJCONTROL COMPACT AND DJUCED 18
GETTING STARTED WITH DJCONTROL COMPACT AND DJUCED 18 INSTALLATION Connect the DJControl Compact to your computer Install the DJUCED 18 software Launch the DJUCED 18 software More information (forums, tutorials,
More informationBrian F. Cooper. Distributed systems, digital libraries, and database systems
Brian F. Cooper Home Office Internet 2240 Homestead Ct. #206 Stanford University cooperb@stanford.edu Los Altos, CA 94024 Gates 424 http://www.stanford.edu/~cooperb/app/ (408) 730-5543 Stanford, CA 94305
More informationResearch on Construction of Road Network Database Based on Video Retrieval Technology
Research on Construction of Road Network Database Based on Video Retrieval Technology Fengling Wang 1 1 Hezhou University, School of Mathematics and Computer Hezhou Guangxi 542899, China Abstract. Based
More informationOptical Storage Technology. MPEG Data Compression
Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the
More informationDRA AUDIO CODING STANDARD
Applied Mechanics and Materials Online: 2013-06-27 ISSN: 1662-7482, Vol. 330, pp 981-984 doi:10.4028/www.scientific.net/amm.330.981 2013 Trans Tech Publications, Switzerland DRA AUDIO CODING STANDARD Wenhua
More informationCompleting the Multimedia Architecture
Copyright Khronos Group, 2011 - Page 1 Completing the Multimedia Architecture Erik Noreke Chair of OpenSL ES Working Group Chair of OpenMAX AL Working Group Copyright Khronos Group, 2011 - Page 2 Today
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationInteractive Progressive Encoding System For Transmission of Complex Images
Interactive Progressive Encoding System For Transmission of Complex Images Borko Furht 1, Yingli Wang 1, and Joe Celli 2 1 NSF Multimedia Laboratory Florida Atlantic University, Boca Raton, Florida 33431
More informationAudio, IEC, and the AES. Audio Engineering Society Standards Bruce C. Olson, AESSC SC Dr. Richard Cabot, AESSC SM
Audio Engineering Society Standards Bruce C. Olson, AESSC SC Dr. Richard Cabot, AESSC SM AESSC A bit of terminology Audio Engineering Society Standards Committee AESSC SC AESSC Standards Chair AESSC SM
More informationBecause of the good performance of vocoder and the potential
FINAL REVIEW ABOUT APPLIED FFT APPROACH IN PHASE VOCODER TO ACHIEVE TIME/PITCH SCALING Digital Audio Systems, DESC9115, 2018 Graduate Program in Audio and Acoustics Sydney School of Architecture, Design
More informationMultimedia Data and Its Encoding
Lecture 13 Multimedia Data and Its Encoding M. Adnan Quaium Assistant Professor Department of Electrical and Electronic Engineering Ahsanullah University of Science and Technology Room 4A07 Email adnan.eee@aust.edu
More informationExplicit consistency constraints for STFT spectrograms and their application to phase reconstruction
Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction Jonathan Le Roux, obutaka Ono and Shigeki Sagayama Graduate School of Information Science and Technology,
More informationCar Information Systems for ITS
Car Information Systems for ITS 102 Car Information Systems for ITS Kozo Nakamura Ichiro Hondo Nobuo Hataoka, Ph.D. Shiro Horii OVERVIEW: For ITS (intelligent transport systems) car information systems,
More informationBaseball Game Highlight & Event Detection
Baseball Game Highlight & Event Detection Student: Harry Chao Course Adviser: Winston Hu 1 Outline 1. Goal 2. Previous methods 3. My flowchart 4. My methods 5. Experimental result 6. Conclusion & Future
More informationSpeech Synthesis. Simon King University of Edinburgh
Speech Synthesis Simon King University of Edinburgh Hybrid speech synthesis Partial synthesis Case study: Trajectory Tiling Orientation SPSS (with HMMs or DNNs) flexible, robust to labelling errors but
More informationExperiments in computer-assisted annotation of audio
Experiments in computer-assisted annotation of audio George Tzanetakis Computer Science Dept. Princeton University en St. Princeton, NJ 844 USA +1 69 8 491 gtzan@cs.princeton.edu Perry R. Cook Computer
More informationStreaming Media. Advanced Audio. Erik Noreke Standardization Consultant Chair, OpenSL ES. Copyright Khronos Group, Page 1
Streaming Media Advanced Audio Erik Noreke Standardization Consultant Chair, OpenSL ES Copyright Khronos Group, 2010 - Page 1 Today s Consumer Requirements Rich media applications and UI - Consumer decisions
More informationLock vs. Lock-free Memory Project proposal
Lock vs. Lock-free Memory Project proposal Fahad Alduraibi Aws Ahmad Eman Elrifaei Electrical and Computer Engineering Southern Illinois University 1. Introduction The CPU performance development history
More informationENHANCED GENERIC FOURIER DESCRIPTORS FOR OBJECT-BASED IMAGE RETRIEVAL
ENHANCED GENERIC FOURIER DESCRIPTORS FOR OBJECT-BASED IMAGE RETRIEVAL Dengsheng Zhang and Guojun Lu Gippsland School of Computing and Info Tech Monash University Churchill, Victoria 3842 dengsheng.zhang,
More informationWorkshops. 1. SIGMM Workshop on Social Media. 2. ACM Workshop on Multimedia and Security
1. SIGMM Workshop on Social Media SIGMM Workshop on Social Media is a workshop in conjunction with ACM Multimedia 2009. With the growing of user-centric multimedia applications in the recent years, this
More informationV. Zetterberg Amlab Elektronik AB Nettovaegen 11, Jaerfaella, Sweden Phone:
Comparison between whitened generalized cross correlation and adaptive filter for time delay estimation with scattered arrays for passive positioning of moving targets in Baltic Sea shallow waters V. Zetterberg
More informationDESIGN AND IMPLEMENTATION OF A REAL TIME HIGH QUALITY DV DIGITAL VIDEO SOFTWARE ENCODER
EC-VIP-MC 2003.4th EURASIP Conference focused on Video I Image Processing and Multimedia Communications. 2-5 July 2003, Zagreb, Croatia DESIGN AND IMPLEMENTATION OF A REAL TIME HIGH QUALITY DV DIGITAL
More informationOn Performance Evaluation of Reliable Topology Control Algorithms in Mobile Ad Hoc Networks (Invited Paper)
On Performance Evaluation of Reliable Topology Control Algorithms in Mobile Ad Hoc Networks (Invited Paper) Ngo Duc Thuan 1,, Hiroki Nishiyama 1, Nirwan Ansari 2,andNeiKato 1 1 Graduate School of Information
More informationPCIe/104 or PCI/104-Express 4-Channel Audio/Video Codec Model 953 User's Manual Rev.C September 2017
PCIe/104 or PCI/104-Express 4-Channel Audio/Video Codec Model 953 User's Manual Rev.C September 2017 Table of Contents LIMITED WARRANTY...3 SPECIAL HANDLING INSTRUCTIONS...4 INTRODUCTION...5 SYSTEM REQUIREMENTS...5
More informationIMAGE COMPRESSION USING HYBRID TRANSFORM TECHNIQUE
Volume 4, No. 1, January 2013 Journal of Global Research in Computer Science RESEARCH PAPER Available Online at www.jgrcs.info IMAGE COMPRESSION USING HYBRID TRANSFORM TECHNIQUE Nikita Bansal *1, Sanjay
More informationA NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO
International journal of computer science & information Technology (IJCSIT) Vol., No.5, October A NEW DCT-BASED WATERMARKING METHOD FOR COPYRIGHT PROTECTION OF DIGITAL AUDIO Pranab Kumar Dhar *, Mohammad
More informationIMPLEMENTATION OF A FAST MPEG-2 COMPLIANT HUFFMAN DECODER
IMPLEMENTATION OF A FAST MPEG-2 COMPLIANT HUFFMAN ECOER Mikael Karlsson Rudberg (mikaelr@isy.liu.se) and Lars Wanhammar (larsw@isy.liu.se) epartment of Electrical Engineering, Linköping University, S-581
More informationMPEG-1 Bitstreams Processing for Audio Content Analysis
ISSC, Cork. June 5- MPEG- Bitstreams Processing for Audio Content Analysis Roman Jarina, Orla Duffner, Seán Marlow, Noel O Connor, and Noel Murphy Visual Media Processing Group Dublin City University Glasnevin,
More informationMobile Operating Systems Lesson 01 Operating System
Mobile Operating Systems Lesson 01 Operating System Oxford University Press 2007. All rights reserved. 1 Operating system (OS) The master control program Manages all software and hardware resources Controls,
More informationAn Improved Image Resizing Approach with Protection of Main Objects
An Improved Image Resizing Approach with Protection of Main Objects Chin-Chen Chang National United University, Miaoli 360, Taiwan. *Corresponding Author: Chun-Ju Chen National United University, Miaoli
More informationPhysical Modeling Synthesis of Sound. Adapted from Perry R. Cook Princeton Computer Science (also Music)
Physical Modeling Synthesis of Sound Adapted from Perry R. Cook Princeton Computer Science (also Music) prc@cs.princeton.edu www.cs.princeton.edu/~prc 1 One View of Sound Sound is a waveform, it, store
More informationSPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL
SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,
More informationDIGITIZING ANALOG AUDIO SOURCES USING AUDACITY
DIGITIZING ANALOG AUDIO SOURCES USING AUDACITY INTRODUCTION There are many ways to digitize and edit audio, all of which are dependant on the hardware and software used. This workflow provides instructions
More information14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP
TRADEOFF BETWEEN COMPLEXITY AND MEMORY SIZE IN THE 3GPP ENHANCED PLUS DECODER: SPEED-CONSCIOUS AND MEMORY- CONSCIOUS DECODERS ON A 16-BIT FIXED-POINT DSP Osamu Shimada, Toshiyuki Nomura, Akihiko Sugiyama
More informationAdaptive Methods for Distributed Video Presentation. Oregon Graduate Institute of Science and Technology. fcrispin, scen, walpole,
Adaptive Methods for Distributed Video Presentation Crispin Cowan, Shanwei Cen, Jonathan Walpole, and Calton Pu Department of Computer Science and Engineering Oregon Graduate Institute of Science and Technology
More informationAudio-coding standards
Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.
More informationChapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution
Chapter 2 Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution Sangita Roy, Dola B. Gupta, Sheli Sinha Chaudhuri and P. K. Banerjee Abstract In the last
More informationSTUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)
STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC) EE 5359-Multimedia Processing Spring 2012 Dr. K.R Rao By: Sumedha Phatak(1000731131) OBJECTIVE A study, implementation and comparison
More information