Top Banner

Click here to load reader

Algorithms for audio watermarking and steganography

Feb 14, 2017






    Department of Electrical andInformation Engineering,

    Information Processing Laboratory,University of Oulu

    OULU 2004



    Academic Dissertation to be presented with the assent ofthe Faculty of Technology, University of Oulu, for publicdiscussion in Kuusamonsal i (Auditorium YB210),Linnanmaa, on June 29th, 2004, at 12 noon.


  • Copyright 2004University of Oulu, 2004

    Supervised byProfessor Tapio Seppnen

    Reviewed byProfessor Aarne MmmelProfessor Min Wu

    ISBN 951-42-7383-4 (nid.)ISBN 951-42-7384-2 (PDF)

    ISSN 0355-3213


  • Cvejic, Nedeljko, Algorithms for audio watermarking and steganography Department of Electrical and Information Engineering, Information Processing Laboratory,University of Oulu, P.O.Box 4500, FIN-90014 University of Oulu, Finland 2004Oulu, Finland

    AbstractBroadband communication networks and multimedia data available in a digital format opened manychallenges and opportunities for innovation. Versatile and simple-to-use software and decreasingprices of digital devices have made it possible for consumers from all around the world to create andexchange multimedia data. Broadband Internet connections and near error-free transmission of datafacilitate people to distribute large multimedia files and make identical digital copies of them. Aperfect reproduction in digital domain have promoted the protection of intellectual ownership and theprevention of unauthorized tampering of multimedia data to become an important technological andresearch issue.

    Digital watermarking has been proposed as a new, alternative method to enforce intellectualproperty rights and protect digital media from tampering. Digital watermarking is defined asimperceptible, robust and secure communication of data related to the host signal, which includesembedding into and extraction from the host signal. The main challenge in digital audio watermarkingand steganography is that if the perceptual transparency parameter is fixed, the design of a watermarksystem cannot obtain high robustness and a high watermark data rate at the same time. In this thesis,we address three research problems on audio watermarking: First, what is the highest watermark bitrate obtainable, under the perceptual transparency constraint, and how to approach the limit? Second,how can the detection performance of a watermarking system be improved using algorithms based oncommunications models for that system? Third, how can overall robustness to attacks to a watermarksystem be increased using attack characterization at the embedding side? An approach that combinedtheoretical consideration and experimental validation, including digital signal processing,psychoacoustic modeling and communications theory, is used in developing algorithms for audiowatermarking and steganography.

    The main results of this study are the development of novel audio watermarking algorithms, withthe state-of-the-art performance and an acceptable increase in computational complexity. Thealgorithms' performance is validated in the presence of the standard watermarking attacks. The maintechnical solutions include algorithms for embedding high data rate watermarks into the host audiosignal, using channel models derived from communications theory for watermark transmission andthe detection and modeling of attacks using attack characterization procedure. The thesis alsoincludes a thorough review of the state-of-the-art literature in the digital audio watermarking.

    Keywords: audio watermarking, digital rights management, information hiding, steganography

  • To my family

  • Preface

    The research related to this thesis has been carried out at the MediaTeam Oulu Group(MT) and the Information Processing Laboratory (IPL), University of Oulu, Finland. Ijoined the MediaTeam in December 2000 and started my postgraduate studies, leadingto the thesis, at the Department of Electrical and Information Engineering in April 2001.Professor Jaakko Sauvola, the director of the MT, docent Timo Ojala, the associate direc-tor of the MT, and professor Tapio Seppnen, the MTs scientific director are acknowl-edged for creating an inspiring research environment of the MT.

    I was fortunate to have professor Tapio Seppnen, who was at the time the head of theIPL, as my thesis supervisor. His pursuit for the uppermost standards in research was thegreat source of my motivation. I wish to thank him for his guidance and encouragement,especially during the starting period of my postgraduate study.

    I am grateful to the reviewers of the thesis, professor Min Wu from the University ofMaryland, College Park, USA, and professor Aarne Mmmel from the Technical Re-search Centre of Finland (VTT), Oulu, Finland. Their feedback improved the quality ofthe thesis significantly. I am also thankful to Lic. Phil. Pertti Vyrynen for proofreadingthe manuscript.

    I am thankful to my project managers and team leaders Jani Korhonen, Anja Keski-narkaus and Mikko Lytynoja for knowing how to distribute my workload related to theprojects and let me carry out research and study that was not always in the narrow scopeof the project. I would like to especially thank to Timo Ojala for his credence and supportthroughout these years. He invested a lot of time and patience in solving numerous practi-cal problems and in making my life in Oulu more pleasant. He would always find time formy dilemmas and our discussions that ranged from research issues to latest happenings inthe Premier League.

    My special thanks are due to my friends with whom I spent my spare time in Oulu. Myfirst neighbors Ilijana and Djordje Tujkovic were a great source of support and happinessfor me. Ilijana was my closest friend that had enough patience to help with all the issuesemerging from my immature personality. Djordje, being himself a researcher, was notonly a friend to me; he also gave me many advices that had a positive impact to thelength of my PhD studies. Anita and Dejan Danilovic, although working hard 12 hoursa day, would always find some extra time to hang out with me. I thank them for all thegreat late night hours we spend together, their sincere friendship and enormous moral

  • support throughout my studies. The largest part of this thesis was made using the PC thatI borrowed from them. Dejan Drajic and Zoran Vukcevic, besides being my friends, hada specific role of familiarizing Finland to me and giving me advices that helped me a lotin the everyday life. Dejan Drajic and Jonne Miettunen were my favorite pub mates and"football experts" that I liked to argue with. I thank Sharat Khungar for all the late luncheswe had together in Aularavintola and all the new things I learned about the culture of theIndian subcontinent.

    I wish also to thank to Protic family, my first cousins Nemanja and Aleksandar and myaunt Jelena and uncle Zivadin. Thank you for your love and support, not only during myPhD studies, but also throughout the hard times my family went trough.

    The financial support provided by Infotech Oulu Graduate School, Nokia, Sonera,Yomi, the National Technology Agency of Finland (TEKES), the Nokia Foundation, andthe Tauno Tnning Foundation is gratefully acknowledged.

    It is hard find words to express my gratitude to my loving parents, Bogdanka andSlavko for everything they have done for me. Thank you for your love, guidance, as wellas encouragement that you have unquestioningly given to me. I thank sincerely to mybrother Dejan for standing by my side during all ups and downs in my life, for his im-mense support, love and credence. My dedication to hard work and vigor to face all thegood and less pleasant things that life brings, I grasp from your love and support you havegiven to me.

    Oulu, May 2004 Nedeljko Cvejic

  • List of Contributions

    This thesis is based on the ten original papers (Appendices IX) which are referred in thetext by Roman numerals. All analysis and simulation results presented in publicationsor this thesis have been produced solely by the author. Professor Tapio Seppnen gaveguidance and needed expertise in general signal processing methods. He had an impor-tant role in the development of the initial ideas and shaping of the final outline of thepublications.

    I Cvejic N, Keskinarkaus A & Seppnen T (2001) Audio watermarking using m se-quences and temporal masking. In Proc. IEEE Workshop on Applications of SignalProcessing to Audio and Acoustics, New York, NY, October 2001, p. 227230.

    II Cvejic N & Seppnen T (2001) Improving audio watermarking performance withHAS-based shaping of pseudo-noise. In Proc. IEEE International Symposium on Sig-nal Processing and Information Technology, Cairo, Egypt, December 2001, p. 163168.

    III Cvejic N & Seppnen T (2002) Audio prewhitening based on polynomial filteringfor optimal watermark detection. In Proc. European Signal Processing Conference,Toulouse, France, September 2002, p. 6972.

    IV Cvejic N & Seppnen T (2002) A wavelet domain LSB insertion algorithm for highcapacity audio steganography. In Proc. IEEE Digital Signal Processing Workshop,Callaway Gardens, GA, October 2002, p. 5355.

    V Cvejic N & Seppnen T (2002) Increasing the capacity of LSB-based audio steganog-raphy. In Proc. IEEE International Workshop on Multimedia Signal Processing, St.Thomas, VI, December 2002, p. 336338.

    VI Cvejic N & Seppnen T (2003) Audio watermarking using attack characterization.Electronics Letters 13(39): p. 10201021.

    VII Cvejic N, Tujkovic D & Seppnen T (2003) Increasi