Blind watermark estimation attack for spread spectrum watermarking.This paper presents an efficient scheme for blind watermark watermark: see paper. See digital watermark. estimation embedded using additive watermark embedding 1. (mathematics) embedding - One instance of some mathematical object contained with in another instance, e.g. a group which is a subgroup. 2. (theory) embedding - (domain theory) A complete partial order F in [X -> Y] is an embedding if methods. The scheme exploits mutual independence between the host media and the embedded watermark and non-Gaussianity of the host media for watermark estimation. The proposed scheme employs the framework of independent component analysis (ICA Ica (ē`kä), city (1993 pop. 108,724), capital of Ica dept., SW Peru, on the Pan-American Highway. It is a commercial center for the cotton, wool, and wine produced in the region. There are several summer resorts nearby. ) and poses the problem of watermark estimation as a blind source separation (BSS See 802.11. BSS - Block Started by Symbol ) problem. Analysis of the scheme shows that the proposed detector significantly outperforms existing correlation-based blind detectors traditionally used for SS-based watermarking. The proposed ICA-based blind detection/decoding scheme has been simulated using real-world audio dips. The simulation results show that the proposed ICA-based method can detect and decode (1) To convert coded data back into its original form. Contrast with encode. (2) Same as decrypt. See cryptography. (cryptography) decode - To apply decryption. watermark with extremely low decoding de·code tr.v. de·cod·ed, de·cod·ing, de·codes 1. To convert from code into plain text. 2. To convert from a scrambled electronic signal into an interpretable one. 3. bit error probability (less than 0.01) against common watermarking attacks and benchmark degradations. Keywords: Spread-spectrum watermarking, independent component analysis, blind source separation, watermark estimation, detection, decoding Povzetek: Opisana je metoda odkrivanja vodnega tiska. 1 Introduction Digital forgeries and unauthorized sharing of digital media have emerged as a growing concern over the last decade. The widespread use of multimedia information is aided by factors such as the growth of the Internet, the Internet, the, international computer network linking together thousands of individual networks at military and government agencies, educational institutions, nonprofit organizations, industrial and financial corporations of all sizes, and commercial enterprises proliferation proliferation /pro·lif·er·a·tion/ (pro-lif?er-a´shun) the reproduction or multiplication of similar forms, especially of cells.prolif´erativeprolif´erous pro·lif·er·a·tion n. of low-cost and reliable storage devices, the deployment of seamless broadband networks, the availability of state-of-the-art digital media production and editing technologies, and the development of efficient multimedia compression algorithms. Multimedia piracy has subjected the entertainment industry to enormous annual revenue losses. For example, music industry alone claims multi-million illegal music downloads on the Internet every week. It is therefore imperative to have robust technologies to protect copyrighted digital media from illegal sharing and tampering tampering The adulteration of a thing. See Drug tampering. . Traditional digital data protection techniques, such as encryption The reversible transformation of data from the original (the plaintext) to a difficult-to-interpret format (the ciphertext) as a mechanism for protecting its confidentiality, integrity and sometimes its authenticity. Encryption uses an encryption algorithm and one or more encryption keys. and scrambling, alone cannot provide adequate protection as these technologies are unable to protect digital content once they are decrypted or unscrambled. Digital watermarking Digital watermarking is a technique which allows an individual to add hidden copyright notices or other verification messages to digital audio, video, or image signals and documents. technology complements cryptography for protecting digital content even after it is deciphered [1]. Digital watermarking refers to the process of imperceptible im·per·cep·ti·ble adj. 1. Impossible or difficult to perceive by the mind or senses: an imperceptible drop in temperature. 2. embedding information (watermark) into the digital object (or the host object). Existing watermarking schemes based on the watermark embedding method used can be classified into two major categories: 1. blind embedding, in which the watermark embedder does not exploit the host signal information during watermark embedding process. Watermarking schemes based on spread-spectrum (SS) [1, 2, 3, 4, 5] fall into this category. 2. informed embedding, in which the watermark embedder exploits knowledge of the host signal during watermark embedding process. Watermarking schemes based on quantization (1) The division of a range of values into a single number, code or classification. For example, class A is 0 to 999, class B is 1000 to 9999 and class C is 10000 and above. (2) In analog to digital conversion, the assignment of a number to the amplitude of a wave. index modulation [1, 6] belong to this category. Similarity, existing watermarking schemes based on the detection method used cab be classified into two major categories: 1. informed detector, which assume that the host signal is available at the detector during watermark detection process, and 2. blind detector, which assume that the host signal is not available at the detector for watermark detection. Although the performance expected from a given watermarking system depends on the target application area [1], but robustness of the embedded watermark and efficient detection are desirable features of a give watermarking scheme. In addition, fidelity (or imperception) of the embedded watermark is additional requirement of perception based watermarking schemes [1]. To meet fidelity requirement, the power of the embedded watermark (watermark strength) is generally kept much lower than the host signal power. In this paper we consider additive watermark embedding model, e.g. SS-based watermarking, where the watermark signal is added to the host signal in the marking space to obtain the watermarked signal. Existing watermark detection schemes for SS-based watermarking generally employ statistical characterization of the host signal to develop an optimal or suboptimal Suboptimal A solution is called suboptimal if a part of the solution has been optimized without regards to the overall objective. watermark detector [6, 7, 8]. It is important to mention that blind watermark detectors for SS-based watermarking perform poorly as the host-signal acts as interference at the blind decoder. Therefore, nonzero non·ze·ro adj. Not equal to zero. nonzero Not equal to zero. decoding error probability at the blind watermark decoder even in the absence of attack-channel distortion is one of the limitations of existing blind watermark detectors for SS-based watermarking schemes. This paper presents a novel blind watermark detection method for the blind additive watermark embedding schemes[l, 2, 3, 4, 5]. The main motivation of this paper is to design a blind detector for SS-based watermarking schemes capable of suppressing host-signal interference (or improving watermark-to-host ratio) at the detector, hence improving decoding as well as detection performance. Towards this end, the proposed detector uses ICA framework by posing watermark detection problem as a blind source separation (BSS) problem. The proposed detector models the received watermarked signal as a linear mixture of underlying independent components (the host signal and the watermark). It also assumes non-Gaussianity of the host signal. Recently, we have shown in [15, 16, 17] that the watermark estimation problem for SS-based watermarking can be modeled as that of BSS of underdetermined mixture of independent sources. Therefore, the ICA framework could be used to estimate the watermark from the watermarked signals obtained using additive embedding model. The proposed ICA-based detector first estimates the hidden independent components (i.e., the watermark and the host signal) from the received watermarked signal using the ICA framework, and then these estimated components are used to detect the embedded watermark. We present theoretical analysis to show that the proposed ICA-based detector performs significantly better than the existing watermark detectors operating without canceling the host signal interference at the watermark detector for watermark detection [6, 7]. Simulation results also show that the proposed detector in estimation-correlation based detection settings also outperforms the normalised normalised - normalisation correlation based detector (commonly used for watermark detection in SS-based watermarking community [1, 2, 3]) operating without host interference suppression. Simulation results presented in this paper are evaluated against variety of signal manipulations and degradations applied to the watermarked media. These signal degradations include addition of colored and white noise, resampling, requantization, lossy compression A compression technique that does not decompress data back to 100% of the original. Lossy methods provide high degrees of compression and result in very small compressed files, but there is a certain amount of loss when they are restored. , filtering, time- and frequency-scaling, and StirMark for audio benchmark attacks [20, 19, 18]. The proposed ICA-based watermark detector is applicable to SS-based watermarking of all media types, i.e. audio, video and images. However, in this paper the proposed detector is tested for digital audio (which includes music and voiced speech signals only) as the host media for watermark embedding, detection, and performance analysis. In the past ICA-based framework has been used for multimedia watermarking [9, 10, 11, 13, 14, 12]. However, existing ICA-based data-hiding schemes are either not applicable to SS-based watermarking [9, 10, 11, 13] or use an informed detection framework for watermark extraction/extraction [14, 12] therefore are not discussed in this manuscript. For example, Yu et al in [14] have proposed ICA-based watermark detector that can be used for SS-based watermarking but their detector uses the embedded watermark and a private data during watermark extraction process. Similarly, Sener et al's proposed ICA based watermark detector in [12] is also applicable to SS-based watermark detection, but their proposed detector also also requires the original watermark during watermark detection process; therefore, cannot be used for blind watermark detection/extraction applications. Rest of the paper is organized as follow: basics of SS-based watermarking are discussed in Section 2; a brief overview of the independent component analysis theory is provided in Section 3. The proposed ICA-based watermark detector along with its decoding, detection, and maximum watermarking-rate performance analysis are described in Section 4. Simulation results for decoding bit error probability performance of the proposed ICA-based watermark detector and a correlation-based detector against different attacks and signal degradations are described in Section 5. Finally the concluding remarks along with future research directions are presented in Section 6. 2 Basics of SS-based watermarking The SS based watermarking system can be modeled using a classical secure communication model [1], as shown in Fig. 1. In Fig. 1, S [member of] [R.sup.n] is a vector containing coefficients of the host signal in marking space. It is assumed that the coefficients, [S.sub.i] : i = 0, 1, ..., n - 1, are independent and identically distributed (i.i.d.) random variables (r.v.) with zero mean and variance [[sigma].sup.2.sub.s]. A watermark, V, is generated using: (1) a message bit, b [member] {[+ or -] 1}, to be embedded into n coefficients of the host signal, (2) a key-dependent pseudo-random sequence W [member of] [{[+ or -] 1}.sup.n], and (3) a perceptual mask, [alpha] [member of] [R.sup.n], estimated based on the human auditory system Noun 1. auditory system - the sensory system for hearing auditory apparatus - all of the components of the organ of hearing including the outer and middle and inner ears ear - the sense organ for hearing and equilibrium (HAS) and the host signal S, i.e. [alpha] = f(S, HAS). We further assume that the watermark sequence W and the host signal coefficients S are mutually independent. The amplitude-modulated watermark is spectrally shaped according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. perceptual mask [alpha] to meet the fidelity requirement of the perception based watermarking. The watermarked signal X is obtained by adding an amplitude-modulated watermark V = [alpha] [dot encircle en·cir·cle tr.v. en·cir·cled, en·cir·cling, en·cir·cles 1. To form a circle around; surround. See Synonyms at surround. 2. To move or go around completely; make a circuit of. ] Wb, here [dot encircle] denotes element-wise product of the two vectors, to the host signal S. The watermarked signal X can be expressed as X = S + V, (1) The embedding distortion, De can be expressed as, [D.sub.e] = X - S. (2) [FIGURE 1 OMITTED] The mean-squared embedding distortion, [d.sub.e] is expressed as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ] (3) where || x || represents the Euclidian norm, E{x} denotes represents variance of the expected value Expected value The weighted average of a probability distribution. Also known as the mean value. of a r.v., and [[sigma].sup.2.sub.v] watermark V. The signal distortion due to an active adversary adversary traditional appellation of Satan [O.T.: Job 1:6; N.T.: I Peter 5:8] See : Devil attack can be viewed as channel noise, N, as shown in Fig. 1. The received watermarked signal at the detector, [??], [??] = X + N, (4) is processed for watermark detection. The watermarking schemes based on blind additive embedding model generally use probabilistic characterization of the host signal to develop an optimal or suboptimal watermark detector (in ML sense). The statistical characterizations of real-world host signal are available in spatial domain as well as in the transform domain. For example, stationary speech samples/coefficients both in the time domain and in the DWT DWT abbr. 1. deadweight tonnage 2. deadweight tons domain can be approximated by Laplacian distribution [21] (see Appendix A for the probability distribution Probability distribution A function that describes all the values a random variable can take and the probability associated with each. Also called a probability function. probability distribution function (pdf) of DWT coefficients) i.e., [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (5) where [beta] = [square root of (2)]/[[sigma].sub.s] The average decoding bit error probability, [P.sub.e], under zero-channel distortion scenario, i.e. [N.sub.i] = 0, can be calculated by assuming that 1. the watermarked sample [X.sub.i] is obtained by adding a binary amplitude-modulated watermark [V.sub.i], i.e. [[alpha].sub.i][W.sub.i]b, 2. the detector is based on Neyman-Pearson criterion, 3. no pre-processing is applied to the watermarked audio to suppress host interference, 4. [W.sub.i] takes values [+ or -] 1 with probability 1/2, In addition, for performance analysis we will consider two information embedding scenarios: (1) one bit b [member of] {[+ or -] 1} of information is embedded in each coefficient of the host signal, [S.sub.i], and (2) one bit b [member of] {[+ or -] 1} of information is embedding in [absolute value of [zeta]] coefficients of the host signal S, where [absolute value of [zeta]] denotes the cardinality A quantity relationship between elements. For example, one-to-one, one-to-many and many-to-one express cardinality. See cardinal number. (mathematics) cardinality - The number of elements in a set. If two sets have the same number of elements (i.e. of the selected coefficient indices set [zeta]. Consider one bit embedding per coefficient, i.e. n = 1, case first. It has been shown in [7] that the ML decoder estimates[??] = 1 if [[??].sub.0][W.sub.0] > 0 and an error will occur when [[??].sub.0][W.sub.0] < 0. The average [P.sub.e] is given by [P.sub.e] = Pr{[[??].sub.0][W.sub.0] < 0|b = 1} = [[integral].sup.0.sub.-[infinity]] [f.sub.s]([tau] - [alpha])d[tau]. (6) Assuming the Laplacian distribution model for the host, it can be shown [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (7) where [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] which is generally referred as signal to-watermark ratio (SWR SWR Standing Wave Ratio (radio term) SWR Südwestrundfunk (Southern German Radio Station) SWR Shoreham Wading River SWR Short-Wave Radio SWR Software Requirements ), when expressed in dB i.e. SWR = 20[log.sub.10][lambda]. It can be observed from Eq. (7) that non-zero [P.sub.e] is not achievable even in the absence of attack-channel distortion, and [P.sub.e] = f([lambda]). In addition, the value of the parameter A determines the tradeoff between fidelity of the embedded watermark and [P.sub.e]. Consider second embedding scenario, i.e., one bit information is embedded in [absolute value of [zeta]] = n coefficients of the host. In this case the watermarked audio is given by, [X.sub.i] = [S.sub.i] + [[alpha].sub.i][W.sub.i]b, i [member of] [zeta]. (8) Let us assume that the watermarked signal used for detection is free of attack-channel distortion, and message symbols are equally probable. In this case, the ML decoder that minimizes the decoding error probability will assign decision regions [D.sub.-] and [D.sub.+] as follow, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (9) where [b.sub.+] (resp.[b.sub.-]) represent the event that binary information b = +1(resp.b = -1) is embedded in the selected indices and [??] is the masking threshold The masking threshold is the sound pressure level (SPL) of a sound you need to make hearing another in presence of a masker signal. This threshold depends on the frequency and the kind of the masker and maskee. This effect normally appears between two sounds close in frequency. estimated from watermarked audio. It is shown in Section 4 that the estimated of masking threshold from the unwatermarked and watermarked audio clip are very close given that attack-channel distortion induced into the watermarked audio is below certain threshold. It is therefore reasonable to assume that [??] [approximately equal to] [alpha]. The ML sufficient statistic, T, assuming Laplacian pdf for the host coefficients [S.sub.i], can be written as, T(x|s, [??]) = [summation summation n. the final argument of an attorney at the close of a trial in which he/she attempts to convince the judge and/or jury of the virtues of the client's case. (See: closing argument) over (i [member of] [zeta])][beta]([absolute value of [X.sub.i] + [[??].sub.i][W.sub.i]] - [absolute value of [X.sub.i] - [[??].sub.i][W.sub.i]]). (10) If b = 1 was embedded, then the sufficient statistics T can be expressed as, T(x| s, [??]) = [summation over (i [member of] [zeta])][beta]([absolute value of [S.sub.i] + 2[[??].sub.i][W.sub.i]] - [absolute value of [S.sub.i]]). (11) Here the ML detector is a bit-by-bit hard decoder, i.e., [??] = sgn(T). (12) To determine the bit error probability for this ML decoder, a statistical characterization of T is required. Here T is sum of [absolute value of [zeta]] i.i.d, random variables. Therefore, by applying the central limit theorem central limit theorem In statistics, any of several fundamental theorems in probability. Originally known as the law of errors, in its classic form it states that the sum of a set of independent random variables will approach a normal distribution regardless of the (CLT CLT total lung-thorax compliance. ), T can be approximated by the Gaussian random variable. Mean of T, E{T} can be calculated as, E{T(x| s, [??])} = [summation over (i [member of] [zeta])][beta]([E.sub.s,w]([absolute value of [S.sub.i] + 2[[??].sub.i][W.sub.i]] - [absolute value of [S.sub.i]])), (13) and variance, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (14) E{T (x| s, [??])} = [summation over (i [member of] [zeta])][beta]([Var.sub.s,w]([absolute value of [S.sub.i] + 2[[??].sub.i][W.sub.i]] - [absolute value of [S.sub.i]])), (15) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. (16) In this case, the [P.sub.e] is given as, [P.sub.e] = Q([absolute value of E{T}]/[square root of (Var{T}]), (17) where [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] Eq. (17) shows that the decoding error probability [P.sub.e] is non-zero even in the absence of attack-channel distortion, and [P.sub.e] is a function of [lambda]. The above analysis also shows that the detection/decoding performance of a blind detector for additive embedding schemes is inherently bounded below by the host-signal interference at the detector. The main motivation behind this paper is to design a watermark detector for additive embedding schemes with an improved watermark detection, decoding, and maximum watermarking-rate performances by suppressing the host-signal interference at the blind detector. Towards this end, theory of ICA is used by posing watermark estimation for additive embedding as a BSS problem. The proposed framework first estimates embedding watermark using BSS based on ICA which is then used for detection and decoding. The fundamentals of the ICA theory are briefly outlined in the following section followed by the details of the proposed ICA-based detector. 3 Independent Component Analysis Independent component analysis (ICA) is a statistical framework for estimating underlying hidden factors or components of multivariate statistical data. In the ICA model, the data variables are assumed to be linear or nonlinear A system in which the output is not a uniform relationship to the input. nonlinear - (Scientific computation) A property of a system whose output is not proportional to its input. mixtures of some unknown latent variables, and the mixing system is also unknown [23, 22]. The hidden variables Hidden variables Additional variables or parameters that would supplement quantum mechanics so as to make it like classical mechanics. Hidden variables would make it possible to unambiguously predict (as in classical mechanics) the result of a specific are also assumed to be non-Gaussian and mutually independent. The ICA model can be considered as an extension of the principal component analysis (PCA) and factor analysis [23, 22]. In fact, ICA can be treated as non-Gaussian factor analysis, since data is modeled as a linear mixture of underlying non-Gaussian factors. The ICA framework has been used in diverse application scenarios including blind source separation (BSS), feature extraction In pattern recognition and in image processing, Feature extraction is a special form of dimensionality reduction. When the input data to an algorithm is too large to be processed and it is suspected to be notoriously redundant (much data, but not much information) then the , telecommunication, and economics [23, 22]. In the following we will review only the linear ICA framework since only that is relevant to the SS-based watermarking model. In general, the linear ICA model can be defined for noise-free as well as noisy scenarios as follows. Noise-free ICA model: ICA of a random vector X [member of] [R.sup.m] consists of estimating the following generative model A generative model is a model for randomly generating observed data, typically given some hidden parameters. It defines a joint probability distribution over observation and label sequences. Generative models are used in machine learning for either modeling data directly (i.e. of the data: X = AS, (18) where X represents n-realizations of the observed m-dimensional random vector, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is the hidden random variables and A [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is mixing matrix. The hidden variables, [S.sup.(i),] in the vector [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] are assumed statistically independent. Noisy ICA model: ICA of a random vector X consists of estimating the following generative model of the data: X = AS + N, (19) where N is n-realizations of an m-dimensional random noise, while X, S, and A are the same as in the noise-free model in Eq. (18). In this paper, we use the noisy ICA generative model to design an ICA-based watermark detector for SS-based watermarking schemes. The proposed ICA-based watermark detector attempts to estimate the embedded watermark from the watermarked signal while reducing the host-signal interference at the watermark detector. Before estimating the underlying independent components from observed data using ICA framework, the generative model should meet certain conditions to ensure the identifiability of the ICA model. The identifiability constraints defined in [22, 24, 25, 29, 26, 27] underdetermined ICA (UICA UICA Urban Institute for Contemporary Arts (Grand Rapids, Michigan) UICA Underground Injection Control Act ) model are outlined below: 1. Statistical independence: The bidden (latent) variables/sources are statistically independent. 2. Non-Gaussianity: At most one of the underlying independent components S(i), i = 1, 2. ... [n.sub.1], is normally distributed. Therefore, independence and maximum non-Gaussianity are two fundamental ingredients of the UICA framework. Independence of the underlying components is one of the assumptions that is made to estimate components from the linear mixture. Note that independence of the underlying components is a stronger condition than uncorrelatedness, e.g., for the BSS problem, there might be many dependent but uncorrelated representations of the observed signals and these uncorrelated but dependent representations of the observed signals cannot separate the mixed sources [22]. Therefore, uncorrelatedness itself is insufficient to solve the BSS problem. In fact, independence implies nonlinear uncorrelatedness [22], that is, if [S.sup.(1)] and [S.sup.(2)] are two independent components then any nonlinear transformations of these components, say, [[phi].sub.1]([S.sup.(1)]) and [[phi].sub.2]([S.sup.(2)]), are uncorrelated as well (i.e. their covariance Covariance A measure of the degree to which returns on two risky assets move in tandem. A positive covariance means that asset returns move together. A negative covariance means returns vary inversely. is zero). On the other hand, if [S.sup.(1)] and [S.sup.(2)] are assumed to be just uncorrelated then in general, the corresponding nonlinear transformations do not necessarily have zero covariance. Thus to perform ICA, a stronger form of decorrelation of the underlying components is required, that is, nonlinear decorrelation. A suitable selection of nonlinearities, i.e. [[phi].sub.1](x) and [[phi].sub.2](x), can be achieved by using tools like maximum likelihood and mutual information from estimation theory Estimation theory is a branch of statistics and signal processing that deals with estimating the values of parameters based on measured/empirical data. The parameters describe the physical scenario or object that answers a question posed by the estimator. and information theory [22]. Maximum non-Gaussianity is another important requirement of ICA-based hidden components estimation [23, 22, 38, 30]. A quantity kurtosis Kurtosis A statistical measure used to describe the distribution of observed data around the mean. Notes: Used generally in the statistical field, it describes trends in charts. defined in terms of the fourth-order central moment [kappa Kappa Used in regression analysis, Kappa represents the ratio of the dollar price change in the price of an option to a 1% change in the expected price volatility. Notes: Remember, the price of the option increases simultaneously with the volatility. ] is generally used as a measure of non-Gaussianity of a random variable. Kurtosis of a real random variable S can be defined as, [kappa] = (E{[(S - E(S)).sup.4]}/[E.sup.2]{[(S - E(S)).sup.2]}) - 3. (20) A normal random variable Normal random variable A random variable that has a normal probability distribution. has zero kurtosis; therefore, kurtosis is a measure of the distance of a random variable from a Gaussian distribution. Distributions that are peakier (flatter) about the mean than a Gaussian distribution generally have positive (negative) kurtosis. Random variables with positive kurtosis, i.e. [kappa] > 0, are generally called super-Gaussian. The Laplacian distribution is a typical example of this case. Random variables with negative kurtosis value, i.e., [kappa] < 0 are called sub-Gaussian, e.g., the uniform distribution. The BSS is one of the most widely explored applications of the ICA model [23, 22]. In case of BSS using ICA framework, the recovery of the underlying sources relies on the assumption that the constituent sources are mutually independent. The cocktail party problem is a classical example of BSS, where several people are simultaneously speaking in the same room and objective is to separate voices of different speakers using microphone recordings (in the room). In order to illustrate the idea [n.sub.1] speakers (sources) are considered here. The observation X [member of] [R.sup.m x n] is generated by mixing sources [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] by a mixing matrix [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. The static linear mixing model can be expresses as, [X.sub.i] = [AS.sub.i] + [N.sub.i], i = 1,2, ..., n (21) The aim of BSS is the recover the underlying sources [S.sup.(1)], l = 1, 2, ... , [n.sub.1] from the observation X only. The ICA achieves the separation relying on the assumption that the underlying sources are mutually independent. To this end the ICA framework finds a linear representation in which the underlying components are statistically independent. In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke" put differently , BSS using ICA tries to estimate the demixing (separating) matrix, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], from the observed data X. The estimated demixing matrix is the inverse (or generalized inverse In mathematics, a generalized inverse or pseudoinverse of a matrix A is a matrix that has some properties of the inverse matrix of A but not necessarily all of them. The term "the pseudoinverse" commonly means the Moore-Penrose pseudoinverse. ) of mixing matrix A, i.e., [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. Most of existing BSS schemes using ICA model are based on the information-theoretic framework. For example, Bell et al's [21] ICA scheme is based on the idea of information maximization, or infomax among the estimated independent components. P. Comon in [23] has used higher-order cumulants whereas, Gaeta et al in [28] used ML estimation framework for BSS. Many existing BSS methods are extensions of infomax, higher-order cumulants, and ML method [23, 22]. 4 Proposed ICA Based Watermark Detector The proposed ICA-based watermark detector consists of two stages: 1) watermark estimation stage, and, 2) watermark decoding and/or detection stage. The watermark estimation stage estimates watermark V from the received watermarked audio [??] using ICA framework, whereas, the watermark decoding (resp. detection) stage decodes (resp. detects) the embedded watermark using the ML approach. The block diagram A chart that contains squares and rectangles connected with arrows to depict hardware and software interconnections. For program flow charts, information system flow charts, circuit diagrams and communications networks, more elaborate graphical representations are usually used. of the proposed watermark detector is given in the Fig. 2. [FIGURE 2 OMITTED] In general the ICA model for BSS estimates the demixing matrix [??] from the observed data X, and hence the underlying independent components [[??].sup.(i)]. This model is extendable to the watermark estimation problem for the watermarked signal, assuming identifiability conditions of the UICA model are satisfied. To verify whether the additive embedding model (Eq. (1)) satisfies the identifiability constraints of an ICA model, rewrite Eq. (1) with b = 1, i.e., X = S + [alpha][dot encircle]W. The non-Gaussianity of the host signal and the watermark is the only requirement to satisfy constraints of UICA. As mentioned in Section (2) that real-world audio samples/coefficients in the time domain as well as in the DWT domain can be approximated by the Laplacian distribution (see Appendix A) and therefore if the watermark, W, is generated based on some non-Gaussian distribution then non-Gaussianity constraint of UICA model is satisfied as well. Once the identifiability conditions of the UICA model are satisfied, the noisy ICA model can be extended to estimate the watermark from the watermarked audio generated using Eq. (1). 4.1 Watermark Estimation For watermark estimation, the proposed watermark detector first estimates the watermark-mixing matrix [??] which is then to estimate the underlying independent components (i.e., the host signal S and the watermark W). An estimate of the watermark-mixing matrix, [??], is usually obtained by optimizing a highly nonlinear function of the hidden sources also known as contrast function [23, 22]. The pseudo-inverse of the estimated watermark-mixing matrix [[??].sup.[dagger]] is applied to the observed mixture to estimate the host signal [??] and the watermark [??]. However, as noted earlier, in the case of blind detectors for SS-based watermarking schemes, watermark estimation using ICA framework is a degenerate case, i.e., m < [[eta].sub.1]. Therefore, just the estimation of watermark-mixing matrix is insufficient to separate the underlying independent components perfectly. In the case of additive embedding, the equation X = [??]S has an affine af·fine adj. Mathematics 1. Of or relating to a transformation of coordinates that is equivalent to a linear transformation followed by a translation. 2. Of or relating to the geometry of affine transformations. set of solutions [34]. A preferred solution in this affine set is generally selected using probabilistic prior model of the independent components [39]. The performance of the proposed ICA-based watermark estimator depends on the separation quality of the separated (estimated) watermark. The separation quality of the separated source is generally measured in terms of, 1) source-to-interference ratio (watermark-to-interference ratio (WIR WIR Wilhelm Imaging Research, Inc. WIR When It's Ready (Borland) WIR Walk in Robe (real estate ads, Australia) WIR World In Review (news magazine) WIR Weekly Intelligence Review ), in case of watermark estimation), source-to-noise ratio, and 2) source-to-artifact ratio (for further details on these separation quality measures please see [35] and references therein). For performance analysis of the proposed ICA detector, only WIR distortion measure is considered here; therefore, the estimated watermark can be expressed as [[??].sub.i] = [[eta].sub.1i][[alpha].sub.i][W.sub.i]b + [S.sup.interf.sub.i], (22) where [[eta].sub.1i] [member of] R, 0 < [[eta].sub.1i] [less than or equal to] 1 and [S.sup.interf.sub.i] is interference due to the host signal. Let [S.sup.interf.sub.i] = [[eta].sub.2i][S.sub.i], [[eta].sub.2i] [member of] R, and 0 < [[eta].sub.2i] [less than or equal to] 1 then Eq. (22) can be rewritten as, [[??].sub.i] = [[eta].sub.1i][[alpha].sub.i][W.sub.i]b + [[eta].sub.2i][S.sub.i]. (23) The relative distortion due to interference in the estimated watermark is defined as, [D.sup.interf] = [([[eta].sub.1]/[[eta].sub.2]).sup.2], (24) where WIR = 10 [log.sub.10] ([D.sup.interf]) dB. In general, [D.sup.interf] > 0 dB for most of existing BSS schemes based on ICA framework [34, 35]. Several researchers have proposed elegant BSS algorithms based on ICA model for noisy data [38, 36, 31], these algorithms can be used for watermark estimation from the watermark audio. Among these, the FastICA for noisy data [38] is used in this paper due to its better computational and separation quality performance over existing algorithms [34]. It can be observed from Eq. (23) that the ICA stage acts as a pre-processing stage that suppresses the host interference or improves watermark-to-host ratio. Once estimated watermark [??] is available, an optimal detector can be designed based on the statistics of V for watermark detection (resp. decoding). It is important to notice that ICA based pre-processing stage uses constraints like mutual independence of the underlying sources, non-Gaussianity, and multichannel observation i.e. m [greater than or equal to] 2. A constrained optimization of highly nonlinear cost function e.g. tanh tanh abbr. hyperbolic tangent tanh Abbreviation of hyperbolic tangent (x), x exp exp abbr. 1. exponent 2. exponential ([-x.sup.2]), etc. is used to suppress the host interference in the estimated watermark [22, 23]. In addition, under practical scenarios, BSS using ICA also requires reasonably large number of data samples n to separate the underlying sources. Therefore, ICA based pre-processing to suppress host interference is inherently different from filtering based pre-processing schemes i.e., optimal linear filtering [44], wiener filtering, non-linear filtering, etc. The ICA-based pre-processing stage is to improve watermark-to-host ratio hence expected to improve the detection performance [41, 42]. It is however important to mention that improvement comes at the cost of higher computational power. In the following subsections we analyze the performance of the proposed ICA-based detector in terms of three parameters: (1) detection rate in terms of false positives and true positives 4.2, (2) decoding error probability 4.3, and (3) maximum watermarking rate 4.4. 4.2 Watermark Detection: Performance Analysis A watermark detector is generally characterized by two performance measures: the probability of false alarm [P.sub.F] and the probability of detection The Probability of Detection is a term used in Radar sets. The radar system must detect, with greater than or equal to 80% probability at a definied range, a one square meter radar cross section. The received and demodulated echo signal is processed by a threshold logic. [P.sub.D]. The probability of detection represents the probability of deciding on the presence of a watermark when the received audio indeed contains a watermark. The probability of false alarm represents chances of deciding the presence of a watermark when in fact the received audio does not contain a watermark. The watermark detection process can be treated as a binary decision problem in the presence or absence of attack-channel distortions. We first consider the case where the received watermarked audio has not suffered attack-channel distortion. The estimated watermark is given by, [[??].sub.i] = [[eta].sub.1i][[alpha].sub.i][W.sub.i]b + [[eta].sub.2i][S.sub.i], i [member of] [zeta]. (25) In this scenario, the watermark detection can be formulated as a binary hypotheses test, [H.sub.1] : [[??].sub.i] = [[eta].sub.1i][[??].sub.i][W.sub.i]b + [[eta].sub.2i][S.sub.i] [H.sub.0] : [V.sub.i] = [[eta].sub.2i][S.sub.i], i [member of] [zeta]. (26) In this detection problem, the watermark W is the target signal and host interference, [[eta].sub.2]S acts as additive noise. The goal of watermark detector is to determine presence or absence of the watermark in the estimated watermark [??] based on the statistics of S and W. Let us assume that statistics of unwatermarked and watermarked audio are same [43], therefore pdfs under each hypothesis are known. The decision rule, in this scenario is based on likelihood ratio which is given as: [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (27) where A([??]) is likelihood ratio and [??] is decision threshold. The log-likelihood is defined as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (28) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (29) where, l [member of] {[+ or -] 1}, [xi] = ln([??]) and r.v. [[??].sub.i] is defined as [[??].sub.i] = [n.sub.2i][S.sub.i]. In the above test, the decision threshold [xi] can be minimized based on Neyman-Pearson rule, that is, maximize the [P.sub.D] for a given value of [P.sub.F] [41, 42]. Assuming Laplacian distribution for the host audio, the L([??]) can be written as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (30) where [[??].sub.i] = [square root of (2)]/[[??].sub.2i][[sigma].sub.s], [[??].sub.1], and [[??].sub.2] are estimates of scaling coefficients of V and S in [??]. Estimation details of [[??].sub.1], and [[??].sub.2] are discussed in Section 4.5. The statistical characterization of L([??]) under hypothesis [H.sub.0] can be determined as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (31) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (32) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (33) Here [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is the sum of [absolute value of [zeta]] statistically independent random variables that can be approximated by the Gaussian random variable based on the CLT, mean, m0 and variance, [[sigma].sup.2.sub.0] of [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is calculated as follows, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (34) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (35) Averaging Eq. (36) over W, we have, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (36) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (37) equation can be rewritten as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (38) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (39) Averaging it over [??] we have, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (40) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (41) Substituting E{[Z.sub.i]}, and Var{[Z.sub.i]} in Eq. (36), we have, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (42) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (43) Similarly, L([??]) under hypothesis [H.sub.i] can be written as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (44) Here L([??]) can be approximated by a Gaussian random variable with the same set of assumptions as under hypothesis [H.sub.0]. In addition, the distribution of L([??]) under hypothesis [H.sub.1] is symmetrical to the distribution of under [H.sub.0] with respect to the origin. Therefore, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (45) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (46) Now the probability of false alarm [P.sub.F] and the probability detection [P.sub.D] are given as, [P.sub.F] = Q ([xi] + [m.sub.1] / [[sigma].sup.1]), (47) [P.sub.D] = Q ([xi] + [m.sub.1] / [[sigma].sup.1]). (48) Lets define the watermark-to-noise ratio (WNR WNR Weapons Neutron Research WNR Wireless Network Router WNR Wide to Narrow Ratio 1) as [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (49) If we denote de·note tr.v. de·not·ed, de·not·ing, de·notes 1. To mark; indicate: a frown that denoted increasing impatience. 2. by [Q.sup.-1]([P.sub.F]) the value x [member of] R such that Q(x) = [P.sub.F] then receiver operating characteristics (ROC) of the proposed detector can be expressed as: [P.sub.D] = Q ([Q.sup.-1]([P.sub.F]) - 2 [square root of (WNR1)]). (50) It can be observed from Eq. (50) that the detection performance of the proposed detector is a function of WNR1. Since the proposed ICA-based detector is designed to reduce the host-signal interference before detection, therefore, the ICA-based detector is expected to perform better than the existing blind detectors [6, 3, 1] operating without reducing host signal interference. The detection performance improvement can be attributed to its host interference suppression or watermark-to-host ratio improving capability. To illustrate this notion the theoretical ROC performance of the proposed detector based on Eq. (50) for different values of host interference suppression values (or WIR) is given in Fig. 3. It can be observed from Fig. 3 that the proposed detector performs superior that the detector operating without host interference canceling. 4.3 Watermark Decoding: Performance Analysis To evaluate performance of the proposed detector in terms of decoding error probability, let us consider watermark embedding model given in Eq. (1) and decoding framework discussed in Section 2. Consider one bit per coefficient embedding case first, that is, [X.sub.0] = [S.sub.0] + [[alpha].sub.0][W.sub.0]. The [P.sub.e] in this case for [[??].sub.0] can be expressed as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (51) [FIGURE 3 OMITTED] Here Eq. (51) shows that ICA-based detector does improve decoding error performance. The decoding error performance gain for the proposed ICA-based detector over the traditional detector can expressed, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (52) It is important to mention that in general BSS using ICA have relatively small interference distortion, i.e., [[??].sub.10]/[[??].sub.20] [34], therefore, G [greater than or equal to] 0 for WIR > 0dB. The performance gain ot the proposed ICA-based detector over that of the decoder given by Eq. (7) is plotted in Fig. 4. It can be observed from Fig. 4 that for a fixed value of SWR, decoding error probability of the proposed detector improves with the increase in the separation quality of the ICA scheme used for watermark estimation. [FIGURE 4 OMITTED] Now consider second embedding scenario, that is, one bit is embedded into [absolute value of [zeta]] coefficients of the host signal S, i.e., [X.sub.i] = [S.sub.i] + [[??].sub.i][W.sub.i]b, i [member of] [zeta]. (53) In this case, the estimated watermark [??] using proposed ICA-based watermark detector, under zero attack-channel distortion, can be expressed as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (54) For equally probable message symbols the ML decoder that minimizes the Pc will satisfy the following condition, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (55) The ML sufficient statistics for Laplacian [??] can be written as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (56) Assuming [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] can be expressed as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (57) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (58) Where, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] The ML decoder is a bit-by-bit hard decoder [??] = sgn(T) (59) To determine the [P.sub.e] for the ML decoder, a statistical characterization of T([??]) is required. As T([??]) is sum of [absolute value of [zeta]]] i.i.d, random variables, therefore, using CLT, T([??]) can be approximated by the Ganssian random variable, the mean and variance of T can be computed as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (60) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (61) In [Z.sub.i], W and [??] are the only r.v.s, so averaging [Z.sub.i] over r.v. W condition to the selected host indices [??] and [W.sub.i] [member of] {[+ or -] 1} with probability 1/2 we have, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (62) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (63) rewriting the above equations, we have, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (64) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (65) Now averaging over r.v. [[??].sub.i], we have, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (66) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (67) Substituting E{[Z.sub.i]}, and Var{[Z.sub.i]} in Eq. (61) and Eq. (61), we have, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (68) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (69) Therefore, [P.sub.e] for an ICA-based detector is given as, [P.sub.e_ICA] = Q([absolute value of E{T}]/[square root of (Var{T})]) (70) It can be observed from Eq. (70) that the decoding error probability of the ML decoder applied to the estimated watermark is a function of WIR and SWR. The performance of the proposed ICA-based detector given by Eq. (70) for different values of WIR and SWR is plotted in Fig. 5. It can observed from Fig. 5 that the proposed ICA-based detector perform superior than the detector operating without host-interference cancelation can·ce·la·tion n. Variant of cancellation. . [FIGURE 5 OMITTED] 4.4 Maximum Watermarking-Rate: Performance Analysis Maximum watermarking-rate (MWR MWR Morale, Welfare and Recreation MWR Ministry of Water Resources (China) MWR Monthly Weather Review MWR Microwave Radiometer MWR Multiple Worksite Report (US Department of Labor) MWR Microwave Radiometry ) is another watermarking performance measure which indirectly depends on the detector structure. Researchers in data hiding (1) Secretly embedding data in graphics images and other file types. See steganography. (2) The result of encapsulation in object-oriented programming. See encapsulation. community have proposed various host interference suppression methods based on linear as well as non-liner filtering to improve MWR performance of a blind detector. For example, Suet al in [44] used optimal linear filtering to suppress host interference at the blind detector to improve MWR. The MWR performance of the proposed ICA-based watermark detector is evaluated for one bit per coefficient embedding case, i.e, [X.sub.0] = [S.sub.0] + [[alpha].sub.0][W.sub.0]b . Let us assume the the received watermarked sample is corrupted by independent additive white Gaussian noise, with mean zero and variance [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. Here using CLT [[??].sub.0] can be approximated by a Gaussian r.v. with mean zero and variance [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (71) In this case, the estimated watermark sample, [[??].sub.0] can be expressed as, [[??].sub.0] = [[eta].sub.10][[alpha].sub.0][W.sub.0] + [[eta].sub.20][S.sub.0] + [N.sub.0] (72) Again [[??].sub.0] can also be approximated by Gaussian r.v. with mean zero and variance [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (73) The MWR of watermarking schemes based on additive embedding using blind correlation-based watermark detector can be approximated by the capacity of an additive white Gaussian noise channel, i.e., [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (74) Similarly, MWR using an informed detector can be expressed as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (75) And, MWR of the proposed ICA-based watermark detector is given as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (76) Since the ICA scheme used for watermark estimation has reasonably good source separation performance [34, 35], therefore following inequality will hold, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (77) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (78) It can be observed from Eq. (78) that the proposed ICA-based detector performs better than the blind detector operating without suppressing the host signal interference. In addition, MWR performance of ICA-based detector is bounded below by the blind detector (0% suppression) and bounded above by an informed detector (100% suppression). Performance analysis of the proposed ICA-based watermark detector indicates that it performs better than existing blind watermark detectors [1, 2, 3, 4, 5] operating without reducing host signal interference. This improved detection performance of ICA-based detector can be attributed to its host signal interference suppression at the detector. 4.5 Estimation of Masking Threshold, Distribution Parameter and WIR factor This section provides details on how to estimate masking threshold, [??], host distribution parameter, [??], and [[??].sub.1], [[??].sub.2] at the blind detector. The [??] is analyzed at the blind detector to estimate [??] based on HAS. It is reasonable to assume that [??] estimated from watermarked audio is similar to the [??] from the corresponding an unwatermarked audio clip given that embedding and attack-channel distortion are imperceptible. To validate this assumption, we estimated [??] from both the unwatermarked and corresponding watermarked music clips. To this end four music clips (Pos1, Pop2, Classic, and Vocal) listed in Table 1) were used. Here music clips Pop1 and Classic were watermarked using FSSS FSSS Fédération Suisse de Sports Subaquatiques FSSS Free Super Saver Shipping (Amazon.com) FSSS Fixed-Step Serial Search FSSS Flight Software Support System FSSS Field Software Service Support based watermarking scheme proposed in [5] and Pop2 and Vocal, were watermarking using audio watermarking scheme presented in [3]. Plots of the [??] estimated from the each watermarked music clip, [[??].sub.W] and corresponding unwatermarked music clip [[??].sub.UW] are given in Fig. 6. It can be observed from Fig. 6 that for both embedding schemes [[??].sub.W] [approximately equal to] [[??].sub.UW]. Similarity between [[??].sub.W] and [??]UW, for the music clips listed in Table 1, in terms of mean squared error In statistics, the mean squared error or MSE of an estimator is the expected value of the square of the "error." The error is the amount by which the estimator differs from the quantity to be estimated. (MSE MSE Mouse (computer) MSE Materials Science & Engineering MSE Mean Squared Error MSE Mean Square Error MSE Master of Science in Engineering MSE Manufacturing Systems Engineering MSE Mechanically Stabilized Earth ) (in dB) is {Pos1, Melodic me·lod·ic adj. Of, relating to, or containing melody. me·lod i·cal·ly adv. , Pop2, Classic,
Vocal} = {0.21566, 1.7321, 2.4507, 1.7716, 0.21566}. Here watermarked
music clips were generated using FSSS-based watermarking. These results
shows that it is reasonable to estimated masking threshold from the
watermarked audio at the blind detector.
Distribution parameter, [beta], can be estimated from the estimated variance [[??].sup.2.sub.s] of the host audio, which can be estimated from the watermarked audio available at the detector [[??].sup.2.sub.s] = [[??].sup.2.sub.x] - 1/M [summation over (j)] [[??].sup.2.sub.j] (79) where [[??].sup.2.sub.m] is the variance of the watermark sequence for [m.sup.th] audio segment and M is total number of watermarked segments. Here [[??].sup.2.sub.x] is estimated using sample variance, i.e., [[??].sup.2.sub.x] = 1/M [summation over (j)][X.sup.2.sub.j] - 1/[M.sup.2] [([summation over (j)] [X.sub.j]).sup.2] (80) It is important to mentions that if this estimate is used to calculate sufficient statistics, this will introduce additional dependence between watermark and sufficient statistics which is hard to analyze theoretically. Due to this added dependence, slight variation between theoretical approximation approximation /ap·prox·i·ma·tion/ (ah-prok?si-ma´shun) 1. the act or process of bringing into proximity or apposition. 2. a numerical value of limited accuracy. and experimental results is expected. [FIGURE 6 OMITTED] The problem of estimating [[eta].sub.1] and [[eta].sub.2] is bit hard due to ambiguity in the scale and sign of the estimated sources using ICA. However, if we assume that scale and sign ambiguity of the separated sources is resolved and WIR factor [D.sup.interf] = [[eta].sub.1] / [[eta].sub.2] is known then, using Eq. (23) and (24), [[eta].sub.1] and [[eta].sub.2] can be estimated by simultaneously solving the following expressions, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (81) Here [D.sup.interf] can calculated using separation quality measure of the ICA method used as discussed in [34, 35]. 5 Simulation Results This section provides detection performance of the proposed ICA-based watermark detector (ICAWD) and its comparison with the conventional normalized correlation watermark detector (NCWD NCWD North Carolina Western District (US federal court system) NCWD National Center for Women Development (Nigeria) ) [1]. The proposed ICAWD can be used to detect watermark for almost all existing SS-based watermark embedding schemes [1, 2, 3, 5, 4]. However detection performance of the proposed detector is compared with Swanson et al's SS-based audio watermarking scheme [3]. Swanson et al's [3] proposed scheme used correlation based detector for watermark detection. To provide a fair performance comparison of both the proposed ICAWD and the NCWD, the proposed ICAWD is used in the estimation-correlation-based detection settings. The simulation results presented based on FSSS-based audio watermark A watermark embedded within an audio stream to identify its origination. See digital watermark. embedding scheme presented in [5]. Detains of watermark embedding using FSSS [5] outlined here. 5.1 FSSS-based Watermark Embedding The block diagram of the FSSS-based watermark generation and embedding used for simulations is illustrated in Fig. 7. The watermark is generated using a pseudo-random noise generator obeying non-Gaussian distribution to satisfy the non-Gaussianity requirement of the ICA model. A secret key [K.sub.w] is used as a 'seed' for the pseudo-random noise generator for watermark generation. In addition, same watermark is embedded in two consecutive audio segments, i.e., if watermark V is embedded into [i.sup.th] audio segment then same watermark is also embedded into [(i + 1).sup.th] segment. Repeated embedding is a necessary condition of the proposed detector to separate hidden signals obeying heavy-tail distribution, especially for BSS from underdetermined linear mixtures [40, 16]. For audio watermarking using FSSS, a secret key, [K.sub.sb] is used to select subband from watermark embedding. 5.2 Watermark Detection The proposed modified ICA-based detector has access to the secret key K only, which is combination of [K.sub.sb] and [K.sub.w], i.e., K = [K.sub.sb]|[K.sub.w]. The watermark detection process for FSSS-based audio watermarking under proposed detection scheme consists of watermark estimation using ICA framework followed by correlation based detection. The main steps of the detection process are outlined below: --Sync Point Extraction: The received audio signal is analyzed first to extract the set of sync points (SP) [4, 5] used to combat desynchronization n. 1. a process causing an absence of synchronization; the relation that exists when things occur at unrelated times; as, the stimulus produced a desynchronizing of the brain waves s>. Noun 1. attacks. --Segmentation: An audio frame consisting of n-samples is selected around each [SP.sub.i] : i = 1, 2 ... M. Where M is cardinality of SP set. --Frame Decomposition decomposition /de·com·po·si·tion/ (de-kom?pah-zish´un) the separation of compound bodies into their constituent principles. de·com·po·si·tion n. 1. : Each frame is then decomposed de·com·pose v. de·com·posed, de·com·pos·ing, de·com·pos·es v.tr. 1. To separate into components or basic elements. 2. To cause to rot. v.intr. 1. into p-subband signals using l-level analysis filter bank described in [5]. --Subband Selection: A secret key [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], is used to select a subband from lower (p - 1)--subbands of [i.sup.th] and [(i + 1).sup.th] frame i.e. [[??].sup.j.sub.i], and [[??].sup.j.sub.i + 1]. --Watermark Estimation: The selected subband signals, i.e. [[??].sup.j.sub.i, and [[??].sup.j.sub.i + 1] are used to estimated the embedded watermark, V Here, observation matrix, X, can be expressed as, X = [Sbi,Sbi+l] [[[??].sup.j.sub.i], [[??].sup.j.sub.i + 1].sup.T] [FIGURE 7 OMITTED] Existing BSS schemes for underdetermined mixtures based on ICA model [27, 39] to estimate watermark from the watermarked image for the proposed detector. However, in this paper, the proposed ICAWD uses the statistical ICA using mean-field approaches presented in [39] for watermark estimation from the watermarked audio. The watermark detection stage uses the correlation based similarity measure to determine the presence or the absence of the embedded watermark from the estimated sources. It is important to mention that permutation One possible combination of items out of a larger set of items. For example, with the set of numbers 1, 2 and 3, there are six possible permutations: 12, 21, 13, 31, 23 and 32. (mathematics) permutation - 1. ambiguity in the estimated sources using ICA will contribute nonzero [P.sub.e] due to incorrect source decoding. The error due to ambiguity in the permutation of the estimated sources is reduced by adding correlation based watermark detection (resp. decoding). However for the sake of simplicity, during analysis part in Section 4, error due to incorrect source decoding is neglected here. --Information Decoding: A binary hypothesis test is used to determine the presence or the absence of the embedded watermark in the estimated signal. For fast and reliable information decoding, normalized correlation between the estimated watermark and the key dependent watermark generated at the watermark detector are used. The normalized correlation is then compared against decision threshold, Th, to determine the presence or the absence of watermark. Following binary hypothesis test is used to decode binary information, [H.sub.1] : max [absolute value of ncor ([[??].sup.(r)],[w.sup.(q)])] [greater than or equal to] Th Decode q [H.sub.0] : otherwise no watermark where ncor(.,.) is the normalized correlation function The introduction to this article provides insufficient context for those unfamiliar with the subject matter. Please help [ improve the introduction] to meet Wikipedia's layout standards. You can discuss the issue on the talk page. defined as: [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (82) where [[??].sup.(r)] is the estimated signals using ICA, Th is the decision threshold (for our simulation results Th was set to 0.15, which corresponds to false positive rate, [P.sub.fp] = [10.sup.-4]), r = 1, 2, 3, and q [member of] {0, 1}. 5.3 Experimental Results To evaluate the robustness performance of the proposed ICAWD, several experimental tests were performed in which the watermarked audio is subjected to commonly encountered degradations. These degradations include addition of white and colored noise, resampling, lossy compression (MP3 Audio compression Encoding digital audio data to take up less storage space and transmission bandwidth. Audio compression typically uses lossy methods, which eliminate bits that are not restored at the other end. ADPCM and MP3 are examples of audio compression methods. See audio codec and data compression. ), filtering, time- and frequency-scaling, and stirmark benchmark attacks for audio [18, 20]. Decoding error probability, [Pb.sub.e], at the watermark detector is used for performance evaluation Performance evaluation The assessment of a manager's results, which involves, first, determining whether the money manager added value by outperforming the established benchmark (performance measurement) and, second, determining how the money manager achieved the calculated return . Here [Pb.sub.e] is defined [Pb.sub.e] = (1 - [N.sub.d] / [N.sub.e]) (83) where [N.sub.d] is number of bits correctly detected and [N.sub.e] number of bits embedded into the audio clip. Block diagram of the proposed ICAWD and the traditional correlation based detector e.g. NCWD used for FSSS audio watermark detection process is given in Fig. 8. The watermark detector given in Fig. 8 acts as ICAWD when switch S is connected to terminal 1 and NCWD when S is connected to 2. [FIGURE 8 OMITTED] 5.4 Robustness Performance To evaluate the robustness performance of the proposed watermarking scheme we have performed several experimental tests in which the watermarked audio is subjected to commonly encountered degradations. These degradations include addition of white and colored noise, resampling, lossy compression (MPEG (Moving Pictures Experts Group) An ISO/ITU standard for compressing digital video. Pronounced "em-peg," it is the universal standard for digital terrestrial, cable and satellite TV, DVDs and digital video recorders (DVRs). audio compression), filtering, time- and frequency-scaling, multiple watermarking, and StirMark benchmark attacks for audio. The robustness performance of the proposed scheme against common degradations for the above settings is discussed next. 5.5 Data Set Experimental results presented here are based on the data set consisting of the sound quality assessment material (SQAM SQAM Software Quality Assurance & Measurement SQAM Superposed Quadrature Amplitude Modulation SQAM Software Quality Assurance Manual SQAM Supplier and Quality Assurance Management SQAM Staggered Quadrature Amplitude Modulation ) audio database downloaded from [45] and five audio clips listed in Table 1. All audio clips used for the performance evaluation here are based on mono (1) See monochrome and monophonic. (2) (Mono) An open source implementation of the .NET environment for Linux, Unix and Windows platforms, sponsored by Novell. Mono includes a C# compiler and a Common Language Infrastructure (CLI) runtime engine. audio channel sampled at 44.1 kHz with 16 bits resolution. In our experiments, the watermarks are generated and embedded using FSSS-based audio watermarking scheme presented in [5]. A perceptual mask is estimated using method discussed in [5]. This mask is then multiplied by 200 independently generated pseudo-random sequences W, with zero-mean and unit variance, to generate 200 independent watermarks. In case of ICAWD, the pseudorandom pseu·do·ran·dom adj. Of, relating to, or being random numbers generated by a definite, nonrandom computational process. sequences, W, follow Laplacian distribution, i.e., fw([tau]) = [beta] / 2 [e.sup.-[beta][absolute value of [tau]], [absolute value of [tau]] < [infinity] (84) where [beta] = [square root of (2)] / [sigma]w, and for the NCWD W follows normal distribution. These 200 random watermarks are embedded in each audio clip according to Eq. (1) that resulted 4000 watermarked audio clips. Experimental results presented in the following sections are averaged over 4000 watermarked audio clips. 5.6 Parameter Settings Simulation results presented in this section are based on the following system settings: --Salient point list (SP) was assumed to be available at the detector, therefore decoding bit error probability [P.sub.e] presented here is due decoding bit error only. --Audio frame size ([2.sup.l][N.sub.1]) was set to [2.sup.13] for [f.sub.s] = 44.1 kHz. --Five-level wavelet (mathematics) wavelet - A waveform that is bounded in both frequency and duration. Wavelet tranforms provide an alternative to more traditional Fourier transforms used for analysing waveforms, e.g. sound. decomposition was used, i.e. l = 5, therefore eight target subbands were available for watermark embedding. --Only one subband was selected at random from eight target subbands for watermark embedding (except multiple watermark embedding case). --Target false positive rate [P.sub.fp] was set to 3.5 x [10.sup.-4] which corresponds to decoding threshold Th = 0.15 (using Eq. (42)). --False positive bit rate, [P.sub.fp], was calculated by applying original (unwatermarked) music clip the proposed detector, and average false positive for the the 20 audio clips used for performance evaluation was calculated to be 2.9 x [10.sup.-4]. --Robustness performance in terms of average decoding bit error rate was calculated without channel coding A way of encoding data in a communications channel that adds patterns of redundancy into the transmission path in order to lower the error rate. Such methods are widely used in wireless communications. See convolutional code and Viterbi decoder. . --In case of ICAWD, watermark repeating factor of two was used during watermark embedding process, i.e., two consecutive audio frames were watermarked with same watermark w. The above settings for watermark embedding using FSSS-based audio watermarking yielded per sample embedding capacity of 1 bit per 512 sample. Fidelity (or transparency) performance of the embedded watermark is evaluated based on the objective degradation measure. Signal-to-watermark ratio (SWR) is used for the objective degradation here which is calculated as, SWR = 10 [log.sub.10] ([[sigma].sup.2.sub.s] / [[sigma].sup.2.sub.v]) (85) where [[sigma].sup.2.sub.v] is calculated using Eq. (3). The average SWR the watermark audio clips used for simulation was [Ave.sub.SWR] = 42.7 (dB), [[sigma].sub.SWR] = 9.17, [max.sub.SWR] = 74.5 (dB), and [min.sub.SWR] = 21.5 (dB). Calculated SWR from watermarked audio clips indicates that on the embedded watermark is very weak compared to the original audio. 5.7 Detection Performance Detection performance of the proposed detector is evaluated for various audio degradations. Detection of the proposed ICAWD and its comparison with NCWD for each degradation is provided next. 5.7.1 Addition of White Noise : White Gaussian noise ranging from zero to 200 % of the power of the audio signal was added to the corresponding watermarked audio clips. The [P.sub.e] average over 4000 watermarked audio clips for ICAWD and NCWD for different SNR See signal-to-noise ratio. SNR - signal-to-noise ratio values are plotted in Fig. 9 which shows that the ICAWD performs better than the NCWD. Superior detection performance of ICAWD than the NCWD can be attributed to its host signal interference cancellation capability. It can be observed from Fig. 9 that for SS-based watermarking very low decoding bit error probability is achievable even in the presence of noise with 60-70% power of the audio signal. 5.7.2 Resampling To simulate resampling attack, a watermarked audio signal was down-sampled at a sampling rate of [[f.sub.s] / [r.sub.f]] (where [r.sub.f] denotes resampling factor) and then interpolated back to [f.sub.s]. The watermark detection was then applied to the resulting watermarked audio clips. Average [P.sub.e] for [r.sub.f] = 2, ... 10, is given in Fig. 10 which shows that the proposed watermarking scheme (using ICAWD) can withstand resampling attacks with [r.sub.f] value up to 5 for each watermarked audio clip, similar decoding performance is achievable for NCWD by using channel coding. Again ICAWD performs better than the NCWD and its superior detection performance can be attributed to its host signal interference suppression capability. [FIGURE 9 OMITTED] 5.7.3 Lossy Compression Lossy compression for audio (e.g. MP3) is generally applied to the digital audio for multimedia applications like transmission and storage to reduce the bit rate. To test the survivability sur·viv·a·ble adj. 1. Capable of surviving: survivable organisms in a hostile environment. 2. That can be survived: a survivable, but very serious, illness. of the watermark, audio encoding/decoding was applied to the watermarked audio using ISO/MPEG-1 Audio Layer III [47] coder at bit rates 32, 64, 96, 112, 128, 192, 256, and 320 k bits/s (kbps). The average [P.sub.e] for lossy compression attacks for bit rates rates 32, 64, 96, 112, 128, 192, 256, and 320 (kbps) is given in Fig. 11. It has been observed from Fig. 11 that the detection performance for both detectors deteriorates as the bit rate of the encoder/decoder decreases; this is due to the stronger distortion introduced by the encoder for lower bit rates. In addition, the ICAWD performs better than the NCWD. [FIGURE 10 OMITTED] [FIGURE 11 OMITTED] 5.7.4 Addition of Colored Noise To simulate an attack with colored noise, white Gaussian noise was spectrally shaped according to the estimated masking threshold using corresponding watermarked audio clip based on the HAS model [46, 47]. This just audible colored noise was then added to the watermarked audio signal. Average [P.sub.e] for the resulting watermarked audio clips is presented in Fig. 12. It has been observed from Fig. 12 that NCWD performs poorly, this is due to increase in interference level, as the colored noised is generated with a process almost identically to that of the watermark generation. Therefore, additive colored noise acts as a second watermark interfering with the watermark to be detected. On the other hand, ICAWD is efficient in handling such attacks due to its interference cancellation ability. 5.7.5 Rescaling Rescaling attacks include time- and frequency-scaling. Time-scaling attacks can be used to desynchronize v. t. 1. to cause a process to occur at times or in cycles independent of another process. Verb 1. desynchronize - cause to become desynchronized; cause to occur at unrelated times desynchronise a watermark detector for SS-based watermarking systems. To test the robustness of the proposed scheme against time-scaling attacks, the watermarked audio clips were time-scaled with time-scaling factor, TSp(n) = +(-) 1%. The detection performance for time-scaling attack using both detection schemes, e.g., ICAWD and NCWD is given in Fig. 12. The frequency-scaling attacks are generally used to deteriorate the detection performance of the frequency domain watermarking schemes. As the proposed watermarking scheme is also a frequency domain watermarking scheme; therefore, it is reasonable to test the robustness performance of the proposed scheme against frequency-scaling attacks as well. To simulate frequency-scaling attack, the watermarked audio clips were frequency-scaled using frequency-scaling factor, FSp(n) = +(-) 1%. The detection performance for the resulting audio clips for both detection schemes, e.g., ICAWD and NCWD is presented in Fig. 12. It can be observed from Fig. 12 that the proposed scheme can withstand rescaling attack of TS [less than or equal to] [+ or -] 1% and FS [less than or equal to] [+ or -] 1% (especially for ICAWD). 5.7.6 Filtering To test the robustness of the proposed watermarking scheme against filtering attacks, the watermarked audio signals were subjected to lowpass filtering (LPF LPF - League for Programming Freedom ), highpass filtering (HPF HPF - High Performance Fortran ), and bandpass filtering (BPF BPF Berkeley Packet Filter BPF British Property Federation (UK) BPF Bonnes Pratiques de Fabrication (Good Manufacturing Practice) BPF British Plastics Federation BPF Band-Pass Filter ) attacks. The specification of filters used for the filtering attacks are, 1. Lowpass Filter: cut-off frequency: [f.sub.c] = 5 kHz with 12 dB/octave roll-off 2. Highpass Filter: cut-off frequency: [f.sub.c] = 1000 Hz with 12 dB/octave roll-off 3. Bandpass Filter An electronic circuit that accepts a signal and filters out unwanted frequencies, allowing only a particular frequency or frequency range (band of frequencies) to reach the output side. : cut-off frequencies: [f.sub.clow] = 50 Hz, and [f.sub.cup] = 5.5 kHz with 12 dB/octave roll-off Detection performance comparison of the ICAWD and the NCWD for LPF, HPF, and BPF attacks is given in Fig. 12. 5.7.7 StirMark Audio Benchmark Attacks For StirMark for audio benchmark attack, watermarked audio clips were subjected to StirMark audio benchmark attacks. The StirMark audio benchmark software, available at [20], was used in the default parameters settings. The decoding bit error probability, [P.sub.e], averaged over 100 watermarked audio clips with the ICAWD and the NCWD, is given in Table 2. It can be observed from Table 2 that the proposed ICAWD based scheme using exhibits superior detection performance than the NCWD. Better performance of ICAWD can be attributed to its better host signal suppression capability. [FIGURE 12 OMITTED] 6 Conclusion An improved watermark detector for additive embedding is presented here. The proposed watermark detector is capable of canceling the host-signal interference at the watermark detector. Bind watermark detection, lower host-signal interference at the detector, improved decoding, detections and watermarking-rate performances are the salient features of the proposed ICAWD. The proposed ICAWD can be used for SS-based watermarking for all types of multimedia data, e.g., audio, video, images, etc. The theoretical results show that the proposed detector performs significantly better than existing blind detectors. Simulation results for real-world data show that the proposed ICAWD performs much better than the traditional NCWD. Moreover, the detection performance of the proposed detector can be improved further by employing channel coding. It is important to mention that better detection performance of ICAWD comes at the cost of security, as ICAWD requires repeated embedding (at least twice) which makes embedded watermark more vulnerable to watermark estimation attacks than without repeated embedding. 7 Appendix A: Statistical characterization of the wavelet coefficients of audio signals To determine the statistical characterization of the subband coefficients of the real-world speech samples, the speech samples, [Y.sub.i], i = 0, 1 ... n - 1 are assumed to be i.i.d. Laplacian random variable with mean zero and variance [[sigma].sup.2.sub.y]. The one-dimensional discrete wavelet transform In numerical analysis and functional analysis, a discrete wavelet transform (DWT) is any wavelet transform for which the wavelets are discretely sampled. The first DWT was invented by the Hungarian mathematician Alfréd Haar. (DWT) of audio signal, Y, can be calculated using Mallat's algorithm [49]. The DWT coefficients using Mallat's algorithm [49], e.g., approximate coefficients [a.sub.k] and detailed coefficients [d.sub.k], at different scales can be expressed as, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (86) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (87) where j denotes the resolution and i is the index. Eq. (86) and (87) describe linear filtering operation fil·ter·ing operation n. The surgical creation of a fistula between the anterior chamber of the eye and the subconjunctival space, as for glaucoma. using filters h and g followed by down-sampling. Here h and g are finite impulse response (electronics, DSP) Finite Impulse Response - (FIR) A type of digital signal filter, in which every sample of output is the weighted sum of past and current samples of input, using only some finite number of past samples. (FIR fir, any tree of the genus Abies of the family Pinaceae (pine family), tall pyramidal evergreen conifers characterized by short, flat, stemless needles and erect cylindrical cones that shed their scales rather than dropping off the tree whole. ) quadrature-mirror filters, also known as the scaling and the wavelet filters, respectively. The scaling filter is a lowpass filter, while the wavelet filter is a highpass filter. Moreover, the top-level coefficients [a.sup.J] represent the original signal y. Eq. (86) and (87) can be expressed using a single equation, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (88) where [[delta].sub.i] is the weighting factor depending on the filter coefficients [h.sub.i] and [g.sub.i], i.e. approximate coefficients or detailed coefficients, and [S.sup.j.sub.i] is the wavelet coefficient at [j.sup.th]-level. Here Eq. (88) states that a wavelet coefficient at an arbitrary level j - 1, is a weighted sum of [N.sub.1] wavelet coefficients from [j.sup.th]-level wavelet. The wavelet coefficients at j - 1 level can be expressed as [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (89) According to Eq. (88), each wavelet coefficient an arbitrary level j : 1 [less than or equal to] j [less than or equal to] J - 1 is a weighted sum of i.i.d. r.v. (e.g. audio samples in our case), therefore, the pdf of a wavelet coefficient [S.sup.j.sub.i] at [j.sup.th]-level, can be determined using joint characteristic function [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]. If we assume that audio sample [Y.sub.i], is a Laplacian r.v., then pdf of [Y.sub.i] can be expressed as, [f.sub.y]([TAU]) = [gamma] / 2 [e.sup.-[gamma][absolute value of [TAU]], [absolute value of [TAU]] < [infinity] (90) where [gamma] = [square root of (2)] / [[omega].sup.2.sub.y] Here characteristic function of r.v. [Y.sub.i], [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], can be expressed as [48], [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (91) Let us consider a r.v. Z which is obtained by magnitude scaling of a r.v. Y i.e., Z = [delta]Y, the characteristic function of Z, [[PHI].sub.z]([omega]), in terms of [[PHI].sub.y]([omega]) can be expressed as [48], [[PHI].sub.z]([omega]) = [[PHI].sub.z] ([delta][omega]) (92) Therefore, the characteristic function of r.v. [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII], can be expressed as [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (93) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (94) [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (95) where [[??].sub.i] = [[gamma].sub.i]/[[gamma].sub.k]-[2.sub.i] and [N.sub.1] is the length of the wavelet filter. In order to determine the pdf of wavelet coefficients [S.sup.J-1.sub.i], [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] characteristic function [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (given by Eq. (95)) is used. The pdf of a r.v. can be determined either using the uniqueness theorem theorem, in mathematics and logic, statement in words or symbols that can be established by means of deductive logic; it differs from an axiom in that a proof is required for its acceptance. or the convolution theorem In mathematics, the convolution theorem states that under suitable conditions the Fourier transform of a convolution is the point-wise product of Fourier transforms. In other words, convolution in one domain (e.g. [48]. The pdf of wavelet coefficients [S.sup.J-1.sub.i], [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] using [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] based on the convolution theorem can be expressed, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (96) where [c.sub.k] [member of] R is a real constant. For different values of [N.sub.1], the polynomial polynomial, mathematical expression which is a finite sum, each term being a constant times a product of one or more variables raised to powers. With only one variable the general form of a polynomial is a0xn+a coefficients, [c.sub.k], are given as: [N.sub.1] = 2, [c.sub.0] = [c.sub.1] = 1/2, and [N.sub.1] = 3, [c.sub.0] = [c.sub.1] = 3/8, and [c.sub.1] = 1/8 and so on. According to the Eq. (95) and (96), as the pdf of wavelet coefficients at [j.sub.th] level, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] ([TAU]), is a obtained by convolving the pdf of r.v. [Y.sub.i], therefore, based on the CLT, the pdf of the subband coefficients move to words Gaussianity as value of [N.sub.1] increases or in other words, pdf of wavelet coefficients at coarser level is closer to the Gaussianity than higher level coefficients. This is because at coarser level, for each wavelet coefficient more audio samples contribute in the weighted-sum equation (given by Eq. (89)) than higher level coefficients. In order to provide evidence in support of this model, a 4-level DWT decomposition of an arbitrary frame of the music clip I Want It That Way ... by Backstreet backstreet Noun a street in a town far from the main roads Adjective denoting secret or illegal activities: a backstreet abortion backstreet n Boys, using 'Daubechies-8' decomposition filter, is given in Fig. 13. The pdf (based on histogram histogram or bar graph Graph using vertical or horizontal bars whose lengths indicate quantities. Along with the pie chart, the histogram is the most common format for representing statistical data. approximation) of corresponding wavelet coefficients at different levels is plotted in Fig. 13. This is clear from Fig. 13 that the higher level, wavelet coefficients exhibit non-Gaussian distribution and distribution moves towards Ganssianity for coarser coefficients due to longer weighted-sum effect at the coarser level. Therefore, the pdf of each subband coefficient (at higher level) of the host signal, [S.sub.i], can be approximated by Laplacian distribution, which is given as, [f.sub.s]([tau]) = [beta] / 2 [e.sup.-[beta][absolute value of [tau]] : [absolute value of [tau]] < [infinity] (97) [FIGURE 13 OMITTED] Received: September 18, 2008 References [1] Cox, I.J., Miller, M.L., and Bloom, J.A.:(2001) Digital Watermarking, Morgan Kaufmann, San Francisco San Francisco (săn frănsĭs`kō), city (1990 pop. 723,959), coextensive with San Francisco co., W Calif., on the tip of a peninsula between the Pacific Ocean and San Francisco Bay, which are connected by the strait known as the Golden . [2] Cox, I.J., Kilian, J., Leighton, T., and Shamoon, T.:(1997) Secure Spread Spectrum Watermarking for Multimedia, IEEE (Institute of Electrical and Electronics Engineers, New York, www.ieee.org) A membership organization that includes engineers, scientists and students in electronics and allied fields. Trans. on Image Processing image processing Set of computational techniques for analyzing, enhancing, compressing, and reconstructing images. Its main components are importing, in which an image is captured through scanning or digital photography; analysis and manipulation of the image, accomplished , vol. 6(12), pp. 1673-1687. [3] Swanson, M.D., Zhu, B., Tewfik, A.H., and Boney, L.:(1998) Robust Audio Watermarking using Perceptual Masking, Signal Processing See DSP. , vol. 66(3), pp. 337-355. [4] Wu, C.-P., Su, P.-C., and Kuo, C.-C. J.:(1999) Robust Audio Watermarking for Copyright Protection, Proc. SPIE's 44th Ann. Meet. Adv. Sig. Proc. Alg. Arch. Impl. IX (SD39), vol. 3807, pp. 387-397. [5] Malik, H., Khokhar, A., and Ansari, R.:(2004) Robust Audio Watermarking using Frequency Selective Spread Spectrum Theory, Proc. IEEE Int. Conf. Acoustics acoustics (ək `stĭks) [Gr.,=the facts about hearing], the science of sound, including its production, propagation, and effects. , Speech, Signal Processing (ICASSP'04),
Montreal, Canada, pp. 385-388.
[6] P-Gonzalez, E, Balado, E, and Hernandez, J.R.:(2003) Performance Analysis of Existing and new Methods for Data Hiding with Known-Host Information in Additive Channels, IEEE Trans. on Signal Processing, vol. 51 (4), pp. 960-980. [7] Hernandez, J., Amado, M., and Perez-Gonzalez, E:(2000) DCT-domain watermarking techniques for still images: Detector performance analysis and a new structure, IEEE Trans. on Image Processing, vol. 9(1), pp. 55-68. [8] Briassouli, A., and Strintzis, M.:(2004) Locally Optimum Nonlinearities for DCT (Discrete Cosine Transform) An algorithm that is widely used for data compression. Similar to Fast Fourier Transform, DCT converts data (pixels, waveforms, etc.) into sets of frequencies. The first frequencies in the set are the most meaningful; the latter, the least. Watermark Detection, IEEE Trans. on Image Processing, vol. 13(12), pp. 1604-1617. [9] Noel, S., and Szu, H.:(2000) Multimedia Authenticity with ICA Watermarks, IS&T/SPIE. Proc., Wavelet Applications VII, vol. 4056, pp. 175-184. [10] Serrano ser·ra·no n. pl. ser·ra·nos A cultivar of the tropical pepper Capsicum annuum having small, blunt, highly pungent red or green fruit used in cooking. , E, and Fuentes, J.:(2001) Independent Component Analysis Applied to Digital Image Watermarking, Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'01), vol. 3, pp. 1997-2000. [11] Toch, B., Lowe, D., and D. Saad, D.:(2003) Watermarking of Audio Signals using using Independent Component Analysis, Proc. 3rd Int. Conf. WEB Delivering of Music, pp. 71-74. [12] Sener, S., and Gunsel gun·sel n. Slang A hoodlum or other criminal, especially one who carries a gun. [Perhaps alteration (influenced by gun) of Yiddish gendzl, gosling, diminutive of gandz , B.:(2004) Blind Audio Watermark Decoding using Independent Component Analysis, Proc. 17th Int. Conf. Patt. Reco. (ICPR'04), vol. 2, pp. 875-878. [13] Bounkong, S., Toch, B., Saad, D., and Lowe, D.:(2002) ICA for Watermarking Digital Images, J. Machine Learning Research 1, pp. 1-25. [14] Yu, D., Sattar, E, and Ma, K.:(2002), Watermark Detection and Extraction using Independent Component Analysis, EURASIP EURASIP European Association for Signal and Image Processing J. Applied Signal Processing, pp. 92-104. [15] Malik, H., Khokhar, A., and Ansari, R.:(2005) Improved Watermark Detection for Spread-Spectrum based Watermarking using Independent Component Analysis, Proc. 5th ACM (Association for Computing Machinery, New York, www.acm.org) A membership organization founded in 1947 dedicated to advancing the arts and sciences of information processing. In addition to awards and publications, ACM also maintains special interest groups (SIGs) in the computer field. Workshop On Digital Rights Management (DRM'05), Washington DC, pp. 102-111. [16] Malik, H., Khokhar, A., and Ansari, R.:(2006) New detector for spread-spectrum based image watermarking using underdetermined ICA, IS&T/SPIE Conf. Security, Steganography, and Watermarking of Multimedia Contents VIII '06, vol. 6072, pp. 747-758. [17] Malik, H., Khokhar, A., and Ansari, R.:(2006) Blind Detection for Additive Embedding Using Underdetermined ICA, Proc. 8th IEEE Int. Symposium on Multimedia, (ISMapos'06), pp. 758-761. [18] Steinebach, M., Lang, A., Dittmann, J., and Prtitcolas, F.A.P.:(2002) StirMark Benchmark: Audio Watermarking Attacks based on Lossy Compression, Proc. SPIE SPIE International Society for Optical Engineering SPIE Society of Photo-Optical Instrumentation Engineers SPIE Source Path Isolation Engine SPIE Special Purpose Insertion Extraction SPIE Software Process Improvement Experimentation SPIE Standard Protocols in Effect Security Watermarking Multimedia, vol. 4675, pp. 79-90. [19] Lang, A., Dittmann, J., Spring, R., and Vielhauer, C.:(2005), Audio watermark attacks : from single to profile attacks, Proc. ACM Multimedia and Security Workshop (MM & Sec'05), New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of , NY, USA, pp. 39-50. [20] StirMark Benchmark for Audio, available at http://amsl-smb.cs.uni-magdeburg. de/smfa/main.php, accessed on June 23, 2008. [21] Bell, A., and Sejnowski, T.:(1995) An Information Maximisation Approach to Blind Separation and Blind Deconvolution The introduction to this article provides insufficient context for those unfamiliar with the subject matter. Please help [ improve the introduction] to meet Wikipedia's layout standards. You can discuss the issue on the talk page. , Neural Computation, MIT MIT - Massachusetts Institute of Technology Press Journals, vol. 7(6), pp. 1129-1159. [22] Hyvarinen, A., Karhunen, J., and Oja E.:(2001) Independent Component Analysis, John Wiley John Wiley may refer to:
[23] Comon, p.:(1994) Independent Component Analysis, A New Concept?, EURASIP Signal Processing, vol. 36(3), pp. 287-314. [24] Cao, X.-R., and Liu, R.-W.:(1996) Genaral Approach to Blind Source Separation, IEEE Trans. Signal Processing, vol. 44(3), pp. 562-571. [25] Eriksson, J., and Koivunen, V.:(2004) Identifiability, Separability sep·a·ra·ble adj. Possible to separate: separable sheets of paper. sep , and Uniqueness of Linear ICA Models, IEEE Signal Processing Letters, vol. 11(7), pp.601-604. [26] Davis, M.:(2004) Identifiability Issues in Noisy ICA, IEEE Signal Processing Letters, vol. 11(5), pp. 601-604. [27] De Lathauwer, L., Comon, P., De Moor, B., and Vandewalle, J.:(1999) ICA Algorithms for 3 Sources and 2 Sensors, Proc. IEEE Signal Processing Workshop Higher-Order Statistics (HOS'99), pp. 116-120. [28] Gaeta, M., and Lacoume, J.-L.:(1990) Source Separation Without Prior Knowledge: The Maximum Likelihood Solution, Proc. EUSIPCO'90, pp. 621-624. [29] Moulinesand, E., Cardoso, J-E, and Gassiat, E.:(1997) Maximum Likelihood for Blind Separation and Deconvolution In mathematics, deconvolution is an algorithm-based process used to reverse the effects of convolution on recorded data.[1] The concept of deconvolution is widely used in the techniques of signal processing and image processing. of Noisy Signals using Mixture Models, Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'97), pp. 3617-3620. [30] Cardoso, J.-F.:(1999) High-order Contrasts for Independent Component Analysis, Nural Computation, Elsevier Science, vol. 11(1), pp. 157-192. [31] Bofill, P., and Zibulevsky, M.:(2001) Underdetermined Blind Source Separation using Sparse Representations, EURASIP Signal Processing, vol. 81(11), pp. 2353-2362. [32] Zibulevsky, M., and Zeevi, Y.Y.:(2002), Extraction of a Single Source from Multichannel Data using Sparse Decomposition, Neurocomputing, Elsevier Science, vol. 49, pp. 163-173. [33] Li, Y., Cichocki, A., and Amari, S.:(2004), Analysis of Sparse Representation and Blind Source Separation, Neural Computation, MIT Press Journals, vol. 16(6), pp. 1193-1234. [34] Gribonval, R., Benaroya, L., Vincent, E., and Fevotte, C.:(2003) Proposals for Performance Measurement in Source Separation, Proc. 4th Int. Sym. Independent Component Analysis and Blind Source Separation, pp. 763-768. [35] Li, Y., Powers, D., and Peach, J.:(2000) Comparison of Blind Source Separation Algorithms, Advances in Neural Networks and Applicaitons, N. Mastorakis (Ed.), WSES WSES World Society of Emergency Surgery WSES Surface Effect Craft (USCGC) WSES Warhead Systems Engineering Services , pp. 18-21. [36] Cichocki, A., Douglas, S., and Amari, S.:(1998) Robust Techniques for Independent Component Analysis (ICA) with Noisy Data, Neurocomputing, Elsevier Science, vol. 22, pp. 113-129. [37] Pajunen, P.:(1997) Blind Separation of Binary Sources With Less Sensors Than Sources, Proc. Int. Conf. on Neural Networks (ICNN'97), vol. 3, pp. 1994-1997. [38] Hyvarinen, A.:(1999) Fast Independent Component Analysis with Noisy Data using Gaussian Moments, Proc. ISCS'99. [39] Hojen-Sorensen, P., Winther, O., and Hansen, L.K.:(2002) Mean-Field Approaches to Independent Component Analysis, Neural Computation, MIT Press Journals, vol. 14, pp. 889-918. [40] Hansen, L.K., and Petersen, K.B.:(2003) Monoaural ICA of While Noise Mixture is Hard, Proc. of Sym ICA and BSS (ICA2003), pp. 815-820. [41] Poor, H. V.:(1994) An Introduction to Signal Detection and Estimation, Springer-Verlag, New York, 2nd-ed. [42] Kay, S.:(1998) Fundamentals of Statistical Signal Processing Statistical signal processing is an area of signal processing that treats signals as stochastic processes, dealing with their statistical properties (e.g., mean, covariance, etc.). : Detection Theory, Prentice Hall Prentice Hall is a leading educational publisher. It is an imprint of Pearson Education, Inc., based in Upper Saddle River, New Jersey, USA. Prentice Hall publishes print and digital content for the 6-12 and higher education market. History In 1913, law professor Dr. , Upper Saddle River, New Jersey Upper Saddle River is a Borough in Bergen County, New Jersey, United States. As of the United States 2000 Census, the borough population was 7,741. It is not to be confused with the neighboring borough of Saddle River. . [43] Swanson, M., Zhu, B., and Tewfik, A.:(1996) Robust Data Hiding for Images, Proc. IEEE Digital Signal Processing See DSP. Digital Signal Processing - (DSP) Computer manipulation of analog signals (commonly sound or image) which have been converted to digital form (sampled). Workshop, pp. 37-40. [44] J. Su, J., Eggers Eggers may refer to:
[45] SQAM Sound Quality Assessment Material, http://sound.media.mit.edu/mpeg4/ audio/sqam/, accessed on June 23, 2008. [46] Zwicker, R. E., and Fastl, H.:(1999)Psychoacoustics: Facts and Models, Springer-Verlag, Berlin. [47] Noll, P.:(1997) MPEG Digital Audio Coding, IEEE Signal Processing Magazine, vol. 14(5), pp. 59-81. [48] Papoulis, A., and Pillai, S.:(2002) Probability, Random Variables and Stochastic By guesswork; by chance; using or containing random values. stochastic - probabilistic Processes, McGrawHill, New York, 4th Ed. [49] Mallat, S.:(1989) A Theory for Multiresolution Signal Decomposition, the Wavelet Representation IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 11 (7), pp. 674-693. Hafiz Hafiz (häfēz`) [Arab.,=one who has memorized the Qur'an], 1319–1389?, Persian lyric poet, b. Shiraz. His original name was Shams al-Din Muhammad. He acquired the surname from having memorized the Qur'an at an early age. Malik Electrical and Computer Engineering Department University of Michigan-Dearborn The University of Michigan-Dearborn, located in Dearborn, Michigan, USA, is part of the University of Michigan system. It was established in 1959 after a gift of 196 acres (793,000 m²) from the Ford Motor Company. , Dearborn, MI 48128, USA E-mail: hafiz@umich.edu, URL URL in full Uniform Resource Locator Address of a resource on the Internet. The resource can be any type of file stored on a server, such as a Web page, a text file, a graphics file, or an application program. : http://www-personal.engin.umd.umich.edu/~hafiz
Table 1: Audio Clips used for Performance Evaluation
Singer Name, Song Title Genre Duration (sec)
Back Street Boys, Pop, 22
I Want It That Way ... (Pop 1)
L. Mangeshkar, Melodic, (Melodic) 15
Kuch Na Kaho ... (Melodic)
A. Bhosle, & R. Sharma, Pop, (Pop2) 10
Kahin Aag Laga ... (Pop2)
N. F. A. Khan, Semi-Classic, 20
Afreen Afreen ... (Classical)
Suzanne Vega, Female Vocal, 5
Tom's diner ... (Spoken Language)
Table 2: Performance Comparison for StirMark Audio Benchmark Attacks
Decoding Bit Error Probability, [P.sub.e]
StirMark Attack NCWD ICAWD
addbrumm_100 0.088 0.0091
addbrumm_1100 0.088 0.0091
addbrumm_2100 0.088 0.0091
addbrumm_3100 0.1023 0.0091
addbrumm_4100 0.1257 0.0091
addbrumm_5100 0.1412 0.0091
addbrumm_6100 0.1477 0.0091
addbrumm_7100 0.1904 0.0091
addbrumm_8100 0.2228 0.0091
addbrumm_9100 0.2293 0.0234
addbrumm_10100 0.2293 0.0491
addfftnoise 1 1
addnoise_100 0.088 0.0491
addnoise_300 0.088 0.0491
addnoise_500 0.088 0.0491
addnoise_700 0.088 0.0634
addnoise_900 0.088 0.0634
addsinus 0.088 0.0634
amplify 0.088 0.0491
compressor 0.088 0.0491
copysamples 0.529 0.1749
cutsamples 0.791 0.4835
dynnoise 0.1056 0.0667
echo 0.0818 0.0667
exchange 0.1056 0.0818
extrastereo_30 0.1056 0.0818
extrastereo_50 0.1056 0.0818
extrastereo_70 0.1056 0.0818
fft_hlpass 0.1074 0
fft_invert 0.1056 0.0818
fft_real_reverse 0.1056 0.0818
fft_stat1 0.1295 0.0238
fft_test 0.1056 0.0238
flippsample 0.1281 0.0725
invert 0.088 0.0491
lsbzero 0.1056 0.0818
normalize 0.088 0.0673
rc_highpass 0.0945 0.0491
rc_lowpass 0.088 0
smooth 1 1
resample 0.1056 0
smooth2 0.1056 0
stat1 0.1056 0
stat2 0.1056 0.0818
voiceremove 1 1
zerocross 0.088 0
zeroremove 0.2759 0.0363
zerolength 0.2189 0.0607
|
|
||||||||||||||||||||

i·cal·ly adv.
`stĭks)
Printer friendly
Cite/link
Email
Feedback
Reader Opinion