Results 1  10
of
53
Matching Pursuit and Atomic Signal Models Based on Recursive Filter Banks
 IEEE Transactions on Signal Processing
, 1902
"... The matching pursuit algorithm can be used to derive signal decompositions in terms of the elements of a dictionary of timefrequency atoms. Using a structured overcomplete dictionary yields a signal model that is both parametric and signaladaptive. In this paper, we apply matching pursuit to the d ..."
Abstract

Cited by 45 (1 self)
 Add to MetaCart
(Show Context)
The matching pursuit algorithm can be used to derive signal decompositions in terms of the elements of a dictionary of timefrequency atoms. Using a structured overcomplete dictionary yields a signal model that is both parametric and signaladaptive. In this paper, we apply matching pursuit to the derivation of signal expansions based on damped sinusoids. It is shown that expansions in terms of complex damped sinusoids can be efficiently derived using simple recursive filter banks. We discuss a subspace extension of the pursuit algorithm which provides a framework for deriving realvalued expansions of real signals based on such complex atoms. Furthermore, we consider symmetric and asymmetric twosided atoms constructed from underlying onesided damped sinusoids. The primary concern is the application of this approach to the modeling of signals with transient behavior such as music; it is shown that timefrequency atoms based on damped sinusoids are more suitable for representing trans...
Denoising by Sparse Approximation: Error Bounds Based on RateDistortion Theory
, 2006
"... If a signal x is known to have a sparse representation with respect to a frame, it can be estimated from a noisecorrupted observation y by finding the best sparse approximation to y. Removing noise in this manner depends on the frame efficiently representing the signal while it inefficiently repres ..."
Abstract

Cited by 44 (7 self)
 Add to MetaCart
If a signal x is known to have a sparse representation with respect to a frame, it can be estimated from a noisecorrupted observation y by finding the best sparse approximation to y. Removing noise in this manner depends on the frame efficiently representing the signal while it inefficiently represents the noise. The meansquared error (MSE) of this denoising scheme and the probability that the estimate has the same sparsity pattern as the original signal are analyzed. First an MSE bound that depends on a new bound on approximating a Gaussian signal as a linear combination of elements of an overcomplete dictionary is given. Further analyses are for dictionaries generated randomly according to a sphericallysymmetric distribution and signals expressible with single dictionary elements. Easilycomputed approximations for the probability of selecting the correct dictionary element and the MSE are given. Asymptotic expressions reveal a critical input signaltonoise ratio for signal recovery.
New Applications of the Sound Description Interchange Format
 Proc. ICMC98, Ann Arbor
, 1998
"... This paper describes the goals and design of SDIF and its standard frame types, followed by a review of recent SDIF work at CNMAT, IRCAM, and IUA. ..."
Abstract

Cited by 40 (5 self)
 Add to MetaCart
This paper describes the goals and design of SDIF and its standard frame types, followed by a review of recent SDIF work at CNMAT, IRCAM, and IUA.
Sound Source Separation in Monaural Music Signals
, 2006
"... Sound source separation refers to the task of estimating the signals produced by individual sound sources from a complex acoustic mixture. It has several applications, since monophonic signals can be processed more efficiently and flexibly than polyphonic mixtures. This thesis deals with the separat ..."
Abstract

Cited by 33 (4 self)
 Add to MetaCart
Sound source separation refers to the task of estimating the signals produced by individual sound sources from a complex acoustic mixture. It has several applications, since monophonic signals can be processed more efficiently and flexibly than polyphonic mixtures. This thesis deals with the separation of monaural, or, onechannel music recordings. We concentrate on separation methods, where the sources to be separated are not known beforehand. Instead, the separation is enabled by utilizing the common properties of realworld sound sources, which are their continuity, sparseness, and repetition in time and frequency, and their harmonic spectral structures. One of the separation approaches taken here use unsupervised learning and the other uses modelbased inference based on sinusoidal modeling. Most of the existing unsupervised separation algorithms are based on a linear instantaneous signal model, where each frame of the input mixture signal is
Extending Spectral Modeling Synthesis with . . .
, 2000
"... Sinusoidal modeling has enjoyed a rich history in both speech and music applications, including sound transformations, compression, denoising, and auditory scene analysis. For such applications, the underlying signal model must efficiently capture salient audio features (Goodwin 1998). In this artic ..."
Abstract

Cited by 31 (1 self)
 Add to MetaCart
Sinusoidal modeling has enjoyed a rich history in both speech and music applications, including sound transformations, compression, denoising, and auditory scene analysis. For such applications, the underlying signal model must efficiently capture salient audio features (Goodwin 1998). In this article, we present an accurate, efficient, and flexible threepart model for audio signals consisting of sines, transients, and noise by extending spectral modeling synthesis (SMS) (Serra and Smith 1990) with an explicit flexible transient model called transientmodeling synthesis (TMS). The sinusoidal transformation system (STS) (McAulay and Quatieri 1986) and SMS find the slowly varying sinusoidal components in a signal using spectralpeakpicking algorithms. Subtracting the synthesized sinusoids from the original signal creates a residual consisting of transients and noise (Serra 1989; George and Smith 1992). However, sinusoids do not model this residual well. Although it is possible to model transients and noise by a sum of sinusoidal signals (as with the Fourier transform), it is neither efficient, because transient and noisy signals require many sinusoids for their description, nor meaningful, because transients are shortlived signals, while the sinusoidal model uses sinusoids that are active on a much larger time scale. In the STS system (generally applied to speech), the transient + noise residual is often masked sufficiently to be ignored (McAulay and Quatieri 1986). In music applications, this residual is often important to the integrity of the
Advances in parametric audio coding
 in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Mohonk, New Paltz
, 1999
"... For very low bit rate audio coding applications in mobile communications or on the internet, parametric audio coding has evolved as a technique complementing the more traditional approaches. These are transform codecs originally designed for achieving CDlike quality on one hand, and specialized spee ..."
Abstract

Cited by 29 (5 self)
 Add to MetaCart
(Show Context)
For very low bit rate audio coding applications in mobile communications or on the internet, parametric audio coding has evolved as a technique complementing the more traditional approaches. These are transform codecs originally designed for achieving CDlike quality on one hand, and specialized speech codecs on the other hand. Both of these techniques usually represent the audio signal waveform in a way such that the decoder output signal gives an approximation of the encoder input signal, while taking into account perceptual criteria. Compared to this approach, in parametric audio coding the models of the signal source and of human perception are extended. The source model is now based on the assumption that the audio signal is the sum of “components,” each of which can be approximated by a relatively simple signal model with a small number of parameters. The perception model is based on the assumption that the sound of the decoder output signal should be as similar as possible to that of the encoder input signal. Therefore, the approximation of waveforms is no longer necessary. This approach can lead to a very efficient representation. However, a suitable set of models for signal components, a good decomposition, and a good parameter estimation are all vital for achieving maximum audio quality. We will give an overview on the current status of parametric audio coding developments and demonstrate advantages and challenges of this approach. Finally, we will indicate possible directions of further improvements. 1.
Methods for separation of harmonic sound sources using sinusoidal modeling
 in Proc. AES 106th Convention
, 1999
"... Methods are proposed for separation of harmonic sound sources using sinusoidal modeling. A local nonlinear leastsquares (NLS) frequency estimator is proposed to resolve sinusoids that are close in frequency. An iterative analysis scheme using interpolated parameter trajectories and subtraction of d ..."
Abstract

Cited by 21 (1 self)
 Add to MetaCart
(Show Context)
Methods are proposed for separation of harmonic sound sources using sinusoidal modeling. A local nonlinear leastsquares (NLS) frequency estimator is proposed to resolve sinusoids that are close in frequency. An iterative analysis scheme using interpolated parameter trajectories and subtraction of detected components is presented. A measure is proposed for testing the accuracy of the model. 0
Analysis and Synthesis of PseudoPeriodic Job Arrivals in Grids: A Matching Pursuit Approach
 In Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid, Rio De Janeiro
, 2007
"... Pseudoperiodicity is one of the basic job arrival patterns on dataintensive clusters and Grids. In this paper, a signal decomposition methodology called matching pursuit is applied for analysis and synthesis of pseudoperiodic job arrival processes. The matching pursuit decomposition is well lo ..."
Abstract

Cited by 11 (6 self)
 Add to MetaCart
(Show Context)
Pseudoperiodicity is one of the basic job arrival patterns on dataintensive clusters and Grids. In this paper, a signal decomposition methodology called matching pursuit is applied for analysis and synthesis of pseudoperiodic job arrival processes. The matching pursuit decomposition is well localized both in time and frequency, and it is naturally suited for analyzing nonstationary as well as stationary signals. The stationarity of the processes can be quantitatively measured by permutation entropy, with which the relationship between stationarity and modeling complexity is excellently explained. Quantitative methods based on the power spectrum are also provided to measure the degree of periodicity present in the data. Matching pursuit is further shown to be able to extract patterns from signals, which is an attractive feature from a modeling perspective. Real world workload data from production clusters and Grids are used to empirically evaluate the proposed measures and methodologies. 1
A Comparison between Fixed and Multiresolution Analysis for Onset Detection in Musical Signals
 In Proc. Digital Audio Effects Workshop (DAFx
, 2004
"... A study is presented for the use of multiresolution analysisbased onset detection in the complex domain. It shows that using variable timeresolution across frequency bands generates sharper detection functions for higher bands and more accurate detection functions for lower bands. The resulting me ..."
Abstract

Cited by 9 (0 self)
 Add to MetaCart
(Show Context)
A study is presented for the use of multiresolution analysisbased onset detection in the complex domain. It shows that using variable timeresolution across frequency bands generates sharper detection functions for higher bands and more accurate detection functions for lower bands. The resulting method improves the localisation of onsets on fixedresolution schemes, by favouring the increased time precision of higher subbands during the combination of results.