DMCA
A Music Retrieval System based on Query-by-singing for
Citations: | 1 - 1 self |
Citations
4312 |
Estimating the dimension of a model
- Schwarz
- 1978
(Show Context)
Citation Context ...hods are proposed to extract vocal’s melody from accompanied Karaoke tracks bysreducing the interference from the background accompaniments. In parallel, we applysBayesian Information Criterion (BIC) =-=[16]-=- to detect the onset time of each phrase insthe accompanied vocal channel, which enables the subsequent DTW-based similarityscomparison to be performed more efficiently. The proposed system further us... |
140 |
Fastdtw: Toward accurate dynamic time warping in linear time and space
- Salvador, Chan
- 2004
(Show Context)
Citation Context ...s shown in Fig. 4(a), the computational complexity in terms of the number of thesnecessary distance computation D(⋅) for constructing a T×L table iss. 4 )( 2 2 2 2 22 LTTL LLTTLTComplexity +−= ×−×−×=s=-=(15)-=-sAs mentioned earlier, since L is usually set to be kT, where , thescomputational complexity can be rewritten ass22/1 ≤≤ k . 4 )14( 22 TkkComplexity −−=s(16)sAlthough the DTW recursion allows a docume... |
84 | Scaling up Dynamic Time Warping for Datamining Applications. In
- Keogh, Pazzani
- 2000
(Show Context)
Citation Context ...al DTW.sSince the complexity is O(T 2 ), the most promising way to speed up the searchingsprocess is to reduce the value of T. Motivated by Keogh and Pazzani’ssPiecewisesAggregate Approximation (PAA) =-=[14]-=-, we propose a dimensionality reductionstechnique, called Multi-Level Data Abstraction (MLDA). Unlike PAA, which dividessa time series into equal-length frames, and then calculates the mean value of t... |
57 |
Query by Humming”,
- Ghias, Logan, et al.
- 1995
(Show Context)
Citation Context ...ween mt and m′t. The optimal b can besfound by choosing one of the possible values within a pre-set range, B, that results insthe smallest estimation error, i.e.,s||minarg* * btbt BbB ab +≤≤− −= mc ,s=-=(1)-=-swheresis the optimal amplitude scaling factor given mt+b. Letting ∂|ct −sabmt+b|2/∂ab = 0, we have a minimum mean-square-error solution of ab ass* ba . |||| 2 * bt btt ba + +′= m mcs(2)sAccordingly, ... |
49 |
A practical query-by-humming system for a large music database
- Kosugi, Nishihara, et al.
- 2000
(Show Context)
Citation Context ...nventory of possiblesnotes performed by a singer, and xt,j denote the signal’s energy with respect to FFTsindex j in frame t, where 1 ≤ j ≤ J. The sung note of a recording in frame t issdetermined bys=-=(3)-=- ,smaxarg 0 12 , 1 ⎟⎠ ⎞⎜⎝ ⎛= ∑ = +≤≤ C c cnt c t yho Nn where C is a pre-set number of harmonics concerned, h is a positive value less than 1sfor discounting higher harmonics, and yt,n is the signal’s... |
46 | A Predominant-F0 Estimation Method for CD Recordings: MAP Estimation using EM Algorithm for Adaptive Tone Models
- Goto
(Show Context)
Citation Context ... L, wheresD(t, l) is the distance between note sequences {q1, q2,…, qt} and {u1, u2,…, ul},scomputed using:s,s),()2,1( ),()1,1( ),(2)1,2( min),( ⎪⎩ ⎪⎨ ⎧ +−− −+−− ×+−− = ll ll ll l tdtD tdtD tdtD tD εs=-=(9)-=-sandsd(t, l) = | qt − ul| ,s(10)swhere ε is a small constant that favors the mapping between notes qt and ul, given thesdistance between note sequences {q1, q2,…,qt-1} and {u1, u2,…, ul-1}.sTo compens... |
36 |
Robust polyphonic music retrieval with N-grams, Journal of Intelligent Information Systems 21(1) (2002) 53–70. at
- Doraisamy, Ruger
(Show Context)
Citation Context ... dropout errors, note insertionserrors, etc. To handle these errors, various approximate matching methods, such assdynamic time warping (DTW) [5][11-12], hidden Markov model [13], and N-gramsmodel [8]=-=[10]-=-, have been studied, with DTW being the most popular. However, duesto the considerable time consumption for DTW, another key issue on designing asquery-by-singing MIR system is how to speed up the sim... |
25 |
K.: Mid-Level Music Melody Representation of Polyphonic Audio for Query-by-Humming System
- Song, Bae, et al.
- 2002
(Show Context)
Citation Context ...sted note sequence o′ =s{o′1, o′2,…, o′T } is obtained bys⎪⎪ ⎪ ⎩ ⎪⎪ ⎪ ⎨ ⎧ −<−⎥⎦ ⎥⎢⎣ ⎢ −−×− >−⎥⎦ ⎥⎢⎣ ⎢ +−×− ≤− =′ , )2/(sif,s12 2/12s)2/(sif,s12 2/12 )2/(||ifs,sRooRooo RooRooo Rooo o t t t t t t tt ts=-=(6)-=-swhere R is the normal varying range of the sung notes in a sequence, say 22, and osissthe mean note computed by averaging all the notes in o. In Eq. (6), a note ot issconsidered as a wrong note and n... |
24 | An Approach Towards a Polyphonic Music Retrieval System
- Doraisamy, Rüger
- 2001
(Show Context)
Citation Context ...ote dropout errors, note insertionserrors, etc. To handle these errors, various approximate matching methods, such assdynamic time warping (DTW) [5][11-12], hidden Markov model [13], and N-gramsmodel =-=[8]-=-[10], have been studied, with DTW being the most popular. However, duesto the considerable time consumption for DTW, another key issue on designing asquery-by-singing MIR system is how to speed up the... |
23 | Hierarchical Filtering Method for Content-based Music Retrieval via Acoustic
- Jang
- 2001
(Show Context)
Citation Context ...rs,sa sung query may contain inevitable tempo errors, note dropout errors, note insertionserrors, etc. To handle these errors, various approximate matching methods, such assdynamic time warping (DTW) =-=[5]-=-[11-12], hidden Markov model [13], and N-gramsmodel [8][10], have been studied, with DTW being the most popular. However, duesto the considerable time consumption for DTW, another key issue on designi... |
21 |
An Approximate String Matching Algorithm for Content-Based Music Data Retrieval.
- Liu, Hsu, et al.
- 1999
(Show Context)
Citation Context ...w phrase onset time t′os as tos-r/2, clone the subsequencesof notes of a music document starting from t′os with a length L of kT+r to u, andsdefine the boundary conditions for the DTW recursion as,s.s=-=(11)-=-s)2,3(2)1,1()2,3( 1, 12),,2()1,1( ),2( , 1),,1( ),1( 4,)2,( 2,)1,( )1,1()1,1( ⎪⎪ ⎪⎪ ⎪ ⎩ ⎪⎪ ⎪⎪ ⎪ ⎨ ⎧ ×+= ⎩⎨ ⎧ ≤<+∞ +≤≤+−= ⎩⎨ ⎧ ≤<∞ ≤≤= ≤≤∞= ≤≤∞= = ddD Llr rdd D Llr rd D TttD TttD dD llll lll After the... |
20 | Music signal spotting retrieval by a humming query using start frame feature dependent continuous dynamic programming
- Nishimura, Hashiguchi, et al.
- 2001
(Show Context)
Citation Context ... number of harmonics concerned, h is a positive value less than 1sfor discounting higher harmonics, and yt,n is the signal’s energy on note en in frame t,sestimated bys,smax ,, )( , jtnnt xy ejUj =∀=s=-=(4)-=-sands, 5.69 440 )(log12)( 2 ⎥⎦ ⎥⎢⎣ ⎢ +⎟⎠ ⎞⎜⎝ ⎛⋅= jFjUs(5)swhere ⎣ ⎦sis a floor operator, F(j) is the corresponding frequency of FFT index j, andsU(⋅) represents a conversion between the FFT indices an... |
8 |
Music Retrieval by Humming
- Kosugi, Nishihara, et al.
- 1999
(Show Context)
Citation Context ... +≤≤− −= mc ,s(1)swheresis the optimal amplitude scaling factor given mt+b. Letting ∂|ct −sabmt+b|2/∂ab = 0, we have a minimum mean-square-error solution of ab ass* ba . |||| 2 * bt btt ba + +′= m mcs=-=(2)-=-sAccordingly, the underlying vocal signal in frame t can be estimated bysst = ct − mt+b*. Fig. 2(c) shows the resulting waveform of the accompanied vocalschannel after background music reduction. We c... |
8 |
A MelodyBased Similarity Computation Algorithm for Musical Information
- Mo, Han, et al.
- 1999
(Show Context)
Citation Context ...+−= ⎩⎨ ⎧ ≤<∞ ≤≤= ≤≤∞= ≤≤∞= = ddD Llr rdd D Llr rd D TttD TttD dD llll lll After the distance matrix D is constructed, the similarity between q and u can besevaluated bys).,(min),( 2/ l l TDS LT ≤≤=uqs=-=(12)-=-s5.2 Multiple-Pass DTW to Improve Retrieval AccuracysSince a query may be sung in a different key or register than the target musicsdocument, i.e., the so-called transposition, the resulting note sequ... |
5 | A query-by-singing technique for retrieving polyphonic objects of popular music
- Yu, Tsai, et al.
- 2005
(Show Context)
Citation Context ...ote Sequence GenerationsAfter reducing the undesired background accompaniments, the next step is to convertseach recording from its waveform representation into a sequence of musical notes.sFollowing =-=[7]-=-, the converting method begins by computing the short-term FastsFourier Transform (FFT) of a signal. Let e1, e2,…, eN be the inventory of possiblesnotes performed by a singer, and xt,j denote the sign... |
1 |
Effectiveness of HMM-based Retrieval on Large
- Shifrin, Burmingham
- 2003
(Show Context)
Citation Context ...vitable tempo errors, note dropout errors, note insertionserrors, etc. To handle these errors, various approximate matching methods, such assdynamic time warping (DTW) [5][11-12], hidden Markov model =-=[13]-=-, and N-gramsmodel [8][10], have been studied, with DTW being the most popular. However, duesto the considerable time consumption for DTW, another key issue on designing asquery-by-singing MIR system ... |