Questions tagged [mfcc]

The mel-frequency cepstrum (MFC) is a representation of the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC. MFCCs are commonly used as features in speech recognition systems

9 questions

votes

1 answer

Are MFCCs the optimal method of representing music to a retrieval system?

A signal processing technique, the Mel frequency Cepstrum, is often used to extract information from a musical piece for use in a machine learning task. This method gives a short-term power spectrum, and the coefficients are used as input. In…

asked Feb 13 '12 at 04:43

jonsca

1,790
3
20
30

votes

2 answers

Neural Network for sound classification

I am currently trying to automate some identification process of characteristic noise sounds. For acoustic feature, I calculate MFCC. I have downloaded a free MATLAB toolbox from Dan Ellis'es website. As a classifier, I am trying Neural Network…

classification neural-networks mfcc

asked Sep 16 '16 at 08:31

Celdor

votes

0 answers

A best measure for speaker recognition

I have a set $E_{1}$, with a finite cardinality $n$ of rectangular matrices which contains the useful MFCC coefficients generated from $n$ speech signals. Similary I have a set $E_{2}$ of same cardinality as that of $E_{1}$ which is a collection of…

distance-functions signal-processing mfcc

asked Jun 06 '11 at 09:51

Dinesh

votes

1 answer

MFCCs and MoG-HMMs for speech recognition

BACKGROUND MFCCs are coefficients which represent the most important parts of speech, and about 12 of them are used to model a one 512 points long frame (of speech). Along with them you would use delta coeffients, which track the change of MFCCs…

hidden-markov-model baum-welch speech-recognition mfcc

asked Dec 17 '17 at 18:38

Desperado

votes

0 answers

Defining norm of a matrix of MFCC coefficients

Scenario Construction: I have a MFCC generator block which gets the speech samples from the user and generates a rectangular matrix say $A$ of the order $m \times n$, whose elements are the Cesptral Coefficients(MFCC). Now, suppose I maintain a…

matlab matrix signal-processing mfcc

asked Dec 08 '11 at 15:39

Dinesh

vote

1 answer

Hidden Markov Model with MFCC coefficients

I don't know whether this is the correct forum for this but here goes: I'm trying to implement a Hidden Markov Model to be able to predict and find the best sequence/path for a training file. So far, I have the mel-frequency cepstral coefficients…

machine-learning hidden-markov-model signal-processing mfcc

asked Feb 18 '13 at 16:57

Phorce

vote

1 answer

Principle Components Analysis – using variance as a variable?

I am following a collaborator’s methods to analyze a set of audio recordings, and I have found that she is using principal components analysis in an unexpected way. I am confused by her approach, and would appreciate feedback on whether this is…

variance pca dimensionality-reduction mfcc

asked Jan 28 '19 at 22:24

Amanda S

vote

1 answer

Normalization for MFCC?

I'm planning on using MFCCs extracted from audio signals to make a speaker recognizer. I noticed that the first MFCC term tends to be very large, compared to the others. That's why I think that normalization is needed when working with machine…

normalization hidden-markov-model lstm mfcc

asked May 29 '18 at 10:20

Isaac

votes

1 answer

Validation of Support Vector Machine using sklearn

I have made a recording of two different sounds and I want to use an SVM in order to be able to distinct between the two. The process I have followed is: Divided each sound in multiple 20ms frames. For every frame I calculate the MFCCs, deltas and…

machine-learning classification svm mfcc

asked Feb 08 '16 at 09:30

user103394