The mel-frequency cepstrum (MFC) is a representation of the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC. MFCCs are commonly used as features in speech recognition systems
Questions tagged [mfcc]
9 questions
10
votes
1 answer
Are MFCCs the optimal method of representing music to a retrieval system?
A signal processing technique, the Mel frequency Cepstrum, is often used to extract information from a musical piece for use in a machine learning task. This method gives a short-term power spectrum, and the coefficients are used as input.
In…

jonsca
- 1,790
- 3
- 20
- 30
3
votes
2 answers
Neural Network for sound classification
I am currently trying to automate some identification process of characteristic noise sounds. For acoustic feature, I calculate MFCC. I have downloaded a free MATLAB toolbox from Dan Ellis'es website. As a classifier, I am trying Neural Network…

Celdor
- 527
- 4
- 13
3
votes
0 answers
A best measure for speaker recognition
I have a set $E_{1}$, with a finite cardinality $n$ of rectangular matrices which contains the useful MFCC coefficients generated from $n$ speech signals. Similary I have a set $E_{2}$ of same cardinality as that of $E_{1}$ which is a collection of…

Dinesh
- 71
- 5
2
votes
1 answer
MFCCs and MoG-HMMs for speech recognition
BACKGROUND
MFCCs are coefficients which represent the most important parts of speech, and about 12 of them are used to model a one 512 points long frame (of speech). Along with them you would use delta coeffients, which track the change of MFCCs…

Desperado
- 131
- 4
2
votes
0 answers
Defining norm of a matrix of MFCC coefficients
Scenario Construction:
I have a MFCC generator block which gets the speech samples from the user and generates a rectangular matrix say $A$ of the order $m \times n$, whose elements are the Cesptral Coefficients(MFCC). Now, suppose I maintain a…

Dinesh
- 71
- 5
1
vote
1 answer
Hidden Markov Model with MFCC coefficients
I don't know whether this is the correct forum for this but here goes:
I'm trying to implement a Hidden Markov Model to be able to predict and find the best sequence/path for a training file.
So far, I have the mel-frequency cepstral coefficients…

Phorce
- 111
- 4
1
vote
1 answer
Principle Components Analysis – using variance as a variable?
I am following a collaborator’s methods to analyze a set of audio recordings, and I have found that she is using principal components analysis in an unexpected way. I am confused by her approach, and would appreciate feedback on whether this is…

Amanda S
- 11
- 2
1
vote
1 answer
Normalization for MFCC?
I'm planning on using MFCCs extracted from audio signals to make a speaker recognizer. I noticed that the first MFCC term tends to be very large, compared to the others. That's why I think that normalization is needed when working with machine…

Isaac
- 115
- 1
- 4
0
votes
1 answer
Validation of Support Vector Machine using sklearn
I have made a recording of two different sounds and I want to use an SVM in order to be able to distinct between the two. The process I have followed is:
Divided each sound in multiple 20ms frames.
For every frame I calculate the MFCCs, deltas and…

user103394
- 3
- 1