Skip to content

kjy7567/speech_emotion_recognition_from_log_Mel_spectrogram_using_vertically_long_patch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

speech emotion recognition from log-Mel spectrogram using vertically long patch

This repo is the official implementation of "Accuracy Enhancement Method for Speech Emotion Recognition from Spectrogram using Temporal Frequency Correlation and Positional Information Learning through Knowledge Transfer".

How to load model

I saved the model as python dict() format like below:

torch.save({
            'model_state_dict': model.state_dict(),
            'CE': cross_entropy_loss
            'L1': L1_loss
            ...
            }, PATH)

So, you can load the pretrained weight like below:

# model.load_state_dict(torch.load(PATH_WEIGHT_FILE)['model_state_dict'])
model.load_state_dict(torch.load('./weight/teacher_92.64_CREMA_D.ckpt')['model_state_dict'])

About

speech emotion recognition from log mel spectrogram

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages