speech emotion recognition from log-Mel spectrogram using vertically long patch

This repo is the official implementation of "Accuracy Enhancement Method for Speech Emotion Recognition from Spectrogram using Temporal Frequency Correlation and Positional Information Learning through Knowledge Transfer".

How to load model

I saved the model as python dict() format like below:

torch.save({
            'model_state_dict': model.state_dict(),
            'CE': cross_entropy_loss
            'L1': L1_loss
            ...
            }, PATH)

So, you can load the pretrained weight like below:

# model.load_state_dict(torch.load(PATH_WEIGHT_FILE)['model_state_dict'])
model.load_state_dict(torch.load('./weight/teacher_92.64_CREMA_D.ckpt')['model_state_dict'])

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
fig		fig
weight		weight
README.md		README.md
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech emotion recognition from log-Mel spectrogram using vertically long patch

How to load model

About

Releases

Packages

Languages

kjy7567/speech_emotion_recognition_from_log_Mel_spectrogram_using_vertically_long_patch

Folders and files

Latest commit

History

Repository files navigation

speech emotion recognition from log-Mel spectrogram using vertically long patch

How to load model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages