Transformers-based Approach for Speech Emotion Recognition

dc.contributor.author KERKAR , Aya ; HAZMOUNE , Samira
dc.date.accessioned2024-10-07T09:21:13Z
dc.date.available2024-10-07T09:21:13Z
dc.date.issued2024
dc.description.abstractMost of the smart devices voice assistants or robots present in the world are not smart enough to understand emotions. They are just like command and follow devices they have no emotional intelligence. When people are talking to each other based on their voice they understand the situation and react to it, for instance, if someone is angry then another person will try to calm him by conveying in a soft tone, these kinds of harmonic changes are not possible with smart devices or voice assistants as they lack emotional intelligence. So adding emotions and making devices understand emotions will significantly enhance their capabilities and take them one step further to human-like intelligence. To address this limitation, our system introduces a novel approach to integrating emotional intelligence into smart devices. The proposed approach in this thesis follows a typical machine learning workflow, encom- passing data preparation, model training, and evaluation. It leverages pre-trained models and transfers learning for feature extraction from emotion datasets, with key components including Mel-frequencyspectrogramextractionalongsidetheWev2vecpre-trainedTransformermodelfor feature extraction. Other steps involve dataset splitting, fine-tuning the HuBERT pre-trained model for SER, and emotion classification. The system also facilitates speakergender identifica- tion (male or female). Standard datasets RAVDESS and CREMA-D were utilized for training and evaluation, yielding accuracies of 84.25% and 71%, respectively
dc.identifier.urihttp://dspace.univ-skikda.dz:4000/handle/123456789/2547
dc.language.isoen
dc.publisherFaculty of Sciences
dc.titleTransformers-based Approach for Speech Emotion Recognition
dc.title.alternativeArtificial Intelligence
dc.typeMasters degree Thesis
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Transformers_based_approach_for_speech_emotion_recognition-1.pdf
Size:
7.02 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:
Collections