当前位置:   article > 正文

部分语音情感识别数据集解析(EMO-DB,RAVDESS,SAVEE)_savee数据集

savee数据集
  1. EMO-DB:
    1. 德语,10 个人(5 名男性,5 名女性)的大约 500 个音频,表达了 7 种不同的情绪(倒数第二个字母表示情绪类别):N = neutralW = angryA = fearF = happyT = sadE = disgustL = boredom
    2. 文件名每个字母的对应:
      1. 有一些版本可能还有第7个letter,暂时不清楚含义,但也应该没有太大作用。
      2. positon 6 对应 情感:
        1. W:anger
        2. L:boredom
        3. E:disgust
        4. A:anxiety/fear
        5. F:happiness
        6. T:sadness
        7. N:neutral version

      3. Positions 3-5 对应的语音内容(Code of texts,此处写出的是由语音中的德语转为了英语):
        1. a01 the tablecloth is lying on the frigde.
        2. a02 she will hand it in on wednesday.
        3. a04 tonight I cound tell him.
        4. a05 the black sheet of paper is located up there besides the piece of timber.
        5. a07 in seven hours it will be.
        6. b01 what about the bags standing there under the table?
        7. b02 they just carried it upstairs and now they are going down again.
        8. b03 currently at the weekends i always went home and saw agnes.
        9. b09 i will just discard this and then go for a drink with karl
        10. b10 it will be in the place where we always store it.
      4. Positions 1-2 对应的人的性别及年龄,Information about the speakers:
        1. 03 - male, 31 years old
        2. 08 - female, 34 years
        3. 09 - female, 21 years
        4. 10 - male, 32 years
        5. 11 - male, 26 years
        6. 12 - male, 30 years
        7. 13 - female, 32 years
        8. 14 - female, 35 years
        9. 15 - male, 25 years
        10. 16 - female, 31 years
  2. RAVDESS:文件名由 7 部分数字标识符组成(例如,02-01-06-01-02-01-12.mp4)。这些标识符定义了刺激特征:
    1. 文件名标识符
      1. Modality (01 = full-AV, 02 = video-only, 03 = audio-only).
      2. Vocal channel (01 = speech, 02 = song).
      3. Emotion (01 = neutral, 02 = calm, 03 = happy, 04 = sad, 05 = angry, 06 = fearful, 07 = disgust, 08 = surprised).
      4. Emotional intensity (01 = normal, 02 = strong). NOTE: There is no strong intensity for the 'neutral' emotion.
      5. Statement (01 = "Kids are talking by the door", 02 = "Dogs are sitting by the door").
      6. Repetition (01 = 1st repetition, 02 = 2nd repetition).
      7. Actor (01 to 24. Odd numbered actors are male, even numbered actors are female).
    2. 文件名示例:02-01-06-01-02-01-12.mp4
      1. Video-only (02)
      2. Speech (01)
      3. Fearful (06)
      4. Normal intensity (01)
      5. Statement "dogs" (02)
      6. 1st Repetition (01)
      7. 12th Actor (12)
      8. Female, as the actor ID number is even
    3. 英文,24 个人(12 名男性,12 名女性)的大约 1500 个音频,表达了 8 种不同的情绪(第三位数字表示情绪类别):01 = neutral02 = calm03 = happy04 = sad05 = angry06 = fearful07 = disgust08 = surprised
  3. SAVEE
    1. Speaker:“DC”、“JE”、“JK”和“KL”是为SAVE数据库记录的四位男性演讲者
    2. Audio data:
      1. 音频文件由以44.1 kHz采样的WAV音频文件组成
      2. 7种情绪类别中的每一种都有15个句子。
      3. 文件名的首字母表示情感类别,后面的数字表示句子编号。
      4. The letters 'a', 'd', 'f', 'h', 'n', 'sa' and 'su' represent 'anger', 'disgust', 'fear', 'happiness', 'neutral', 'sadness' and 'surprise' emotion classes respectively. 
      5. E.g., 'd03.wav' is the 3rd disgust sentence. 
声明:本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:【wpsshop博客】
推荐阅读
相关标签
  

闽ICP备14008679号