当前位置:   article > 正文

librosa 语音识别 学习笔记_librosa声音识别

librosa声音识别

目录

阿里达摩院FunASR:一款高效的端到端语音识别工具包

不错的功能介绍

librosa安装

语音识别


阿里达摩院FunASR:一款高效的端到端语音识别工具包

不错的功能介绍

librosa,一个很有趣的 Python 库! - 简书

音频转特征向量

GitHub - librosa/librosa: Python library for audio and music analysis

librosa安装

2024.04.27 测试ok Win11系统

pip install librosa

  1. import os
  2. import numpy as np
  3. from transformers import Wav2Vec2Processor, Wav2Vec2Model
  4. import torch
  5. import librosa
  6. def load_example_input(audio_path, processor=None):
  7. if processor is None:
  8. processor = Wav2Vec2Processor.from_pretrained("facebook/wav2vec2-base-960h")
  9. speech_array, sampling_rate = librosa.load(os.path.join(audio_path), sr=16000)
  10. audio_feature = np.squeeze(processor(speech_array, sampling_rate=sampling_rate).input_values)
  11. audio_feature = np.reshape(audio_feature, (-1, audio_feature.shape[0]))
  12. return torch.FloatTensor(audio_feature)
  13. audio_path=r'demo/wav/man.wav'
  14. load_example_input(audio_path)

语音识别

pip install SpeechRecognition

pip install pyaudio

  1. import librosa
  2. import speech_recognition as sr
  3. # 录制音频
  4. r = sr.Recognizer()
  5. with sr.Microphone() as source:
  6. print("请开始说话...")
  7. audio = r.listen(source)
  8. # 将音频转换为文本
  9. try:
  10. text = r.recognize_google(audio)
  11. print("识别结果:", text)
  12. except sr.UnknownValueError:
  13. print("无法识别音频")
  14. except sr.RequestError as e:
  15. print(f"请求出错:{e}")

本文内容由网友自发贡献,转载请注明出处:【wpsshop博客】
推荐阅读
相关标签
  

闽ICP备14008679号