Skip to main content

All Questions

0 votes
1 answer
67 views

I cant seem to get my Android App to work with Vosk in MacOS12.7

I have been through the wringer trying to get Voice Recognition for an Android App i'm developing on my MacOS 12.7, using Python3.10, Kivy2.3, speechrecognition requirement, and Vosk 3.44, ive been ...
GX1705's user avatar
  • 3
0 votes
0 answers
26 views

How to Capture Loopback Audio with SpeechRecognition (PyAudio)?

I’m working on a project where I need to use the speech_recognition module to process audio in real-time. However, from my research, it seems that speech_recognition (which works with pyaudio) only ...
Priyal Deep's user avatar
0 votes
0 answers
21 views

SpeechRecognition and Pocket Sphinx doesn't work

I'm following a youtube tutorial to make an assistant in python. When I say "Piggy", it responds as usual, but after that, when I say how are you, it gives me this error. The guy in the ...
Jhpark0303's user avatar
0 votes
0 answers
32 views

"BrokenPipeError: [Errno 32] Broken pipe" when sending a python scripts output to a while loop

Been pulling my brains out for a few hours now, I cannot seem to get this working, I have been to the 3rd page of google results but I cannot seem to get this right. code: #!/bin/bash python3.12 -m ...
usr_40476's user avatar
0 votes
0 answers
40 views

PyAnnote Speaker Verification: All Speakers Getting Perfect 1.000 Similarity Scores

I'm experiencing an issue with PyAnnote's speaker verification where all speakers are getting perfect similarity scores (1.000), even when they are clearly different voices. Environment pyannote....
user29588450's user avatar
0 votes
0 answers
118 views

How to use speech_recognition and pyannote.audio simultaneously

How can I use the data from speech_recognition's listen() function as an embedding to compare with previously recorded .wav files of different speakers talking so that I can print (speaker): (...
Flamethrower's user avatar
0 votes
0 answers
37 views

Why is Google recognizer missing in my SpeechRecognition library?

I'm trying to use the google recognizer from the SpeechRecognition library in Python import speech_recognition as sr rec = sr.Recognizer() with sr.Microphone() as mic: rec....
Quasartioon's user avatar
1 vote
0 answers
50 views

speech_recognition and gtts don't understand numbers lower than 11

I put together straightforward code that asks the user to choose between option 1, oranges, and option 2, pears: options = { (1, "1", "one", "number one", "...
Louie Morais's user avatar
0 votes
2 answers
2k views

Use Vosk speech recognition with Python

I'm trying to use Vosk speech recognition in a Python script, but the result is always : { "text" : "" } It's not a problem with my file because when I use in DOS "vosk-...
Rémi Descamps's user avatar
0 votes
0 answers
147 views

Montreal Forced Aligner(MFA) taking too much time(almost 18 days still going on) to train a 33 GB corpus

WE are using Montreal Forced Aligner (MFA) 3.x to train an acoustic model on a large dataset (~33GB of audio and transcripts in an Indian language). The training process takes an extremely long time(...
Swayangjit's user avatar
  • 1,881
2 votes
0 answers
37 views

How can I prevent muffled audio?

I am making an assistant in Python using SpeechRecognition and some other libraries. I use this library for both getting the voice and turning it into text. But when I try to listen to some audio, ...
PrinceMask's user avatar
0 votes
2 answers
5k views

I am Using speech_recognition module but it giving me this error ModuleNotFoundError: No module named 'aifc'

import speech_recognition as sr r = sr.Recognizer() with sr.Microphone() as source: print("Talk") audio_text = r.listen(source) print("Time over, thanks") try: ...
Budithi Sai Sudhanshu's user avatar
0 votes
0 answers
82 views

Google Cloud Speech-To-Text tagging requests for billing purposes

I am looking to reduce the amount of hassle we get by managing multiple customers within the same app. Today we're using service account JSON files for authentication, one file for each customer. We ...
user3218338's user avatar
0 votes
0 answers
60 views

Reading from a queue in sounddevice with a long lag

I writing a script for real-time speech translation. Sound from a microphone is put in a python queue, with another process getting sound chunks from the queue, transforming speech into text and ...
Denis Kazakov's user avatar
1 vote
0 answers
24 views

Python `speech_recognition` Script Stuck on `recognizer.listen()` Without `phrase_time_limit`

I'm working with the speech_recognition library in Python, and my script is getting stuck on the recognizer.listen() call. I want to continuously listen for commands after detecting the wake word but ...
Sandip Mishra's user avatar

15 30 50 per page
1
2 3 4 5
80