596 questions
1
vote
0
answers
65
views
alternative_language_codes with hi-IN causes English speech to be transliterated into Devanagari script
Environment:
API: Google Cloud Speech-to-Text v1
Model: default
Audio: LINEAR16, 16kHz
Speaker: Indian English accent
Issue:
When alternative_language_codes=["hi-IN"] is configured, ...
0
votes
0
answers
66
views
Tracking Speech-to-Text usage per client with gRPC custom metadata in Google Cloud Logs
I am implementing a system using Google Cloud Speech-to-Text (STT) services via gRPC streams and need a reliable way to track usage per client for quota management. I want Google Cloud's logging ...
-3
votes
1
answer
125
views
Is this a Google.Cloud.Speech.V2 client package bug related to PhraseSets or am I using it wrong?
I am migrating code from the v1 package to the v2 package. As far as I can tell the quotas are the same however when I use the v2 package with larger phrase sets I get an error even though I have ...
1
vote
1
answer
74
views
Google cloud speech v1 not giving proper transcription for Spanish when audio contains English and Spanish both
We are using Google Cloud Speech V1 libraries available in c#.
We are using a wav file which contains bilingual content. It is English and Spanish in the same audio.
The problem is Spanish text is not ...
1
vote
0
answers
45
views
ttsClient.SynthesizeLongAudio undefined (type *"cloud.google.com/go/texttospeech/apiv1".Client has no field or method SynthesizeLongAudio)
Triggering a Cloud Run Function 2nd Generation based on changes to files in a Google Cloud Storage bucket.
go.mod
go 1.24
require (
cloud.google.com/go/storage v1.51.0
cloud.google.com/go/...
0
votes
0
answers
29
views
Google Speech-to-Text V2 SpeechAdaption didn't help for special words
I'm having a problem using the cloud stt V2 SpeechAdaption.
I added the some following adaption_phrases to expect the api could recognize them more likely, but actually it's not, still output the &...
1
vote
0
answers
125
views
Google Cloud Speech-to-Text V2 (Android): `SPEECH_ACTIVITY_END` not triggered with `ja-JP` but works with `en-US`
Environment:
Platform: Android (Java)
Google Cloud Speech-to-Text Version: V2
Model: chirp_2
Endpoint: us-central1-speech.googleapis.com:443
Description:
I’m using Google Cloud Speech-to-Text V2 ...
0
votes
0
answers
113
views
Google Cloud Speech-to-Text V2 StreamingRecognize CANCELLED when using `cmn-Hans-CN`
I’m using Google Cloud Speech-to-Text V2 in an Android (Java) app. With languageCode set to "ja-JP" or "en-US", streaming recognition works perfectly. As soon as I switch only the ...
2
votes
1
answer
115
views
How to fix av_interleaved_write_frame() broken pipe error in php
I have an issue using ffmpeg to stream audio and parse to google cloud speech to text in PHP.
It returns this output.
I have tried delaying some part of the script, that did not solve it.
I have also ...
0
votes
0
answers
168
views
Speech to Text error - "Permission 'speech.recognizers.recognize' denied on resource (or it may not exist)"
I have been able to get speech to text v1 working in Postman no problem, however, getting v2 to work has been a bit of a nightmare.
I am currently getting the error ""Permission 'speech....
1
vote
1
answer
63
views
extracting uri from results in BatchRecognizeResponse in Python
I'm using Google Cloud Speech-to-Text V2 API, specifically the BatchRecognizeResponse type. This response is structured as a protobuf object with multiple embedded fields, but I’m having difficulty ...
0
votes
0
answers
113
views
How to get fine grained smaller time resolution srt from google cloud speech to text API?
I have a working code in python which generates an srt file using google cloud speech to text.
from google.api_core.client_options import ClientOptions
from google.cloud.speech_v2 import SpeechClient
...
0
votes
0
answers
134
views
Does Google Speech-To-Text service supports OGG_OPUS or WEBM_OPUS for realtime streaming recognize?
I was able to get it working with LINEAR16 encoding, but can't with OGG_OPUS. So question is, is it really supported? I didn't find any related information in docs unfortunately.
0
votes
0
answers
175
views
Get lyrics of song by google speech to text
In my Nodejs server am using Google's speech to text API to get the lyrics of song, but it doesn't seem to work well with music. I loose most part of words, so my question is, does this api work with ...
0
votes
0
answers
103
views
Google Cloud Speech-to-Text API: Unauthenticated Error in Production but Works in Development
I'm using the Google Cloud Speech-to-Text API in my ASP.NET Core web application, and it works perfectly in my local development environment. However, when I deploy the application to the production ...