javascript - Web Speech API: Setting a Custom Audio Output Device (Speaker) for TTS Playback with sinkId

I'm building a Text-to-Speech (TTS) component in React, where I want to specify the audio output device using sinkId (e.g., a selected speaker). Despite setting sinkId on the audio element, the audio consistently plays through the default output device instead of the selected one.

In this setup, I'm using SpeechSynthesisUtterance to generate TTS playback, and I tried routing the audio through an AudioContext with a MediaStreamDestination to control the output. However, sinkId does not seem to take effect. Any suggestions for correctly setting the output device?

example code :

import { useEffect, useRef, useState } from "react";
import { useDispatch, useSelector } from "react-redux";

const UseTTS = () => {
    const [isSpeaking, setSpeaking] = useState(false);
    const [isPaused, setIsPaused] = useState(false);

    const audioContextRef = useRef(null);
    const audioElement = useRef(null);
    const utteranceRef = useRef(null);

    const { selectedSpeakerSrc } = useSelector(state => state.setup);

    useEffect(() => {
        // Initialize AudioContext and audio element
        if (!audioContextRef.current) {
            audioContextRef.current = new (window.AudioContext || window.webkitAudioContext)();
        }
        if (!audioElement.current) {
            audioElement.current = new Audio();
        }
    }, []);

    const handlePlay = async () => {
        const ttsText = "Hello, this is a test message!";
        
        if (!utteranceRef.current) {
            utteranceRef.current = new SpeechSynthesisUtterance(ttsText);
            utteranceRef.current.lang = 'en-US';
            utteranceRef.current.onend = handleSpeechEnd;
        }

        try {
            if (audioElement.current.setSinkId && selectedSpeakerSrc) {
                await audioElement.current.setSinkId(selectedSpeakerSrc);
            }
        } catch (error) {
            console.error(`Failed to set sinkId: ${error}`);
        }

        utteranceRef.current.voice = speechSynthesis.getVoices()[0];
        speechSynthesis.speak(utteranceRef.current);
        setSpeaking(true);
    };

    const handleSpeechEnd = () => {
        setSpeaking(false);
        utteranceRef.current = null;
    };

    return {
        isSpeaking,
        handlePlay,
    };
};

export default UseTTS;

asked Nov 11, 2024 at 17:33

Aadarsh velu

351 silver badge7 bronze badges

1

There was github.com/WICG/speech-api/issues/69 that should have gotten there, but it stall.
– Kaiido
Commented Nov 11, 2024 at 23:38
Thanks @kaiido, seen the thread, seems stall
– Aadarsh velu
Commented Nov 12, 2024 at 3:04

Add a comment |

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Web Speech API: Setting a Custom Audio Output Device (Speaker) for TTS Playback with sinkId

0

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.