@david_chisnall I scrolled through a lot of the replies here, it's unfortunate I don't see accessibility mentioned. You alluded to it with your car example.
I don't think I get why the issue isn't that LLMs can't understand what you're saying, not that speech input can be ambiguous.
"OK google, play XYZ"
Sure, playing XYZ on YouTube music. You will need to install..
"Cancel. Play XYZ on Spotify"
Sure, playing XYZ on Spotify
An actual interaction I've had with my car (Gemini wasn't used)