I think there was a part in Google IO that demoed something like this. "Coming later". But the state of affairs *today* is tragic.
Open the instagram app, turn on a screen reader, and listen to the dog shit generated alt text. It just lists out the number of people and objects in the photo. "May be a meme of 2 people. Bath robe, laundry basket, and text."