I've been thinking a lot about alt text recently. People don't write alt text as often as they should. And when they do, most people are awful at it.
I think this is one of the single best applications for some multi-modal LLM to step in and write alt text for you.
I think there was a part in Google IO that demoed something like this. "Coming later". But the state of affairs *today* is tragic.
Open the instagram app, turn on a screen reader, and listen to the dog shit generated alt text. It just lists out the number of people and objects in the photo. "May be a meme of 2 people. Bath robe, laundry basket, and text."