I've been thinking a lot about alt text recently. People...

sam henri gold's posts Post Back to profile

I've been thinking a lot about alt text recently. People don't write alt text as often as they should. And when they do, most people are awful at it.

I think this is one of the single best applications for some multi-modal LLM to step in and write alt text for you.

Like 12 May 2023 at 14:41 | Open on hachyderm.io

10 comments

sam henri gold

I think there was a part in Google IO that demoed something like this. "Coming later". But the state of affairs *today* is tragic.

Open the instagram app, turn on a screen reader, and listen to the dog shit generated alt text. It just lists out the number of people and objects in the photo. "May be a meme of 2 people. Bath robe, laundry basket, and text."

12 May 2023 at 14:44 | Open on hachyderm.io

@samhenrigold i kinda feel like education is a big thing here… people are awful at it because very few people know what good alt text reads like. feels like an opportunity to guide / teach people — in context — vs glossing over it and making it happen in the background without people thinking about it…

(and then fall back to machine generated / community suggested alt text if the author intentionally skips it)

12 May 2023 at 14:45 | Open on mastodon.social

sam henri gold

@edwellbrook Agreed. For an upcoming Mastodon app update, I've added a dialog that sheds some additional info and tips for alt text. Nothing groundbreaking but a worthwhile first step imo

12 May 2023 at 14:49 | Open on hachyderm.io

DELETED

@samhenrigold

What would an LLM say this is?

color edit of a black and white screenshot in brownish yellow, red and black. twin serpentine shapes are actually synchronized-swimming women shot from above. they have been made uniform by identical swimsuits and bathing caps with hands on the shoulders of the swimmer in front. the effect is an abstract design, two twins snakes in identical, twisted shapes

12 May 2023 at 14:52 | Open on mstdn.social

sam henri gold

@DemocracySpot needs more time in the oven, i guess.

An AI caption generator's output for that image. All of its suggestions are some variant of "food on a table".
a bunch of donuts that are on a table, a close up of a plate of food on a table, a cake on a plate.

12 May 2023 at 14:56 | Open on hachyderm.io

DELETED

@samhenrigold

Wow! 😂 Thanks.

12 May 2023 at 14:58 | Open on mstdn.social

Kevin T

@samhenrigold Yeah, it def feels like a task that could really get the benefits of this kind of tech. At least for providing some kind of basic template framework for a user to start from.

I had to take a class on website accessibility compliance, and part of it focused on writing good image descriptor text; there are a lot of things that people don't even consider which are actually pretty important, and also a ton of stuff that people think is important to include in an image descriptor which is actually really unnecessary/unhelpful. At the very least, a tool providing some kind of heuristic GOOD/OK/BAD could be super useful and shouldn't be toooooo crazy hard to train. :thaenkin:

@samhenrigold Yeah, it def feels like a task that could really get the benefits of this kind of tech. At least for providing some kind of basic template framework for a user to start from.

Expand text...

12 May 2023 at 14:55 | Open on infosec.exchange

sam henri gold

@ovmoro And, ya know, give that model a little nudge in the right direction.

```
if (altText.startsWith("An image of")) {
return Score.Awful
}
```

12 May 2023 at 15:16 | Open on hachyderm.io

Kevin T

@samhenrigold lol, there are def some worse ways to start a descriptor. But yeah, at the very least "a photo of", "a drawing of", and "a screenshot of" need to rank higher than just "it's a picture" XP

12 May 2023 at 16:11 | Open on infosec.exchange

Thomas

@samhenrigold my suggestion would be that you could propose alt texts for others as a Mastodon feature. Either for images with no text or bad text. As an author, you could accept the proposal and it would edit the post.

12 May 2023 at 16:15 | Open on det.social