@dalias @bedast speech recognition has used language...

A.V.

@dalias @bedast speech recognition has used language models for decades now. It was one of original applications of language models, way before they scaled up to aping shakespeare.

But even without language models, the act of transcription is very close to generative ai, as its the task of predicting the next text token, given previous tokens and encoded audio sequence.

Like 10 January at 13:10 | Wall-to-wall | Open on sigmoid.social

3 comments

Rich Felker

@varavs @bedast Then don't call it "AI".

But also, question what harms are coming out of the predictive models. The more they force the output to sound natural and fix misrecognitions, the greater the chance they're altering meaning. Same as autocorrect vs typed text with typos and misspellings.

10 January at 13:14 | Open on hachyderm.io

Rich Felker

@varavs @bedast Also ask if the model is ethically and legally sound. Was it produced from professional training material with compatible license terms? Or stolen from millions of movies or YouTube videos?

10 January at 13:16 | Open on hachyderm.io

LisPi

@dalias @bedast @varavs Aren't basically all the embeddable models that don't have absurd spec requirements sourced & produced by university projects?

10 January at 21:04 | Open on udongein.xyz

Go Up