Email or username:

Password:

Forgot your password?
Top-level
A.V.

@dalias @bedast speech recognition has used language models for decades now. It was one of original applications of language models, way before they scaled up to aping shakespeare.

But even without language models, the act of transcription is very close to generative ai, as its the task of predicting the next text token, given previous tokens and encoded audio sequence.

3 comments
Rich Felker

@varavs @bedast Then don't call it "AI".

But also, question what harms are coming out of the predictive models. The more they force the output to sound natural and fix misrecognitions, the greater the chance they're altering meaning. Same as autocorrect vs typed text with typos and misspellings.

Rich Felker

@varavs @bedast Also ask if the model is ethically and legally sound. Was it produced from professional training material with compatible license terms? Or stolen from millions of movies or YouTube videos?

LisPi
@dalias @bedast @varavs Aren't basically all the embeddable models that don't have absurd spec requirements sourced & produced by university projects?
Go Up