Email or username:

Password:

Forgot your password?
Jason Lefkowitz

Bookmark this link in case ten years from now you are wondering why your refrigerator talks like an Edwardian dowager

wired.com/story/harvard-ai-tra

Harvard University announced Thursday it’s releasing a high-quality dataset of nearly one million public-domain books that could be used by anyone to train large language models and other AI tools. The dataset was created by Harvard’s newly formed Institutional Data Initiative with funding from both Microsoft and OpenAI. It contains books scanned as part of the Google Books project that are no longer protected by copyright.
4 comments
Jason Bowen

@jalefkowit I'd be on board. My fridge is an asshole

Rocketman

@jalefkowit Not quite what we meant when we said “bring back proper manners”

clew

unexpected rationale for the Diamond Age style @jalefkowit

AN/CRM-114

@jalefkowit “It is a truth universally acknowledged, that a single man in possession of a good fortune, must be in want of Oikos Triple Zero High Protein Nonfat Greek Yogurt. Would you like to order some for Peapod delivery?”

Go Up