Email or username:

Password:

Forgot your password?
70 posts total
internetarchive

Busily spending part of my weekend enjoying one aspect of my job at the Archive - sorting through uploads of a certain type - in this case, anything calling itself a "Manual". There are hundreds of people uploading a single manual for the most obscure of things, and the style and breadth is amazing.

Stu

@textfiles Whoa, that Amstrad printer is a blast from the past!

IA is the first place I'd go for a manual now, no contest.

internetarchive

Why is The Wayback Machine so important for preserving our digital culture? According to new analysis from Pew Research Center:
❌ 38% of webpages that existed in 2013 are no longer accessible
❌ 23% of news webpages contain at least one broken link
🔗 pewresearch.org/data-labs/2024

Show previous comments
devflow

@internetarchive does the wayback machine preserve a snapshot of google reviews? I find that a lot of times businesses remove bad reviews. It would be very helpful to be able to go back in time to see these bad reviews that were removed.

Peter Motte

@internetarchive Does The Wayback Machine accept html files? In other words: do they have room for websites and webpages?

PhoenixChief

@internetarchive The large language model industy loves the internet achive data sets? What about copyright and data ownership?

Stack overflow questions are okay to be achieved since the answers are in the public domain.[1] en.m.wikipedia.org/wiki/Stack_

But if a site is bloated trash and the content is locked, please let it die.

internetarchive

It has become increasingly common for publishers to limit access to #eBooks by imposing highly restrictive licensing conditions, demanding excessive prices, or simply denying licenses. This needs to stop. Join us in two weeks for our salon!

Register here: communia-association.org/2024/

internetarchive

📣 If you’re in the Bay Area this afternoon, come celebrate the 10th anniversary of Authors Alliance, featuring a keynote from Cory Doctorow! Tickets are still available 🎟️ eventbrite.com/e/authorship-in

internetarchive

🌐 Ever wonder what happens to .gov websites before & after presidential elections? We archive them!

Help our team preserve US government info by nominating sites today 👉 blog.archive.org/2024/05/08/en #EOTArchive

internetarchive

Internet Archive Canada, Vancouver HQ, is a great place to have events, and it now does on the regular.

internetarchive

"CARRYING QUEENS IN TUBULAR CAGES. on the tips of my fingers"

A random image from the @internetarchive account on Flickr that caught my eye today.

flickr.com/photos/internetarch

#flickr #bees #fingers

internetarchive

The story of how conglomeration has reshaped the publishing industry & the books we read is a fascinating & cautionary tale that continues to be written to this day. Learn more from yesterday's #booktalk with BIG FICTION author, Dan Sinykin 👉
archive.org/details/big-fictio

internetarchive

The current legal framework doesn’t enable libraries to play their crucial role in ensuring access to knowledge & culture in the digital environment. Join us on wed 29/05 from 15-16h for our salon on e-lending to learn what needs to be done! Register here communia-association.org/2024/

COMMUNIA

We will be joined by Stephen Wyber (Director External Affairs at IFLA) who will give an overview of the state of e-lending in Europe.

Konrad Gliściński (Jagiellonian University/Centrum Cyfrowe), the principal investigator of a forthcoming study, commissioned by KR21 will present his findings on secure digital lending as well as suggestions for a legal intervention.

The event will be moderated by COMMUNIA Legal Director Teresa Nobre, who will also present our new policy paper on this issue.

internetarchive

When the Harkness Eye Institute moved after 90+ years, the fate of the library within was in question. Some materials could move, but not the large runs of bound,
print journals. That's when our donations team sprang into action to help preserve these useful ophthalmology journals 👉 blog.archive.org/2024/05/06/ey #preservation #access

Madagascar_Sky

@internetarchive
You guys are the Superman of conserving humanity's long term memory! I wish I had more money to give you.

internetarchive

InterLibrary loan lives!

Record numbers are getting articles and chapters through this reliable free method from the Internet Archive.

The long tail holdings are needed!

384 libraries are now getting docs through one system.

Go @internetarchive go #ILL

Xenograg

@brewsterkahle @internetarchive ILL is awesome. I have gotten access to 4 books already this year, with at least that many requests waiting for some kind library to fulfill.

internetarchive

🕰️ From forgotten blog posts to vanished news articles, the Wayback Machine is a crucial tool for exploring our shared online history. Begin your search today ➡️ web.archive.org/ #preservation

Claudio Segovia

@internetarchive
Querida gente de Internet Archive:
¿Qué características o requisitos debe cumplir un sitio web para ser incluído en vuestro servicio?
¿O se van agregando automáticamente?
Mí duda se debe a que no he logrado encontrar información de determinados medios en español, como por ejemplo, www.pagina12.com.ar.
Desde ya, muchas gracias por lo que puedan informarme.
¡Saludos desde la Patagonia! 👋🏽

Claudio Segovia

@internetarchive

Dear Internet Archive people:
What characteristics or requirements must a website meet to be included in your service?
Or are they added automatically?
My doubt is because I have not been able to find information from certain media in Spanish, such as www.pagina12.com.ar.
Thank you very much in advance for what you can inform me.
Greetings from Patagonia! 👋🏽

internetarchive

Thanks to @internetarchive for the pointer to @bourgwick 's excavation of Psyche Pscene, a late 60's early 70's independent zine published here in Chicago

jessejarnow.com/2024/04/psyche

internetarchive

🗣️ Free range archivist Jason Scott (@textfiles) takes Whisper, the open source speech recognition project, out for a spin on 5+ years worth of his weekly podcasts, preserved on archive.org ▶️ blog.archive.org/2024/04/28/ta x

Chris A.

@internetarchive @textfiles Whisper is pretty amazing. It's integrated into the open source video editor "Kdenlive" and works great.

Steven Sandoval

@internetarchive @textfiles I can confirm that Whisper is useful for generating transcripts of podcasts that are 80% “good enough” for local text searching with non-LLM #FLOSS tools such as Recoll recoll.org/ packages.debian.org/bookworm/r #search

LLM audio transcription is like a bulldozer tunneling a straight line into an otherwise inaccessible jungle; sure, stuff gets mangled, but it gives you quick access to biomes too costly to survey on foot.

internetarchive

The California Historical Radio Society contributed hundreds of video and audiotapes documenting various DXpeditions, going back as far as the 1960s (though most from the '80s and '90s.) 80 of these presentations have been digitized so far — check them out at archive.org/details/dx-peditio

internetarchive

"No one buys [their] books" a report on the big publisher's court testimony.

wow:

"The DOJ’s lawyer collected data on 58,000 titles published in a year and discovered that 90 percent of them sold fewer than 2,000 copies and 50 percent sold less than a dozen copies. "

Yet, they sue to make sure libraries can not buy them (above and beyond copyright). They changed the laws so copyright lasts 95 years-- so no one can get to them.

good stewards of our cultural legacy?

elysian.press/p/no-one-buys-bo

"No one buys [their] books" a report on the big publisher's court testimony.

wow:

"The DOJ’s lawyer collected data on 58,000 titles published in a year and discovered that 90 percent of them sold fewer than 2,000 copies and 50 percent sold less than a dozen copies. "

Yet, they sue to make sure libraries can not buy them (above and beyond copyright). They changed the laws so copyright lasts 95 years-- so no one can get to them.

Bill Seitz

@brewsterkahle and they forced a huge increase in the prices of ebooks, to match paper books.

Keith Mann

@brewsterkahle Owning and controlling information is the same kind of imperative for publishers that preserving and disseminating it is for the IA. It, not books, is their asset. Yet it's so hard to value individually that they simply hoard it until some nugget shows potential, and they will milk it dry and speculate and diversify to improve their odds. It is as much quantity as quality, and the more, the better. Past sales be damned; there's always a chance for more as long as you own it.

OddOpinions5

@brewsterkahle apparently this is wrong, but I hae lost the post that rebuts it sorry

internetarchive

The all-hands monday meeting at Internet Archive featured Wylie Gustafson, who you may have heard as the Yahoo! yodel.

Go Up