Email or username:

Password:

Forgot your password?
Kevin Marks

Maybe Google shouldn't strip non-ascii characters when training AI

64 comments
BogDrakonov

@sil @KevinMarks that episode had a plot wish could have lasted 2-3 episodes to really flesh out the slow shrinking of the known universe. Such a cool story.

dasparadoxon

@sil that was my first thought ! :) :) :) :)

Daniel Marks

@KevinMarks I'm made of perhaps 1026 of them, leaving 52 to 56 of them left for everyone else.

Dragoniff

@profdc9 @KevinMarks "the universe is made out of 1082 atoms, and I'm made of 1026 of them" is a level of egocentric I never thought possible.

Fragarach

@dragoniff2 @profdc9 @KevinMarks

"If there's anything in here more important than my ego, I want it caught and killed at once!" - Zaphod Beeblebrox in THHGTTG by Douglas Adams.

SpaceLifeForm

@KevinMarks

But ^ is an Ascii character, so it may be that it was originally a superscript markup.

Even so, that should not force removal of valid UTF-8.

Ariaflame

@SpaceLifeForm @KevinMarks It usually uses the sup html tag unless they're doing weird things now.

John Cormier

@KevinMarks swear if I hear Trump complaining that the Haitian immigrants are taking all the atoms imma lose my shit

Jay

@KevinMarks Speaking as someone who’s at least 4 million times worse than the average person at estimation, this seems good enough to me!

Neil E. Hodges
I could swear there were at least 1083! :P
The Turtle

@KevinMarks looks like they corrected it. Still hasn't told me how much Ohio weighs.

Eugene Meidinger

@KevinMarks Google's Gemini doesn't have an issue with it. Probably a problem with the output not the training?

Taurus :ms_18_plus:

@KevinMarks i have the impression this is not correct

the universe is just 6 atoms we are just very good at sharing them

:jan:‍:abreath:‍‍🌬:dandelion:

@KevinMarks I asked Google for some prime numbers and it gave me 4 prime numbers and the letter "Q"...

erik

@KevinMarks They clearly state it's an approximation.

Dragoniff

@KevinMarks the bleeding edge in computing,with the full power of a neural network of fifteen and a half neurons. The finest in the world.

Dragoniff

@KevinMarks How can there be 1.5 billion chinese if the universe only got about 1078 atoms? I'll tell you what, that whole China business is some CIA Conspiracy!

Ainsley Lowbeer

@KevinMarks
Each of these atoms forms the brain cell of an orange cat.

n8chz ⒶⒺ

@KevinMarks That, or it's been "reading" a lot of pdf files.

Joseph Meyer

@KevinMarks
Atoms are apparently much bigger than we once thought.

OddOpinions5

@KevinMarks

Engineers: Accurate to two decimal places

Physicists: Accurate to an order of magnitude

Astrophysicists: Accurate to an order of magnitude in the exponent

Martin Ueding

@failedLyndonLaRouchite @KevinMarks
The estimates for the dark energy density that come from astronomy and quantum field theory differ by a factor of 10^120. That doesn't even satisfy that.

David Nash

@KevinMarks @nellie_m It’s pretty close to correct if you use base 10^27 (1000 in that base = 10^81 in decimal). However, “unlikely to be too far off the mark” is the best you’re likely to get, since no one I’ve met has been able to remember all the different digits required to write any given number in that base.

Jack Linke 🦄

@KevinMarks It's a *ROUGH ESTIMATE*, Kevin 😒

(😂😂😂)

Karsten Johansson

@KevinMarks This is unacceptable. People use Google to find information. The results produced by Google themselves should not be so erroneous that the feature lacks all reason for existing.

Michael Weiss

@ksaj @KevinMarks it's becoming unacceptable to use Google to find out information.

Oblomov

@KevinMarks this looks more like it stripoed HTML sup tags, honestly

Steveg58

@KevinMarks
Google has done this before. Back in the early days of Google search you could not search for C++ because only alphanumeric characters were valid in search terms.

Eugene Glover

@KevinMarks @glennf I’ve always leaned towards 1078; however, I’m willing to go as high as 1080. 1082, though, is just absurd.

Martin Vogel

@KevinMarks
<sup> and </sup> are made of total totally fine ASCII characters.

Sonikku

@KevinMarks And companies are falling over themselves to get on this AI bandwagon, and boast about how it is already being used to handle important decisions such as people's taxes? Brave new world!

Shambolic Matter

@KevinMarks Well, that would simplify the fuck out of the traveling salesman problem.

BogDrakonov

@KevinMarks I have a uBlock filter that hides “Slop Overview” from my search results. Clearly it’s a filter worth keeping.

Fragarach

@KevinMarks

I think, rather than laugh, we should be very, very afraid!

Hugo Mills

@KevinMarks That particular one's not just Google or AI. The number of times I've seen that kind of error in newspapers is just silly.

Go Up