Email or username:

Password:

Forgot your password?
248 posts total
Simon Willison

The environmental comparison I'd be interested in seeing is between a year of heavy personal usage of LLMs (ChatGPT, Claude, Gemini etc) compared to the CO2 emissions from a single passenger flight

Can I do my own personal carbon offsetting by skipping one trip a year?

Does that question even make sense?

Show previous comments
Daniel

@simon I've found the self-reported Llama2 training (!) emissions interesting. See section 2.2.1 here:

arxiv.org/abs/2307.09288

That doesn't say much about inference, tho, but it's a start.

You'd also have to include not just GPU energy but e.g. emissions of the data center and supporting software around it, cluster utilization, the emissions of people working on this, traveling, and so on.

There's Scope1/2/3 etc. categories to make this a bit more structured but in the end it's tough.

mborus

@simon The part where you calculate your useβ€˜s footprint makes sense (and would be very interesting to me, too) - justifying it by offsetting it against a specific unrelated β€žbad footprintβ€œ activity like flying not so much unless you set a hard CO2 budget for yourself.

Trond Olsen

@simon Knowing the location of datacenters and using a source like app.electricitymaps.com/map you can guestimate the carbon footprint. My employer did the same for its datacenters in Norway but its somewhat complicated by a complex energy exchange market (we're connected to UK, Germany, Netherlands, Denmark, Sweden).

Simon Willison

PSA for web developers: if you plug your iPhone into a Mac with a USB-C cable and turn on the iPhone setting "Safari -> Advanced -> Web Inspector" you can run the full Safari web inspector on your Mac against live pages displayed on your phone

Wildly useful, not just for debugging but even for things like tweaking a page a bit in order to get a better screenshot

Simon Willison

My notes on the new OpenAI audio input/output APIs - they're a lot of fun to play with, but the cost is HIGH

gpt-4o-audio-preview audio input is currently 1,066 times more expensive than Google Gemini 1.5 Flash 8B audio input!
simonwillison.net/2024/Oct/18/

Simon Willison

The sheer volume of meticulously reported content in @molly0xfff's newsletter is really something else - absolutely no idea how she can churn out that much high quality reporting on a weekly basis, I have enough trouble just finding the time to read it!
citationneeded.news/issue-68/

Quinn Comendant

@simon @molly0xfff You always want the wikipedia editor prodigy on your side!

Simon Willison

I guess a great way to forge a journalist is in the fires of Wikipedia for a decade or so

Simon Willison

I am delighted to report that NotebookLM now lets you provide custom instructions when you use it to generate a podcast...

"You are both pelicans who work as data journalist at a pelican news service. Discuss this from the perspective of pelican data journalists, being sure to inject as many pelican related anecdotes as possible"

7m40s audio and transcript here: simonwillison.net/2024/Oct/17/

Show previous comments
Simon Willison

> You ever find yourself wading through mountains of data trying to pluck out the juicy bits? It's like hunting for a single shrimp in a whole kelp forest, am I right?

...

> Think of those facial recognition systems they have for humans. We could have something similar for our finned friends. Although, gotta say, the ethical implications of that kind of tech are a whole other kettle of fish. We pelicans gotta use these tools responsibly and be transparent about it.

Leaping Woman

@simon You continually delight and amaze me.

Jote πŸ³οΈβ€πŸŒˆ :verified_gay:

@simon I'm sad that audio is not several minutes of pelican chaos and noises. πŸ˜‚

Simon Willison

Video scraping: extracting JSON data from a 35 second screen capture for less than 1/10th of a cent simonwillison.net/2024/Oct/17/

I needed to extract information from a dozen emails in my inbox... so I ran a screen capture tool, clicked through each of them in turn and then got Google's Gemini 1.5 Flash multi-modal LLM to extract (correct, I checked it) JSON data from that 35 second video.

Total cost for 11,018 tokens: $0.00082635

Show previous comments
Frederik Elwert

@simon So you basically re-implemented Recall? πŸ˜‰

[DATA EXPUNGED]
th0ma5

@simon great documentation ... Any details on accuracy? How much did you have to clean up the output and did you have to check it all by hand?

Simon Willison

Put together some notes on the Gemini terms-of-service: it looks like their paid API tier doesn't train on your inputs to the model, but the free API tier does:

simonwillison.net/2024/Oct/17/

Data used to create fine-tuned models, even on the free tier (they have a free tier!) won't be used for training. I hadn't realized you could fine-tune a model there for free, that's pretty wild: ai.google.dev/gemini-api/docs/

Simon Willison

You only have 10 days to submit your nominations for DSF board: djangoproject.com/weblog/2024/

There is an interesting thread if you are wondering if you're a good fit or what's the requirements: forum.djangoproject.com/t/2025
Please consider it if you want to improve the community, this is one way to do so.

I would like to have a board representative of the Django community: diversity and inclusivity 🌍

If you have any questions, feel free to ask in the forum, discord, or privately to me or @thibaudcolas πŸ’š

You only have 10 days to submit your nominations for DSF board: djangoproject.com/weblog/2024/

There is an interesting thread if you are wondering if you're a good fit or what's the requirements: forum.djangoproject.com/t/2025
Please consider it if you want to improve the community, this is one way to do so.

Simon Willison

Inspired by the new Markdown-based testing framework added to Ruff, I wrote up some notes on "literate testing", including trying out a pytest plugin called pytest-markdown-docs simonwillison.net/2024/Oct/16/

Simon Willison

We're starting regular DSF Office Hours! Anyone’s welcome if you want to work on DSF things with others. I'll try to be there every week, and other board members will likely attend too.

If you're thinking about running for the DSF Board β€” this would be a great time to come and ask questions / get help with your nomination!

It'll be each Wednesday at 6PM UTC β€” about 4 hours from now. Short notice I know, but mark it down for next week.

DM/email me (or other board members) for joining info!

Simon Willison

I have absolutely no idea what the consequences of agreeing to this particular dialog are, but I keep on seeing variants of it from all sorts of different applications

macOS dialog

Allow β€œGoogle Chrome” to find devices on local networks?

This will allow you to select from available devices and display content on them.

Buttons: Don't Allow or Allow
Show previous comments
Hugo ι›¨ζžœ

@simon Sounds to me like it would grant an application permission to discover and connect to devices in your LAN via mDNS.

Dr. JΓΌrgen KnΓΆdlseder

@simon @juandesant It’s a Faustian question. If you click Β« Allow Β» you sold your soul to the Devil. 🀣

Codepope

@simon Previously, Apple required some granular permissions to access particular local network features, but it was generally open. From around iOS 14 to iOS 18, they’ve been turning up the frog steamer on the feature (so I had to add the request into my SuperConnector app to control my radio) and looks like we're in pre-boiled frog territory now.

Simon Willison

Extremely basic audio engineering question: if I am recording audio on my Mac laptop through microphones plugged into that laptop, what is the best way for me to listen to that audio in order to check that it sounds right?

How do I get my headphones to reflect the input from that external microphone?

Show previous comments
Jan Lehnardt :couchdb:

@simon garage band, set a track to the mic and set it on [M]onitor

Joe Cotellese

@simon I do all of my audio processing through audio hijack plus black hole a loop back audio device.

sayrer

@simon probably not the answer you are looking for, but the easiest way to work is to get a basic audio interface (like Focusrite etc), and a mic that plugs into its XLR rather than a USB mic. Then, the audio interface connects via USB3 to the mac. There will be dials that let you hear the mic itself, and the computer.

This is all really easy, since it’s made for musicians, not computer people.

Simon Willison

ChatGPT will happily write you a thinly disguised horoscope: simonwillison.net/2024/Oct/15/

I wrote about the currently circulating meme where ChatGPT appears to provide deep insights into your personality if you ask it "From all of our interactions what is one thing that you can tell me about myself that I may not know about myself" - when actually all it's doing is spinning up a pseudo-horoscope for you based on short notes it added to its "memory"

Show previous comments
Simon Willison

(I shared this on Twitter and it's interesting how some people there are very resistant to the idea that the deeply personal insights ChatGPT gave them about themselves might be bogus junk)

Neil Kandalgaonkar

@simon I tried asking ChatGPT to back up its assertions with evidence, which it did. That told me there was a β€œbio” somewhere, though I didn’t know that existed.

ChatGPT will also criticize you if you ask. I was told I was β€œperfectionistic” to which I replied that I only go to it when I’m trying to optimize something, and that was duly recorded in my bio

EmmanuelOga

@simon "Whether you're exploring new tools like Bun or diving into Kubernetes storage solutions, you're always pushing the envelope, seeking not just solutions but also better, more efficient approaches for future work."

That's totally me! lol

Simon Willison

Up at 7am for the Half Moon Bay World Championship Pumpkin Weigh-Off

A sign reads Half Moon Bay World Championship Pumpkin Weigh-Off - in front of it are two pick up trucks each containing absurdly oversized pumpkins that fill the entire pick up bed
Show previous comments
Simon Willison

The big ones have started to come out now... here's 748lb

A large misshapen pumpkin on digital scales reading 748
Simon Willison

Built a little tool for turning a web page into Markdown and easily copying it to my clipboard - it's a very thin wrapper around the Jina Reader API tools.simonwillison.net/jina-r

Details on how I built it (by prompting Claude) here: simonwillison.net/2024/Oct/14/

Screenshot of Jina Reader tool - a URL has been pasted in and the submit button clicked, below is a textarea full of markdown, a copy to clipboard button and the markdown rendered as HTML
Robert Atkins

@simon How do you get a 500kg pumpkin onto a truck, then off the truck and onto a scale?!

Simon Willison

If I have a video where the audio from two mics is uneven - so one participant sounds a lot quieter than the other - is there a very low effort magic trick I can play on that video to boost the quiet sound? I don't have separate tracks

Show previous comments
Mans R

@simon Do the speakers (mostly) take turns? If so, it shouldn't be all that difficult to boost the quiet segments.

Lovell Fuller

@simon If you're using Audacity then its Limiter feature can do this. For a low effort workflow try: 1. Normalise to 0dB, 2. Effect > Plugin > Limiter > Soft Limiter (use defaults), 3. Normalise to -1dB.

blan πŸŽ‘

@simon

> very low effort magic trick

You might try using something like autoeditor?

But to answer your question more directly, if you can split the track into segments either by using silence or some fancy AI thing that can detect the turn-taking then you only need to apply normalize to each segment separately and it should be even

Simon Willison

Wrote up some notes on Cloudflare's fascinating new SQLite-backed "Durable Objects" system, which encourages an architectural style where your application creates thousands of tiny read-write SQLite databases scattered across Cloudflare's network simonwillison.net/2024/Oct/13/

Simon Willison

Just learned about this neat site which runs experiments to determine where in Cloudflare's network Durable Objects are being created and stored where.durableobjects.live/

Where Durable Objects Live

Created by the wonderful Jed Schmidt, and now maintained with ❀️ by Alastair. Source code available on Github.

Cloudflare Durable Objects are a novel approach to stateful compute based on Cloudflare Workers. They aim to locate both compute and state closest to end users.

This page tracks where new Durable Objects are created; for example, when you loaded this page from Half Moon Bay, a worker in San Jose, California, United States (SJC) created a durable object in Los Angeles, California, United States (LAX).

Currently, Durable Objects are available in 11.35% of Cloudflare PoPs.

To keep data fresh, this application is constantly creating/destroying new Durable Objects around the world. In the last hour, 394,046 Durable Objects have been created(and subsequently destroyed), FOR SCIENCE!

And a map of the world showing lots of dots
Simon Willison

Apple Music tip for dog friends: you can use lyrics mode in Apple Music and turn down the volume of the lyrics, which means if your dog needs to take a tablet you can play Gay Bar by Electric Six, dial down the lyrics and sing "You... do you want this in your dog face? Do you want this in your dog face? Do you want this in your dog face, dog face, dog face!" - and then give them the tablet (strategically coated in peanut butter)

Boo Ramsey πŸ§›πŸ»β€β™‚οΈπŸ§Ÿβ€β™‚οΈπŸ‘»πŸŽƒ

@simon I think they also refer to that as karaoke mode in some places. There’s a microphone icon you can tap to get to that mode.

Clifford Adams

@simon
... and one day your dog replies: "No, I want it in my OTHER face!" πŸ€ͺ

Go Up