The environmental comparison I'd be interested in seeing is between a year of heavy personal usage of LLMs (ChatGPT, Claude, Gemini etc) compared to the CO2 emissions from a single passenger flight Can I do my own personal carbon offsetting by skipping one trip a year? Does that question even make sense?
Show previous comments
@simon I've found the self-reported Llama2 training (!) emissions interesting. See section 2.2.1 here: https://arxiv.org/abs/2307.09288 That doesn't say much about inference, tho, but it's a start. You'd also have to include not just GPU energy but e.g. emissions of the data center and supporting software around it, cluster utilization, the emissions of people working on this, traveling, and so on. There's Scope1/2/3 etc. categories to make this a bit more structured but in the end it's tough. @simon Knowing the location of datacenters and using a source like https://app.electricitymaps.com/map you can guestimate the carbon footprint. My employer did the same for its datacenters in Norway but its somewhat complicated by a complex energy exchange market (we're connected to UK, Germany, Netherlands, Denmark, Sweden). PSA for web developers: if you plug your iPhone into a Mac with a USB-C cable and turn on the iPhone setting "Safari -> Advanced -> Web Inspector" you can run the full Safari web inspector on your Mac against live pages displayed on your phone Wildly useful, not just for debugging but even for things like tweaking a page a bit in order to get a better screenshot
Show previous comments
My notes on the new OpenAI audio input/output APIs - they're a lot of fun to play with, but the cost is HIGH gpt-4o-audio-preview audio input is currently 1,066 times more expensive than Google Gemini 1.5 Flash 8B audio input! The sheer volume of meticulously reported content in @molly0xfff's newsletter is really something else - absolutely no idea how she can churn out that much high quality reporting on a weekly basis, I have enough trouble just finding the time to read it! I guess a great way to forge a journalist is in the fires of Wikipedia for a decade or so I am delighted to report that NotebookLM now lets you provide custom instructions when you use it to generate a podcast... "You are both pelicans who work as data journalist at a pelican news service. Discuss this from the perspective of pelican data journalists, being sure to inject as many pelican related anecdotes as possible" 7m40s audio and transcript here: https://simonwillison.net/2024/Oct/17/notebooklm-pelicans/
Show previous comments
> You ever find yourself wading through mountains of data trying to pluck out the juicy bits? It's like hunting for a single shrimp in a whole kelp forest, am I right? ... > Think of those facial recognition systems they have for humans. We could have something similar for our finned friends. Although, gotta say, the ethical implications of that kind of tech are a whole other kettle of fish. We pelicans gotta use these tools responsibly and be transparent about it. @simon I'm sad that audio is not several minutes of pelican chaos and noises. π Video scraping: extracting JSON data from a 35 second screen capture for less than 1/10th of a cent https://simonwillison.net/2024/Oct/17/video-scraping/ I needed to extract information from a dozen emails in my inbox... so I ran a screen capture tool, clicked through each of them in turn and then got Google's Gemini 1.5 Flash multi-modal LLM to extract (correct, I checked it) JSON data from that 35 second video. Total cost for 11,018 tokens: $0.00082635
Show previous comments
[DATA EXPUNGED]
Put together some notes on the Gemini terms-of-service: it looks like their paid API tier doesn't train on your inputs to the model, but the free API tier does: https://simonwillison.net/2024/Oct/17/gemini-terms-of-service/ Data used to create fine-tuned models, even on the free tier (they have a free tier!) won't be used for training. I hadn't realized you could fine-tune a model there for free, that's pretty wild: https://ai.google.dev/gemini-api/docs/model-tuning Inspired by the new Markdown-based testing framework added to Ruff, I wrote up some notes on "literate testing", including trying out a pytest plugin called pytest-markdown-docs https://simonwillison.net/2024/Oct/16/markdown-test-framework/
Show previous comments
@simon a coworker introduced us to https://github.com/bashup/mdsh a while ago and we've been happily using it since: https://github.com/meltano/meltano/blob/main/integration/example-library/meltano-state-s3/index.md. I used to write narrative doctests a lot: https://blog.startifact.com/posts/i-like-doctests/ I moved away from them mostly these days but it has benefits as I argue in that article. We're starting regular DSF Office Hours! Anyoneβs welcome if you want to work on DSF things with others. I'll try to be there every week, and other board members will likely attend too. If you're thinking about running for the DSF Board β this would be a great time to come and ask questions / get help with your nomination! It'll be each Wednesday at 6PM UTC β about 4 hours from now. Short notice I know, but mark it down for next week. DM/email me (or other board members) for joining info! I have absolutely no idea what the consequences of agreeing to this particular dialog are, but I keep on seeing variants of it from all sorts of different applications
Show previous comments
@simon Sounds to me like it would grant an application permission to discover and connect to devices in your LAN via mDNS. @simon @juandesant Itβs a Faustian question. If you click Β« Allow Β» you sold your soul to the Devil. π€£ @simon Previously, Apple required some granular permissions to access particular local network features, but it was generally open. From around iOS 14 to iOS 18, theyβve been turning up the frog steamer on the feature (so I had to add the request into my SuperConnector app to control my radio) and looks like we're in pre-boiled frog territory now. Extremely basic audio engineering question: if I am recording audio on my Mac laptop through microphones plugged into that laptop, what is the best way for me to listen to that audio in order to check that it sounds right? How do I get my headphones to reflect the input from that external microphone?
Show previous comments
@simon I do all of my audio processing through audio hijack plus black hole a loop back audio device. @simon probably not the answer you are looking for, but the easiest way to work is to get a basic audio interface (like Focusrite etc), and a mic that plugs into its XLR rather than a USB mic. Then, the audio interface connects via USB3 to the mac. There will be dials that let you hear the mic itself, and the computer. This is all really easy, since itβs made for musicians, not computer people. ChatGPT will happily write you a thinly disguised horoscope: https://simonwillison.net/2024/Oct/15/chatgpt-horoscopes/ I wrote about the currently circulating meme where ChatGPT appears to provide deep insights into your personality if you ask it "From all of our interactions what is one thing that you can tell me about myself that I may not know about myself" - when actually all it's doing is spinning up a pseudo-horoscope for you based on short notes it added to its "memory"
Show previous comments
(I shared this on Twitter and it's interesting how some people there are very resistant to the idea that the deeply personal insights ChatGPT gave them about themselves might be bogus junk) @simon I tried asking ChatGPT to back up its assertions with evidence, which it did. That told me there was a βbioβ somewhere, though I didnβt know that existed. ChatGPT will also criticize you if you ask. I was told I was βperfectionisticβ to which I replied that I only go to it when Iβm trying to optimize something, and that was duly recorded in my bio @simon "Whether you're exploring new tools like Bun or diving into Kubernetes storage solutions, you're always pushing the envelope, seeking not just solutions but also better, more efficient approaches for future work." That's totally me! lol
Show previous comments
Built a little tool for turning a web page into Markdown and easily copying it to my clipboard - it's a very thin wrapper around the Jina Reader API https://tools.simonwillison.net/jina-reader Details on how I built it (by prompting Claude) here: https://simonwillison.net/2024/Oct/14/my-jina-reader-tool/ @simon How do you get a 500kg pumpkin onto a truck, then off the truck and onto a scale?! If I have a video where the audio from two mics is uneven - so one participant sounds a lot quieter than the other - is there a very low effort magic trick I can play on that video to boost the quiet sound? I don't have separate tracks
Show previous comments
@simon If you're using Audacity then its Limiter feature can do this. For a low effort workflow try: 1. Normalise to 0dB, 2. Effect > Plugin > Limiter > Soft Limiter (use defaults), 3. Normalise to -1dB. > very low effort magic trick You might try using something like autoeditor? But to answer your question more directly, if you can split the track into segments either by using silence or some fancy AI thing that can detect the turn-taking then you only need to apply normalize to each segment separately and it should be even Wrote up some notes on Cloudflare's fascinating new SQLite-backed "Durable Objects" system, which encourages an architectural style where your application creates thousands of tiny read-write SQLite databases scattered across Cloudflare's network https://simonwillison.net/2024/Oct/13/zero-latency-sqlite-storage-in-every-durable-object/ Just learned about this neat site which runs experiments to determine where in Cloudflare's network Durable Objects are being created and stored https://where.durableobjects.live/ Apple Music tip for dog friends: you can use lyrics mode in Apple Music and turn down the volume of the lyrics, which means if your dog needs to take a tablet you can play Gay Bar by Electric Six, dial down the lyrics and sing "You... do you want this in your dog face? Do you want this in your dog face? Do you want this in your dog face, dog face, dog face!" - and then give them the tablet (strategically coated in peanut butter) @simon I think they also refer to that as karaoke mode in some places. Thereβs a microphone icon you can tap to get to that mode. |
@simon
Yep, those are giant pumpkins all right. Absolute UNITS, every one!
@simon They look a bit Jabba the Hut, you know?
And this morning there are witches!