Simon's wall

I still don't like Accept headers

I just found out if you hit https://pypi.org/simple/pydantic/ with "Accept: application/vnd.pypi.simple.v1+json" you get back super-useful JSON about that package... but I can't link to a demo because I can't include the Accept header in a link!

Here's a Gist instead: https://gist.github.com/simonw/8cf8a850739e2865cf3b9a74e6461b28

Like 15 November at 2:18 | Open on fedi.simonwillison.net

Show previous comments

Stefan Eissing

@simon http content-negotiation sucks. It always gets in the way.

Many thanks to the github people who showed how to do good url design.

15 November at 8:39 | Open on chaos.social

Pradyun Gedam

@simon FYI: ?format= works.

https://pypi.org/simple/pydantic/?format=application/vnd.pypi.simple.v1+json

15 November at 9:13 | Open on mastodon.social

danbri

@simon yeah the RDF / Linked Data community somehow got itself addicted to content negotiation as a favoured way to publish data views. Horrible!

15 November at 18:13 | Open on mastodon.social

Simon Willison

Posted some notes on the new PyPI digital attestations feature released today, providing digital signatures that help demonstrate that the package you are downloading from PyPI was built from a specific version of the underlying code on GitHub https://simonwillison.net/2024/Nov/14/pypi-digital-attestations/

Like 14 November at 20:00 | Open on fedi.simonwillison.net

Hugo 雨果

@simon I understand what this does, but I don’t understand the value of it. It provides validation that the build happened on MS’s server and that they used used a specific checkout. But if builds are not reproducible (eg: use unchecksumed external resources), this guarantees nothing. If builds are properly reproducible, what value does the attestation add?

15 November at 5:44 | Open on fosstodon.org

Show 1 reply

Simon Willison

Adam Johnson :django: :python:

13 November at 21:25

Shout out to @simon ’s shot-scraper for grabbing web browser screenshots from the command line.

I redid dozens of screenshots for the upcoming book update, and shot-scraper made it way easier than my previous approach with Firefox’s screenshot tool.

https://shot-scraper.datasette.io/en/stable/

Like 13 November at 21:59 | Open on fedi.simonwillison.net

Simon Willison

On the one hand, I'm very sympathetic to the argument that "AI" is an over-hyped buzzword that is rapidly losing all meaning, if it ever had any (beyond being a combination of science-fiction and an academic discipline from the 1950s)

On the other hand though, I'm building a feature where LLMs help a user build a SQL query using an English-language question and I need to decide what label to put on that button, it's hard to come up with anything that's as clear as "Use AI to write this query"

Like 13 November at 5:56 | Open on fedi.simonwillison.net

Show previous comments

Alex Bradbury

@simon looks like in Chrome, Google is going for "Ask AI" and "AI Assistance" https://developer.chrome.com/docs/devtools/ai-assistance/network

13 November at 10:07 | Open on fosstodon.org

Petr Viktorin

@simon “Generate” (or “autogenerate”)?
To the end user it doesn't matter that the heuristics use many floats rather than few ints/pointers. Traditional autocorrect/autocomplete (and other tools that deal with human language) are also often wrong. If the engine actually works, no one should care what's inside.

13 November at 10:08 | Open on mastodon.social

postweber

@simon Grafana has graphical query builders for some languages. You could have an SQL/Plain English toggle.

13 November at 11:34 | Open on mstdn.social

Simon Willison

Thanks to the combo of Ollama and the llm-ollama plugin you can now run Meta's Llama 3.2 Vision image model (7.9GB) on a Mac and use it to run prompts against images https://simonwillison.net/2024/Nov/13/ollama-llama-vision/

If you have Ollama installed you can fetch the 11B model (7.9 GB) like this:

ollama pull llama3.2-vision

Or the larger 90B model (55GB) like this:

ollama pull llama3.2-vision:90b

I was delighted to learn that Sukhbinder Singh had already contributed support for LLM attachments to Sergey Alexandrov's llm-ollama plugin, which means the following works once you've pulled the models:

llm install --upgrade llm-ollama
llm -m llama3.2-vision:latest 'describe' \
-a https://static.simonwillison.net/static/2024/pelican.jpg

A photograph of a California Brown Pelican in a harbor

$ llm -m llama3.2-vision:latest 'describe' \
-a https://static.simonwillison.net/static/2024/pelican.jpg
This image features a brown pelican standing on rocks, facing the camera and positioned to the left of center. The bird's long beak is a light brown color with a darker tip, while its white neck is adorned with gray feathers that continue down to its body. Its legs are also gray.

In the background, out-of-focus boats and water are visible, providing context for the pelican's environment.

Like 13 November at 2:01 | Open on fedi.simonwillison.net

Jan

@simon Curious how you’re running Ollama - is it just in your laptop or you have some beefy server running it?

13 November at 3:09 | Open on mastodon.hidupmanis.studio

Show 2 replies

Jeff Triplett

@simon the 90B (55GB) might confuse people.

You do need ~88GB of RAM, not counting your context window, just to run the 90B model size. So 128 GB of RAM, or else you are going to get 1 token per 30 to 45 seconds or more of output while everything swaps around.

That small model is going to run very, very well on any M-series Mac with enough RAM.

13 November at 3:28 | Open on mastodon.social

Show 2 replies

Simon Willison

Wrote up some notes on the new Qwen2.5-Coder-32B model, which is the first model I've run on my own Mac (64GB M2) that appears to be highly competent at writing code
https://simonwillison.net/2024/Nov/12/qwen25-coder/

Like 12 November at 23:39 | Open on fedi.simonwillison.net

Show previous comments

Stefano Pacifico 🧬 🇺🇦

@simon besides offline use and additionally privacy, did you detect any other advantage running locally?

13 November at 1:33 | Open on sigmoid.social

Show 1 reply

Drew Breunig

@simon Did you notice a speed difference between mlx and ollama?

13 November at 1:47 | Open on note.computer

Show 1 reply

balloob

@simon qwen is amazing. It’s the best performing local model in the Home Assistant AI benchmarks. https://github.com/allenporter/home-assistant-datasets/tree/main/reports

13 November at 5:17 | Open on fosstodon.org

Simon Willison

The more experience I gain as a software developer the less tolerance I have for the idea that something doesn't need documenting if you can go and read the source code instead

(That's despite getting much, much better at reading source code to answer my own questions as I gain experience)

Like 12 November at 18:58 | Open on fedi.simonwillison.net

Show previous comments

Marty Fouts

@simon The thing that makes me sad about this discussion is that it has gone on since the late 1950s; there have not been any new arguments in 30 years and the only good explanation was Knuth’s Literate Programming book but nobody ever really understood it.

13 November at 0:02 | Open on mastodon.online

SpaceLifeForm

@simon

If you do not have at least as many comments in your source code as the actual compiliable source code, you are making a mistake.

13 November at 1:14 | Open on infosec.exchange

Simon Willison

I should clarify: when I talk about documentation here I'm not talking about code comment style docs - I'm talking about "this is how to use this library / API" docs

If your code is clearly written and nothing else ever needs to call it then I don't particularly mind if there's no additional documentation - but if I'm expected to use call your library from my own code I'm very much not keen on being told I have to read all of that code myself just to use it!

13 November at 1:22 | Open on fedi.simonwillison.net

Show 1 reply

Simon Willison

jacoBOOian 👻

11 November at 22:59

Like a lot of people I'm really concerned about what the incoming regime is going to do, so here's one small way I'm trying to help: https://jacobian.org/2024/nov/11/digital-security-checkup/

Like 12 November at 1:13 | Open on fedi.simonwillison.net

Simon Willison

Jamelle Bouie's TikTok account is one of my favorite sources of political commentary right now - he's a columnist for the New York Times who has basically perfected the very different art of TikTok

Here's his latest, about tariffs and domestic supply chains

https://www.tiktok.com/@jamellebouie/video/7435779269323803947

Like 11 November at 15:43 | Open on fedi.simonwillison.net

Coty Rosenblath

@simon He’s great in all formats.

11 November at 15:45 | Open on mastodon.social

Andy Baio

@simon He’s great on Bluesky too. https://bsky.app/profile/jamellebouie.net

11 November at 15:56 | Open on xoxo.zone

Simon Willison

Glyph

11 November at 4:05

There are many “what should we do next” thinkpieces, but this one is mine.

If you want an abstract summary, the idea is “we need to run a year-round parallel campaign apparatus that just introduces people to progressive ideas by making their lives better in whatever ways we can”.

That is a staggeringly huge project and if it does even happen, I can only be a tiny part of it, so I will need your help. Contact info is at the end of the blog post.

https://blog.glyph.im/2024/11/its-time-for-democrats-to-get-more-annoying.html

There are many “what should we do next” thinkpieces, but this one is mine.

If you want an abstract summary, the idea is “we need to run a year-round parallel campaign apparatus that just introduces people to progressive ideas by making their lives better in whatever ways we can”.

That is a staggeringly huge project and if it does even happen, I can only be a tiny part of it, so I will need your help. Contact info is at the end of the blog post.

Expand text...

Like 11 November at 15:13 | Open on fedi.simonwillison.net

64 Islands Airship Co-op

@glyph i remember saying this in 2001. it’s not wrong!

11 November at 4:09 | Open on cloudisland.nz

Glyph

my other idea that I have _zero_ chance of actually making happen is that Crooked Media or someone like them needs to poach Matt Levine from Bloomberg and stand up a full-fledged competitor to CNBC where they cover economic and market issues from a progressive standpoint. If everyone gets all their financial and economic news from conservatives of course the population is going to keep trusting conservatives on the economy

11 November at 4:54 | Open on mastodon.social

Show 9 replies

M. Treasurer commandasaurus 🦖

@glyph I'm down. Building bridges across my experiences, communities, and identities is absolutely the plan.

11 November at 14:43 | Open on hachyderm.io

Simon Willison

Got distracted digging around in the belly of the MDN browser compatibility tables, and found out their API is served with access-control-allow-origin: *... so now I've built my own little browser support timeline viewer tool! https://tools.simonwillison.net/mdn-timelines#ViewTransition

More details here: https://simonwillison.net/2024/Nov/11/mdn-browser-support-timelines/

Like 11 November at 3:46 | Open on fedi.simonwillison.net

Show previous comments

Magyk

@simon
It seems that the back/forward navigation doesn't allow exiting from the page using the back button
(tested on Firefox Android)

11 November at 7:42 | Open on tooot.im

Show 1 reply

Stuart Knightley

@simon you might also be interested in https://bcd-watch.igalia.com to get updates when new APIs get browser support (based on the same MDN data you’re using!)

11 November at 16:11 | Open on mastodon.social

Rachel Andrew

@simon You might be interested in what we're building over at https://webstatus.dev/ (which does have an API, though it needs docs). You can already do some pretty interesting queries.

13 November at 7:06 | Open on front-end.social

Show 2 replies

Simon Willison

I want to enable comments on my blog again, but (I'm current possibly overthinking things in that) I'm worrying if I need a privacy policy, or how I should think about things like GDPR, and should users be able to delete their comments?

Never thought about this stuff for a second back in the 2000s!

Like 9 November at 14:56 | Open on fedi.simonwillison.net

Show previous comments

Jeff Triplett

@simon If you go the federated route, I like how these daily prompts work. See ,https://kmcd.dev/posts/daily-prompts/ for details and then click on the /prompts section to see them in action. (there is pretty low engagement)

That said, I saw your post about using GitHub Auth and that's what I default to these days. The stakes are higher for not being a jerk plus you have GH's moderations rules/team should you need to have to report someone.

10 November at 17:29 | Open on mastodon.social

Hope

Sensitive content

@simon Here's mine. https://hopesnotes.net/privacy-policy/

11 November at 23:59 | Open on theres.life

Hope

Sensitive content

@simon Although I like to call it a statement, because it's something I believe in, not just something I found a form to generate for me.

11 November at 23:59 | Open on theres.life

Simon Willison

Saw dolphins in Half Moon Bay on Thursday!

(I'm pretty sure this is a dolphin and not a porpoise, I think porpoises are smaller)

Like 9 November at 14:50 | Open on fedi.simonwillison.net

Ryan Hiebert

@simon TIL I learned that they aren't the same thing.

9 November at 15:18 | Open on fosstodon.org

Simon Willison

Kicking off our first ever Discord chat + video + live demos Datasette Public Office Hours in 10 minutes time, details here:
https://simonwillison.net/2024/Nov/7/datasette-public-office-hours/

Like 8 November at 21:50 | Open on fedi.simonwillison.net

Simon Willison

Here are detailed notes from our public office hours, showing how myself and @alexgarciaxyz imported San Mateo County election results into Datasette, cleaned them up and then used them to build geospatial visualizations in an @observablehq notebook https://simonwillison.net/2024/Nov/9/visualizing-local-election-results/

9 November at 23:36 | Open on fedi.simonwillison.net

Show 1 reply

Simon Willison

Wrote up a few notes on trying out ChainForge, a Yahoo-Pipes-style "visual programming" tool for evaluating prompts against different LLMs https://simonwillison.net/2024/Nov/8/chainforge/

Like 8 November at 20:54 | Open on fedi.simonwillison.net

Raphael Fetzer :kirby:

@simon Same style, different purpose: https://github.com/comfyanonymous/ComfyUI

9 November at 6:27 | Open on mastodon.social

Simon Willison

I am finding myself turning to gpt-4o-mini a whole lot more since they added prompt caching last month - where you get an automated 50% discount if you send the same tokens twice or more

It is fantastic for use-cases like answering questions about a medium sized codebase

Like 8 November at 17:25 | Open on fedi.simonwillison.net

Sviatoslav Abakumov

@simon Curious, how do you feed the whole codebase into the context?

9 November at 15:24 | Open on mastodon.social

Simon Willison

Hearing about the death of June Spencer at the age of 105 - who was in the Archers from 1950 to 2022 - made me curious as to the world record for longest time playing the same character

Depends on how you count: June started in 1950 but took some breaks for family, while Patricia Greene's Jill Archer started in 1957 and has a 67 year uninterrupted run

https://en.m.wikipedia.org/wiki/List_of_longest-serving_soap_opera_actors

Table showing longest-serving soap opera actors:

Patricia Greene as Jill Archer (The Archers, 67 years), June Spencer as Peggy Woolley/Rita Flynn (The Archers, 66 years), William Roache as Ken Barlow (Coronation Street, 63 years), Ludmiła Łączyńska as Wisia Matysiakowa (Matysiakowie, 62 years), and Lesley Saweard as Christine Barford (The Archers, 60 years).

Like 8 November at 16:14 | Open on fedi.simonwillison.net

Marcin Wichary

@simon TIL Matysiakowie, and I’m Polish

8 November at 16:29 | Open on mastodon.online

Paul Bowsher

@simon Jill is starting to sound all of those 67 years :(

8 November at 19:01 | Open on mastodon.me.uk

Simon Willison

Currently enjoying browsing stock photos of pelicans on their nests https://www.alamy.com/stock-photo/brown-pelican-nest.html?sortBy=relevant

Like 8 November at 15:51 | Open on fedi.simonwillison.net

Simon Willison

A thing I have learned about voting in US elections in California is that you should do vote by mail, but be sure to return the ballot before 1st November

If you instead of drop off your pre-filled ballot on 5th of November it doesn't get counted for several more days (still waiting here)

Like 7 November at 20:01 | Open on fedi.simonwillison.net

Simon Willison

This is particularly frustrating when one of your hyper-local elections only attracts a few thousand votes total and two of the candidates are within 100 votes of each other!

7 November at 20:02 | Open on fedi.simonwillison.net

Show 5 replies

Marcello Bastéa-Forte

@simon I did that and it got counted early this morning

8 November at 3:59 | Open on hci.social

Simon Willison

I interviewed Rajiv Sinclair about his team's new project, VERDAD - an outstanding piece of data journalism that tracks 48 US talk radio stations (many in Spanish), archives their audio, transcribes it and uses Gemini 1.5 to help identify potential snippets of misinformation - then presents the results in a UI for human review

https://simonwillison.net/2024/Nov/7/project-verdad/

Like 7 November at 18:48 | Open on fedi.simonwillison.net

Simon Willison

I'm hoping to turn this into a series of YouTube interviews with people building cool data projects where we nerd out about what they've built and how they built it, so I'm optimistically thinking of this as episode one! https://www.youtube.com/watch?v=t_S-loWDGE0

7 November at 18:50 | Open on fedi.simonwillison.net

Show 9 replies

Jay Nakrani

@simon That is a superb use of LLMs. I've seen a lot of text-classification tasks (that previously required expensive model training) can now be done rather cheaply using LLMs + engineered prompts. Cost and development velocity has improved quite a bit with this new LLM-as-rater approach compared to previous approaches of custom-model-training.

The next bottleneck is human evals, but I guess we can't completely remove them until LLMs stop making mistakes.

8 November at 5:03 | Open on mastodon.world