Email or username:

Password:

Forgot your password?
248 posts total
Simon Willison

New release of sqlite-utils, my combined CLI tool and Python library for doing useful things with SQLite databases sqlite-utils.datasette.io/en/s

Simon Willison

Looking for something to do this weekend? Come on an artist treasure hunt down on the coast!

There are stalls and open studios you can visit and #shoplocal to support artists. colonyofcoastsideartists.com/c

Here are some of the pots I’ll have for sale! I’m #19 on the map.

Simon Willison

Weeknotes: asynchronous LLMs, synchronous embeddings, and I kind of started a podcast simonwillison.net/2024/Nov/22/

Simon Willison

Wrote about a delightfully subversive use of a Bluesky custom labeler: displaying labels on accounts belonging to British public figures showing which expensive private school they went to and what the current fees are for that school
simonwillison.net/2024/Nov/22/

heracl.es

@simon Imagine such labelers provided by public watchdogs and rights-awareness groups!
Though bad actors can also quickly turn that to a doxxing tool, so a labeler for labelers will be needed.

Simon Willison

We need input from our community πŸ’š Take and re-share the 2024 #Django Developers Survey – jb.gg/75ipes - collab with @jetbrains, makers of @pycharm 🫢

Simon Willison

Google released a new LLM today - gemini-exp-1121, hot on the heels of last week's gemini-exp-1114

It's currently at the top of the Chatbot Arena. I've updated my llm-gemini plugin to support it and used that to run my pelican on a bicycle SVG benchmark

My notes: simonwillison.net/2024/Nov/22/

Simon Willison

Amazon S3 just grew "append" support! It's only available for the more expensive, lower latency "S3 Express One Zone" bucket class but you can now append data to an object up to 10,000 times - previously you could only ever atomically replace a whole object with an updated version simonwillison.net/2024/Nov/22/

Simon Willison

Released a new version of my llm-gguf LLM plugin adding support for GGUF embedding models - which means you can use models like the bafflingly small (30.8MB in its smallest quantization) mxbai-embed-xsmall-v1 model with LLM

simonwillison.net/2024/Nov/21/

Prem Kumar Aparanji πŸ‘ΆπŸ€–πŸ˜

@simon do you see any convergence being possible on the file format for the model weights? GGUF, ONNX, etc?

Simon Willison

TIL Fabrice Bellard has a closed-source REST server for serving LLMs (and image generation models) called TextSynth, which he's been hacking on since 2019 starting with GPT-2 simonwillison.net/2024/Nov/21/

Hynek Schlawack

@simon Fabrice will never stop making me feel inadequate as a programmer

Simon Willison

Foursquare just open sourced their 100 million place point of interest dataset! Some notes on poking around with it using DuckDB (it's Parquet files on S3) simonwillison.net/2024/Nov/20/

Simon Willison

Built my first experiment on top of Bluesky's API (actually the Jetstream WebSocket proxy) - it took ~15s of prompting in Claude to get this working: tools.simonwillison.net/bluesk

More details including the prompt transcript here: simonwillison.net/2024/Nov/20/

Simon Willison

Let's Encrypt is 10 years old today!
Let's Encrypt is a free, automated, and open certificate authority brought to you by the nonprofit Internet Security Research Group (ISRG). Huge thanks to everyone involved in making HTTPS available to everyone for free

letsencrypt.org/

#tech #technology #security #privacy #encryption #https #letsencrypt #ISRG

Show previous comments
Django

@Some_Emo_Chick I'm not sure, but will I use certificates from a so called phishing CA? .... a difficult question ...

Ciprian Cucuruz

@Some_Emo_Chick They were not the first (anyone remember startssl.com?) but they sure did a great job with the automation.
The web is now safer because of @letsencrypt

Kevin Karhan :verified:

@Some_Emo_Chick I do congratulate @letsencrypt even tho @cacert was way earlier there and only got #cickblocked by #GAFAMs like #Aoole & #Microsoft who refused to integrate it and @mozilla who didn't integrate it either.

- The reasons why are the most abdurd given compromized CAs as well as free, non-#KYC-Certs were accepted without warning...

Meanwhile #LetsEncrypt can be setup fully-automatic.

Simon Willison

Here's video and a bunch of links from the conversation I had today with @benjedwards about that memorable time when Microsoft Bing went feral simonwillison.net/2024/Nov/19/

Simon Willison

I used a trick to help write the shownotes: dump a Whisper transcript into Claude and prompt "List of potential articles and other resources to link to in show notes - be as comprehensive as possible, no need to provide URLs, just provide a description of each one" gist.github.com/simonw/865c1b1

Simon Willison

Looks like it's a big LLM release Monday today - so far Qwen 2.5 Turbo (API-only) and a new vision model from Mistral called Pixtral Large (open weights)

qwenlm.github.io/blog/qwen2.5-
mistral.ai/news/pixtral-large/

Jeff Triplett

@simon I wonder what kind of hardware it takes to support a 1M token context window. That's amazing.

Simon Willison

Notes on accessing Pixtral Large via LLM and llm-mistral on my blog: More notes on my blog: simonwillison.net/2024/Nov/18/

Salvatore ZappalΓ 

@simon do you reckon this is a coincidence or is there coordination between different labs?

Simon Willison

New release of my LLM combined CLI tool and Python library for interacting with LLMs - the big new feature in 0.18 is support for async models llm.datasette.io/en/stable/cha

Simon Willison

And a new plugin release: llm-claude-3 0.9, adding support for asynchronous access to the Claude family of models github.com/simonw/llm-claude-3

Simon Willison

Video and notes from yesterday's session with @phildini talking about civic.band/ - his project to gather minutes and agendas from 100+ US local governments and make them searchable using @datasette simonwillison.net/2024/Nov/16/

Sharon Machlis

@simon @phildini @datasette Bookmarked to watch later! I'm scraping my city's agendas and minutes now and making them full-text searchable using R and the R data.table package in an R Shiny app. Someday I may move to more robust infrastructure and maybe look at adding embeddings and natural-language queries.

apps.machlis.com/shiny/framing

Simon Willison

Some notes on NuExtract, a family of small LLMs fine-tuned for structured data extraction simonwillison.net/2024/Nov/16/

James Williams

@simon Did you have much luck with this? I played around with the GGUF quant but it seemed to falter and miss fields once I deviated from the published json templates even slightly.

Simon Willison

Starting in 1.5 hours we'll be hosting the second Datasette Public Office Hours livestream - covering embeddings, vector search, enrichments and with a special guest appearance by @phildini talking about civic.band/ - come join us in our Discord: discord.gg/jFWyFW8A?event=1306

Simon Willison

Very impressed by Recraft AI - a new image generation service that can generate editable vector graphics that you can export as SVG

This seems massively more useful than tools that can only output raster graphics
simonwillison.net/2024/Nov/15/

Go Up