I spun up a new LLM benchmark: how well can they handle this prompt?
Generate an SVG of a pelican riding a bicycle
I find the results so far utterly delightful: https://simonwillison.net/2024/Oct/25/pelicans-on-a-bicycle/
I spun up a new LLM benchmark: how well can they handle this prompt? Generate an SVG of a pelican riding a bicycle I find the results so far utterly delightful: https://simonwillison.net/2024/Oct/25/pelicans-on-a-bicycle/ 8 comments
The Llama models I tried both did terribly, but Gemini 1.5 Flash 8B wins for weird charm (even if it doesn't really look like a pelican at all) @simon This is great! For awhile I was testing them by asking them to draw a deserted island in Processing. It was hilarious. Paul Calcraft extended this idea into an implementation of Pictionary where different vision LLMs generate SVGs and race to guess what the others are drawing and it is absolutely brilliant https://twitter.com/paul_cal/status/1850262678712856764 @simon how long until some of those models start to optimize to deal with your pelican obsession? :rofl: @simon saw this and thought of you https://bsky.app/profile/socalleslie.bsky.social/post/3l7ewtd4koe2x @simon |
OpenAI's models are quite good at it (not as good as Claude 3.5 Sonnet though)