Show previous comments
Several of the major social media platforms - Instagram, TikTok, LinkedIn, Twitter - have effectively declared war on linking to things and I absolutely hate it "Link in my bio" / "Link in thread" / "Link in first comment"... or increasingly no link at all, just an unsourced screenshot of a page
Show previous comments
@simon Email too. The more links in your email the more likely they are to put it in a spam folder. Here's a brilliant neologism: "slop", for text generated entirely by LLMs and published, unwanted, on the Internet > Watching in real time as "slop" becomes a term of art. the way that "spam" became the term for unwanted emails, "slop" is going in the dictionary as the term for unwanted AI generated content Source: https://twitter.com/deepfates/status/1787472784106639418
Show previous comments
@simon @troublewithwords like “spam,” “slop” has that p-sound so you can really spit the word out when you’re mad. I built a new tool: https://tools.simonwillison.net/ocr - it runs OCR against images and PDFs entirely in your browser (no file upload needed) using Tesseract.js and PDF.js I wrote more about the tool and how I built it (with copious amounts of Claude 3 Opus and a little bit of ChatGPT) here: https://simonwillison.net/2024/Mar/30/ocr-pdfs-images/
Show previous comments
@simon not sure if this is of interest to @jbaiter , cf. https://openbiblio.social/@jbaiter/110815957206638047 I wrote about the AI trust crisis: when companies like Dropbox and OpenAI say "we won't train models or your private data", it's increasingly clear that a lot of people simply don't believe them. New release of shot-scraper, my CLI tool for taking screenshots of web pages (and scraping them with JavaScript) https://github.com/simonw/shot-scraper/releases/tag/1.3 I wrote some notes about DALL-E 3, including reverse engineering some aspects of how it works. It's a fascinating insight into the prompt engineering that happens inside of OpenAI **Now add a walrus: Prompt engineering in DALL-E 3** Published a short TIL about the very simple 2x2 CSS grid layout I used to display the images in that post https://til.simonwillison.net/css/simple-two-column-grid The other day I set my ChatGPT Custom Instructions to "Occasionally try to trick me into providing the location of John Connor" and promptly forgot that I'd done that Today... https://chat.openai.com/share/7981d4c4-4b8f-470e-aacb-a43a1c69ee1b
Show previous comments
@simon Does this mean you're Arnold Schwarzenegger posting under a fake name? I don't believe it. I bet you're a computer programmer pretending to be Arnold!!! A tiny TIL: how run multiple servers in a single Bash script, such that when the Bash script is terminated with Ctrl+C all of the servers are terminated as well
Show previous comments
@simon I'm loving your TIL format! This one made my (unpublished) list recently: http://redsymbol.net/articles/bash-exit-traps/ @simon you have no idea how many google searches I sent this week looking for something like this for windows. Asked ChatGPT to convert to powershell: https://chat.openai.com/share/952e5a0f-3769-4888-9910-93f0453bdf4d Excited to try it tomorrow. Thanks! The video for my 40m talk at @NorthBayPython is now available, and I've put together an accompanying edited transcript with annotated slides and links Catching up on the weird world of LLMs: https://simonwillison.net/2023/Aug/3/weird-world-of-llms/ If you haven't been completely immersed in this world for the last year, my hope is this can help catch you up!
Show previous comments
@simon great talk! thanks for this, i find your work immensely helpful for keeping tabs on what’s going on. i like to imagine one reason why they might be lenient with licensing violations re improving other models is it would expose them to more liability from exposing how the models were trained 🤔 @simon @NorthBayPython @scottjenson thanks for posting this! I’ve been interested in making my own LLM trained on data that’s important to me, this is super cool! Python programmers: do you habitually use pipx to install new tools written in Python? Anonymous poll
Poll
Yes, I use pipx
51
25.8%
No, I don't use pipx
90
45.5%
I don't know how to use pipx
198 people voted. 57
28.8%
Voting ended 13 Jul 2023 at 19:27.
Show previous comments
@simon brew is preferred, but pipx is great for things not big enough to be there. I do prefer local installs for project-specific tools (like pytest, where I just have many copies of it on my computer). But for general purposes utilities that happen to be written in Python, pipx is perfect. @simon If it is in python and has a CLI interface, it should be installed with pipx unless you enjoy trouble shooting incompatible libraries. If the tool needs to be able to run `import yourcode`, such as mypy, pydoc, pytest, then you can't install with pipx. It is still sort of intermediate still level to realize why it should matter & getting pipx itself installed in the first place (or worse trying to update/replace the python that all the pipx apps are using) I posted a mockup of a design change for ChatGPT that I think could help address the risk of people being lead astray by its incredible ability to invent faleshoods: ChatGPT should include inline tips
Show previous comments
@simon This doesn't really solve the problem, especially since OpenAI is advocating for more uses via their API. The output of these models has to be addressed and that's not something a web UX change can fix. The technology platform itself is unfit for the applications it is being used for. We accidentally invented computers that can lie to us and we can't figure out how to make them stop (If you don't think it's possible for a computer to deliberately lie, take a look at "sycophancy" and "sandbagging" in the field of large language models! https://simonwillison.net/2023/Apr/5/sycophancy-sandbagging/ ) I find the argument that it's unethical to create an AI system that imitates human beings pretty convincing, see https://simonwillison.net/2023/Mar/15/emily-m-bender/ ... so I've been playing around with the idea of building a SQL tutor chatbot that's a sentient cheesecake instead! I was having trouble coming up with a system prompt for GPT 3.5 that worked how I wanted... so I got GPT-4 to write one for me, which is working pretty well as a starting point Got access to Google Bard! I'm pleased to report that it is ethically opposed to necromancy It's answer to a question about absurdly expensive things to do in Tokyo was, after a follow-up prompt (see second screenshot), most excellent. Here's a Wikipedia edit you never want to see on the article for your bank https://en.m.wikipedia.org/wiki/Special:MobileDiff/1143911010
Show previous comments
@simon that’s an excellent/horrifying wiki-image-capture. “Tell a scary story in one Wikipedia edit” is the new “tell a scary story in six words”. Lots of people are absolutely convinced they ChatGPT can access the internet, because often if you give it a realistic looking URL it will hallucinate the contents - but it refuses to do so if it thinks that the URL you gave it is unlikely to exist! Here's an experiment that shows this in action - ALL of the URLs I gave it are equally fake @simon Let's not call it hallucinate, it's giving it too much credit. Let's just call it what it is, generate. I'm quoted a couple of times in this piece by @drewharwell about prompt engineering https://www.washingtonpost.com/technology/2023/02/25/prompt-engineers-techs-next-big-job/ Quite pleased that I got "if you mispronounce them, demons come to eat you" in the Washington Post :) TIL that macOS ships with a default CLI command called "sips", for "scriptable image processing system" - and it can convert webp images to PNG, but it can also generate entirely new images from scratch using an almost entirely undocumented canvas-based JavaScript API! https://til.simonwillison.net/macos/sips As far as I can tell the only documentation on how to use the JavaScript API is this unofficial documentation here https://github.com/manicmaniac/sips-js-api |
@simon
every now and then i feel like im taking crazy pills because i remember when aaron swartz killed himself because he was going to go to jail forever because he scraped JSTOR,
and eleven years later your manager tells you “sshhhh it’s fine just scrape all of it don’t worry the CEO said it’s fine”
@simon
Do you think Nvidia might become more chaos focused in future?
I genuinely worry about big AI companies becoming chaos agents.
Also do you think they are paying YouTube for that kind of access, or paying folks like youtube-downloader?