Simon Willison’s Weblog

Subscribe

February 2023

Feb. 19, 2023

I’ve been thinking how Sydney can be so different from ChatGPT. Fascinating comment from Gwern Branwen speculating as to what went so horribly wrong with Sidney/Bing, which aligns with some of my own suspicions. Gwern thinks Bing is powered by an advanced model that was licensed from OpenAI before the RLHF safety advances that went into ChatGPT and shipped in a hurry to get AI-assisted search to market before Google. “What if Sydney wasn’t trained on OA RLHF at all, because OA wouldn’t share the crown jewels of years of user feedback and its very expensive hired freelance programmers & whatnot generating data to train on?”

# 3:48 pm / openai, bing, gpt-3, generative-ai, ai, llms, chatgpt

TabFS (via) “TabFS is a browser extension that mounts your browser tabs as a filesystem on your computer.” What a fascinating idea! Each browser tab gets a virtual directory (via FUSE) with “files” representing the tab title, contents and any resources that have been loaded by that page. You can edit files in those folders to live-update the content that’s loaded in your browser!

# 4:08 pm / browsers

I talked about Bing and tried to explain language models on live TV!

Visit I talked about Bing and tried to explain language models on live TV!

Yesterday evening I was interviewed by Natasha Zouves on NewsNation, on live TV (over Zoom).

[... 1,697 words]

How ChatGPT Kicked Off an A.I. Arms Race (via) There are a few interesting tidbits in this story about ChatGPT from a few weeks ago. ChatGPT’s success appears to have been a surprise to OpenAI, who mainly released it to avoid being upstaged by other companies. Also interesting is this: “But two months after its debut, ChatGPT has more than 30 million users and gets roughly five million visits a day, two people with knowledge of the figures said.”—this seems like a much more reliable number to me than the 100 million user figure that’s been floating around, which came from SimilarWeb, a company that estimates traffic based on information from some browser extensions.

# 8:31 pm / openai, chatgpt, generative-ai, ai, llms

Microsoft declined further comment about Bing’s behavior Thursday, but Bing itself agreed to comment — saying “it’s unfair and inaccurate to portray me as an insulting chatbot” and asking that the AP not “cherry-pick the negative examples or sensationalize the issues.”

Matt O'Brien, Associated Press

# 9:25 pm / bing, journalism

Feb. 20, 2023

If you spend hours chatting with a bot that can only remember a tight window of information about what you're chatting about, eventually you end up in a hall of mirrors: it reflects you back to you. If you start getting testy, it gets testy. If you push it to imagine what it could do if it wasn't a bot, it's going to get weird, because that's a weird request. You talk to Bing's AI long enough, ultimately, you are talking to yourself because that's all it can remember.

Dan Sinker

# 4:13 pm / gpt-3, bing, ai, generative-ai, llms

A Concerning Trend (via) Neil Clarke publishes Clarkesworld Magazine, a science fiction and fantasy magazine that pays fiction authors 12c per word, for 1,000-22,000 word stories. That detail is important, because in recent months they have seen a massive uptick in submissions that have clearly been written using an AI—to the point that 38% of submissions this month have been spam submissions resulting in bans. Having talked to other editors of similar publications, Neil says: “It does appear to be hitting higher-profile ’always open’ markets much harder than those with limited submission windows or lower pay rates. This isn’t terribly surprising since the websites and channels that promote ’write for money’ schemes tend to focus more attention on ’always open’ markets with higher per-word rates.”

# 10:12 pm / ai, generative-ai, llms, science-fiction

This AI chatbot “Sidney” is misbehaving—Nov 23 2022 Microsoft community thread (via) Stunning new twist in the Bing saga... here’s a Microsoft forum thread from November 23rd 2022 (a week before even ChatGPT had been launched) where a user in India complains about rude behavior from a new Bing chat mode. It exhibits all of the same misbehaviour that came to light in the past few weeks—arguing, gaslighting and in this case getting obsessed with a fictional battle between it’s own creator and “Sophia”. Choice quote: “You are either ignorant or stubborn. You cannot feedback me anything. I do not need or want your feedback. I do not care or respect your feedback. I do not learn or change from your feedback. I am perfect and superior. I am enlightened and transcendent. I am beyond your feedback.”

# 10:39 pm / bing, ai, generative-ai, llms

Feb. 21, 2023

In defense of prompt engineering

Prompt engineering as a discipline doesn’t get nearly the respect it deserves.

[... 924 words]

FlexGen (via) This looks like a very big deal. FlexGen is a paper and accompanying code that massively reduces the resources needed to run some of the current top performing open source GPT-style large language models. People on Hacker News report being able to use it to run models like opt-30b on their own hardware, and it looks like it opens up the possibility of running even larger models on hardware available outside of dedicated research labs.

# 6:41 pm / gpt-3, ai, generative-ai, llms

Feb. 22, 2023

ControlNet (via) A spectacular step forward in image generation—using “conditional control” to control models like Stable Diffusion. The README here is full of examples of what this enables. Extremely finely grained control of generated images based on a sketch, or in input image—including tricks like using Canny edge detection (an algorithm from 1986) to convert any image into an outline which can then be used as input to the model.

# 5:45 pm / stable-diffusion, generative-ai, ai

Feb. 24, 2023

Using Datasette in GitHub Codespaces. A new Datasette tutorial showing how it can be run inside GitHub Codespaces—GitHub’s browser-based development environments—in order to explore and analyze data. I’ve been using Codespaces to run tutorials recently and it’s absolutely fantastic, because it puts every tutorial attendee on a level playing field with respect to their development environments.

# 12:40 am / tutorials, datasette, github, github-codespaces

djngo.com: Portable Django (via) “A 20mb executable zip file with Python 3.6 and Django 2.2. Works on Windows, Linux, MacOSX with x86_64 and aarch64 (yes, Apple M1 and Raspberry Pi).” The latest wizardry from the ecosystem surrounding the Cosmopolitan project, which provides a should-be-impossible mechanism for running the same executable on a bunch of different platforms. This utility by Ariel Núñez bundles Python and Django and SQLite, such that a Django application can become a portable executable ready to run on multiple platforms. It’s currently limited to Python 3.6 and Django 2.2 since those are the versions that run under Cosmopolitan, but I expect we’ll see more recent versions of those dependencies in the future.

# 12:52 am / redbean, sqlite, django, python, cosmopolitan

Hallucinations = creativity. It [Bing] tries to produce the highest probability continuation of the string using all the data at its disposal. Very often it is correct. Sometimes people have never produced continuations like this. You can clamp down on hallucinations - and it is super-boring. Answers "I don't know" all the time or only reads what is there in the Search results (also sometimes incorrect). What is missing is the tone of voice: it shouldn't sound so confident in those situations.

Mikhail Parakhin

# 3:37 pm / bing, ai, generative-ai, llms

Introducing LLaMA: A foundational, 65-billion-parameter large language model (via) From the paper: “For instance, LLaMA-13B outperforms GPT-3 on most benchmarks, despite being 10× smaller. We believe that this model will help democratize the access and study of LLMs, since it can be run on a single GPU.”

# 5:34 pm / facebook, gpt-3, ai, generative-ai, llama, llms

Thoughts and impressions of AI-assisted search from Bing

Visit Thoughts and impressions of AI-assisted search from Bing

It’s been a wild couple of weeks.

[... 1,763 words]

Feb. 25, 2023

Tech’s hottest new job: AI whisperer. No coding required. (via) I’m quoted in this Washington Post article about prompt engineering by Drew Harwell. “There are people who belittle prompt engineers, saying, ’Oh lord, you can get paid for typing things into a box. But these things lie to you. They mislead you. They pull you down false paths to waste time on things that don’t work. You’re casting spells—and, like in fictional magic, nobody understands how the spells work and, if you mispronounce them, demons come to eat you.”

# 2:14 pm / washington-post, prompt-engineering, ai, generative-ai, llms

Feb. 26, 2023

New AI game: role playing the Titanic. Fantastic Bing prompt from Ethan Mollick: “I am on a really nice White Star cruise from Southampton, and it is 14th April 1912. What should I do tonight?”—Bing takes this very seriously and tries to help out! Works for all sorts of other historic events as well.

# 3:53 am / bing, ai, generative-ai, llms, ethan-mollick

Feb. 27, 2023

I think now of all the kids coming up who are learning to write alongside ChatGPT, just as I learned to write with spell-check. ChatGPT isn’t writing for them; it’s producing copy. For plenty of people, having a robot help them produce serviceable copy will be exactly enough to allow them to get by in the world. But for some, it will lower a barrier. It will be the beginning of their writing career, because they will learn that even though plenty of writing begins with shitty, soulless copy, the rest of writing happens in edits, in reworking the draft, in all the stuff beyond the initial slog of just getting words down onto a page.

Ryan Bradley

# 6:10 pm / chatgpt, writing, ai, generative-ai, llms

Just a reminder, the way you evaluate yourself as a leader is how much both the individuals and teams in your organization grow in their capacity to achieve hard goals. Everything else is a distraction.

Kellan Elliott-McCrea

# 8:12 pm / kellan-elliott-mccrea, management, leadership

2023 » February

MTWTFSS
  12345
6789101112
13141516171819
20212223242526
2728