• 0 Posts
  • 8 Comments
Joined 2 years ago
cake
Cake day: August 3rd, 2023

help-circle

  • I advise everyone to ignore this article and read the actual paper instead.

    The gist of it is, they gave the LLM instructions to achieve a certain goal, then let it do tasks that incidentally involved “company communications” that revealed the fake company’s goals were no longer the same as the LLM’s original goal. LLMs then tried various things to still accomplish the original goal.

    Basically the thing will try very hard to do what you told it to in the system prompt. Especially when that prompt includes nudges like “nothing else matters.” This kinda makes sense because following the system prompt is what they were trained to do.




  • The numbers are different because the site doesn’t naively count every line but merges some as a single package. For example, at the very top of the Debian list we have 0ad, 0ad-data, 0ad-data-common. These are all counted as one single “package.”

    One might argue that doing the comparison in that way is more useful to an average user asking “which distribution has more software available.”