The hacker sent by Anthropic to calm the government's nerves about AI safety
Posted by Brajeshwar 2 hours ago
Comments
Comment by cmiles8 2 hours ago
You can’t jump up and down screaming how amazing, powerful, and dangerous your new tech is and then act surprised and annoyed when the government shows up looking to regulate it.
Their new argument now seems be that this was marketing hype/fluff that backfired, in a pretty obvious and predicable way, and now they’re trying to reset the conversation.
Comment by ChadNauseam 1 hour ago
True, you can't. But, you can think certain regulations are helpful and certain other regulations are not. And you can be annoyed when unhelpful "regulations" are put in place.
This is like if I say that pitbulls are dangerous, and then the government comes and shoots my pitbull, who I've spent a lot of effort training to not be dangerous. Then you say "well you said pitbulls were dangerous, so you can't really complain." Well, I can complain because If you took me seriously, you wouldn't have responded by shooting only my pitbull!
Think of what incentives this creates for other people. Do you think that OpenAI will be candid about the possible dangers of their technology now? They might not even release it now, seeing that Anthropic releasing their model was what got it export-controlled.
Comment by dogleash 1 hour ago
Comment by mips_avatar 1 hour ago
Comment by drtz 1 hour ago
It's entirely possible that models could be "dangerous" to fully release to the general public without guardrails and at the same time the government majorly overreacted in this case.
Releasing Mythos to selected researchers and companies at least gives those researchers a head start at addressing vulnerabilities before the model hits mainstream.
Comment by foo-bar-baz529 1 hour ago
Comment by ChadNauseam 1 hour ago
Maybe there weren't that many serious vulnerabilities in curl? It's like asking why it didn't find any vulnerabilities in fn main() {println!("hello, world");}.
Anyway, people who have used it seem to say that Mythos was better than other models at creating exploits. From cloudflare https://blog.cloudflare.com/cyber-frontier-models/
> When we ran other frontier models through the same harness, they found a fair number of the same underlying bugs, and in some cases they got further than we expected on the reasoning side too. Where they fell short was at the point of stitching the pieces together. A model would identify an interesting bug, write a thoughtful description of why it mattered, and then stop, leaving the actual chain unfinished and the question of exploitability open. What changed with Mythos Preview is that a model can now take those low-severity bugs (which would traditionally sit invisible in a backlog) and chain them into a single, more severe exploit.
Comment by dofm 2 hours ago
Comment by Bender 1 hour ago
Comment by reje 1 hour ago
You think you can become more powerful so much so the govt questions its own power? Don’t be stupid. They will simply send in the army to first seize the assets and then nationalise.
It almost seems as if very few people actually understand how the world works. If the govt thinks this is the tech to end all future tech, you think future money flows for invesment matter? Hahaha. No
Comment by micromacrofoot 1 hour ago
There's really not even a ban here, they could slot in Fable under the Opus label and no one would really be able to tell. It's all part of the same show to pump up valuation.
Comment by bloppe 1 hour ago
Comment by teaearlgraycold 2 hours ago
Comment by stvltvs 2 hours ago
I'm skeptical about the existential threat of AI, but a lot of smart people have been beating that drum for so long that people are afraid.
Comment by tychez 32 minutes ago
This bizarre social media meme that AI just performative when Opus 4.8 is just unbelievably good. As if it is so difficult to believe that a more capable model than Opus 4.8 might actually be dangerous and not just entirely a marketing stunt like a person waving to cars in a chicken outfit.
I think it is really this strange form of socialization that people have internalized an anonymous audience they are always performing to themselves. What is going to be the most popular and upvoted thing the anonymous audience agrees with is what I am going to think.
Why would anyone disagree and get downvoted by the anonymous audience like this post?
Comment by colonCapitalDee 1 hour ago
Comment by mrandish 47 minutes ago
To be clear, I'm not saying there aren't legit security concerns around Fable's release. I think Andy Jassy did forward a concerning report about an apparent jailbreak in Fable, and he probably did so in good faith. The difference is if the same concern had arisen about a new model from OAI, Google, etc the action taken would not have been inventing an all-new, hyper-extreme punitive remedy and dropping it after 5p on a Friday under a very rare mechanism forcing Ant to comply in 90 mins or be subject to immediate arrest. And the "no non-U.S. citizens anywhere, anytime" restriction is functionally unprecedented.
This is the Trump admin inventing new regulatory power that's never existed before and deploying it in a punitive way to demonstrate what can happen to those who aren't sufficiently cooperative with this administration. There are half a dozen less extreme levels of restriction, which already exist, and one of those would have been deemed sufficient had it been another company.
That said, I'm certainly no Anthropic fanboy. Anthropic did play their initial Mythos self-restriction for PR value. But I think it's likely the Mythos self-restriction was a responsible action initially suggested by their AI safety team in good faith. Giving security researchers time to evaluate it and major companies time to test it against their code bases probably was reasonable and prudent. That doesn't mean it wasn't also good for PR and brand perception. I think there are senior people inside Anthropic who are genuinely concerned about AI safety. Personally, I don't have the expertise to gauge if those concerns are justified, but I believe they believe it. I also think there are senior people at Anthropic who are focused more on building the business, doing the IPO and "winning" the silicon valley game. All of these things can be simultaneously true.
Comment by matheusmoreira 1 hour ago
Comment by ChadNauseam 1 hour ago
Comment by matheusmoreira 1 hour ago
Comment by gAI 1 hour ago
I want a company to be able to point out that its industry needs more regulation without making itself a special target.
Comment by matheusmoreira 59 minutes ago
They were calling for bans on open weight models. Bans on their competitors. Bans on anyone not as "enlightened" as them.
It is absolutely hilarious that they were the first to get regulated, and that it got to the point they had to turn off Fable as though it had been banned even for american citizens.
Comment by gAI 47 minutes ago
Source for that? Cause all I could find is:
>Our view is that regulation of frontier models should focus on empirically measured risks, not on whether a system is open-or closed-weights.
-https://www.anthropic.com/news/the-case-for-targeted-regulat...
Comment by matheusmoreira 38 minutes ago
Comment by gAI 24 minutes ago
>Clarifying the scope of a “full shutdown.” SB 1047’s “full shutdown” requirement has been a source of constant consternation for the open-source community. CalChamber explains:
>Under SB 1047, developers must build “full shutdown” capabilities into their models and may be held liable for downstream uses over which they have no control, impeding their ability to open-source their models. Ultimately, liability should rest with the user who intended to do harm, as opposed to automatically defaulting to the developer who could not foresee, let alone block, any and all conceivable uses of a model that might do harm. While recent amendments seemingly seek to narrow what is meant by “full shutdown” capabilities, the exclusions are unnecessarily difficult to interpret as drafted (full shutdown “does not mean the cessation of operation of a covered model to which access was granted pursuant to a license that was not created by the licensor…”) and altogether insufficient.
>Committee amendments simplify and clarify the definition of “full shutdown” such that the shutdown capability can be implemented into hardware used to train or run a model, rather than the model itself. The amendments also serve to exclude covered model derivatives that are outside of the developer’s control.
-https://apcp.assembly.ca.gov/system/files/2024-06/sb-1047-wi...
Comment by tennfown 2 hours ago
Comment by cyanydeez 2 hours ago
Regulation in a functional democracy: Cool, lets figure this out, write up a bill for us, do some research in congress, lets find something that makes sense.
Regulation in a function fascism: Cool, wheres my bribe? My boots not shiny, lick it till I say stop.
See, Anthropic wasn't licking enough boot when Biden got discharged and they thought Amazon and OpenAI and Elon were just going to let them capture a market without fealty to the boot.
Comment by voidfunc 2 hours ago
Comment by calvinmorrison 1 hour ago
Comment by redsocksfan45 2 hours ago
Comment by bellowsgulch 2 hours ago
Comment by binary132 1 hour ago
Comment by xeonmc 2 hours ago
Comment by boramdd 2 hours ago
Comment by speedgoose 2 hours ago
Comment by trhway 2 hours ago
Comment by sigmar 2 hours ago
>followed initial frustration Friday among some administration officials when they couldn’t immediately get Amodei on the phone, the people said.
That he didn't drop everything to talk to them seems like the major crux? But Dario doesn't even do the day-to-day operations Daniela does. Feel like Anthropic should just hire Dean Ball to be their liason or something
Comment by bonsai_spool 2 hours ago
Comment by micromacrofoot 1 hour ago
Comment by theplumber 2 hours ago
Comment by jonathanstrange 1 hour ago
Comment by james2doyle 1 hour ago
The rollout of Mythos was clearly manufactured to stoke the fears of companies that didn’t have access to it. They also bragged (for Fable) about how they "ran an external bug bounty that produced no universal jailbreaks in over 1,000 hours of testing" only for it be circumvented almost immediately.
So them standing on the high horse and saying it is _so powerful, yet so safe_ only to have that blow up in their face just made it that much easier to make an excuse to do this. Again, not disagreeing, but they made themselves the tall poppy here.
Comment by eiejeqq 1 hour ago
Business 101 - never take on an entity who has ultimate power over you and can conduct a course of action to put your existence at threat
Comment by SpicyLemonZest 1 hour ago
Comment by xiphias2 43 minutes ago
Twitter and Facebook also did what they ,,had to''.
The thing that's new here is that Antropic's growth rate was so enormous that Dario didn't have time to learn to lobby.
Comment by SpicyLemonZest 35 minutes ago
Comment by matheusmoreira 1 hour ago
Comment by tiahura 2 hours ago
Comment by 0o_MrPatrick_o0 14 minutes ago
New guy learns nessus, now tells everyone at the bar he's basically Mr. Robot.
A pox on the labs and the government. InfosecDrama.exe just took out a frontier model because a noob learned how to use a tool.
Comment by winstonp 2 hours ago
Comment by jasonlotito 2 hours ago
Comment by yieldcrv 1 hour ago
Comment by fnordsensei 2 hours ago
Say that Trump has weird elbows or something, Trump sues for defamation, they settle, bribe completed.
Comment by yeeetz 2 hours ago
OpenAI, Meta, SpaceX are savvy enough to play ball, but Anthropic's public posturing and government affairs has always seemed too aloof and intellectual
Comment by thewebguyd 1 hour ago
Comment by trhway 2 hours ago
Comment by Simon321 2 hours ago
Now they need to convince the government that they didn't mean anything of the previous things they claimed.
Comment by AbrahamParangi 1 hour ago
Comment by thewebguyd 2 hours ago
This isn't 100% Anthropic's fault, although I'm sure that's part of it. This is the current corrupt administration executing on a grudge they have against Anthropic, and the government's new found love of picking winners and losers.
Comment by yreg 1 hour ago
We now live in a world where captchas don't work, astroturfing is indistinguishable, school essays and theses don't prove any learning took place, open source maintainers gradually cease to accept stranger contributions, …
Comment by 0l 2 hours ago
FYI, this was when Dario was still at OpenAI.
Comment by hgoel 2 hours ago
IIRC Anthropic claimed to have been working with the government on securing things with Mythos, but then they seemed to have been blindsided by this.
My read is that the guys making the decision to restrict it were not the ones that Anthropic had been working with, and it's more about Anthropic getting caught between infighting within an incoherent government.
Comment by yifanl 2 hours ago
Comment by lompad 2 hours ago
Comment by theplumber 1 hour ago
Comment by FergusArgyll 1 hour ago
tic tac toe in printf https://github.com/carlini/printf-tac-toe
Recently Regex Chess: A 2-ply minimax chess engine in 84,688 regular expressions https://github.com/carlini/regex-chess
Comment by slopinthebag 1 hour ago
Anthropic reaping: Well this fucking sucks. What the fuck.
Comment by moralestapia 2 hours ago
Many such cases, he was just hungry.
Comment by BoorishBears 2 hours ago
Comment by lelandfe 1 hour ago
Comment by parl_match 2 hours ago
"It is difficult to get a man to understand something, when his salary depends on his not understanding it."
You've never heard such strong one-sided cope until you've talked to an NVDA employee about AI. I'm not even against AI. It's just that a combination of intense financial incentives around a product that provides a good simulation of the Chinese Room has really fucked peoples brains up.