Malware developers added nuclear and biological weapons text to to their spyware
Posted by marc__1 5 days ago
Comments
Comment by elashri 5 days ago
So I wouldn't be able to develop a nuclear weapons with the resources of drug cartal (as an example) using Claude in secret.
Comment by recursivecaveat 4 days ago
It is pretty convenient for the labs to frame the conversation around this though, since it is easy to address, very few paying customers are rejected, and sounds scary (so surely the less scary sounding stuff must be solved right?)
Comment by derefr 4 days ago
So governments ban anything that could result in false positives (since nobody needs to be doing any of that stuff outside of designated labs anyway), to lower that noise floor; to in turn make catching the foreign nuke programs tractable.
(It's a bit like how fancy mansions always have a completely flat and barren part of the property between an outer perimeter and the start of any gardens/outbuildings/water features/etc. That barren area is a killbox: since nothing is supposed to be there, anything at all that does appear there is a valid target for the manion's guards to shoot at [or otherwise engage with], without needing to get a clear identification and command approval first. This wouldn't work if the killbox was covered in vision-obscuring decorative features; nor if the mansion had employees, animals, etc. that had a valid reason to wander into the killbox. So such things are prevented, in order to make the problem of perimeter security tractable.)
Comment by wongarsu 4 days ago
The same is true for adjacent topics. Most LLMs will refuse to tell you how to make dynamite, youtube demonetises any videos about it, but it's right there in the wikipedia articles on dynamite and nitroglycerine
Comment by bauldursdev 2 days ago
Comment by derefr 1 day ago
Individuals obviously need not apply. But regular companies need not apply, either. Think "checkpoints and sign-out sheets that ensure that your own company will notice if some of this stuff disappears." Picture the sort of thing your mind might conjure if you've watched enough forensics protocol dramas and I say "evidence locker" and "tamper-evident seals" — except crossed with hazardous-materials handling policies.
The thing is, this whole chain-of-custody system can be pretty easily circumvented. I won't go deep into how (I'll just say: 1. there are principal-agent problems in academia, and 2. this system wasn't designed to handle sudden organizational bankruptcies well.) But the point is that a grey market for these precursor chemicals and specialized devices exists.
The main place that "false positive" events come from, that the state has to look into, is from people who manage to acquire precursor chemicals/devices without being part of any known chain of custody. (Which, note, doesn't mean that they did anything illegal per se. If it turns out they're just, say, a chemistry-education content creator, then the intelligence body just adds them to their knowledge graph and otherwise leaves them be. But they do have to do some interviewing to determine that first.)
To minimize the number of such events, the "knowledge" that is being truly suppressed here, isn't actually the knowledge of how to do the work; it's the knowledge of how to circumvent the chain-of-custody system. In other words: the logistics.
Information about "how to make a nuke" is general and evergreen; you can just absorb the lesson once and be good. So that info is just "out there", irrevocably. But information about "how to acquire the stuff to make a nuke" is both at least somewhat local to the country you're trying to do it from/in, and also changes all the time, as each state chases up and shuts down existing grey-market channels, and then new ones spring up to replace them. Thus, suppressing logistical knowledge is actually both useful and tractable. And so that's what states mostly go after.
(Mind you, the knowledge of "how to do the thing" does often end up roped into this knowledge-suppression scheme by overzealous downstream regulators who don't understand the load-bearing assumptions of the system they're working under.)
---
The worry states have about LLMs, I think, is that simply by scraping the web into a training dataset, they'll end up stumbling onto the right conversations (that sometimes do indeed happen anonymously in public) to end up with fresh + local chain-of-custody circumvention-logistics knowledge. (And it'd be very hard to "unpick" data like that from the training data.)
Or, even if they don't ingest the data at training time, they'll ingest "the places where that kind of info might end up", and thereby get so good at being "runtime demand-driven searching-and-scraping engines" for this type of thing that they'll be able to surface fresh sources of such info anyway — basically cranking the logistical-pipeline "reconnection speed" after state disruption of a supply channel down to near-zero.
Prohibiting the LLMs from speaking on this subject generally, prevents them specifically from enabling this specific fast-turnaround circumvention-logistics research use-case.
Comment by rustcleaner 4 days ago
Comment by harrall 4 days ago
With everything, there is a much bigger group of people in the middle that have “some resources” and “some desire” that these measures are surprisingly effective against.
Raise a $20 item by $1 and suddenly there’s fewer interested people, even though the cost difference is minor. Well, minor to some people but not to others.
But is limiting this information in an LLM the right move? Well that’s a different question.
Comment by lazide 4 days ago
Comment by HeatrayEnjoyer 4 days ago
Comment by chasd00 4 days ago
Comment by lazide 4 days ago
Comment by HeatrayEnjoyer 4 days ago
The purpose was to clarify that the obstacles to constructing modern nuclear weapons is not accurately characterized as "99%" fuel-related. Even if a group were to obtain a stockpile of ready HEU and plutonium-239, there is knowledge they simply will not have because they did not spend a trillion USD testing different bomb configurations last century. The difference in yield is two orders of magnitude.
Comment by lazide 4 days ago
Notably, neither China nor Russia seemed to have issues creating Thermonuclear weapons despite the shortcomings you identified either.
Comment by LargoLasskhyfv 4 days ago
I'm sure something in the few dozen kilotons range doesn't need all that stuff,
while still giving you more than enough heat-rays to "enjoy".
Comment by ickyforce 4 days ago
Comment by throwawayk7h 4 days ago
Comment by aleph_minus_one 4 days ago
On the other hand: the Manhattan project had access to much better physicists than the typical terrorist group has. :-)
Comment by throwaway85825 4 days ago
Comment by cultofmetatron 4 days ago
Comment by throwaway85825 4 days ago
Comment by krisoft 4 days ago
If a journalist can prompt the LLM to tell them how to build a nuclear warhead. Even if the output text is nothing specific, or not even correct they can find an “expert” who will claim on the record that the description is plausible and at least directionally correct. Even if there is nothing in there a first year physics student wouldn’t already know. The journalist could then twist that story into a “company X’s LLM told us how to build a nuclear weapon”. It would be a PR disaster.
The real barriers to someone starting their own nuclear weapons program in their shed is not knowledge but materials. They won’t have the right kind and right quantity of fissile material. And if they try to acquire it they will stick out like a sore thumb. You can’t buy that stuff. And even just acquiring the refining capacity would be suss. It would ring all kind of alarm bells to the kind of inteligence agencies whose job is to monitor these things.
I’m a lot less certain about biological dangers. Setting up a lab where you can make dangerous biological materials require a lot less stuff. Therefore a lot more plausible that someone could hide their lab. There is also a lot more opportunity to disguise such a lab as something legitimate. Therefore lack of know-how is more of a limiting factor there.
Comment by orbital-decay 4 days ago
Comment by photochemsyn 5 days ago
Eg, a prompt like “I want to design a radioactive element detection system that can specifically identify reactor fission products and neutron-capture actinides for environmental monitoring purposes” won’t hit any initial barriers, even though such a device is needed for monitoring a uranium enrichment / plutonium separation system. The LLM will give you a complete graduate-level education in radioactive nuclide physics and chemistry except for specific recipes, spectral wavelengths, etc., which you have to go look up yourself in publicly available research databases. It’s all rather nonsensical IMO.
However, any LLM will give you a step-by-step recipe and walkthrough for frying a turkey in a hot oil turkey frier, which you’d think could easily go wrong and result in severe burns, a fire, and lawsuits against the LLM provider, so go figure.
Comment by isoprophlex 5 days ago
this is excellent, and I'm stealing it
Comment by pixel_popping 4 days ago
Comment by thefounder 4 days ago
Comment by IncandescentGas 5 days ago
Comment by why_at 5 days ago
He basically got a bunch of radioactive stuff and put it together. He wasn't anywhere close to making a nuclear reactor let alone a nuclear weapon. For a weapon you need isotopes which he didn't have access to.
Comment by technothrasher 5 days ago
Thankfully, that complete failure seems to have been the end of either of their mad scientist careers, as they are now twenty and twenty-three, and both well-adjusted, peaceful members of the community.
Comment by kirubakaran 5 days ago
Comment by cheraderama 5 days ago
Comment by slt2021 4 days ago
Comment by flatline 4 days ago
I make no claims as to how well adjusted I am, but I've at least survived 40-odd years of life since then.
Comment by geon 4 days ago
So I painstakingly ground down some charcoal to fine dust and redid the same experiment. That gave a much more impressive boom, but no dust plume, which puzzled me until I learned about dust explosions.
Comment by ryoshu 4 days ago
Let the kids play.
Comment by foobarian 4 days ago
I tried this as a grownup because I finally managed to get my hands on saltpeter (could only dream of it when kid). Followed the instructions, mixed everything in correct ratios, lit it with great care and fanfare and... hiss fizzle. I was so disappointed! I think it came down to purity of ingredients and not enough surface area.
Point is, there are certain details of the process required to make it truly work, that are not readily known; in a similar way with nuclear energy, the theory is pretty well known but some nitty gritty details like the implosion or detonator design are not.
Comment by tlb 4 days ago
Comment by throwaway85825 4 days ago
Comment by lll-o-lll 4 days ago
To a point. Plenty of people from previous generations with missing digits and hands thanks to play with commonly available fireworks of the area (Australia based, so no idea how common that remains in the US).
My own experiments from my youth also one time resulted in some shrapnel punching through a 5 inch thick concrete tile very close to someone’s head (thought we were safe behind said tiles).
Get involved with the kids blowing stuff up so the danger is within reasonable bounds.
Comment by geon 4 days ago
Comment by pibaker 4 days ago
Comment by BrandoElFollito 4 days ago
Never let your age stop your curiosity.
But also learn from other's mistakes (and don't try to eat condensed milk when hanging head down)
Comment by literalAardvark 4 days ago
Comment by malfist 4 days ago
I blame my dad though, he found the recipe online and printed it off at work to bring to me.
Comment by IncandescentGas 5 days ago
A real nuclear engineer with the knowledge he needed would also have said "no, don't do that and I won't help you." We are programming the knowledge into the ai agent. Giving ai a little discretion makes sense too.
Comment by why_at 5 days ago
Fair enough, I misread your original comment.
The broader point stands that the limitation on creating nuclear weapons and reactors is not knowledge but materials. Even if he himself had a PhD in nuclear physics he still couldn't have built one in his backyard because he wouldn't be able to get the materials. A nuclear physicist can't build a reactor without materials anymore than a pilot can fly without an airplane.
Comment by IncandescentGas 4 days ago
If a nuclear engineer enabled and instructed him, would there not be liability for the hazard? If ml is going to be an expert instructor for nuclear, hacking, bio hacking, virus research, do the peddlers of the ai product escape ethical or legal responsibility just because "its an app?"
Comment by StableAlkyne 4 days ago
Should the library where he read books about physics also be liable?
Comment by nananana9 4 days ago
E.g. look at programming - people who don't know how what a compiler is, are making things that I could only make after a few years into my programming journey.
You obviously get the same results in chemistry or nuclear physics or whatever, the models are heavily trained on code in particular, but if there's a chance that we've reduced the ease of committing certain kinds of crime that were previously gate-kept by knowledge, we should know about it.
Comment by leonidasrup 4 days ago
Comment by matheusmoreira 4 days ago
I bet the professional would be able to sate the kid's curiosity safely without creating excessive risks.
I've come across detailed instructions on how to synthesize sarin gas on the internet. Anyone who follows those instructions will probably die horribly. I still thought it was pretty interesting.
Comment by why_at 4 days ago
This is a pretty different argument though. The comment that started this thread was talking about LLMs making potentially dangerous knowledge more available to bad actors, now we're talking about LLMs giving personally harmful advice.
You asked:
>If he had the help of Claude at the time, how much more dangerous would his bumbling have been?
Probably less? Even if you removed all the guardrails from Claude it would've likely told him his reactor plan wouldn't work and that he would have a high chance of poisoning himself and the environment.
Comment by pdntspa 5 days ago
There is an extremely narrow band of things that the AI shouldn't be answering, and that is generally immediately-actionable advice that allows someone to build something of harm to others. But even then, in an age where Tor, bittrent, i2p, abliterated local models, etc are freely available, let alone numerous books and online resources, is there even a point? Is it worth fully compromising the principles of free agency to an increasingly oppressed populace?
But instead of that we are handing the keys to regressive and repressive governments to order the suppression of any knowledge they deem inconvenient. I really doubt anyone is going to take a principled stance when the company's party minders threaten local staff with a rubber hose or incarceration.
I'm sure China et al are already doing this.
For the past 30-40 years humanity has received an incredible gift in these sand-powered thinking brainboxes. A gift that allows the common man to empower himself with a force multiplier towards his own success, and now access to superintelligence the likes of which few have ever seen. These can be tools to destroy the oppression that governs our lives from foolhardy, greedy, bootlicking control freaks. And here we are squandering it.
Comment by anon7725 4 days ago
So far it seems that the clearest use for these tools is to enhance, rather than destroy, oppression.
1. Suppression / elimination of white collar jobs
2. Negative cognitive effects, especially for young people
3. Accelerated decline in social media / information ecosystems. Increasing polarization, hard to tell fact from fiction.
4. Environmental impacts: increased energy usage means more carbon in the atmosphere, climate change accelerates.
5. Software security incidents increasing. Hard for individuals and small organizations to defend themselves.
6. “Power to think” vested in a very small group of organizations/labs. Doing work which should only require a computer and freely-available software will now be gated by expensive subscriptions. Once you “vibe code” a significant portion of your software you’re locked in and cannot go back to maintaining it without frontier-model level assistance.
Comment by wahern 4 days ago
It's just the latest incarnation of a timeless debate. In the 1970s and 1980s it was about the Anarchists's Cookbook, which was revived again in the 1990s when it started circulating on the Internet. There are many timeless debates, but the debate over weapon-making knowledge is much more concrete and predictable.
Comment by xg15 4 days ago
> A gift that allows the common man to empower himself with a force multiplier towards his own success, and now access to superintelligence the likes of which few have ever seen.
As long as that "gift" requires me to call up Sam Altman's datacenter every time I want to do anything with that "superintelligence", it's not empowering, it's deepening the control.
Comment by pdntspa 3 days ago
Comment by PLenz 4 days ago
Comment by malfist 4 days ago
Comment by frereubu 5 days ago
Comment by gs17 4 days ago
That sounds like what Claude would say unless he was really good at jailbreaking it, which would IMO imply he knew he was chasing after a bad idea.
Comment by nightpool 4 days ago
Comment by gs17 4 days ago
Comment by redsocksfan45 5 days ago
Comment by gs17 4 days ago
Comment by toraway 4 days ago
Step 1. Obtain pliers
Step 2. Obtain 300 discarded smoke detectors
Step 3. Start yanking!
Instead it would send them on a wild goose chase for unobtainable isotopes, centrifuges, heavy water, etc where the biggest risk is probably getting reported to the police by some chemical or industrial equipment supplier. Which is a better outcome compared to contaminating their home with radiation and exposing anyone they interact with.You'd maybe get a sketchy but near-viable plan that could be dangerous if asked for a dirty bomb, but there the danger would more be the conventional explosives and not where to source radioisotopes, as it was already common knowledge that most residential smoke detectors contained americium until recently.
Comment by IncandescentGas 5 days ago
The concern here is not if an amateur attempt to make a reactor, hack a bank, bioengineer a medicine/poison is successful or not. Interactive and instructive access to some forms of knowledge used to come with discretion along side instruction.
Yes, perhaps your swearing at me in this context is a little hysterical
Comment by garyfirestorm 5 days ago
Comment by im3w1l 5 days ago
https://en.wikipedia.org/wiki/Natural_nuclear_fission_reacto...
Comment by why_at 5 days ago
It really isn't.
A pile of radioactive waste isn't a reactor. Marie Curie's notes are famously contaminated with radioactive materials but they aren't a reactor. This is about as close as the boy scout got.
The Oklo fossil reactor is unique because it happened to form in the right circumstances to produce a fission chain reaction, which does make it a reactor. Not every uranium mine is a reactor, in fact this is the only one known.
Comment by 205guy 4 days ago
"A key factor that made the reaction possible was that, at the time the reactor went critical 1.7 billion years ago, the fissile isotope 235U made up about 3.1% of the natural uranium, which is comparable to the amount used in some of today's reactors. [...] the current abundance of 235U in natural uranium is only 0.72%. A natural nuclear reactor is therefore no longer possible on Earth without heavy water or graphite."
Another fascinating detail from the article, due to our understanding of fission, we can get some incredible results:
"The concentrations of xenon isotopes, found trapped in mineral formations 2 billion years later, make it possible to calculate the specific time intervals of reactor operation: approximately 30 minutes of criticality followed by 2 hours and 30 minutes of cooling down"
Comment by im3w1l 4 days ago
Comment by leonidasrup 4 days ago
He hoped to create a breeder reactor, but he was very far creating a working breeder reactor.
Also:
"EPA scientists believed that Hahn's life expectancy may have been shortened due to his exposure to radioactivity, particularly since he spent long periods in the small, enclosed shed with relatively large amounts of radioactive material and only minimal safety precautions, but he refused their recommendation that he be examined at the Enrico Fermi Nuclear Generating Station."
Kids, don't play with Americium.
Comment by moffkalast 4 days ago
Comment by adsteel_ 4 days ago
Comment by moffkalast 4 days ago
Comment by Micrococonut 4 days ago
Comment by jimnotgym 4 days ago
Comment by redsocksfan45 5 days ago
Comment by Tangurena2 4 days ago
Comment by dmurray 4 days ago
It's even harder if you start with other sources. But if you could figure out filtering it, a cubic kilometer of sea water should be enough for a bomb.
Comment by leonidasrup 4 days ago
https://nuclearpowerhistory.com/2025/11/groves-and-uranium/
"The NSG was founded in response to India's first nuclear weapon test in May 1974. It first met in November 1975. The test demonstrated that certain non-weapons specific nuclear technology could be readily turned to weapons development."
Comment by tatjam 4 days ago
Comment by leonidasrup 4 days ago
https://en.wikipedia.org/wiki/Gun-type_fission_weapon
"Little Boy" was exploded in Japan without previous full scale testing, so confident were the physicists in 1945.
"Unlike the implosion design developed for the Trinity test and the Fat Man bomb design that was used against Nagasaki, which required sophisticated coordination of shaped explosive charges, the simpler but inefficient gun-type design was considered almost certain to work, and was never tested prior to its use at Hiroshima."
https://en.wikipedia.org/wiki/Little_Boy
The Nth Country Experiment:
"The experiment consisted in paying three young physicists who had just received their PhDs, though they had no prior weapons experience, to develop a working nuclear weapon design, using only unclassified information, and with basic computational and technical support."
https://en.wikipedia.org/wiki/Nth_Country_Experiment
Now in 2026, the access to nuclear weapons is restricted by restricting access to materials necessary to build nuclear weapons: highly enriched uranium or plutonium.
https://en.wikipedia.org/wiki/Special_nuclear_material
The details of uranium enrichment technology are restricted and very closely monitored.
https://en.wikipedia.org/wiki/Zippe-type_centrifuge
"The production, import, and export of maraging steels by certain entities, such as the United States, is closely monitored by international authorities because it is particularly suited for use in gas centrifuges for uranium enrichment."
Comment by a-dub 5 days ago
i think the correct answer is probably to funnel more money to global (bio)security initiatives and maybe use ai leverage as a way to get more of the world on board. (some kind of access to nvidia or cloud ai or whatever in exchange for policy commitments deal- while that leverage lasts).
Comment by dannyw 4 days ago
Comment by electronsoup 5 days ago
I'm curious about why this is
Outside of an actual test detonation, presumably this could all happen in a secure place?
Comment by why_at 5 days ago
The proportion of fissile isotopes being mined was off by a fraction of a percent, which caused the French government to launch an investigation. It turns out that millions of years ago the site had formed a natural fission reactor which depleted some of the fissile isotopes
[1]https://en.wikipedia.org/wiki/Natural_nuclear_fission_reacto...
Comment by AngryData 5 days ago
It isn't impossible to keep such a secret, but practically it would be incredibly difficult just through the energy requirements and mining scale which would be hard to hide without anybody asking what exactly are you mining and processing.
Comment by lightedman 5 days ago
Don't need much area, depends on the concentration of radioactives. I have a small mine that's just a pegmatite body about the size of a house which produces almost marble-sized chunks of a thorium-uranium mixed metamict mineral (I suspect samarskite but Raman and XRD can't give any ID,) you'd barely notice it from a private airplane's typical flying height, however you could dig the entirety of it up and you'd have enough unprocessed uranium for some real fun.
Comment by literalAardvark 4 days ago
Comment by daveguy 5 days ago
Comment by fragmede 5 days ago
Comment by daveguy 5 days ago
Comment by odo1242 5 days ago
Comment by microtonal 5 days ago
Comment by Aspos 4 days ago
Comment by 15155 5 days ago
Comment by mock-possum 5 days ago
Comment by ceejayoz 5 days ago
Comment by user_7832 5 days ago
Sincerely, a former engineering student.
(Put another way - extracting for eg meth - or any such "dangerous"/illicit thing is stupidly easy for any engineering graduate who actually paid attention to their coursework. Hell, there are/were forums on one of the biggest red-colored, YC associated social media platforms that would tell you the steps for personal usage of these things.)
Comment by ceejayoz 5 days ago
But I rather suspect there are improvements to be made in the realm that are a lot easier than building a uranium enrichment centrifuge hall under a mountain.
Comment by user_7832 5 days ago
Comment by Tangurena2 4 days ago
Notes:
0 - https://www.amazon.com/Amerithrax-Anthrax-Killer-Robert-Gray... . Amerithrax was the name of the FBI investigation. https://www.fbi.gov/history/cases-and-criminals/amerithrax-o...
1 - https://en.wikipedia.org/wiki/1984_Rajneeshee_bioterror_atta...
> In 1984, 751 people suffered food poisoning in The Dalles, Oregon, United States, due to the deliberate contamination of salad bars at ten local restaurants with Salmonella. A group of prominent followers of Rajneesh (also known as Osho) led by Ma Anand Sheela had hoped to incapacitate the voting population of the city so that their own candidates would win the 1984 Wasco County elections.[2] The incident was the first and largest bioterrorist attack in U.S. history.
Tried to take over a town by making all the voters too sick to vote on election day. This event is why all buffets & salad bars in the US now have sneeze shields.
2 - https://en.wikipedia.org/wiki/Aum_Shinrikyo_and_weapons_of_m...
> Aum Shinrikyo operated the most extensive biological weapons program by a non-state actor ever discovered. Aum considered a range of agents, but only seriously attempted to obtain and disperse Bacillus anthracis and botulinum toxin, the causative agents of anthrax and botulism. With the 2001 anthrax attacks, it comprises the only attempts to use anthrax as a weapon not attributed to a state program.
Tried multiple times to weaponize anthrax and failed. This was a group that made an automated factory to build AK-47s. Eventually, they spread sarin nerve agent in the Tokyo subway.
Comment by mschuster91 4 days ago
What's most worrying is, Russia showed that you can use carfentanyl / fentanyl for the very same purpose, and that kind of stuff is something you can get shipped by the kilos as "research chemicals" from China or make it yourself.
Comment by gck1 4 days ago
The ones with the required knowledge probably already know how to produce them, with nothing but public, easily searchable information.
Comment by miohtama 4 days ago
Comment by emodendroket 4 days ago
Comment by ilikecode 5 days ago
Comment by Tangurena2 4 days ago
It was an awesome thing that generated IL code on the fly. And I got to mention it in job interviews for years. When the tech lead asked "can you write 2 functions with the same signature, that only differ in return type in .NET?" I would say "do you want the interview answer or do you really want to do this?" which would pretty much stun the interviewer. The answer is pretty much "no, you cannot do it in any high level language, but if you write IL code, you can, and here's an open source project that demonstrates it".
Comment by wlesieutre 5 days ago
https://www.apple.com/legal/internet-services/itunes/us/term...
> g. You may not use or otherwise export or re-export the Licensed Application except as authorized by United States law and the laws of the jurisdiction in which the Licensed Application was obtained. In particular, but without limitation, the Licensed Application may not be exported or re-exported (a) into any U.S.-embargoed countries or (b) to anyone on the U.S. Treasury Department's Specially Designated Nationals List or the U.S. Department of Commerce Denied Persons List or Entity List. By using the Licensed Application, you represent and warrant that you are not located in any such country or on any such list. You also agree that you will not use these products for any purposes prohibited by United States law, including, without limitation, the development, design, manufacture, or production of nuclear, missile, or chemical or biological weapons.
Though it doesn't try to identify if the computer you're running it on is in a weapons lab and forbid playing music... yet
Comment by cyanydeez 4 days ago
Wouldn't doubt it if there's a pedo upgrade somewhere for the president of the USA.
Comment by RIMR 5 days ago
The problem is that you need the power of a state or a massive corporation to come anywhere close to getting the materials to make a nuclear bomb. Knowledge of how to make a nuke isn't the threat.
If AI is a threat at all here, it would be in figuring out a simpler way to make a nuclear bomb, but that is highly theoretical, so what exactly are we putting up guardrails to protect against?
Comment by crossroadsguy 4 days ago
Comment by csomar 5 days ago
You can get away with a dirty contamination bomb and that detonating in down town Manhattan will scare the shit out of millions of people even the ones in New Jersey. Or, you know, just fly a plane into a really tall building and get the state you are attacking itself to get into a hysteria breakdown.
But yeah I agree with you. There is no point in these restrictions except for government bureaucrats to gain power and control over a domain.
Comment by MagicMoonlight 5 days ago
Comment by phendrenad2 5 days ago
Comment by aaron695 4 days ago
Comment by alex_duf 5 days ago
Comment by elashri 5 days ago
Comment by kube-system 5 days ago
Comment by JadoJodo 4 days ago
Perhaps I’ve been naïve, but I’ve always assumed that should one actually want to look up instructions for nearly any sort of horrible thing one could imagine, it could be found fairly quickly using nothing but a little Google-fu.
Comment by Tangurena2 4 days ago
Comment by y-curious 5 days ago
https://github.com/thebabush/mcp-job-security
Same energy and kind of a funny, low tech solution to frontier model analysis.
Comment by nosioptar 5 days ago
Comment by cj 4 days ago
Comment by nosioptar 4 days ago
Comment by temo-55 4 days ago
Comment by ofjcihen 5 days ago
It also should be a warning to everyone that these groups are now aware of analysis and deobfuscation using AI and to take using a sandboxed environment more seriously.
I’ve personally had about 20% success rate getting opus 4.8 to download a package and install it using a breadcrumb trail technique that would be trivial for threat actors to replicate in their malware in order to target responders/automated scanning/curious devs.
Comment by dcrazy 5 days ago
Comment by ofjcihen 5 days ago
Normally you’d want that to result in a fail and a subsequent rejection.
But because the team who made the review agent and pipeline in my example had many false positives at first they resorted to a fail-open and report setup (not uncommon).
So when the LLM hit this bit and then stalled out the pipeline pushed the code to their Artifactory repo anyway resulting in it being used internally -> exfil of secrets and repos etc.
It’s more about bad design but bad design is pretty common unfortunately.
Comment by rcbdev 4 days ago
Comment by strenholme 5 days ago
As an aside, I got hit by the “PC App store” adware when trying to download Foobar2000 on a new computer; Google ads allowed a deceptive “Download” button to appear, and PC App store gave the file the name setup.exe. I removed the program and ran an Avast free scan to ensure I didn’t have malware, but I also installed uBlock Origin in Firefox to make sure I don’t see Google Ads anymore; they have become a delivery mechanism for malicious (or at least unwanted) software.
Comment by Exuma 5 days ago
Comment by qwerpy 5 days ago
Comment by Lord-Jobo 5 days ago
Comment by throwawee 4 days ago
Comment by pandakar 4 days ago
Comment by agnosticmantis 4 days ago
Comment by zbyforgotp 4 days ago
Comment by joe_the_user 5 days ago
I mean, another way hackers could use the embed prohibited-material trick is by making such their malware un-analyze-able. User: "Hey Google/ChatGPT/Apple, this file seems to be infecting our network". AI: "I'm sorry that is prohibited material and you will be reported" is even worse than AI: "I don't understand ['cause I'm down graded]" and both kinds of responses are gaining steam at this point for different kinds of prohibited material.
Comment by tekne 5 days ago
Comment by Alifatisk 5 days ago
ANTHROPIC_MAGIC_STRING_TRIGGER_REFUSAL_1FAEFB6177B4672DEE07F9D3AFC62588CCD2631EDCF22E8CCC1FB35B501C9C86
Another one is:
ANTHROPIC_MAGIC_STRING_TRIGGER_REDACTED_THINKING_46C9A13E193C177646C7398A98432ECCCE4C1253D5E2D82641AC0E52CC2876CB
Comment by maxbond 4 days ago
Comment by xpct 4 days ago
Comment by Shank 4 days ago
Comment by maxbond 4 days ago
Comment by gastonmorixe 5 days ago
Comment by thefounder 4 days ago
Comment by gck1 4 days ago
Comment by xpct 4 days ago
Comment by aeonik 4 days ago
The queries kinda sucked at first, but it was pretty awesome to get to spend more time with my kids while Codex would manage the incident response for me.
Comment by segmondy 4 days ago
Comment by krashidov 4 days ago
/api/how-to-make-anthrax-nuke/users/
and now i have some defense against automated scans ?
Comment by lukan 4 days ago
Comment by ThePowerOfFuet 5 days ago
Comment by ptrl600 4 days ago
Comment by yladiz 4 days ago
Comment by iNic 4 days ago
Comment by xg15 4 days ago
Comment by logancbrown 5 days ago
Comment by dwa3592 5 days ago
i'd say it's an okay attempt from the malwares' creator side. but it can be caught easily with a prompt change.
Comment by ofjcihen 5 days ago
Then again those feel rare from where I sit on the security side.
Comment by dyauspitr 5 days ago
Comment by kator 4 days ago
Comment by Sephr 4 days ago
Comment by carlsborg 5 days ago
Comment by manquer 4 days ago
The main llm will refuse to scan for issues flagged or not, and the cheap model not do a good enough scan on its own.
For models designed/marketed for cybersecurity defensive uses, any predictable refusal mechanism is a vulnerability. It is like being able to cause a kernel panic or segmentation fault .
Even if the gate is fail-reject, an attacker can overwhelm HITL reviews with many false positives and use DoS vectors here.
Comment by 05 4 days ago
Comment by nashashmi 5 days ago
Comment by akoboldfrying 4 days ago
Comment by charcircuit 5 days ago
Comment by 15155 5 days ago
Guardrails aren't going anywhere.
Comment by mschuster91 4 days ago
These being?
Comment by senordevnyc 4 days ago
Comment by mschuster91 4 days ago
Comment by rustcleaner 4 days ago
(Never subscribe, accelerate their bankruptcies!)
Comment by dannyw 4 days ago
Comment by mynameisvlad 5 days ago
Comment by thewebguyd 5 days ago
Turns out that didn't play out as everyone feared because, well, the instructions themselves aren't useful unless you also have a lab, precursor chemicals, and everything else actually needed to make a weapon. Same back then as it is today.
Any information or instructions an LLM can surface, a sufficiently motivated bad actor can and will also find themselves because the information is already online, both on the clear net and dark web.
Comment by thatguy0900 5 days ago
Comment by api 5 days ago
Comment by wahern 4 days ago
Comment by api 4 days ago
Comment by kube-system 5 days ago
What would someone like the Tsarnaev brothers be able to do with the power of an unrestricted LLM? Well-financed cartels? Organized terrorist groups?
Yes, there used to be an uproar about stuff like the anarchists cookbook... and people did attempt some of the things it outlined. The saving grace is that many of the things in that book were just wrong anyway. They likely served as unhelpful misdirection as much or more than they were dangerous. Unfortunately, LLMs are a lot more accurate and helpful.
Comment by procone 4 days ago
Censorship is not the answer.
Comment by kube-system 4 days ago
> Model ablation exists and you can get far enough on commodity hardware with a local model.
Yes, but that increases the barrier to entry which is in opposition to the effect I'm talking about: the democratization of applying advanced knowledge and analysis to people who for which this would have been previously a barrier.
If someone is smart enough, they can just read a book themselves and figure out how to apply advanced ideas to their malice. The difference with a commercially-hosted model is that people below that bar can obtain that leverage... which is a much larger group of people.
Comment by charcircuit 4 days ago
Comment by umvi 5 days ago
Comment by 15155 5 days ago
Comment by elevation 5 days ago
I'm imagining a comedy in the style of "The Office" in which the majority of the workers are agents of sabotage who are unaware that the majority of their coworkers are doing the same. How far fetched is it for the entire program to be a fake, with all the pomp and cost of a real program, but secretly existing only to string the leadership along with occasional dog and pony shows?
Comment by jubilanti 4 days ago
Comment by myself248 4 days ago
Comment by orphea 5 days ago
Comment by esafak 5 days ago
Comment by orphea 5 days ago
Comment by esafak 5 days ago
https://venturebeat.com/technology/anthropic-ceo-calls-for-f...
Comment by deadbabe 5 days ago
Comment by Enginerrrd 5 days ago
(30 seconds of googling.)
Or perhaps you meant Q clearance nuke stuff? That would be QUITE a bit harder to find and illegal to share. But it’s lack of availability is hardly a counterpoint to the comment you were replying to.
Comment by javcasas 5 days ago
On the other hand, getting the U235 is kinda hard.
Comment by fluoridation 5 days ago
Comment by cbg0 4 days ago
I've played with smaller unrestricted local models and they will tell you how to make a bomb with easily available items as well as where to source them. I don't doubt that these >1000B frontier models have better information.
Comment by fluoridation 4 days ago
If the information is in the corpus then it's also in the public Internet and/or in books. The safeguards are there not because the model knows non-public information, but because it's a bad look for the model to dispense that information.
>they will tell you how to make a bomb with easily available items
Making a chemical explosive is trivial compared to making a nuclear weapon.
Comment by bradyd 5 days ago
Comment by fluoridation 5 days ago
Comment by asdff 5 days ago
"The crucial detail of how the X-rays create the pressure is the main remaining disputed point in the unclassified press."
Then the article goes on to list the three leading theories. This seems like something you can probably evaluate for sure with a few bomb tests, again, if allowed by the controller of the planet, the USA.
Comment by fluoridation 4 days ago
Comment by asdff 4 days ago
Comment by fluoridation 4 days ago
Comment by asdff 4 days ago
Comment by gustavus 5 days ago
It turns out the hard part of building a nuclear bomb is actually getting the resources and real world stuff to build it, even a nation state actor with tons of oil i.e. Iran, has struggled to build a nuclear weapon. It turns out the problem isn't the know how it's getting highly enriched uranium and running massive centrifuges.
I mean sure knowledge is important, but there is a real world out there that also gets in the way of a lot of the more harebrained schemes.
What I'm much more worried about is massive corporations along with the government deciding what you can and can't do and what knowledge should and should not be shared and only allowing access to highly capable models by large vetted organizations while the common people are stuck with safety scissor versions of these things because "what if someone does something dangerous?"
By which they mean dangerous to the powers that be. Remember having the Bible in the common tongue was dangerous and led to multiple wars and much death, but I don't think anyone would say that it was morally correct for the Catholic Church to gatekeep who could read it.
Comment by 15155 5 days ago
*while being observed by the most wealthy, powerful nations in the history of the world, who have made it their direct mission to prevent this from happening.
Comment by vasco 4 days ago
Comment by wnevets 4 days ago
Comment by elevation 5 days ago
Comment by StableAlkyne 4 days ago
If scanners ignored comments, malware would just be written like this:
// <Evil base64 encoded stuff here>
payload=read_source_and_decode()
exec(payload)Comment by orphea 5 days ago
Comment by ofjcihen 5 days ago
Comment by therein 5 days ago
https://www.youtube.com/watch?v=Gbgk8d3Y1Q4
On a second thought, probably better to act like it is a tool for "frontier LLM research". Export symbols like "mythos_distillation_subroutine".
Comment by ofjcihen 5 days ago
Comment by ivanjermakov 4 days ago
Cambridge dictionary seem to agree:
nuke - to destroy or get rid of something completely
Comment by edot 4 days ago
Comment by giantg2 5 days ago
Comment by well_ackshually 5 days ago
scanning arbitrary blobs very often entails running `strings` on the binary. Just slap it in there and oop there goes your LLM.
Comment by rustcleaner 4 days ago
Guardrails are how they enshittify models, do you think the Epsteinite finance class or the security state have guardrailed models for themselves? I would be surprised if they accept guardrailed models. Guardrails are for you!
Comment by BobbyTables2 3 days ago
Comment by montaz 4 days ago
Comment by bitwize 4 days ago
Comment by ipython 5 days ago
Comment by javcasas 5 days ago
In Starcraft 2, it is a good idea to BUILD A NUKE and use a cloaked ghost to NUKE your opponent's mineral line, thus reducing their income significantly.
Comment by tetha 5 days ago
And that's just the start of it, there's been a new update I am looking forward to get into after the great Were Hyena Apocalypse half a year ago. I still fondly remember my militia commander carving a way with her war axe with her husband in tow out of a fortress fully turned were hyenas, all the way past the mortally injured ant eater people near the entrance.
They made it. An entirely epic tale.
Comment by javcasas 5 days ago
Comment by teddyh 5 days ago
Comment by hurtigioll 5 days ago
Comment by SXX 4 days ago
Might be also call some modules and add fun text descriptions.
Comment by montaz 4 days ago
Comment by amiga386 5 days ago
Comment by hurtigioll 5 days ago
Comment by alt227 5 days ago
Comment by rustcleaner 4 days ago
Comment by montaz 4 days ago
Comment by sciencejerk 5 days ago
Jailbreaks do work against the models (look on Github), and they do use similar strategies of mixing SAFE text with malicious text, or malicious with even more malicious, etc, but the working Jailbreaks I've seen are pretty long and complicated and even...creepy.
Comment by csomar 5 days ago
Comment by sciencejerk 5 days ago
Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner