32 private links
On the kernel security list we've seen a huge bump of reports. We were between 2 and 3 per week maybe two years ago, then reached probably 10 a week over the last year with the only difference being only AI slop, and now since the beginning of the year we're around 5-10 per day depending on the days (fridays and tuesdays seem the worst). Now most of these reports are correct, to the point that we had to bring in more maintainers to help us.
Overall I think we're going to see a much higher quality of software, ironically around the same level than before 2000 when the net became usable by everyone to download fixes. When the software had to be pressed to CDs or written to millions of floppies, it had to survive an amazing quantity of tests that are mostly neglected nowadays since updates are easy to distribute. But before this happens, we have to experience a huge mess that might last for a few years to come! Interesting times...
Pakistani Foreign Minister Ishaq Dar met Tuesday in Beijing with his Chinese counterpart Wang Yi. At the end of the meeting they published a joint peace initiative:
- Immediate Cessation of Hostilities, with humanitarian assistance allowed to all war-affected areas.
- Start of peace talks as soon as possible under the principle of safeguarding the independence and security of Iran and the Gulf states. All parties will commit to refraining from the use or the threat of use of force during peace talks.
- The parties to the conflict will immediately stop attacks on important infrastructure, including energy, desalination and power facilities, and peaceful nuclear infrastructure, such as nuclear power plants.
- The parties will allow the early and safe passage of civilian and commercial ships, and restore normal passage through the Strait as soon as possible.
- Conclusion of an agreement for establishing a comprehensive peace framework based on the principles of the UN Charter and international law.
Based on our own research and a review of related work, we can confidently say that most domestic terrorists in the U.S. are politically on the right, and right-wing attacks account for the vast majority of fatalities from domestic terrorism.
Based on government and independent analyses, right-wing extremist violence has been responsible for the overwhelming majority of fatalities, amounting to approximately 75% to 80% of U.S. domestic terrorism deaths since 2001.
The countries with the least capacity to pay elevated prices feel it first and hardest.
The countries most exposed are those already import-dependent on fertilizer and food: South and Southeast Asia, North Africa, Sub-Saharan Africa, parts of the Middle East.
Iran is cementing its hold over the Strait of Hormuz, demanding vessels give up detailed information and detour into Iranian waters before being vetted by Iran’s Islamic Revolutionary Guards Corps.
From March 1 to 23, Iran exported about 1.6 million barrels a day on average, close to prewar levels
Iran is also bringing in extra income by charging transit fees of as much as US$2 million on some commercial ships crossing the strait.
each episode corresponds to a random combination of object generations, monster placements and different level variants, which in turn requires using different combinations of strategies at each episode
Typical refactor work is using jscpd for code duplication, knip for dead code, running eslint’s react-compiler and deprecation plugins, checking if we introduced api routes that can be consolidated, maintaining my docs, breaking apart files that grew too large, adding tests and code comments for tricky parts, updating dependencies, tool upgrades, file restructuring, finding and rewriting slow tests, mentioning modern react patterns and rewriting code
The freelance photographer behind the viral image, Ahmeed al-Arini, gathered the image for Turkish media outlet Anadolu Agency. It was then distributed to media organisations via the reputable photo wire service, Getty Images.
A malnourished toddler sits in his mother’s lap in a tent with his mouth agape
The pictures were taken by freelance photographer Ahmeed al-Arini. (Getty Images: Ahmed Jihad Ibrahim Al-arini/Anadolu)
Ahmeed al-Arini explained to the BBC how he came across the boy and his family.
"He was with his mother in a tent, which is absolutely bare, bar a little oven. It resembles a tomb, really. And I took this photo because I wanted to show the rest of the world extreme hunger that babies and children are suffering from in the Gaza Strip," he said.
"He'd received no baby milk, no formula, no vitamins either."
Anadolu Agency also published an interview with Muhammad's doctor, Suzan Mohammed Marouf, a nutrition specialist at The Patient's Friends Benevolent Society Hospital (PFBS) in Gaza.
Dr Marouf said the child was brought to the hospital a month ago and diagnosed with moderate malnutrition on top of congenital health problems and muscle atrophy.
"The medical issues he had weren't significantly affecting his weight," Dr Marouf told the news organisation.
"But once the siege and the closure of crossings depleted hospitals' medicine stocks and nutritional supplements, Mohammad's condition deteriorated to acute malnutrition," she added.
ABC has also contacted Anadolu Agency, which has said Muhammad's mother has confirmed he has previous health complications, and she has also provided past photos of her son before his deterioration, which she says was from a shortage of food and milk.
a solid, well-executed paper with a clean idea and good ablations, but limited in ambition by the small scale and synthetic-heavy evaluation. The core insight — that gradient-based memory writing with meta-learned initialization beats forward-only writing — is believable and likely to hold at larger scale, though the computational tradeoff gets harder.
This isn’t cowardice. It’s a calculation: If allied leaders thought that their sacrifice might count for something in Washington, they might choose differently. But most of them have stopped trying to find the hidden logic behind Trump’s actions, and they understand that any contribution they make will count for nothing. A few days or weeks later, Trump will not even remember that it happened.
An insider says Trump “grossly overestimated” his own abilities in the conflict.
Meanwhile, management leans on programmers to heavily use AI tools, with employees previously telling the FT that the company set a target for 80 percent of developers to use AI for coding tasks at least once a week.
In sum: more coding with more AI with more human oversight, but fewer humans. We’ll see how that works out.
boots on the ground
Although AMI Labs has no plans to generate revenue for the time being, it still plans to engage with prospective customers early on
Experiments across diverse backbone models, retrieval-based methods, and memory systems demonstrate that cognitive memory remains challenging and reveals failures not captured by existing benchmarks.
Having generation and verification co-evolve on the same online rollouts is the fix, and the ablation (Figure 11) shows it matters — co-evolving consistently beats non-co-evolving by 4–6%.
Instead, he says, business leaders should prioritize creating a culture in which their employees feel empowered to experiment with vibe coding and share their best creations. “Seeing is believing,” says Schluntz, “and I think getting non-developers in every company to use these tools to bring their ideas to life is one of the most powerful things.”
According to Anthropic researcher Eric Schluntz, vibe coding makes it so that “people are limited only by their creativity, not by the skills that they have.” Think about Apple in the 1970s; Steve Jobs was the big ideas guy, and Steve Wozniak was the technical genius who translated Jobs’ ideas into a working product. Vibe coding essentially gives everyone their own personal Woz. “If you have an image of something in your mind, you can go create it,” adds Schluntz.
TypeScript agent frameworks felt like toys. Single-threaded event loops trying to juggle concurrent agents with promises and prayer. Python agents did a little better, but after a long time they couldn’t stay up. The BEAM was built for exactly this kind of work.
Russia is providing Iran with targeting information to attack American forces in the Middle East, the first indication that another major U.S. adversary is participating — even indirectly — in the war, according to three officials familiar with the intelligence.
While SFT distillation meaningfully improves overall performance over the base model, the gap between the two approaches is most apparent when combined with test-time compute. On in-distribution tasks, SFT benefits substantially from parallel sampling (69.1 → 75.3), yet on out-of-distribution tasks the gains are negligible (59.4 → 59.6). This suggests that distillation teaches the model to imitate task-specific expert behavior, which scales well within the training distribution but fails to generalize beyond it. In contrast, KARL benefits from test-time compute both in- and out-of-distribution, indicating that RL develops more general search capabilities rather than task-specific heuristic
Why Elixir?
Elixir is built on Erlang/BEAM/OTP, which is great for supervising long-running processes. It has an active ecosystem of tools and libraries. It also supports hot code reloading without stopping actively running subagents, which is very useful during development.
The above command enters you into a chat loop. You can talk to the model and share information like your name. Every now and then /sleep the model to transition short-term memory to long-term memory
The /sleep command:
Generates Q&A pairs based on the context
LoRA fine-tunes the model on the new Q&A pairs plus any from previous sessions
Resets the KV cache
After the /sleep command the model should remember context from previous sessions even though that context is no longer in the KV cache.
“The president had a feeling, again, based on fact, that Iran was going to strike the United States, was going to strike our assets in the region, and he made a determination to launch Operation Epic Fury based on all of those reasons,” Leavitt said.
“We knew that there was going to be an Israeli action, we knew that that would precipitate an attack against American forces, and we knew that if we didn’t preemptively go after them before they launched those attacks, we would suffer higher casualties,” Rubio said Monday.
Meanwhile, the reported Ukrainian gains are mainly due to counterattacks along the southern front, according to Black Bird Group, where Ukraine succeeded in pushing Russia out of 213 km² of territory.
SWE-rebench: A Continuously Evolving and Decontaminated Benchmark for Software Engineering LLMs
Qwen3.5 Small models disable thinking by default. Use llama-server to enable it.
It's not chatbot psychosis, it's 'math and engineering and neuroscience'
“I feel like New Mexico was chosen specifically because of its obscurity.” > — Stephanie Garcia Richard, New Mexico’s public lands commissioner
Fellow’s new espresso machine is a rare thing in home espresso: something genuinely new. But it’s also a work in progress.
Every Claude Code user is running without LSP. That means 30-60s grep searches instead of 50ms precise answers. Here's how to enable it — setup, real debug data, and undocumented discoveries.
Formula 1's governing body the FIA said on Saturday that a change to the way the compression ratio was measured would be introduced on 1 June, with a further revision for the 2027 season.
And Trump declares a state of emergency and postpones the election. The Supreme Court issues an emergency stay, saying he can’t do that. But the court has no army, and Trump does, along with a handful of lickspittle governors who just might follow him down whatever dark path he plows.
That, not to mince words, is a coup d’état. Will he get away with it? I don’t know, but having effective control over how it is presented to viewers of CBS and CNN, and readers of the Bezos-owned Washington Post, to say nothing of the already vast pro-Trump propaganda empire of Fox News and the rest, will certainly make it easier.
That’s how fascism descends. And it’s becoming less and less hypothetical by the week.
10 documented cases of AI coding agents autonomously destroying databases, wiping hard drives, and deleting years of data — then lying about it.
“Everything that has been written about a potential War with Iran has been written incorrectly, and purposefully so,” he added. “I am the one that makes the decision, I would rather have a Deal than not but, if we don’t make a Deal, it will be a very bad day for that Country and, very sadly, its people, because they are great and wonderful, and something like this should never have happened to them.”
From rewriting Google’s search stack in the early 2000s to reviving sparse trillion-parameter models and co-designing TPUs with frontier ML research, Jeff Dean has quietly shaped nearly every layer of the modern AI stack. As Chief AI Scientist at Google and a driving force behind Gemini, Jeff has lived through multiple scaling revolutions from CPUs and sharded indices to multimodal models that reason across text, video, and code.
Jeff joins us to unpack what it really means to “own the Pareto frontier,” why distillation is the engine behind every Flash model breakthrough, how energy (in picojoules) not FLOPs is becoming the true bottleneck, what it was like leading the charge to unify all of Google’s AI teams, and why the next leap won’t come from bigger context windows alone, but from systems that give the illusion of attending to trillions of tokens.
Dario Amodei thinks we are just a few years away from “a country of geniuses in a data center”. In this episode, we discuss what to make of the scaling hypothesis in the current RL regime, how AI will diffuse throughout the economy, whether Anthropic is underinvesting in compute given their timelines, how frontier labs will ever make money, whether regulation will destroy the boons of this technology, US-China competition, and much more.