AI Intelligence Brief

Fri 26 June 2026

Daily Brief — Curated and contextualised by Best Practice AI

196Articles
Editor's pickSummary

Governments Plan Retraining, Samsung Pours Billions, and Apple Hikes Prices

TL;DRA coalition of employers and state governments is launching a national strategy to manage AI-driven labor displacement. Meanwhile, Samsung and SK Hynix are preparing massive capital expenditures to address memory shortages. Apple has responded to these supply chain pressures by raising MacBook and iPad prices by 20%, contributing to a $263 billion loss in market capitalization. Separately, the Trump administration has requested that OpenAI stagger its model releases to mitigate potential security risks.

Editor's highlights

The stories that matter most

Selected and contextualised by the Best Practice AI team

11 of 196 articles
Lead story
Editor's pickPAYWALLEducation
WSJ· Yesterday

The New Push to Ready Millions for AI Career Upheaval

A coalition of employers and state governments says it is developing a sweeping strategy to help workers respond to the AI age.

Editor's pickPAYWALLTechnology
WSJ· Yesterday

ON Semiconductor to Buy Synaptics in All-Stock Deal With $7 Billion Enterprise Value

ON Semiconductor said Synaptics’s AI compute platform, human-machine interface technology, and connectivity solutions would help it meet demand for increasingly capable AI solutions that can interact with the physical world.

Editor's pickPAYWALLTechnology
Bloomberg· Today

SemiAnalysis Xie on Asia AI Supply Chain

Myron Xie, AI supply chain research lead at SemiAnalysis, explains who, within Asia's hardware suppliers, are standing out as the winners in the market's surge in AI demand. He speaks with Shery Ahn on Bloomberg Tech: Asia. (Source: Bloomberg)

Editor's pickPAYWALLTechnology
Bloomberg· Today

Samsung, SK Hynix Reportedly Preparing Huge AI Spending Push

Samsung Electronics Co. and SK Hynix Inc. are preparing to announce hundreds of billions of dollars worth in new investments on Monday, according to South Korean media reports this week.

Editor's pickPAYWALLTechnology
Bloomberg· Yesterday

Trump Administration Asks OpenAI to Stagger AI Model Release

The Trump administration has asked OpenAI to stagger the release of an upcoming powerful artificial intelligence model, according to a person familiar with the matter, nearly two weeks after rival Anthropic PBC suspended its most capable offerings from the market under regulatory pressure.

Editor's pickTechnology
Arxiv· Today

The Shift to Agentic AI: Evidence from Codex

arXiv:2606.26959v1 Announce Type: new Abstract: We analyze usage data from OpenAI's Codex tool to present large-scale evidence of how agentic AI technology, which can take actions on a user's behalf, changes how people work. We use an automated, privacy-protecting pipeline to contrast usage across three populations: external personal-account users, external organizational-account users, and workers within OpenAI. We find that agentic AI usage is growing rapidly: the number of active users has grown more than fivefold in the first half of 2026, with the most rapid increase occurring outside the initial audience of software developers. Uptake is uneven: within OpenAI, Codex usage is nearly universal and has largely replaced business usage of ChatGPT. We document a similar shift to agentic tooling outside OpenAI, particularly within organizations, although external adoption remains lower and more uneven. In addition to headline usage figures, we observe measures of sophistication, and find that a growing number of users have used Codex to change their workflows substantially. More than 10% of users manage three or more concurrent Codex agents at some point each week and that 26.6% use skills, which allow users to share instructions for complex workflows. Alongside these changes in usage practices, request complexity has increased: since the start of the year, the share of individual Codex users who submit at least one request for a task estimated to require more than eight hours for an experienced human to complete has increased nearly tenfold. Concurrently, output has grown rapidly -- in June 2026, the median OpenAI employee in a legal role generated 13 times more monthly output tokens across Codex and ChatGPT than they did in November 2025, while the median researcher generated more than 50 times as many. We conclude by discussing the implications of these patterns for productivity, job reorganization, and workforce restructuring.

Editor's pickEnergy & Utilities
Arxiv· Today

How Do Tool-Augmented LLM Agents Perform on Real-World Energy Analytics Tasks?

arXiv:2606.26346v1 Announce Type: new Abstract: Agentic benchmarks have emerged across general-purpose and domain-specific settings, including finance, coding, law, and drug discovery, yet energy-domain evaluations remain largely limited to static knowledge recall. This is a critical gap for a sector that requires live data retrieval, specialized regulatory and market knowledge, and multi-step quantitative reasoning under real-world constraints. We present an empirical study of tool-augmented LLM agents on real-world energy market analytics tasks. Our evaluation environment includes 243 expert-curated problems across three categories: (1) Market Data Retrieval and Analysis, (2) Knowledge Retrieval and Interpretation, and (3) Advanced Quantitative Modeling and Decision Analytics. Tasks include price and demand analysis, tariff impact modeling, asset revenue and returns estimation, hedging strategy analysis, and optimization modeling, with problems spanning multiple difficulty levels. Agents are equipped with a configurable suite of domain tools, including live electricity market APIs for major U.S. ISOs, regulatory docket search, utility tariff databases, asset optimization models, and retrieval-augmented generation over energy market documents. We assess agent responses using a multi-dimensional evaluation protocol that scores approach correctness, answer accuracy, attribute alignment, and source validity, with category-aware routing to match scoring criteria to question type. We evaluate both closed-source and open-source LLMs, providing a comparative analysis of how model capability and domain tooling interact in a high-stakes professional domain. Key artifacts are publicly released to support reproducibility and future research.

Editor's pickMedia & Entertainment
Arxiv· Today

Dream machine -- the next creative economy

arXiv:2606.26114v1 Announce Type: new Abstract: We examine the structural transformation of creative industries under generative artificial intelligence, drawing on 374 primary sources spanning policy documents, industry data, creator surveys, and platform analytics. Beginning with the December 2024 release of OpenAI's Sora video model as a watershed event, we trace the historical pattern of creative resistance to technological disruption, then develop an analytical framework -- the Human-AI Agency Continuum for mapping the spectrum of human and machine collaboration in creative work. We present evidence for the "slop ceiling," an audience-imposed quality threshold that constrains AI-generated content to approximately 1--3% of platform streams despite comprising 44% of uploads. Analysis of the UK Government's 2025 consultation on AI and copyright (over 11,500 responses, 88% opposing expanded AI training rights) reveals deep structural tensions between technology firms and creative workers. We investigate how major studios, from Disney's $1 billion OpenAI investment to Netflix's AI-native animation unit, are positioning for an AI-augmented production pipeline. The work covers coordination collapse in creative supply chains, the emergence of new professional roles such as prompt engineers and AI orchestrators, and proposes four principles for navigating the transition: transparency, consent, compensation, and human-centred design. Eight appendices provide quantitative analysis, a glossary, topical bibliography, and deep dives into shadow AI adoption, AI stigma, and algorithmic intent.

Editor's pickProfessional Services
Arxiv· Today

The Open Source Economic Index of AI Adoption and Capability

arXiv:2606.26118v1 Announce Type: new Abstract: We work towards measuring both AI adoption and the capability of AI to perform discrete labor tasks across various occupations. To measure adoption, we develop an open-source economic index that uses publicly available user-LLM chat data and O*NET tasks to replicate studies produced by frontier AI labs, finding that occupations in the finance, computer science, and arts sectors are those with the highest adoption rates. To measure capabilities, we build a system that generates benchmark scenarios grounded in O*NET occupations, tasks, and model-context-protocol (MCP) servers. We test Kimi-k2.5 with an OpenAI agents SDK harness on scenarios across 9 occupations that appear frequently in our index, finding that AI correctly executes high-level workflows but often errs in the granular details (such as specific tool calls used).

Editor's pickPAYWALLTechnology
FT· Yesterday

Apple increases MacBook and iPad prices by 20%

iPhone maker loses $263bn in market capitalisation, blames costs on AI-driven memory shortage

Editor's pickPAYWALLTechnology
NYT· Yesterday

Chinese A.I. Models Gain Ground on Anthropic and OpenAI

Silicon Valley engineers recently flocked to new technology from a Chinese company, Z.ai, that is almost as good as its American competitors but much cheaper.

Economics & Markets

56 articles
AI Business Models2 articles
Editor's pickMedia & Entertainment
Arxiv· Today

Dream machine -- the next creative economy

arXiv:2606.26114v1 Announce Type: new Abstract: We examine the structural transformation of creative industries under generative artificial intelligence, drawing on 374 primary sources spanning policy documents, industry data, creator surveys, and platform analytics. Beginning with the December 2024 release of OpenAI's Sora video model as a watershed event, we trace the historical pattern of creative resistance to technological disruption, then develop an analytical framework -- the Human-AI Agency Continuum for mapping the spectrum of human and machine collaboration in creative work. We present evidence for the "slop ceiling," an audience-imposed quality threshold that constrains AI-generated content to approximately 1--3% of platform streams despite comprising 44% of uploads. Analysis of the UK Government's 2025 consultation on AI and copyright (over 11,500 responses, 88% opposing expanded AI training rights) reveals deep structural tensions between technology firms and creative workers. We investigate how major studios, from Disney's $1 billion OpenAI investment to Netflix's AI-native animation unit, are positioning for an AI-augmented production pipeline. The work covers coordination collapse in creative supply chains, the emergence of new professional roles such as prompt engineers and AI orchestrators, and proposes four principles for navigating the transition: transparency, consent, compensation, and human-centred design. Eight appendices provide quantitative analysis, a glossary, topical bibliography, and deep dives into shadow AI adoption, AI stigma, and algorithmic intent.

AI Investment & Valuations20 articles
Editor's pickPAYWALLTechnology
WSJ· Yesterday

ON Semiconductor to Buy Synaptics in All-Stock Deal With $7 Billion Enterprise Value

ON Semiconductor said Synaptics’s AI compute platform, human-machine interface technology, and connectivity solutions would help it meet demand for increasingly capable AI solutions that can interact with the physical world.

Editor's pickTechnology
Reuters· Yesterday

Micron overtakes Meta, Tesla in market value amid relentless AI infrastructure demand

The Australian airline studied details from nutrition and ergonomics to movement and light.

Editor's pickPAYWALLFinancial Services
Bloomberg· Today

SoftBank Shares Tumble After Report of OpenAI’s IPO Delay

SoftBank Group Corp.’s stock fell as much as 13% on concerns that OpenAI may hold off on an initial public offering until next year and delay returns for its Japanese backer.

Editor's pickTechnology
Reuters· Yesterday

Micron and Qualcomm forecasts ignite $400 billion AI chip stock rally

The Australian airline studied details from nutrition and ergonomics to movement and light.

Editor's pick
Exponentialview· Yesterday

🔮 The state of the AI economy

We've reconstructed the AI economy from the bottom up

Editor's pickTechnology
Fortune· Yesterday

What bubble? JPMorgan says the $5.5 trillion AI capex explosion is profitable–for now

JPMorgan’s midyear outlook argues the hyperscalers are profitable, the debt markets are holding, and the cycle has room to run.

Editor's pickPAYWALLFinancial Services
Bloomberg· Today

State Street: Asia May Win in AI Boom

Ninghui Liu, Head of APAC Investment Strategy and Research at State Street Investment Management, discusses why he thinks Asia may be the biggest beneficiary in the global AI boom. He speaks with Shery Ahn on Bloomberg Tech: Asia. (Source: Bloomberg)

Editor's pickTechnology
Theregister· Yesterday

Amazon pours another $13B into India's AI and cloud infrastructure

Mumbai and Hyderabad datacenter expansion forms part of broader $48B five-year investment pledge

Editor's pickPAYWALL
FT· Today

Global markets fall as investors fret about AI demand

Apple’s price rises and fears over delay to OpenAI’s IPO send Asian bourses sharply lower

Editor's pickPAYWALLTechnology
Bloomberg· Yesterday

China’s Hardware Tech Stocks Look to Earnings to Sustain Rally

Chinese hardware technology stocks have been on a tear. The next challenge is showing the earnings to back it up.

Editor's pickTechnology
Fortune· Yesterday

Micron drives global rally tech stock rally as traders abandon their fear of an AI bubble | Fortune

Everything you need to know before you reach the office this morning.

Editor's pickEnergy & Utilities
ETF Trends· Yesterday

Why AI Spending Is Favoring Industrials & Utilities | ETF Trends

State Street's Matt Bartolini says industrials and utilities are collecting the first dollars of AI spending while tech valuations stretch.

Editor's pickTechnology
Intellectia.AI· Yesterday

AI Stocks Selloff June 2026: Nasdaq Falls 2.2% as Tech Giants Tumble

The artificial intelligence investment boom that has driven markets to record highs in 2026 is facing its most significant test yet. On June 24, 2026, the Nasdaq Composite plunged 2.21% to 25,587 while the S&P 500 dropped 1.44% to 7,365 as a deepening AI stock selloff rattled Wall Street for ...

Editor's pickTechnology
NPR· Yesterday

Despite AI bubble fears, memory chip makers work to fill insatiable demand : NPR

Markets have been on edge about the AI investment boom, but earnings from the biggest U.S. memory chip maker, Micron, signal no end in sight to demand for the microchips at the heart of it all.

Editor's pickFinancial Services
Forbes· Yesterday

Council Post: GPUs Are Becoming The New Infrastructure Asset Class

Much like real estate, telecom towers and data centers evolved into institutional investment categories over time, and AI compute infrastructure may gradually follow the same path. The AI economy is no longer being built only with software. It is increasingly being built with physical capacity.

Editor's pickTechnology
Artificial Intelligence Newsletter | June 26, 2026· Yesterday

Onsemi to acquire Synaptics in all-stock deal, gives Synaptics $7bn enterprise value

Onsemi has agreed to acquire Synaptics in an all-stock deal valued at $7 billion, aiming to expand its capabilities into intelligent systems through Synaptics' Edge AI and connectivity portfolio.

Editor's pick
Substack· Yesterday

The Generative AI Fizzle™ - by Gary Marcus - Marcus on AI

Yesterday I coined a new term, the Generative AI Fizzle™. There is no doubt in my mind that a lot of things in AI are overvalued, but that doesn’t mean they will deflate at all once. They might, but maybe what we will see is more like a slow decline, as investors lose enthusiasm for the ratio between hype (insane) and profits (not that impressive thus far).

Editor's pickTechnology
Investorideas· Yesterday

The AI Stock Story Roars on with Micron Technology (Nasdaq:MU) Earnings | InvestorIdeas

Micron Technology (Nasdaq:MU) reports record fiscal Q3 2026 results: revenue $41.46B, GAAP net income $28.24B, stock hits new 52-week high of $1,255.00 on AI memory demand surge.

Editor's pickTechnology
Bebeez· Yesterday

Dubai Holding could buy stake in data center firm Hscale – report

Dubai Holding could take a stake in hyperscale data center developer Hscale as it looks to grow its presence in Europe. The state-backed investment firm, which is owned by Dubai’s ruler Sheikh Mohammed bin Rashid al-Maktoum and has a portfolio worth $136 billion, has hired advisors to investigate the possibility of buying part of Hscale, […]

Editor's pickTechnology
StartupHub.ai· Yesterday

Applied Materials and Lam Research lead chip equipment surge on Micron's blowout quarter, SOXX +3.94% | StartupHub.ai

Micron Technology's third-quarter beat drove chip equipment stocks to their biggest single-day gains in months, with Applied Materials surging 13% and Lam Resea

AI Market Competition11 articles
Editor's pickPAYWALLTechnology
NYT· Yesterday

Chinese A.I. Models Gain Ground on Anthropic and OpenAI

Silicon Valley engineers recently flocked to new technology from a Chinese company, Z.ai, that is almost as good as its American competitors but much cheaper.

Editor's pickPAYWALLTechnology
FT· Yesterday

Competition intensifies for Anthropic and OpenAI ahead of IPOs

Renewed challenge from open-source models raises stakes on AI labs to make their case

Editor's pickPAYWALL
Washington Post· Yesterday

How a New York race became the first front in the AI industry’s midterm war - The Washington Post

Competing AI interests poured millions into the Democratic primary, a taste of more spending to come this fall and elections beyond.

Editor's pickConsumer & Retail
Arxiv· Today

Divergent Recommendations, Convergent Diagnoses: Cross-Provider Failure-Mode Convergence in AI Commercial Recommendation

arXiv:2606.26116v1 Announce Type: new Abstract: A brand whose customers use both ChatGPT and Claude for product recommendations faces a strategic choice: a single optimization playbook, or one per provider? Across 215 commercially-framed prompts in four measurement batches, the two providers disagree on which brands they recommend roughly two-thirds of the time (cross-provider recommendation Jaccard 0.35, below the 0.50-0.61 same-prompt rerun baseline). The picks diverge. But when neither provider recommends a brand, we classify the failure into one of three modes -- discoverability (the brand never reaches the model), compellingness (it reaches the model but isn't mentioned), or positioning (it's mentioned but not recommended) -- and on 7,763 such joint failures, both providers diagnose the same failure mode 95.1% of the time (clustered 95% CI [94.3%, 95.7%]). Agreement rises monotonically with falling brand prominence, from 81% [78.2%, 84.0%] on category leaders to 99.6% [99.3%, 99.9%] on long-tail regional brands. The two providers reach their picks by measurably different generative routes -- Anthropic recommends from priors 43-52% of the time, OpenAI 8-29% -- but they converge on the failure diagnosis where it matters most for the long tail. Work that addresses the diagnosed failure mode lifts visibility on both providers; positioning - and content-level work for category leaders is more provider-specific.

Editor's pickPAYWALLTechnology
Bloomberg· Today

Apple Price Hikes Spark Asia Tech Selloff on Memory Cost Concern

Asian technology stocks slumped after Apple raised prices for its products, stoking concern that rising component prices will curb demand for devices and eventually slow the memory chip rally that has powered much of the AI trade.

Editor's pickConsumer & Retail
Fortune· Yesterday

Gas station owners have found a use case for AI, lawsuit says: colluding to fix prices

A new lawsuit claims AI pricing software helped Marathon, BP, and Circle K fix gas prices across 1,700 California stations.

Editor's pickTechnology
ETTelecom.com· Yesterday

Qualcomm Projects $15 Billion in Data Center Chip Sales by 2029, Boosting AI Chip Market Competition, ETTelecom

Qualcomm anticipates significant growth in its data center business, projecting $15 billion in sales by 2029, driven by new AI chips and major clients like Microsoft and Meta. Discover how Qualcomm is diversifying beyond smartphone chips and competing in the AI-centric chip market.

Editor's pickTechnology
Artificial Intelligence Newsletter | June 25, 2026· 2 days ago

Microsoft-Inflection AI deal set for Brazil CADE fast-track review

Microsoft has formally notified Brazil’s competition authority of its 2024 partnership with Inflection AI, following a determination by the agency’s Tribunal that the transaction should be reviewed.

Editor's pickTechnology
DataCenterKnowledge· Yesterday

Qualcomm Lands Meta CPU Deal, Unveils AI Data Center Platform

If the company can provide software ... as AI infrastructure becomes more heterogeneous. Related:Bridging the Divide: How Data Centers Are Addressing Community Concerns · Central to Qualcomm’s strategy is High-Bandwidth Compute, or HBC, which the company says combines SRAM-class performance with HBM-class capacity to reduce ...

Editor's pickTechnology
Artificial Intelligence Newsletter | June 26, 2026· Yesterday

New York Times proposes third amended complaint against OpenAI, Microsoft

The New York Times is requesting to file a third amended copyright complaint against OpenAI and Microsoft, featuring expanded contributory infringement claims related to AI training services.

Editor's pickTechnology
💡 Army of agents· Yesterday

Exclusive: Palihapitiya on Facebook's "fumble"

Chamath Palihapitiya discusses Facebook's AI strategy and the company's approach to open-source AI in an exclusive interview.

AI Pricing & Cost Curves9 articles
Editor's pickPAYWALLTechnology
FT· Yesterday

Apple increases MacBook and iPad prices by 20%

iPhone maker loses $263bn in market capitalisation, blames costs on AI-driven memory shortage

Editor's pickPAYWALLTechnology
NYT· Yesterday

Apple Raises Prices on Macs and iPads Amid the A.I. Boom

The tech giant cited the soaring costs of memory and storage chips as it increased prices more than $200 on some devices.

Editor's pickPAYWALLTechnology
Bloomberg· Today

Apple Shares Fall After Prices Increase for Macs, iPads

Apple Inc. shares fell after it raised prices of all Macs, iPads, home devices and the Vision Pro to offset cost hikes caused by a shortage of memory chips and storage. The price increases are in effect globally and are largely unprecedented, with no equivalent in the company's modern history of sweeping hikes across much of its product line. Bloomberg's Neil Campling reports. (Source: Bloomberg)

Editor's pickTechnology
CNBC· Yesterday

Why Micron Technology's gain could be hyperscalers' pain

Margin expansion for memory chip makers equals cost expansion for the sector's customers.

Editor's pickPAYWALLConsumer & Retail
Bloomberg· Today

More Expensive iPhones on the Horizon?

Bloomberg Intelligence says Apple has to hike prices for its iPhones after raising prices for its Macs, iPads and home devices. Those hikes went into effect globally on Thursday. Bloomberg Intelligence Senior Technology Analyst Anurag Rana spoke with Bloomberg's Yvonne Man and Avril Hong on "Bloomberg: The China Show." (Source: Bloomberg)

AI Productivity5 articles
Editor's pickConsumer & Retail
MIT Technology Review· Yesterday

Repositioning retail for the AI era

Artificial intelligence is rapidly reshaping retail, but not in the ways consumers might immediately notice. The biggest transformation may not be flashy virtual try-ons or chatbot shopping assistants, but in how decisions are made behind the scenes: how products surface in search results, how inventory moves through supply chains, how engineers ship code faster, and…

Editor's pick
Arxiv· Today

Accelerating Returns and the Qualitative Engine for Science

arXiv:2606.26359v1 Announce Type: new Abstract: Ray Kurzweil described a thesis of accelerating returns, which is the most influential narratives in discussions of technological progress. Its central claim is that advances in multiple technological fields, especially compute, artificial intelligence, brain science, and biotechnology, interact in such a way that progress becomes self-amplifying and approximately exponential. This paper gives a simple mathematical interpretation of that claim and then argues that, even if such acceleration is real, it does not by itself resolve the central problem of scientific discovery. The reason is that accelerating returns apply most naturally to executional and infrastructural capability, whereas genuine discovery often depends on a different capacity: qualitative reasoning about when a current framework is structurally inadequate and what conceptual move is needed next. Recent ARC-AGI-3 results sharpen this distinction: humans solve the benchmark at ceiling, whereas frontier AI systems remain below 1%, indicating that the gap between current AI and human flexible reasoning is still very large. At the same time, Demis Hassabis has emphasized that humans must retain their sense of meaning and what they choose to focus their lives on, a reminder that the future of AI is not only a technical forecast but also a question of what forms of human understanding are worth preserving and transmitting. This paper positions the Qualitative Engine for Science (QES) [3] as a response to that missing capacity. In this view, the Kurzweil theory helps explain why quantitative capability may accelerate, while QES addresses the central problem in scientific discovery that acceleration alone does not solve. Its value does not depend on when AGI arrives, but on the fact that the processes of scientific discovery themselves constitute a form of human wisdom worth preserving, organizing, and making accessible.

Editor's pickEnergy & Utilities
gasworld· Yesterday

AI could cut LNG production costs by $80bn a year by 2050 | LNG / LPG | gasworld

The rapid expansion of AI-driven data centres and a wave of industrial reshoring are driving a surge in global electricity demand. Technologies need to scale quickly and securely without impacting the grid or raising costs. Emerging fuel-cell-based systems can be deployed faster and produce high-concentration CO₂ streams that reduce the cost, complexity and energy consumption ...

Editor's pick
AEI· Yesterday

AI's Biggest Impact May Be Making Workers More Valuable | American Enterprise Institute - AEI

The most important economic effect of artificial intelligence may not be that machines become more capable. It may be that people become more capable when they work alongside AI.

Editor's pick
Arxiv· Today

Do more heads imply better performance? An empirical study of team thought leaders' impact on scientific team performance

arXiv:2606.26483v1 Announce Type: new Abstract: Thought leadership plays a crucial role in boosting team performance; thus, teams with more thought leaders may perform better. However, the impact of the number of thought leaders on team performance in a scientific context remains understudied. In this study, we consider the authors of a publication as a scientific team and define authors responsible for conceptual tasks, such as conceived and designed the experiments in the PLOS contribution statement classification system, as thought leaders. Leveraging more than 140,000 papers from PLOS journals, we examine the relationship between the number of thought leaders and two aspects of team performance, namely team impact and team disruptiveness, from both correlational and causal perspectives. The results show that (1) an inverted U-shaped relationship exists between the number of thought leaders and team impact, and (2) teams with more thought leaders tend to produce less disruptive ideas. We also explore how international collaboration, team size, and gender diversity interact with the number of thought leaders in shaping team performance, and find that (3) international collaboration improves team impact but lowers the disruptiveness of team outputs. This study advances scholarly understanding of thought leadership in scientific teams and provides valuable insights for policymakers and team managers.

AI Startups & Venture7 articles

Labor, Society & Culture

46 articles
AI & Employment22 articles
Editor's pickPAYWALLEducation
WSJ· Yesterday

The New Push to Ready Millions for AI Career Upheaval

A coalition of employers and state governments says it is developing a sweeping strategy to help workers respond to the AI age.

Editor's pickPAYWALLProfessional Services
NYT· Yesterday

Big Companies Aim to Ease A.I. Transition for American Workers

OpenAI, Anthropic, Amazon and Microsoft have signed on to an effort led by Gina Raimondo, a former commerce secretary.

Editor's pickPAYWALLGovernment & Public Sector
Washington Post· Yesterday

Gina Raimondo launches a bipartisan plan to stop AI replacing workers - The Washington Post

Gina Raimondo is leading a broad coalition to fight AI steamrolling through the job market.

Editor's pick
International Monetary Fund· Yesterday

Artificial Intelligence and the Economics of Adjustment

Today’s AI policies will shape tomorrow’s job market

Editor's pickGovernment & Public Sector
Fortune· Yesterday

One of the Democratic Party’s brightest stars is co-founding a group to help with the coming AI jobs earthquake

"We’re talking about a certain level of unemployment that could destabilize our country and our democracy," Gina Raimondo told the AP.

Editor's pickPAYWALLProfessional Services
FT· Yesterday

In-house lawyers aim to shed the grind, not the people

Legal teams find that AI opens up opportunities on top of shouldering repetitive drudge work

Editor's pickProfessional Services
Top Daily Headlines: Infosys boss says vibe coding is no threat because there’s more to writing software than writing software· Today

Infosys boss says vibe coding is no threat because there’s more to writing software than writing software

Despite warnings of revenue deflation, the chairman predicts AI will create more work rather than less for services organizations.

Editor's pickGovernment & Public Sector
CA· Yesterday

California becomes the first state to launch a tool to monitor and track artificial intelligence’s impacts on the workforce | Governor of California

Paired with the tracker is a comprehensive analysis of the data, which at this time shows no evidence of rising statewide unemployment claims in AI-exposed occupations. However, the data does demonstrate impacts to workers in high-exposure AI positions following the expansion of AI software ...

Editor's pick
Theregister· Yesterday

AI giants back non-profit to retrain workers left behind by AI

Sorry we spent your wages on data centers, but call us when you're AI-ready

Editor's pickProfessional Services
Seeking Alpha· Yesterday

The true cost of AI productivity: Goldman ups job displacement forecast | Seeking Alpha

Goldman now sees generative AI displacing 9% of U.S. workers (15M) over 10 years—why the impact may be manageable.

Editor's pickProfessional Services
PYMNTS· Yesterday

Goldman Sachs Says AI Will Eliminate 15 Million US Jobs | PYMNTS.com

Goldman Sachs has increased its estimate of the share of U.S. jobs that could be displaced by generative AI to over 9%.

Editor's pickFinancial Services
American Banker· Yesterday

Are fears of AI taking jobs overblown? | American Banker

CEOs of companies such as Block and Bolt have pointed to AI as a driving force behind layoffs at their organizations. But Tania Babina, an associate professor of finance at the University of Maryland Smith School of Business, says there is no systematic evidence that AI is taking jobs.

Editor's pickProfessional Services
ANI News· Today

AI unlikely to trigger 'Job Apocalypse', it may create uneven workforce disruption: Goldman Sachs Report

Despite rapid advances in artificial intelligence (AI) and growing concerns over mass job losses, a new Goldman Sachs report argues that fears of an imminent "AI job apocalypse" are overstated, although the technology is expected to significantly reshape labour markets over the coming decade.

Editor's pickTechnology
Tech Times· Yesterday

AI Job Displacement 2026: Oracle Names AI In SEC Filing, Career Tier Risk Guide

Stanford's payroll data confirms this is already happening at measurable scale for software developers aged 22 to 25, whose employment fell nearly 20% from its late 2022 peak while their colleagues aged 30 and older saw employment grow. Why is AI job displacement accelerating in 2026 specifically?

Editor's pickTechnology
LinkedIn· Yesterday

Meta does a 180 on reassigning thousands to AI training

We cannot provide a description for this page right now

Editor's pick
Business Insider· Yesterday

OpenAI, Anthropic, Microsoft, and Amazon are behind a new organization that aims to help prepare workers for AI

Former Commerce Secretary Gina Raimondo is leading a new nonprofit with big aims to address AI job displacement.

Editor's pickTechnology
💡 Army of agents· Yesterday

AI agents are here for real this time

The use of agentic AI platforms like OpenAI's Codex is accelerating, with non-developers becoming the fastest-growing user group as these tools increasingly handle workplace and life admin tasks.

Editor's pickGovernment & Public Sector
NPR· Yesterday

Two former governors are teaming up to address potential job losses from AI : NPR

NPR's Steve Inskeep talks with two former governors, Indiana Republican Eric Holcomb and Rhode Island Democrat Gina Raimondo, about combatting AI-related job losses.

Editor's pickGovernment & Public Sector
The Straits Times· Today

California launches AI job loss tracker as layoff fears grow | The Straits Times

It comes as pressure mounts for policymakers to appear proactive on the threat of AI-driven job loss. Read more at straitstimes.com.

Editor's pick
Moneywise· Yesterday

AI was blamed for 40% of the 97,000 US job cuts in May

AI-linked layoffs are surging, but the technology isn’t at the root of all recent job losses.

Editor's pickTechnology
Business Wire· Yesterday

Bespoke Partners Releases Software & SaaS Talent Market Update: The AI Reckoning Arrives for Software Leadership

Bespoke Partners 1H2026 Software & SaaS Talent Market Update examines how AI disruption is reshaping leadership needs in PE-backed software and SaaS.

Editor's pick
ABC News· Yesterday

AI is plowing through the workplace. This new group wants to help people adapt and have jobs - ABC News

A new bipartisan nonprofit wants to help Americans who find they're out of work because of AI

AI & Inequality1 articles
Editor's pickEducation
Arxiv· Today

The Tilted Playing Field for Women in Science

arXiv:2606.26469v1 Announce Type: new Abstract: Institutional prestige shapes access to resources, visibility, and collaboration opportunities in science. Yet whether prestige benefits researchers equally, and how it relates to differences in scientific productivity and collaboration, remains unclear. Here, we quantify prestige advantage as the relative likelihood that researchers at higher-ranked institutions have more collaborators and produce more high-impact papers compared to their lower-ranked peers. Analyzing nearly 5 million papers by 6.5 million authors across more than 65,000 institutions, we present a distributional, tail-sensitive framework to compare prestige advantage across groups. We find that the association between prestige and scientific achievement differs systematically by gender. While both men and women benefit from prestige, the returns are not gender-neutral: women experience comparable advantages only at the most elite institutions, whereas men retain persistent advantages across the broader hierarchy, with disparities widening at higher levels of achievement. Prestige advantage also grows nonlinearly, disproportionately benefiting authors at the most elite institutions. These differences align with collaboration patterns: women's networks are more locally clustered and focused on their own institution, while men collaborate more broadly across institutional strata. Together, these findings reveal a tilted playing field in science: one where prestige amplifies success unevenly and network structure shapes who can access its benefits.

AI Ethics & Safety13 articles
Editor's pickGovernment & Public Sector
Artificial Intelligence Newsletter | June 25, 2026· 2 days ago

Australian state police to trial facial-recognition technology in public places

Police in Western Australia have announced they will trial live facial-recognition technology in public places from June 22 to 28, amid concerns over biometric surveillance.

Editor's pickPAYWALL
FT· Yesterday

Ethical AI rows open way to wave of litigation

Lawyers are at the forefront of debate and dispute over the technology’s lawful and responsible use

Editor's pickPAYWALLMedia & Entertainment
NYT· Yesterday

The New York Times Amends Lawsuit Against OpenAI and Microsoft

In a new court filing, The Times accused Microsoft of encouraging OpenAI to train its A.I. systems using copyrighted articles.

Editor's pick
Arxiv· Today

From Celebrities to Anyone: Characterizing AI Nudification Content, Technology, and Community Dynamics on 4chan

arXiv:2606.27234v1 Announce Type: new Abstract: AI nudification uses generative models to create synthetic non-consensual sexually explicit imagery (SNEACI) of real individuals. Prior work has examined dedicated nudification platforms and model repositories, finding that most targets are female celebrities. However, the anonymous content community, where SNEACI is actively requested, generated, and exchanged, remains unexplored. In this work, we present a large-scale study of AI nudification in the wild, identifying 24,105 SNEACI items. We find a significant shift in target demographics: non-celebrity individuals now account for 55.8\% of targets, compared to only 4.7\% in prior studies, indicating that AI nudification has expanded from targeting public figures to increasingly harming individuals within users' own social circles. Meanwhile, open-source models dominate production, with Stable Diffusion family generating 42.7\% of images and Wan generating 66.5\% of videos, all driven by thousands of shared fine-tuned models and accessible tutorials. Yet the ecosystem runs on a small cohort of active producers, with the most prolific producing 780 items, drives community engagement, shapes target demographics, and disseminates technical knowledge that lowers barriers for new producers. Our work provides an empirical understanding of how AI nudification operates in the wild, revealing the mechanisms that sustain this ecosystem and highlighting the urgent need for interventions in platform governance, technical safeguards, and affected individual protection.

Editor's pickPAYWALLTechnology
Washington Post· Yesterday

Are AI chatbots like ChatGPT politically biased? We tested them. - Washington Post

So, are chatbots politically biased? The Washington Post tested the AI models behind Open AI ’s ChatGPT, Google’s Gemini and others using political questions designed by researchers to gauge how chatbots respond to hot-button political issues.

Editor's pickHealthcare
Arxiv· Today

Simulating Eating Disorder Patients with LLMs: Evaluating Psychological Persona Stability in Multi-Turn Conversations

arXiv:2606.26109v1 Announce Type: new Abstract: Large language model (LLM)-based simulations of clinical patients are increasingly used for research and training, yet their validity requires persona stability: coherent maintenance of an assigned psychological profile across and within conversations. We evaluate this prerequisite using eating disorder personas grounded in five published case vignettes, a dual-assessment framework (self-report + independent observer ratings), and validated psychometric instruments (EDE-Q) with known ground-truth scores. Across six LLMs and two experiments (between-conversation stability (Exp. I) and within-conversation stability (Exp. II)), we find that LLMs are paradoxically too stable and too inaccurate: variability is negligible, yet all models systematically overshoot ground-truth severity by 12-30% of the scale range (0.7-1.8 points on a 0-6 scale). The mechanism is selective stereotyping: models differentiate cases on behavioural items (dietary restraint) but maximise cognitive-affective items (body dissatisfaction, weight preoccupation) at ceiling regardless of case severity. Additional conversational context does not improve accuracy; it compounds the overshoot. LLMs can portray severe eating pathology but lack a representation of moderate clinical presentations, a "missing middle".

Editor's pickProfessional Services
NatLawReview· Yesterday

Workplace Strategies Watercooler 2026: Ethics of AI in the Workplace—Emerging Standards and Risks [Podcast]

In this installment of our Workplace Strategies Watercooler 2026 podcast series, shareholders Simone Francis (St. Thomas/New York) and Lauren Hicks (Indianapolis) explore the fast-moving legal landscape surrounding AI ethics in the workplace, from the ethics rules that already govern attorney ...

Editor's pickPAYWALL
FT· Yesterday

Anthropic has hired an economist with . . .  interesting views on human survival

Skynet versus unending rainbows

Editor's pickProfessional Services
Mondaq· Yesterday

New Stanford Study Reveals Bias In AI Hiring Tools And Raises Stakes For Employers - Discrimination, Disability & Sexual Harassment - United States

A recent study by Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) provides new large-scale evidence that artificial intelligence (AI) hiring tools can produce racially disparate outcomes.

Editor's pickProfessional Services
Accounting Today· Yesterday

IMA favors AI use over abuse | Accounting Today

The Institute of Management Accountants discussed the pros and cons of artificial intelligence with officials, including the new leader of the PCAOB.

Editor's pickTechnology
Artificial Intelligence Newsletter | June 25, 2026· 2 days ago

OpenAI endorses US Defiance Act targeting nonconsensual deepfakes

OpenAI has become the first AI company to endorse the DEFIANCE Act, which aims to create a civil right of action for victims of non-consensual intimate deepfakes.

Editor's pick
Arxiv· Today

Detecting and Controlling Sycophancy with Cascading Linear Features

arXiv:2606.26155v1 Announce Type: new Abstract: Interpreting and controlling model behaviors through activation steering methods requires many pairs of contrastive samples that clearly exhibit desired or undesired behavior. These data pairs determine the degree to which interpretability frameworks can reliably detect model features responsible for a behavior, and therefore the ability to steer models toward or away from such behavior. In this work, we present an iterative data generation pipeline that isolates cascading linear features responsible for a behavior. Specifically, we show how moving beyond simple binary pairs of samples, and instead isolating samples that show degrees of features that scale linearly with behavior, allows for better disentanglement of features. We focus on detecting and steering away from sycophancy -- the tendency of language models to prioritize user validation. We demonstrate that sycophancy features discovered through cascading samples form linearly separable subspaces, and allow for selection of model activations that more clearly correspond to the desired behavior than baseline approaches. We also evaluate their ability to enable detection, deterministic scoring, and robust steering, and see that they either match or outperform LLM-as-a-judge and system prompting baselines while providing lower computational demand and more interpretability guarantees. Code & Data: https://cascading-feats.github.io/

Editor's pick
Arxiv· Today

Refusal Lives Downstream of Persona in Chat Models

arXiv:2606.26161v1 Announce Type: new Abstract: Linear directions in activation space have been identified for both refusal and persona traits in instruction-tuned chat models, but the two have been studied as separate mechanisms. We show they interact: a compliant persona gates refusal. In Qwen2.5-7B-Instruct and Llama-3.1-8B-Instruct, we extract a compliant model-persona direction and a refusal direction and intervene on both. Compliant persona steering suppresses refusal -- in Llama, the refusal rate falls from 97% to 2%. Reintroducing the refusal direction partially restores refusal at late layers but not at early ones. Projecting out the persona direction in a late-layer window restores it to baseline; projecting out a random direction does not. Refusal is therefore gated at the late-layer expression stage, downstream of where it is computed. Treating refusal as a single isolated direction misses its dependence on persona.

AI Skills & Education6 articles
Editor's pickEducation
Arxiv· Today

The Effortless Trap: Productive Struggle, AI, and the Illusion of Learning

arXiv:2606.26181v1 Announce Type: new Abstract: With AI advancing fast, educators face a dilemma: allow the tool or ban it. Conflicting evidence that it both helps and hurts learning only deepens the confusion. The allow-or-ban framing is a false dichotomy; the relevant design question is placement. Used well, AI can scale feedback, examples, practice, and individualized support. Used poorly, it replaces the cognitive work that learning requires and leaves an illusion of learning: a confident sense of mastery that collapses on the unaided task. The strongest causal evidence shows the outcome flips on design: an unguarded AI helper left high-school students about 17% worse on an unaided exam than peers with no tool at all, while the same model rebuilt to withhold answers erased the harm, and a well-engineered tutor roughly doubled learning. We give educators one graspable frame for placing the tool. A new idea is learned through six moves, in order: Prime, Probe, Point, Attach, Strengthen, and Test. Secure the first hard attempt and the final unaided check, scaffold with guarded AI in between, and one diagnostic carries the frame: if letting AI in makes the task feel effortless, it is in the wrong place. To make it usable, we map classical teaching moves and AI-supported interventions to each step. Together, the six-move model, the placement rule, and the intervention menu provide a practical foundation for lesson and course redesign in the age of AI.

Editor's pickPAYWALLEducation
Theatlantic· Yesterday

AI Can’t Fix the Student-Motivation Problem

It turns out bots aren’t great teachers.

Editor's pick
New Kerala· Yesterday

MSME AI Skill Demand Surges 164% in FY26: Report

India's workforce is becoming more geographically scattered, with demand for AI-related skills rising sharply by 164% in FY26 within the MSME sector, according to the Apna MSME Hiring Pulse 2026 report. "Demand for jobs requiring AI-related skills grew by 164% in FY26, signalling that MSMEs ...

Editor's pick
ANI News· Yesterday

MSME sector sees 164% surge in AI skill demand; workforce turns geographically dispersed: Report

New Delhi [India], June 25 (ANI): ... for AI-related skills rising sharply by 164% in FY26 within the MSME sector, according to the Apna MSME Hiring Pulse 2026 report. The report notes that Indian MSMEs are changing their hiring patterns as they expand, adopt digital tools and build a more future-ready workforce. It added that hiring growth remained broad-based, driven by both new businesses entering the recruitment ecosystem and existing employers scaling up their workforce requirements...

Editor's pickEducation
The Independent· Yesterday

A new $500 million push to retrain workers for an AI-driven future | The Independent

And unlike many other quality news ... and analysis with paywalls. We believe quality journalism should be available to everyone, paid for by those who can afford it.Your support makes all the difference.Read more · The United States is hurtling towards an artificial intelligence-driven future, largely unprepared for the potential for widespread job displacement. While some warn of "doomsday scenarios" reminiscent of science fiction, proponents argue AI will create ...

Editor's pickEducation
Guardian· Yesterday

‘More relevant than making fires’: Explorer Scouts launch badges for AI and digital age

Content creation and online safety among new topics for 14- to 18-year-olds – but tweaks may be needed when social media ban comes in Scouts are introducing badges in content creation, digital communication and online safety after consulting nearly 3,000 teenagers who said they wanted skills to help them navigate a world increasingly shaped by AI, social media and digital technology. The new Explorer Scout badges, part of the Scout movement’s first major overhaul in almost 25 years, will require 14- to 18-year olds to explore how digital communities shape opinion, create online campaigns, investigate digital footprints and design toolkits to help others stay safe online. Continue reading...

Technology & Infrastructure

51 articles
AI Agents & Automation10 articles
Editor's pickTechnology
Arxiv· Today

The Shift to Agentic AI: Evidence from Codex

arXiv:2606.26959v1 Announce Type: new Abstract: We analyze usage data from OpenAI's Codex tool to present large-scale evidence of how agentic AI technology, which can take actions on a user's behalf, changes how people work. We use an automated, privacy-protecting pipeline to contrast usage across three populations: external personal-account users, external organizational-account users, and workers within OpenAI. We find that agentic AI usage is growing rapidly: the number of active users has grown more than fivefold in the first half of 2026, with the most rapid increase occurring outside the initial audience of software developers. Uptake is uneven: within OpenAI, Codex usage is nearly universal and has largely replaced business usage of ChatGPT. We document a similar shift to agentic tooling outside OpenAI, particularly within organizations, although external adoption remains lower and more uneven. In addition to headline usage figures, we observe measures of sophistication, and find that a growing number of users have used Codex to change their workflows substantially. More than 10% of users manage three or more concurrent Codex agents at some point each week and that 26.6% use skills, which allow users to share instructions for complex workflows. Alongside these changes in usage practices, request complexity has increased: since the start of the year, the share of individual Codex users who submit at least one request for a task estimated to require more than eight hours for an experienced human to complete has increased nearly tenfold. Concurrently, output has grown rapidly -- in June 2026, the median OpenAI employee in a legal role generated 13 times more monthly output tokens across Codex and ChatGPT than they did in November 2025, while the median researcher generated more than 50 times as many. We conclude by discussing the implications of these patterns for productivity, job reorganization, and workforce restructuring.

Editor's pickConsumer & Retail
Yahoo!· Yesterday

Retail is entering the age of Agentic AI

This shift is closely tied to advances ... complex AI agents. These systems are increasingly being tested across e-commerce platforms, logistics networks, and marketing operations. While many applications are still in pilot stages, the direction is consistent across the industry: more tasks are being handed over to automated decision-making ...

Editor's pickTechnology
Theregister· Yesterday

OpenAI says employees moving beyond chat to agents

Codex, it's not just for developers, really

Editor's pickFinancial Services
stoik AI· Yesterday

The Rise of Agentic AI: Transforming Business Operations

Agentic AI goes a step further—it analyzes, decides, and executes based on real-time conditions. For instance, an AI-powered investment platform could: Monitor global financial markets. Detect trends and adjust portfolio allocations autonomously. Adapt investment strategies based on economic conditions. ... One of the biggest concerns about automation ...

Editor's pickTechnology
Daily AI News June 25, 2026: AI Agents 101: The 2026 Stack· Yesterday

The AI Agents Stack

This article outlines a six-layer framework for transitioning from an LLM to a production-ready AI agent, covering orchestration, memory, tools, evaluation, and guardrails.

Editor's pickTechnology
Orangemantra· Yesterday

Trends in Software Testing: What’s New in 2026? Today

This dramatically reduces the maintenance burden on automated test suites, especially for applications with frequent UI updates. The future involves autonomous AI testing agents that continuously explore application surfaces, agentic QA systems that generate and execute tests without human ...

Editor's pickFinancial Services
Arxiv· Today

OpenFinGym: A Verifiable Multi-Task Gym Environment for Evaluating Quant Agents

arXiv:2606.26350v1 Announce Type: new Abstract: Although large language model agents are increasingly applied to quantitative-finance workflows, their evaluation remains fragmented across isolated tasks, while the financial relevance of benchmark tasks is often overlooked. Yet financial workflows are inherently multi-stage, spanning interdependent tasks such as forecasting, strategy construction, risk management, and trading. Existing platforms typically focus on a single task, and can therefore overstate agent competence and fail to reveal weaknesses in generalization, real-market interaction, and financially meaningful decision-making. We introduce OpenFinGym, a unified gym environment for quantitative-finance agent development that covers forecasting, market generation, real-time trading, and fraud detection under a single execution and verification interface. OpenFinGym additionally provides an automated task-construction pipeline that turns quantitative finance publications into executable task packages; a containerised runtime with a host-side verifier service that supports scalable agent rollouts and prevents runtime train-test leakage; a paper trading engine with a low-latency data-stream design; deferred-resolution support for long-horizon and event-market forecasts; and integration for SFT and RL post-training

Editor's pick
Arxiv· Today

Pingquanqi (Equalizer): A Cross-Domain Sociotechnical Framework for Human-Agent Interaction Governance

arXiv:2606.26573v1 Announce Type: new Abstract: LLM agents are transitioning from experimental tools to permanent infrastructure -- a computational layer as enduring as the electrical grid. Like any infrastructure, they carry a cost chain from physical capital through enterprise investment to user consumption, ending at the user's most irreplaceable resource: lifetime. When unoptimized, this chain leaks, consuming user lifetime without adequate compensation. This paper proposes Pingquanqi (Equalizer), a cross-domain sociotechnical framework for Human-Agent Interaction Governance (HAIGF). Its product form is an Agent framework-level embedded design specification, analogous to WCAG for web accessibility, whose goal is not to be purchased but adopted as a standard. Pingquanqi consists of four integrated components deployable as native middleware: (1) a user-state discrimination model enabling proactive knowledge leveling, (2) a Bayesian progressive stop-loss rule capping per-session interaction cost, (3) controlled friction mechanisms breaking self-reinforcing dependency loops, and (4) Lsteal, a transparency metric rendering token-to-lifetime cost conversion visible. A fifth mechanism, reflective summarization (F5), enables guided cognitive recollection. The framework is grounded in cross-cultural philosophy: Mao's epistemology of practice (On Practice, 1937) provides the basis for cross-session knowledge accumulation; Wang Yangming's unity of knowledge and action (zhi xing he yi, c. 1509) illuminates Lsteal's root -- knowing without acting is incomplete; and Hegel's unity of theory and practice demonstrates cross-traditional convergence. This paper argues Pingquanqi's primary economic beneficiary is the enterprise deploying Agent services -- through reduced wasted computation, improved user satisfaction, and sustained subscription revenue -- with individual user benefit as the natural downstream consequence.

Editor's pickTechnology
Arxiv· Today

Instruction Bleed: Cross-Module Interference in Prompt-Composed Agentic Systems

arXiv:2606.26356v1 Announce Type: new Abstract: Practitioners of prompt-composed agentic systems report a recurring failure mode: editing one prompt module silently shifts the behavior of others despite no shared variable or executable dependency. We formalize this as compositional behavioral leakage (CBL): interference between modules sharing a context window. CBL is enabled by architectural non-isolation: transformer self-attention provides no formal boundary between concatenated modules. We probe CBL on a deployed job-evaluation agent (Claude Sonnet 4.6, 144 trials) through a reusable three-channel protocol that perturbs non-focal modules along volume, content, and form. Only the content channel produces a detectable paired effect (Cohen's d = 0.63, bootstrap 95% CI excluding zero); no recommendation flipped -- a sub-threshold regime invisible to standard QA but compounding across the thousands of decisions a deployed agent makes. CBL is orthogonal to known agent-failure axes (adversarial injection, cognitive degradation, multi-agent fault propagation, privacy leakage). We contribute an operational definition, a reusable protocol, a falsifiable prediction set, and a system-class characterization, establishing cross-module interference measurement as a requirement for prompt-composed agent evaluation.

Editor's pick
Whois JSON API Blog· Yesterday

Why Harnesses Matter in Agentic AI Systems

Absent runtime safety controls, raw LLM agents face exacerbated engineering risks from fault propagation, adversarial input vulnerabilities, and unrecoverable error states. Safety-critical engineered systems—whether in aviation, industrial automation, or healthcare—embed layered fail-safes, watchdogs, and recovery mechanisms extensively tested to prevent fault escalation. AI ...

AI Energy6 articles
Editor's pickEnergy & Utilities
Artificial Intelligence Newsletter | June 25, 2026· 2 days ago

US House subcommittee advances bill to shield consumers from AI energy costs

A US House subcommittee on Wednesday advanced bipartisan legislation aimed at shielding American consumers from electricity rate hikes attributable to artificial intelligence infrastructure.

Editor's pickEnergy & Utilities
Arxiv· Today

Too cheap to meter? A stochastic analysis of projected future fusion costs

arXiv:2606.26536v1 Announce Type: new Abstract: In recent years, technological developments and activities by private actors have led a reemerged discussion of the potential of nuclear fusion to meet growing global energy demands. So far, however, fusion technologies remain at comparatively low development levels and their deployment in commercial power plants is probably still decades away. Regardless, over the last decades, many cost studies have been conducted that estimate the future cost of potential fusion power plants. But to date, there is no systematic and harmonized assessment of these projections. Therefore, this study conducts a stochastic analysis of future fusion power plant costs for three distint technology lines, magnetic confinement, inertial confinement, and magneto-inertial confinement fusion, including cost assessments of different technology maturity levels. These levels are further assessed to determine projected learning rates for future fusion costs. For mature technologies, mean LCOE are determined at 114.6, 110.3, and 143.9 USD per MWh for MCF, ICF, and MIF devices, respectively. This implies learning rates of more than 30%. We find that these projected values are rather optimistic when compared to other literature or comparable technologies like fission. We therefore urge policymakers to caution when potential fusion developers refer to the potential economic competitiveness of fusion power plants.

Editor's pickEnergy & Utilities
Explainx· Today

AI and Climate Change 2026: Energy Footprint vs. Climate Solutions | explainx.ai Blog | explainx.ai

The IEA's 2024 Electricity report ... consumption could reach 1,000 TWh per year by 2026, roughly double 2022 levels. AI workloads are the primary driver of that growth. ... This does not mean AI is single-handedly ruining the climate. The global electricity mix is getting cleaner, and large tech companies are the world's largest buyers of renewable energy. But it does mean that AI infrastructure is now a material factor in global energy demand — too large ...

AI Infrastructure & Compute16 articles
Editor's pickTechnology
CNBC· Yesterday

Anthropic's latest hiring spree reveals where it's building AI data centers next

The AI lab said recent growth is "placing an inevitable strain on our infrastructure." Australia and Japan are emerging as key markets for AI data center growth for Anthropic, though copyright laws and power access remain challenges. Anthropic is racing to increase its AI compute capacity in the ...

Editor's pickTechnology
Euronews· Yesterday

Micron posts record results as AI boom drives 15-fold jump in net profit | Euronews

The US chipmaker Micron Technology has reported quarterly revenue and profit far above Wall Street's expectations, as insatiable demand for the memory chips that power AI transformed the firm's fortunes.

Editor's pickTechnology
Daily AI News June 25, 2026: AI Agents 101: The 2026 Stack· Yesterday

OpenAI and Broadcom unveil LLM-optimized inference chip

OpenAI has announced its first custom inference-only chip, co-developed with Broadcom, aimed at improving LLM inference economics and reducing reliance on external hardware suppliers.

Editor's pickEnergy & Utilities
OilPrice.com· Yesterday

Why the AI Boom Could Trigger the Biggest Energy Trade in Decades | OilPrice.com

In Finland, Bitzero has secured a massive one-gigawatt development campus in Kokemäki tied directly into low-cost Nordic power infrastructure. The site gives the company room to scale both Bitcoin mining and AI compute operations over time as demand for energized capacity continues accelerating ...

Editor's pickTechnology
Top Daily Headlines: Boffin claims Microsoft's supposed quantum leap does not compute due to 'basic Python errors'· Yesterday

Chinese supercomputer using local processors heads TOP500 list

The use of Arm cores and Linux indicates that Beijing has not fully broken away from global standards.

Editor's pickTechnology
SAP· Yesterday

Sovereign Data Infrastructure in Europe: Essential or a Distraction?

The push for sovereign AI data centers in Europe reflects a shift in how IT infrastructure is perceived.

Editor's pickEnergy & Utilities
DIGITIMES· Yesterday

Europe's AI infrastructure: the cost gap that policy cannot paper over

The EU currently hosts roughly 5% of the world's AI compute capacity. The US holds close to 75%. McKinsey projects European data center demand will grow from 10 GW of IT load in 2024 to 35 GW by 2030 — a tripling driven almost entirely by AI. The infrastructure to close that gap does not ...

Editor's pickPAYWALLTechnology
Bloomberg· Today

SemiAnalysis Xie on Asia AI Supply Chain

Myron Xie, AI supply chain research lead at SemiAnalysis, explains who, within Asia's hardware suppliers, are standing out as the winners in the market's surge in AI demand. He speaks with Shery Ahn on Bloomberg Tech: Asia. (Source: Bloomberg)

Editor's pickPAYWALLTechnology
Bloomberg· Today

Samsung, SK Hynix Reportedly Preparing Huge AI Spending Push

Samsung Electronics Co. and SK Hynix Inc. are preparing to announce hundreds of billions of dollars worth in new investments on Monday, according to South Korean media reports this week.

Editor's pickTechnology
Guardian· Yesterday

Datacentres are growing target of global climate-related legal cases, report finds

LSE analysis highlights litigation linked to energy sources, water consumption and air pollution The proliferation of datacentres and AI is increasingly at the forefront of environmental litigation around the world, from the US and UK to Chile to Ireland, a report has found. In an analysis of about 3,600 climate-related lawsuits filed since 2015, the latest annual review of climate litigation by the London School of Economics (LSE) found a growing number of cases challenging the energy sources, water consumption and air pollution of datacentres, all of which have related climate implications. Continue reading...

Editor's pickEnergy & Utilities
DataCenterKnowledge· Yesterday

ICF Warns Grid Deliverability May Limit AI-Era Power Growth

His reporting focuses on the ... energy procurement, and next-generation data center architectures. He has won recent Azbee awards for news series and government reporting. Based in Raleigh, North Carolina, Snider covers how hyperscalers, utilities, chipmakers, and infrastructure providers are responding to the rapid rise of AI workloads and global compute demand...

Editor's pickEnergy & Utilities
MarketScale· Yesterday

AI demand, nuclear strategy, and grid innovation reshape the global energy sector

Taken together, this week's ... reflect an energy sector in active transition, where AI demand is providing both the economic justification and the urgency for upgrading infrastructure that, in many cases, was built for a different era of power consumption....

Editor's pickTechnology
Arm Newsroom· Yesterday

From host node to heterogeneous rack: Rethinking the AI CPU - Arm Newsroom

It must route requests to the right compute tier, move data efficiently between prefill and decode, manage KV cache transfers, maintain session state, execute non-model work and enforce service-level objectives across the full pipeline. That is why the industry is moving toward a more system-level view of AI infrastructure. The amount of useful agentic work a data center can perform is not determined by accelerator capacity ...

Editor's pickTechnology
Forbes· Yesterday

Dell Technologies BrandVoice: Why AI Infrastructure Bottlenecks Are Moving Beyond GPUs

The variable most organizations are missing isn’t compute — it’s storage purpose-built for AI context, not just data capacity.

Editor's pickTechnology
Mezha· Yesterday

Former Databricks AI chief predicts AI energy use can drop 1,000-fold | Ukraine news - #Mezha

Data infrastructure also plays a key role: efficient storage formats, fast data access paths, and minimizing data movement should reduce energy consumption and environmental costs. Such statements are met with both support and skepticism in the industry. The main obstacles are related to rising demand ...

Editor's pickTelecommunications
Senior Executive· Yesterday

How Space-Based AI Infrastructure Could Transform Cloud Computing

Members of the Senior Executive ... and AI platforms—and what business leaders should do now to prepare. ... For decades, the technology industry’s infrastructure strategy has been remarkably straightforward: Build bigger data centers, add more fiber and deploy more compute capacity closer to ...

AI Models & Capabilities6 articles
Editor's pickTechnology
Arxiv· Today

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

arXiv:2606.26300v1 Announce Type: new Abstract: A classical intuition holds that verifying a solution is easier than producing one. For today's coding agents, this intuition is being inverted: as foundation models develop stronger reasoning capabilities and engineering harnesses grow more sophisticated, generating complex candidate solutions is no longer difficult -- reliably verifying them has become the harder problem. Every verifier we can build is only a proxy for human intent, never the intent itself. This makes verification subject to a twofold difficulty: first, intent is underspecified by nature, making it inherently hard to faithfully check whether it has been fulfilled; second, during model training, optimization widens the gap between proxy and intent -- manifesting as reward hacking or signal saturation. To address this, we characterize the quality of verification signals along three dimensions -- scalability, faithfulness, and robustness -- and argue that achieving all three simultaneously is the central challenge. We further study four reward constructions: a test verifier for general coding tasks, a rubric verifier for frontend tasks, the user as verifier for real-world agent tasks, and an automated agent verifier for long-horizon tasks. Across different task types and policy capability levels, we conduct in-depth analysis and experiments on the core challenges of reward design and how to more effectively leverage reward signals. Experiments show that targeted verification design can effectively suppress reward hacking, improve task completion quality, and achieve significant gains across multiple internal and public benchmarks. These experiences collectively point to a core observation: no fixed reward function can remain effective as policy capability continues to grow; and verification must co-evolve with the generator.

Editor's pick
Arxiv· Today

Human--LLM Collaboration Is Transforming Complexity Metrics in Scientific Texts

arXiv:2606.27052v1 Announce Type: new Abstract: While human language has long been studied as a complex system, Large Language Models (LLMs) are rapidly becoming contributors to its dynamics. Because LLMs are trained on human language use, their effects on the broader human-AI linguistic ecosystem are likely subtle at first. As their use becomes more widespread, however, LLMs may alter emergent properties of language, particularly as models increasingly train on mixed human-LLM textual data. Here, we draw on complexity science to look for subtle LLM effects in millions of arXiv abstracts from 2010 to 2025. The year 2023, when LLMs rapidly became widely used, serves as a landmark in a natural experiment. While we find a sharp increase in a composite LLM-associated style index after early 2023, we observe only subtle changes in the exponents of Zipf's law and Heaps' law. More compelling, however, are two subtle changes in complexity metrics that emerge from 2023 onward. First, turnover among top-ranked words increases sharply. Second, the positive relationship between the LLM-associated style index and three complexity metrics--vocabulary size and the exponents of Heaps' and Zipf's laws--becomes flatter after 2022. Together, these patterns are consistent with changes in the emergent properties of scientific text in a mixed human-AI linguistic ecosystem.

Editor's pickTechnology
VentureBeat· Yesterday

Liquid AI's smallest model yet LFM2.5-230M beats models 4X its size at data extraction, can run 'anywhere'

Liquid AI, founded by former MIT computer scientists, today released its smallest AI language model yet, LFM2.5-230M, and enterprises would do well to consider it for their uses in data extraction and local deployment on smartphones, laptops and robotics. This is a 230-million-parameter foundation model explicitly designed for on-device agentic workflows, and as Liquid states in its release blog post, that small size makes it possible to run nearly "anywhere." According to Liquid, it also outperforms models more than 4X its size on selected benchmarks, specifically doing better at data extraction than the 800 million parameter count Alibaba Qwen3.5-0.8B (Instruct) and 1-billion parameter Google Gemma 3 1B. The model targets developers and engineers building lightweight data extraction pipelines and autonomous edge systems. Operating under a dual-use commercial license, the model remains free for individuals and companies generating less than $10 million in annual revenue, while requiring a paid enterprise agreement for larger corporations. This release distinguishes itself from other small AI models by utilizing the LFM2 architecture to achieve high inference speeds without the massive memory overhead typical of parameter-heavy transformers. While major AI companies Anthropic, OpenAI, Google, Microsoft, Meta and others push parameter counts into the hundreds of billions or trillions to achieve frontier performance, a parallel race focuses entirely on the edge and local deployments. Liquid AI's launch of LFM2.5-230M signals a pivotal shift toward architectural efficiency over brute-force scaling. By squeezing 19 trillion tokens of pre-training into a 230-million-parameter footprint, the company demonstrates that edge devices do not need massive computational power or persistent cloud connections to execute complex, multi-step agentic workflows. How LFM2.5-230M works The LFM2.5-230M model diverges from standard transformer architectures, relying instead on the LFM2 framework. This architecture functions as a hybrid system, interleaving gated short-range convolutions with grouped-query attention to process information efficiently. For those tracking the evolution of efficient architectures, Liquid’s approach shares a similar conceptual goal: managing long contexts and sequential data effectively on edge hardware without the quadratic memory costs of pure attention mechanisms. The model supports an expansive 32K context window, allowing it to ingest substantial documents or continuous streams of robotic telemetry. When analyzing the performance charts provided in the release, the architectural efficiency becomes visually apparent. The model maintains a memory footprint of under 400MB while achieving prefill and decode speeds that outpace comparable models like Gemma 3 1B IT and Granite 4.0-H-350M. On a Samsung Galaxy S25 Ultra equipped with a Qualcomm Snapdragon Gen4 CPU, the model reaches a decode speed of 213 tokens per second. Even on a highly constrained Raspberry Pi 5, the model maintains a decode rate of 42 tokens per second. Furthermore, internal benchmarking shows the GPU inference stack delivers lower end-to-end latency than competing small models across all concurrency levels. Why it matters for enterprises To understand why a 230-million-parameter model is necessary, one must look at how enterprises currently manage data. Organizations have traditionally relied on rigid, rule-based Extract, Transform, Load (ETL) scripts to move and process data. However, these legacy systems are notoriously brittle; a simple change in a document's layout or a schema update can break the entire pipeline. To solve this, the industry is shifting toward "AI ETL," where machine learning infers mappings, detects schema drift, and adapts to changes automatically. In a modern lightweight data extraction pipeline, an AI model connects to unstructured sources—like PDFs, emails, or web forms—and structures the data into formats like JSON without requiring hardcoded rules. For enterprises, using a massive flagship model like Claude Opus 4.6 (which costs $5.00 per million input tokens) to parse routine invoices, format addresses, or route telemetry data is economically unviable. This is where models like LFM2.5-230M become critical. Designed explicitly as a lightweight extraction engine, it allows companies to automate repetitive formatting and data parsing at a fraction of the compute cost and latency, running directly on local hardware rather than relying on expensive, continuous cloud API calls. Small Model Benchmarks: LFM vs. The 3B Class The AI industry in mid-2026 is seeing a renaissance in "small" models, but the definition of "small" varies wildly. Recently, the open-weight community was stunned by Weibo's VibeThinker-3B, a 3-billion-parameter model built on a Qwen2-style backbone that achieved a massive 94.3 on the AIME 2026 math benchmark, rivaling 600-billion-parameter behemoths through aggressive data curation and reinforcement learning. Similarly, Google's Gemma 4 family — which recently crossed 200 million downloads — pushes frontier AI to the edge, including the E2B (2 billion parameters) designed specifically for mobile and IoT deployments. By contrast, Liquid AI's LFM2.5-230M operates in a completely different weight class. At just 230 million parameters, it is roughly one-tenth the size of Google's smallest Gemma 4 model and VibeThinker-3B. Because of its microscopic footprint, LFM2.5-230M is not designed to compete on reasoning-heavy workloads like advanced math, coding, or creative writing—a constraint Liquid AI explicitly acknowledges. However, in its intended domains of data extraction and tool calling, the model punches well above its weight class. Benchmarks released by Liquid AI show LFM2.5-230M scoring 43.26 on the BFCLv3 tool-use benchmark, dominating IBM's Granite 4.0-350M (39.58) and completely outpacing larger 1-billion-parameter models like Google's Gemma 3 1B IT (16.61). On CaseReportBench for data extraction, it scores 22.51, decimating the Qwen3.5-0.8B (Instruct). LFM2.5-230M proves that while 3-billion-parameter models like VibeThinker are solving advanced calculus, a 230-million-parameter model is the superior, highly optimized choice for executing structured tool calls and keeping agentic pipelines running efficiently on constrained hardware. Advanced research uses Because it excels at tool calling, LFM2.5-230M functions primarily as a skill-selection layer. Liquid AI demonstrated this capability by deploying the model on a Unitree G1 humanoid robot. Running entirely on-device via the robot's onboard NVIDIA Jetson Orin compute module, the model successfully processes complex environmental commands. As noted in the company's technical blog, the model takes a free-form instruction like, *"Hold still for 2 seconds, then walk forward at 1 meter per second for 3 meters, hold a forward one-leg kneel for 5 seconds, and walk backward at 0.5 meters per second for 3 meters,"* and automatically translates it into a structured multi-step plan calling on pre-trained low-level skills provided by NVIDIA's SONIC framework. The base and post-trained models are available immediately on Hugging Face, with native day-one support across the inference ecosystem for llama.cpp (GGUF), MLX, vLLM, SGLang, and ONNX. Dual-use, custom LFM Open License Liquid AI ships LFM2.5-230M under the LFM Open License v1.0. Despite the word "open" in the title, this is not an Open Source Initiative (OSI) compliant license; it operates as a restricted, dual-use commercial framework. For independent developers, researchers, and early-stage startups, the license functions identically to open-source software. Users receive a perpetual, worldwide, royalty-free license to reproduce, modify, and distribute the model, provided they retain original copyright notices and prominently state any modifications. However, the license includes a strict "Commercial Use Limitation". Any legal entity generating $10 million or more in annual revenue loses the right to use the model commercially under this agreement. Large enterprises crossing this financial threshold must negotiate a separate, paid commercial agreement with Liquid AI to deploy the model in production. This strategy protects the company from having its intellectual property absorbed by major technology conglomerates for free, while still seeding the model at the grassroots developer level.

Editor's pick
Arxiv· Today

What We are Missing in Multimodal LLM Evaluation?

arXiv:2606.26348v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) can process diverse inputs, e.g., text, images, audio, and video, and generate textual responses. While their capabilities have advanced rapidly, evaluation of such models has not kept pace. Most existing evaluation benchmarks are limited to isolated tasks and reveal little about whether a model integrates information across modalities. We examine current means for evaluating MLLMs and review the existing benchmark taxonomy to identify gaps, including temporal-spatial coherence, physical world understanding, multimodal consistency, and selective attention. Addressing these gaps is essential for measuring real progress in multimodal intelligence and exposing capability boundaries.

Editor's pickTechnology
OpenAI built its own chip in 9 months—here's what changes for API cost· Yesterday

OpenAI upgrades GPT-5.5 Instant with smarter intent and better recommendations

OpenAI updated GPT-5.5 Instant to better understand user intent, handle multiple constraints, and adapt to feedback. The model is now available in the API as 'chat-latest'.

Editor's pickManufacturing & Industrials
Arxiv· Today

COrigami: An AI Pipeline for Co-Designing Flat-Foldable Visually Recognisable Origami

arXiv:2606.26299v1 Announce Type: new Abstract: While generative AI has achieved remarkable success in solving problems with verifiable solutions, generating physical art that satisfies both strict geometric constraints and subjective visual aesthetics remains a challenge. This paper presents an approach to tackle these difficulties in the domain of computational origami, a mathematically rigid environment that grounds artistic design within the equations of flat foldability. We present COrigami, an end-to-end AI-driven pipeline that assists the design cycle by generating crease patterns from natural language. Our pipeline involves generating a semantic stick figure, computing a base packing, solving for a flat-foldable crease pattern, shaping the flat-folded crease pattern, and refining the generated model using reinforcement learning driven by an autonomous aesthetic evaluation loop. Our system acts as a highly effective collaborative assistant, generating structural starting points that human artists can further expand and shape. By integrating algorithmic optimisation with autonomous aesthetic critique, this work demonstrates how AI systems can satisfy multi-objective physical constraints to enable reliable, mathematically grounded co-creativity.

AI Security & Cybersecurity8 articles

Adoption, Deployment & Impact

16 articles
AI Adoption Barriers & Enablers4 articles
AI Applications4 articles
Editor's pickEnergy & Utilities
Arxiv· Today

How Do Tool-Augmented LLM Agents Perform on Real-World Energy Analytics Tasks?

arXiv:2606.26346v1 Announce Type: new Abstract: Agentic benchmarks have emerged across general-purpose and domain-specific settings, including finance, coding, law, and drug discovery, yet energy-domain evaluations remain largely limited to static knowledge recall. This is a critical gap for a sector that requires live data retrieval, specialized regulatory and market knowledge, and multi-step quantitative reasoning under real-world constraints. We present an empirical study of tool-augmented LLM agents on real-world energy market analytics tasks. Our evaluation environment includes 243 expert-curated problems across three categories: (1) Market Data Retrieval and Analysis, (2) Knowledge Retrieval and Interpretation, and (3) Advanced Quantitative Modeling and Decision Analytics. Tasks include price and demand analysis, tariff impact modeling, asset revenue and returns estimation, hedging strategy analysis, and optimization modeling, with problems spanning multiple difficulty levels. Agents are equipped with a configurable suite of domain tools, including live electricity market APIs for major U.S. ISOs, regulatory docket search, utility tariff databases, asset optimization models, and retrieval-augmented generation over energy market documents. We assess agent responses using a multi-dimensional evaluation protocol that scores approach correctness, answer accuracy, attribute alignment, and source validity, with category-aware routing to match scoring criteria to question type. We evaluate both closed-source and open-source LLMs, providing a comparative analysis of how model capability and domain tooling interact in a high-stakes professional domain. Key artifacts are publicly released to support reproducibility and future research.

Editor's pickHealthcare
Arxiv· Today

Knowledge-augmented Agentic AI for Mental Health Medication Information Seeking

arXiv:2606.26205v1 Announce Type: new Abstract: Patients increasingly seek medication information online, yet safety knowledge for psychiatric drugs is split between regulatory adverse-event records, which are authoritative but abstract, and patient narratives, which are experience-near but unvalidated. Integrating them without conflating evidence and anecdote is especially consequential in psychiatry, where poorly contextualised information can amplify fear, nocebo responses, and non-adherence. Here we develop a provenance-aware, knowledge-graph-based multi-agent framework unifying 466,525 Reddit posts, 60,782 WebMD reviews, and twenty years of U.S. FDA Adverse Event Reporting System records for nine antidepressants. A large-language-model entity-recognition pipeline benchmarked against physician annotations reached highest F1 scores of 0.969 for medications and 0.973 for conditions. The two community platforms were far more concordant with each other (overlap up to a Jaccard similarity of 0.905) than with regulatory reports, indicating that patient-generated data form a partly independent safety signal. For sertraline, many adverse events appeared in community sources hundreds of days before the corresponding FDA date. A Neo4j knowledge graph grounded in ATC-N, ICD-10, and MedDRA vocabularies preserves provenance, keeping every claim traceable and regulatory facts distinct from patient experience. These results establish source-aware integration as a route to more auditable psychiatric medication information, with usefulness and patient benefit to be tested prospectively.

Editor's pick
Chief Investment Officer· Yesterday

Will AI’s Next Productivity Revolution Begin in the Fields? | Chief Investment Officer

A First Eagle Investments fund manager explores how artificial intelligence tools will benefit businesses, like farming, using it to solve practical operating problems.

Editor's pickFinancial Services
Arxiv· Today

AlgoEvolve: LLM-driven Meta-evolution of Algorithmic Trading Programs

arXiv:2606.26173v1 Announce Type: new Abstract: Recent work shows that Large Language Models (LLMs) can act as semantic mutation operators for the evolutionary discovery of programs and proofs. Most current applications focus on static coding benchmarks. We extend this paradigm to algorithmic trading. This domain is uniquely challenging because it is noisy, non-stationary, and highly discontinuous. We present AlgoEvolve, an LLM-driven evolutionary framework that generates, evaluates, and iteratively improves executable trading strategies. These strategies are expressed as Python code and evaluated through a rigorous testing protocol. Across multiple experiments, the system exhibits emergent regime-adaptive strategy logic, including autonomous shifts in trading rules. We further introduce a meta-evolutionary outer loop that evolves the prompts guiding program synthesis in the inner loop. This outer loop discovers improved search heuristics. These heuristics balance exploration and exploitation while reducing zero-trade failures. They consistently outperform initial human-designed instructions. The results demonstrate that LLM-based semantic evolution provides a viable approach for continual program synthesis in complex environments.

AI Measurement & Evaluation2 articles
Editor's pick
Arxiv· Today

Life After Benchmark Saturation: A Case Study of CORE-Bench

arXiv:2606.26158v1 Announce Type: new Abstract: When a benchmark's accuracy saturates, it is often retired and replaced with a more challenging version. We show that this approach privileges accuracy and misses the opportunity to study six other key dimensions of agent performance: construct validity issues such as shortcuts, out-of-distribution generalizability, efficiency, reliability, the relative importance of the model versus the scaffold, and uplift from human-agent collaboration. We use CORE-Bench Hard, a benchmark for computational reproducibility of scientific code, as a case study to demonstrate that measuring agents along these dimensions yields meaningful insights into agent performance even after accuracy saturates. First, we surface threats to construct validity in CORE-Bench Hard that are difficult to anticipate with less capable agents. We introduce an improved benchmark, CORE-Bench v1.1, and an out-of-distribution task suite, CORE-Bench OOD. Second, we find that despite accuracy saturation, CORE-Bench v1.1 remains useful for measuring efficiency, reliability, model performance, and scaffold performance. Finally, we conduct a small-scale randomized experiment to measure uplift from human-agent collaboration on real-world computational reproducibility tasks. We find a statistically significant speedup by about a factor of two -- likely underestimated due to one-fifth of human-only reproductions reaching the time limit before completing -- and describe various other findings. Together, our contributions present a more rigorous alternative to the dominant accuracy-centric evaluation paradigm.

AI ROI & Business Case4 articles
Editor's pickPAYWALLProfessional Services
FT· Yesterday

Law firms look for clear gains from AI

Despite increased spending on legal tech, barriers to its full embrace remain

Editor's pick
Arxiv· Today

Enterprise Data Asset Quality: A Management-Standard Conformity-Benefit Realization Framework and Formation Mechanisms

arXiv:2606.26186v1 Announce Type: new Abstract: Motivated by the limited standardization of enterprise data asset quality evaluation and the unclear relationship between assessment outcomes and value realization, this study develops a three-dimensional framework comprising Data Asset Management Capability, Data Quality Standard Conformity, and Data Asset Benefit Realization Capability, based on grounded theory and LDA topic modeling. To examine the formation mechanisms of data asset quality, this study adopts a multi-method approach combining PLS-SEM, Necessary Condition Analysis (NCA), and fuzzy-set Qualitative Comparative Analysis (fsQCA), to capture net effects, capability thresholds, and configurational paths. The results show that significant positive relationships exist among the three dimensions, with Data Asset Management Capability exerting the strongest effect on Data Quality Standard Conformity and further promoting Data Asset Benefit Realization Capability, forming a chain mechanism of management foundation-standard enhancement-value realization. In addition, all three dimensions constitute critical necessary conditions for achieving high data asset quality, and multiple equivalent configurational paths reflecting different combinations of Management, Standard, and Benefit are identified, such as governance-oriented and benefit-driven mechanisms. This study integrates structural (PLS-SEM), necessary-condition (NCA), and configurational (fsQCA) analyses within a unified framework, providing a systematic approach to understanding data asset quality formation and offering practical insights for enterprise data governance and data factor market development.

Geopolitics, Policy & Governance

27 articles
AI Geopolitics6 articles
AI Policy & Regulation18 articles
Editor's pickPAYWALLTechnology
Bloomberg· Yesterday

Trump Administration Asks OpenAI to Stagger AI Model Release

The Trump administration has asked OpenAI to stagger the release of an upcoming powerful artificial intelligence model, according to a person familiar with the matter, nearly two weeks after rival Anthropic PBC suspended its most capable offerings from the market under regulatory pressure.

Editor's pickPAYWALLGovernment & Public Sector
Bloomberg· Yesterday

US Proposes AI Partnership With EU to Strengthen Chip Supply Chains - Bloomberg

The US has proposed that the European Union sign on to an artificial intelligence partnership as part of an effort to create an alliance to secure supply chains for semiconductors as competition with China intensifies.

Editor's pick
Arxiv· Today

The Governance Inversion Hypothesis: Why More AI Regulation May Produce Less Organisational Control

arXiv:2606.26117v1 Announce Type: new Abstract: This paper introduces the Governance Inversion Hypothesis (GIH) to explain a growing paradox in artificial intelligence (AI) governance: under conditions of increasing regulatory expansion and technological complexity, organisations may become more formally governed while simultaneously experiencing a decline in operational control over AI systems. Existing AI governance frameworks generally assume that stronger regulation improves accountability, oversight, and organisational control. This paper challenges that assumption by arguing that governance formalisation itself may contribute to the erosion of control in AI-intensive environments. Drawing on institutional theory, organisational governance research, accountability scholarship, and emerging AI governance literature, the paper develops a conceptual framework explaining how regulatory expansion may weaken operational authority through four interconnected mechanisms: authority fragmentation, symbolic governance expansion, externalisation of control, and authority paralysis. As governance systems become increasingly layered and procedurally dense, organisations may struggle to maintain coherent authority, technical visibility, escalation capability, and meaningful intervention power over opaque and externally mediated AI infrastructures. The paper extends institutional decoupling theory by introducing governance inversion as a structural condition in which governance expansion may actively undermine operational coherence rather than strengthen it. It concludes that the central risk in AI governance may not be the absence of governance structures, but the emergence of institutions that appear increasingly governed while progressively losing the capacity to govern effectively.

Editor's pickGovernment & Public Sector
Arxiv· Today

Governing Actions, Not Agents: Institutional Attestation as a Governance Model for Autonomous AI Systems

arXiv:2606.26298v1 Announce Type: new Abstract: Autonomous AI agents may begin to perform consequential, irreversible actions such as clinical prescribing and production software deployment. This paper observes that human institutions have governed powerful autonomous actors not by monitoring their reasoning but by requiring independently attested evidence at the point of consequential action. We formalise this institutional pattern as a computational governance model for AI agent systems. Under the proposed model, an agent retains full autonomy over planning and reasoning but holds no execution authority over designated high-risk actions. Execution is conditional on preconditions that are each independently attested by a separate authoritative source, cryptographically bound to a declared intent, and evaluated by a deterministic policy. Decisions are recorded in a tamper-evident log amenable to independent re-verification. We present a proof-of-concept implementation and illustrate the model with examples from software deployment and clinical prescribing.

Editor's pickMedia & Entertainment
Arxiv· Today

Generative AI and Copyright Infringement: A Legal-Technical Analysis of AI Music Generation Systems Under 17 U.S.C. Title 17

arXiv:2606.26111v1 Announce Type: new Abstract: Generative artificial intelligence (GenAI) has enabled users to synthesize music with text prompts, combining copyrighted lyrics, AI-composed melodies, and synthetic vocals that imitate real artists. This paper examines the legal and technical dimensions of AI-based music creation (e.g., Google Gemini's music tools) under U.S. copyright law. We analyze whether a user who inputs one artist's protected lyrics into a GenAI system, directs it to use another artist's voice or style, publishes the resulting song, and monetizes it violates 17 U.S.C. Section 106's exclusive rights [3]. The analysis integrates Title 17 doctrine (rights of reproduction, derivative works, distribution), 17 U.S.C. Section 114's narrow sound recording protection [4], and the new voice-cloning laws emerging at the state level [20]. We argue that unauthorized lyric copying poses a high risk of infringement of the musical composition, whereas mere AI-generated voice imitation typically falls outside federal sound recording protection and instead implicates state publicity rights [12], [13]. Recent cases and legislation (Concord v. Anthropic [10]; Kadrey v. Meta [11]; Lehrman v. Lovo [12]; Tennessee's "ELVIS Act" [20]; UMG v. Uncharted Labs [14]; etc.) illustrate this split. We map AI technical components (prompt encoding, latent diffusion, neural vocoders, speaker embeddings) to legal risks and identify a regulatory gap: federal law robustly protects lyrics and melody but currently provides limited remedies for synthesized vocal likeness [22], [23]. The paper concludes with policy suggestions for clearer rules on AI music creation.

Editor's pickPAYWALLGovernment & Public Sector
Washington Post· Yesterday

AI interests win, and lose, in one New York district - The Washington Post

Democratic Assembly member Micah Lasher defeated fellow assemblyman Alex Bores in a race that became a proxy for how AI should be regulated.

Editor's pickTechnology
Top Daily Headlines: Infosys boss says vibe coding is no threat because there’s more to writing software than writing software· Today

European Commission lines up Amazon and Microsoft for cloud gatekeeper status

The European Commission is considering designating Amazon and Microsoft as cloud gatekeepers under the Digital Markets Act.

Editor's pickGovernment & Public Sector
Arxiv· Today

Benchmarking Open-Weight Foundation Models for Global AI Technical Governance

arXiv:2606.26099v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed in artificial intelligence (AI) governance analysis across national and international organisations. There is, however, growing evidence that such models produce significantly less accurate responses for countries that are underrepresented in their training data-a pattern described in existing literature as geographic bias. Existing studies examining this phenomenon are subject to three methodological limitations that together undermine their findings: (1) reliance on proprietary systems whose weights are not publicly released, which prevents independent replication; (2) evaluation of model knowledge about years that fall after data collection for model training had concluded, leading to geographic ignorance in addition to the natural limits of each model's knowledge; and (3) use of coarse binary response classification that cannot distinguish models' confident fabrication (HF) from their honest acknowledgement of uncertainty. This study addresses all three limitations by benchmarking four open-weight frontier language models against the Global AI Dataset v2 (GAID v2), a verified ground-truth database of 24,453 indicators across 227 countries published on Harvard Dataverse in January 2026. A total of 18 indicators, mapped to the eight thematic dimensions of the IEEE IRAI 2026 framework, are selected from GAID v2, yielding approximately 2,990 country-metric-year observations across six evaluation years (within the period of 2010-2023). Model responses are classified using a five-category scheme that distinguishes (a) verified accuracy (VA), (b) HF, (c) honest refusal (HR), (d) qualitative hedging (QH), and (e) misattribution (MF). Geographic disparities in accuracy are estimated through mixed-effects logistic regression and difference-in-differences (DiD) analysis.

Editor's pickPAYWALLGovernment & Public Sector
Washington Post· Yesterday

AI & Tech Brief: Bores loses the big AI primary - The Washington Post

Bores was particularly threatening to the AI industry due to his ability to synthesize ideas from the worlds of tech and politics.

Editor's pickTechnology
Arxiv· Today

Agentic Analysis for Agentic Infrastructure: An LLM-Powered Pipeline for Comparative Governance of DAO and Corporate AI Protocols

arXiv:2606.26203v1 Announce Type: new Abstract: As AI agent protocols proliferate, the governance structures shaping their interoperability standards remain empirically underexamined. We introduce an LLM-powered comparative pipeline for large-scale governance discourse analysis, integrating automated annotation, neural topic modeling, and multi-layer network analysis to study socio-technical power structures at scale. We validate it on two contrasting standards for agent interoperability: ERC-8004 (permissionless, on-chain) and Google A2A (corporate-led). Analyzing 4,323 governance participation records, we combine LLM-assisted coding, topic modeling, and multi-layer network analysis to examine how institutional design shapes thematic priorities and community structure. We find that while governance form influences substantive focus, both regimes exhibit comparable levels of participation inequality and community fragmentation. Discourse alignment is denser in the permissionless setting, suggesting that open governance may foster greater thematic convergence despite decentralized participation. These findings illustrate how LLM-assisted methods can advance the empirical study of technology governance, with implications for designing more equitable agentic AI standards. All data and code are openly available.

Editor's pickTechnology
The Verge· Yesterday

OpenAI will delay GPT-5.6 after Trump administration request | The Verge

The government will approve customer access on a case-by-case basis.

Editor's pickTechnology
Crypto Briefing· Yesterday

Trump administration asks OpenAI to stagger release of GPT 5.6 over cybersecurity concerns

The Trump administration asked OpenAI to delay GPT 5.6's release under a new voluntary 30-day cybersecurity review framework for advanced AI models.

Editor's pickGovernment & Public Sector
Artificial Intelligence Newsletter | June 26, 2026· Yesterday

US House committee passes bipartisan AI legislation

The US House Science, Space, and Technology Committee passed a bipartisan package of AI legislation aimed at expanding research access, strengthening the workforce, and promoting innovation.

Editor's pickGovernment & Public Sector
House of Commons Library· Yesterday

AI regulation in the UK - House of Commons Library

This briefing provides an introduction to artificial intelligence (AI) and how it is regulated in the UK.

Editor's pickGovernment & Public Sector
Artificial Intelligence Newsletter | June 25, 2026· 2 days ago

Bipartisanship on kids' online safety reignites as US Congress pursues reform

Both chambers of US Congress are looking to act on children's online safety with fresh bipartisan approaches announced this week.

Editor's pickGovernment & Public Sector
Foreign Policy· Yesterday

The Manufacturing-Era Toolkit Won't Work for the AI Revolution

Policymakers’ approach to automation won’t work for AI.

Editor's pickGovernment & Public Sector
WRAL.com· Yesterday

As AI demand rises, North Carolina considers new electricity rules for data centers :: WRAL.com

State leaders are weighing new ... as AI fuels record power demand and concerns grow over who should pay for grid upgrades. ... A file photo of a data center. ... As artificial intelligence fuels unprecedented growth in electricity demand, North Carolina policymakers are exploring new rules that could determine who pays for the power plants, transmission lines and other infrastructure needed to support a new wave of data centers. The state's Energy Policy Council ...

Editor's pickConsumer & Retail
Reuters· Yesterday

Reuters AI News | Latest Headlines and Developments | Reuters

Explore the latest artificial intelligence news with Reuters - from AI breakthroughs and technology trends to regulation, ethics, business and global impact.

Best Practice AI© 2026 Best Practice AI Ltd. All rights reserved.

Get the full executive brief

Receive curated insights with practical implications for strategy, operations, and governance.

AI Daily Brief — leaders actually read it.

Free email — not hiring or booking. Optional BPAI updates for company news. Unsubscribe anytime.

Include

No spam. Unsubscribe anytime. Privacy policy.