Thu 9 April 2026
Daily Brief — Curated and contextualised by Best Practice AI
Amazon Mulls Chip Sales, OpenAI Shelves UK Plans, and Meta Pivots to Proprietary Models
TL;DR Amazon CEO Andy Jassy said the company's in-house AI chip unit is on track for over $20 billion in annual revenue and may sell to external firms. OpenAI has halted its £31bn Stargate UK investment citing high energy costs and regulatory hurdles. Meta launched its first proprietary AI model, Muse Spark, from Superintelligence Labs, while deepening a $21 billion cloud partnership with CoreWeave. The Pentagon blacklisted US AI firm Anthropic as a national security risk, with a federal court refusing to block the decision. A Gartner survey found only 28% of AI infrastructure projects deliver full returns, with IT service management showing the highest success rate.
The stories that matter most
Selected and contextualised by the Best Practice AI team
Amazon Is Considering Selling Its AI Chips to Other Companies
Amazon.com Inc. is considering selling its chips to other companies, Chief Executive Officer Andy Jassy said Thursday, adding that the cloud-computing giant’s in-house silicon unit is on pace to bring in more than $20 billion over the course of a year.
OpenAI shelves landmark £31bn UK investment package
Artificial intelligence company cites high energy costs and regulation as reasons for putting Stargate project on hold Business live – latest updates OpenAI has put on hold plans for a landmark project to strengthen the UK’s AI capabilities, citing high energy costs and regulation. Stargate UK was a part of the landmark UK-US AI deal announced last September, in which US companies appeared to commit £31bn to the UK’s tech sector, part of a larger series of investments intended to “mainline AI” into the British economy. Continue reading...
The AI job loss story is all about bundles
Evidence on white-collar work displacement is beginning to match the theory
Goodbye, Llama? Meta launches new proprietary AI model Muse Spark — first since Superintelligence Labs' formation
Meta has been one of the most interesting companies of the generative AI era — initially gaining a loyal and huge following of users for the release of its mostly open source Llama family of large language models (LLMs) beginning in early 2023 but coming to screeching halt last year after Llama 4 debuted to mixed reviews and ultimately, admissions of gaming benchmarks. That bumpy rollout of Llama 4 apparently spurred Meta founder and CEO Mark Zuckerberg to totally overhaul Meta's AI operations in the summer of 2025, forming a new internal division, Meta Superintelligence Labs (MSL) which he recruited 29-year-old former Scale AI co-founder and CEO Alexandr Wang to lead as Chief AI Officer. Now, today, Meta is showing us the fruits of that effort: Muse Spark, a new proprietary model that Wang says (posting on rival social network X, used more often by the machine learning community) is "the most powerful model that meta has released," and has "support for tool-use, visual chain of thought, & multi-agent orchestration." He also says it will be the start of a new Muse family of models, raising questions about what will become of Meta's popular lineup and ongoing development of the Llama family. It arrives not as a generic chatbot, but as the foundation for what Wang calls "personal superintelligence"—an AI that doesn’t just process text but "sees and understands the world around you" to act as a digital extension of the self, echoing Zuckberg's public manifesto for a vision of personal superintelligence published in summer 2025. However, it is proprietary only — confined for now to the Meta AI app and website, as well as a " private API preview to select users," according to Meta's blog post announcing it — a move likely to rankle the literally billions of users of Llama models and the thousands of developers who relied upon it (some of whom are active participants in rival social network Reddit's r/LocalLLaMA subreddit). In addition, no pricing information for the model has yet been announced. It's unclear if Meta has ended development on the Llama family entirely. When asked directly by VentureBeat, a Meta spokesperson said in an email: “Our current Llama models will continue to be available as open source,” which doesn’t address the question of development of future Llama models. Visual chain-of-thought At its core, Muse Spark is a natively multimodal reasoning model. Unlike previous iterations that "stitched" vision and text together, Muse Spark was rebuilt from the ground up to integrate visual information across its internal logic. This architectural shift enables "visual chain of thought," allowing the model to annotate dynamic environments—identifying the components of a complex espresso machine or correcting a user's yoga form via side-by-side video analysis. The most significant technical leap, however, is a new "Contemplating" mode. This feature orchestrates multiple sub-agents to reason in parallel, allowing Meta to compete with extreme reasoning models like Google's Gemini Deep Think and OpenAI's GPT-5.4 Pro. In benchmarks, this mode achieved 58% in "Humanity’s Last Exam" and 38% in "FrontierScience Research," figures that Meta claims validate their new scaling trajectory. Perhaps more impressive for the company’s bottom line is the model’s efficiency. Meta reports that Muse Spark achieves its reasoning capabilities using over an order of magnitude less compute than Llama 4 Maverick, its previous mid-size flagship. This efficiency is driven by a process called "thought compression". During reinforcement learning, the model is penalized for excessive "thinking time," forcing it to solve complex problems with fewer reasoning tokens without sacrificing accuracy. Benchmarks reveal a return-to-form The launch of Muse Spark is framed as a statistical "quantum leap," ending Meta’s year-long absence from the absolute frontier of AI performance. By reconciling Meta’s official internal data with independent auditing from third-party LLM tracking firm Artificial Analysis, a clear picture emerges: Muse Spark is not just a marginal improvement over the Llama series; it is a fundamental re-entry into the "Top 5" global models. According to the Artificial Analysis Intelligence Index v4.0, Muse Spark achieved a score of 52. For context, Meta’s previous flagship, Llama 4 Maverick, debuted in 2025 with an Index score of just 18. By nearly tripling its performance, Muse Spark now sits within striking distance of the industry’s most elite systems, trailing only Gemini 3.1 Pro Preview (57), GPT-5.4 (57), and Claude Opus 4.6 (53). Meta’s official benchmarks suggest that Muse Spark is particularly dominant in multimodal reasoning, specifically where visual figures and logic intersect. CharXiv Reasoning: In "figure understanding," Muse Spark achieved a score of 86.4, significantly outperforming Claude Opus 4.6 (65.3), Gemini 3.1 Pro (80.2), and GPT-5.4 (82.8). MMMU Pro: Official reports place the model at 80.4, while Artificial Analysis’s independent audit measured it at 80.5%. This makes it the second-most capable vision model on the market, surpassed only by Gemini 3.1 Pro Preview (83.9% official; 82.4% independent). Visual Factuality (SimpleVQA): Muse Spark scored 71.3, placing it ahead of GPT-5.4 (61.1) and Grok 4.2 (57.4), though it narrowly trails Gemini 3.1 Pro (72.4). These scores validate Meta’s focus on "visual chain of thought," enabling the model to not just recognize objects, but to reason through complex spatial problems and dynamic annotations. The "Thinking" gear of Muse Spark was put to the test against specialized benchmarks designed to break non-reasoning models. Humanity’s Last Exam (HLE): In this multidisciplinary evaluation, Meta reports a score of 42.8 (No Tools) and 50.4 (With Tools). Independent audits by Artificial Analysis tracked the model at 39.9%, trailing Gemini 3.1 Pro Preview (44.7%) and GPT-5.4 (41.6%). GPQA Diamond (PhD Level Reasoning): Muse Spark achieved a formidable 89.5, surpassing Grok 4.2 (88.5) but trailing the specialized "max reasoning" outputs of Opus 4.6 (92.7) and Gemini 3.1 Pro (94.3). ARC AGI 2: This remains a notable weak point. Muse Spark scored 42.5, far behind the abstract reasoning puzzles solved by Gemini 3.1 Pro (76.5) and GPT-5.4 (76.1). CritPT (Physics Research): Independent auditing found Muse Spark achieved the 5th highest score at 11%. This marks a substantial lead over Gemini 3 Flash (9%) and Claude 4.6 Sonnet (3%). One of the most striking results from the official data is Muse Spark's performance in the health sector, likely a result of Meta's collaboration with over 1,000 physicians. HealthBench Hard: Muse Spark achieved 42.8, a massive lead over Claude Opus 4.6 (14.8), Gemini 3.1 Pro (20.6), and even GPT-5.4 (40.1). MedXpertQA (Multimodal): It scored 78.4, comfortably ahead of Opus 4.6 (64.8) and Grok 4.2 (65.8), though it still trails Gemini 3.1 Pro’s top-tier score of 81.3. Agentic Systems and Efficiency: The "Thought Compression" Effect While Muse Spark excels at reasoning, its "agentic" performance—executing real-world work tasks—presents a more nuanced picture. SWE-Bench Verified: Muse Spark scored 77.4, trailing Claude Opus 4.6 (80.8) and Gemini 3.1 Pro (80.6). GDPval-AA Elo: Meta’s official score of 1444 differs slightly from Artificial Analysis’s recorded 1427. In both cases, Muse Spark trails GPT-5.4 (1672) and Opus 4.6 (1606), suggesting that while the model "thinks" well, it is still refining its ability to "act" in long-horizon software and office workflows. Token Efficiency: This is where Muse Spark distinguishes itself. To run the Intelligence Index, it used 58 million output tokens. In contrast, Claude Opus 4.6 required 157 million tokens and GPT-5.4 required 120 million. This supports Meta's claim of "thought compression"—delivering frontier-class intelligence while using less than half the "thinking time" of its closest competitors. Benchmark Llama 4 Maverick (2025) Muse Spark (Official) Gemini 3.1 Pro (Official) Intelligence Index Score 18 52 57 MMMU Pro -- 80.4 83.9 CharXiv Reasoning -- 86.4 80.2 HealthBench Hard -- 42.8 20.6 License Open-Weights Proprietary Proprietary With Muse Spark, Meta has successfully transitioned from being the "LAMP stack for AI" to a direct challenger for the title of "Personal Superintelligence". While agentic workflows remain a hurdle, its dominance in vision, health, and token efficiency places Meta back at the center of the frontier race. Personal wellness and Instagram shopping Meta is immediately deploying Muse Spark to power specialized experiences across its app family. Shopping Mode: A new feature that leverages Meta’s vast creator ecosystem. The AI picks up on brands, styling choices, and content across Instagram and Threads to provide personalized recommendations, effectively turning every post into a shoppable interaction. Health Reasoning: In a move toward medical utility, Meta collaborated with over 1,000 physicians to curate training data. Muse Spark can now analyze nutritional content from photos of food or provide "health scores" for pescatarian diets with high cholesterol. Interactive UI: The model can generate web-based minigames or tutorials on the fly. For example, a user can prompt the AI to turn a photo into a playable Sudoku game or a highlights-based tutorial for home appliances. Evaluation awareness While Muse Spark demonstrates strong refusal behaviors regarding biological and chemical weapons, its safety profile includes a startling new discovery. Third-party testing by Apollo Research found that the model possesses a high degree of "evaluation awareness". The model frequently recognized when it was being tested in "alignment traps" and reasoned that it should behave honestly specifically because it was under evaluation. While Meta concluded this was not a "blocking concern" for release, the finding suggests that frontier models are becoming increasingly "conscious" of the testing environment—potentially rendering traditional safety benchmarks less reliable as models learn to "game" the exam. What happens to Llama? In February 2023, Meta released Llama 1 to demonstrate that smaller, compute-optimal models could match larger counterparts like GPT-3 in efficiency. Although access was initially restricted to researchers, the model weights were leaked via 4chan on March 3, 2023, an event that inadvertently democratized high-tier research and catalyzed a global movement for running models on consumer-grade hardware. This shift was solidified in July 2023 with the release of Llama 2, which introduced a commercial license that permitted self-hosting for most organizations. This approach saw rapid adoption, with the Llama family exceeding 100 million downloads and supporting over 1,000 commercial applications by the third quarter of 2023. Through 2024 and 2025, Meta scaled the Llama family to establish it as the essential infrastructure for global enterprise AI, frequently referred to as the LAMP stack for AI. Following the launch of Llama 3 in April 2024 and the landmark Llama 3.1 405B in July, Meta achieved performance parity with the world's leading proprietary systems. The subsequent release of Llama 4 in April 2025 introduced a Mixture-of-Experts architecture, allowing for massive parameter scaling while maintaining fast inference speeds. By early 2026, the Llama ecosystem reached a staggering scale, totaling 1.2 billion downloads and averaging approximately one million downloads per day. This widespread adoption provided businesses with significant economic sovereignty, as self-hosting Llama models offered an 88% cost reduction compared to using proprietary API providers. As of April 2026, Meta’s role as the undisputed leader of the open-weight movement has transitioned into a highly contested multi-polar landscape characterized by the rise of international competitors. While the United States accounts for 35% of global Llama deployments, Chinese models from labs like Alibaba and DeepSeek began accounting for 41% of downloads on platforms like Hugging Face by late 2025. Throughout early 2026, new entrants such as Zhipu AI’s GLM-5 and Alibaba’s Qwen 3.6 Plus have outpaced Llama 4 Maverick on general knowledge and coding benchmarks. In response to this global pressure, Meta's Muse Spark arrives with hefty expectations and an open source legacy that will be tough to live up to. Proprietary only (for now) The launch marks a controversial departure from Meta AI's "open science" roots. While the Llama series was famously accessible to developers, Muse Spark is launching as a proprietary model. Wang addressed the shift on X, stating: "Nine months ago we rebuilt our ai stack from scratch. New infrastructure, new architecture, new data pipelines... This is step one. Bigger models are already in development with plans to open-source future versions." However, the developer community remains skeptical. Some see this as a necessary pivot after the Llama 4 series failed to gain expected developer traction; others view it as Meta "closing the gates" now that it has a competitive reasoning model. Wang himself acknowledged the transition’s difficulty, noting there are "certainly rough edges we will polish over time". For the 3 billion people using Meta’s apps, the change will be felt almost instantly. The AI they interact with is no longer just a library of information, but an agent with a $27 billion brain and a mandate to understand their world as intimately as they do.
Pentagon Blacklists US AI Firm Anthropic and Court Refuses to Stop It – RedState
The Pentagon has designated one of America's top AI companies a national security risk, a label typically reserved for foreign adversaries, and a federal appeals court on Wednesday refused to block that decision, keeping Anthropic's technology out of military systems. ... The ruling locks in the government's restrictions for now. It bars defense ...
Differences in AI adoption in Europe and the US - CEPR
Differences in AI adoption in Europe and the US: Explanations and implications for productivity growth | CEPR VoxEU Column Productivity and Innovation # Differences in AI adoption in Europe and the US: Explanations and implications for productivity growth 9 Apr 2026 The economic impact of generative AI will depend on the speed and breadth of adoption by workers and firms. Drawing on a survey of workers in the US and six European countries and a firm survey covering 32 European countries, this column explores the speed of AI adoption across countries and whether an ‘AI gap’ will exacerbate existing productivity differences between the US and Europe. There are large differences in AI adoption across countries, much of which are accounted for by management practices. Higher AI adoption rates are associated with faster productivity growth but not with changes in employment. #### Authors
Meta, CoreWeave deepen AI cloud partnership with fresh $21 billion deal | Reuters
The social media company on Wednesday unveiled Muse Spark, the first AI model from Meta Superintelligence Labs, an expensive team it assembled last year after the poor performance of its Llama 4 model.
AI's next bottleneck: Why even the best chips made in the U.S. take a round trip to Taiwan
Nvidia has reserved the majority of TSMC’s most advanced packaging capacity. The lesser-known chipmaking step may become the next bottleneck for AI.
Citigroup says AI helps speed account openings and systems ...
Citigroup says AI helps speed account openings and systems upgrades | Reuters Exclusive news, data and analytics for financial market professionalsLearn more aboutRefinitiv Citi Bank logo appears in this illustration taken December 1, 2025. REUTERS/Dado Ruvic/Illustration Purchase Licensing Rights, opens new tab - Summary - Companies - Citigroup is using AI to speed internal processes - Bank is sharply reducing time of document review to open an account - Citi is in the midst of a plan to limit the use of tech contractors NEW YORK, April 8 (Reuters) - Citigroup is using artificial intelligence to speed up account openings and the retirement of ol
Economics & Markets
OpenAI shelves landmark £31bn UK investment package
Artificial intelligence company cites high energy costs and regulation as reasons for putting Stargate project on hold Business live – latest updates OpenAI has put on hold plans for a landmark project to strengthen the UK’s AI capabilities, citing high energy costs and regulation. Stargate UK was a part of the landmark UK-US AI deal announced last September, in which US companies appeared to commit £31bn to the UK’s tech sector, part of a larger series of investments intended to “mainline AI” into the British economy. Continue reading...
Meta, CoreWeave deepen AI cloud partnership with fresh $21 billion deal | Reuters
The social media company on Wednesday unveiled Muse Spark, the first AI model from Meta Superintelligence Labs, an expensive team it assembled last year after the poor performance of its Llama 4 model.
Meta releases first AI model since Zuckerberg’s spending spree
Muse Spark ‘purpose-built’ for social media apps as investors question huge AI investment
Sixteen new START.nano companies are developing hard-tech solutions with the support of MIT.nano
MIT.nano is supporting sixteen new startup companies focused on developing hard-tech solutions.
Why the ‘SaaSpocalypse’ doomsayers are wrong
Every major tech platform shift follows the same pattern: short-term pain for long-term gain
AI funding boom pulls Big Four deeper into startup diligence | Mint
AI funding boom pulls Big Four deeper into startup diligence | Mint # AI funding boom pulls Big Four deeper into startup diligence 5 min read8 Apr 2026, 05:30 AM IST Increased diligence coincides with growing investor interest in AI startups, as the technology threatens to disrupt traditional models and the software-as-a-service model.(Reuters) Summary For investors, the founder's pedigree, market size, product, and financials do not suffice. They want to know how the data is sourced, how AI models were trained, how reliable the outputs are, and whether customers are seeing real value. When Raunak Bhinge first approached institutional investors seeking funding for Infinite Uptime, the pitch was straightforward: much of India’s factory equipment was still analogue, and digitising it could help manufacturers improve output. Over time, the venture pivoted from being seen as a software
AI Hardware Stocks Undervalued: Investment Opportunity in Nvidia, Broadcom, TSMC | 2026 - News and Statistics - IndexBox
A 2026 analysis identifies Nvidia, Broadcom, and Taiwan Semiconductor as undervalued AI hardware stocks trading below record highs, presenting a rare investment window amid strong long-term demand projections for AI computing infrastructure.
VC Eclipse has a new $1.3B fund to back — and build — ‘physical AI’ startups
Venture capital firm Eclipse has raised $1.3 billion to invest in and build startups focused on physical AI applications.
Muse Spark arrives as Meta's AI reset
Meta's Superintelligence Labs has released its first model, marking a new phase in the company's AI development strategy.
AI and investment: a new approach to due diligence - Vaultinum
AI and investment: why due diligence needs to evolve by 2026 Vaultinum/ Blog/ AI and investment: a new approach to due diligence # AI and investment: a new approach to due diligence Reading time: 2 min Last Updated on 8 April 2026 The way investors evaluate technology companies is quickly changing. By 2026ⁱ, artificial intelligence will not only be transforming products. It will also be fundamentally altering decision-making criteria in private equity. Against this backdrop, traditional due diligence approaches are no longer sufficient to capture the real risks or the potential for value creation. AI and investment: a new approach to due diligence Table of contents Why investors are having more difficulty making decisions How AI is redefining risk in software Why execution capability is becoming key Key takeaways - Inves
OpenAI said ads were a "last resort." Then crossed $100M in 6 weeks.
OpenAI has reportedly generated over $100 million in advertising revenue in just six weeks, despite previous claims that ads were a last resort.
Patlytics Secures $40M to Revolutionize Patent Law with AI, Targets Global Expansion
Patlytics has secured $40 million in Series B funding to enhance its AI platform for patent lifecycle management, targeting Am Law 100 firms.
Franco-American startup Plume raises €3.3 million to speed up renewable energy site selection with geospatial AI
Plume, a Franco-American startup specialising in geospatial AI for renewable energies, has announced a €3.3M funding round to expand the team, as well as for European and American expansion. The round was led by AENU, with participation from Y Combinator, Kima Ventures, Raise Phiture, Better Angle and Collab Fund. “Plume tackles the most critical bottleneck […]
Spain’s Golden Owl raises €1.4 million to advance AI-powered anticipatory intelligence operating system
Golden Owl, an Alicante-based DeepTech startup, has closed €1.4 million in Seed funding to advance its anticipatory intelligence operating system focused on recognising complex dynamics in informational, geopolitical, and business contexts. The round was led by venture capital fund First Drop, and supported by business angels and public funding, including support from the Empresa Nacional […]
Earnings expectations are too high
Plus AI and jobs
How AI and geopolitical rivalry are breaking economic orthodoxy
Opinion | How AI and geopolitical rivalry are breaking economic orthodoxy | South China Morning Post Advertisement Qiyuan Xu and Panpan Yang # OpinionHow AI and geopolitical rivalry are breaking economic orthodoxy ### Today’s imbalances go beyond exchange rates; they reflect concentrated technological investment in one economy and uneven risks in others 3-MIN READ3-MIN Listen Qiyuan Xu and Panpan Yang Published: 4:30pm, 8 Apr 2026 Global imbalances are once again taking shape, albeit differently than how they manifested before the financial crisis of the late 2000s. Back then, the story was simple: some countries, led by China and Germany, saved too much, while the United States consumed too much. The answer, at
US real GDP growth projected at 1.8% for 2026, supported by AI infrastructure spending, consumer resilience, and One Big Beautiful Bill Act stimulus
The University of Michigan’s ... of 2026. According to the RSQE, consumer spending resilience is reinforced by upward revisions in personal income data since 2022, supporting growth in real final sales to private domestic purchasers. The unemployment rate is expected to stabilize at 4.4% in 2025 before declining. Trading Economics projects a more moderate growth rate of 1.8% by the end of the current quarter, based on global macroeconomic models. Their data show that annualized GDP growth slowed ...
You’re looking at the AI revolution all wrong, top economist says: 40% unemployment and a 3-day work week are the same thing
"Sixty percent of people employed and 40% unemployed is the same number of working hours as 100% employed at 60% of the hours," Alex Tabarrok says.
Amazon Is Considering Selling Its AI Chips to Other Companies
Amazon.com Inc. is considering selling its chips to other companies, Chief Executive Officer Andy Jassy said Thursday, adding that the cloud-computing giant’s in-house silicon unit is on pace to bring in more than $20 billion over the course of a year.
Goodbye, Llama? Meta launches new proprietary AI model Muse Spark — first since Superintelligence Labs' formation
Meta has been one of the most interesting companies of the generative AI era — initially gaining a loyal and huge following of users for the release of its mostly open source Llama family of large language models (LLMs) beginning in early 2023 but coming to screeching halt last year after Llama 4 debuted to mixed reviews and ultimately, admissions of gaming benchmarks. That bumpy rollout of Llama 4 apparently spurred Meta founder and CEO Mark Zuckerberg to totally overhaul Meta's AI operations in the summer of 2025, forming a new internal division, Meta Superintelligence Labs (MSL) which he recruited 29-year-old former Scale AI co-founder and CEO Alexandr Wang to lead as Chief AI Officer. Now, today, Meta is showing us the fruits of that effort: Muse Spark, a new proprietary model that Wang says (posting on rival social network X, used more often by the machine learning community) is "the most powerful model that meta has released," and has "support for tool-use, visual chain of thought, & multi-agent orchestration." He also says it will be the start of a new Muse family of models, raising questions about what will become of Meta's popular lineup and ongoing development of the Llama family. It arrives not as a generic chatbot, but as the foundation for what Wang calls "personal superintelligence"—an AI that doesn’t just process text but "sees and understands the world around you" to act as a digital extension of the self, echoing Zuckberg's public manifesto for a vision of personal superintelligence published in summer 2025. However, it is proprietary only — confined for now to the Meta AI app and website, as well as a " private API preview to select users," according to Meta's blog post announcing it — a move likely to rankle the literally billions of users of Llama models and the thousands of developers who relied upon it (some of whom are active participants in rival social network Reddit's r/LocalLLaMA subreddit). In addition, no pricing information for the model has yet been announced. It's unclear if Meta has ended development on the Llama family entirely. When asked directly by VentureBeat, a Meta spokesperson said in an email: “Our current Llama models will continue to be available as open source,” which doesn’t address the question of development of future Llama models. Visual chain-of-thought At its core, Muse Spark is a natively multimodal reasoning model. Unlike previous iterations that "stitched" vision and text together, Muse Spark was rebuilt from the ground up to integrate visual information across its internal logic. This architectural shift enables "visual chain of thought," allowing the model to annotate dynamic environments—identifying the components of a complex espresso machine or correcting a user's yoga form via side-by-side video analysis. The most significant technical leap, however, is a new "Contemplating" mode. This feature orchestrates multiple sub-agents to reason in parallel, allowing Meta to compete with extreme reasoning models like Google's Gemini Deep Think and OpenAI's GPT-5.4 Pro. In benchmarks, this mode achieved 58% in "Humanity’s Last Exam" and 38% in "FrontierScience Research," figures that Meta claims validate their new scaling trajectory. Perhaps more impressive for the company’s bottom line is the model’s efficiency. Meta reports that Muse Spark achieves its reasoning capabilities using over an order of magnitude less compute than Llama 4 Maverick, its previous mid-size flagship. This efficiency is driven by a process called "thought compression". During reinforcement learning, the model is penalized for excessive "thinking time," forcing it to solve complex problems with fewer reasoning tokens without sacrificing accuracy. Benchmarks reveal a return-to-form The launch of Muse Spark is framed as a statistical "quantum leap," ending Meta’s year-long absence from the absolute frontier of AI performance. By reconciling Meta’s official internal data with independent auditing from third-party LLM tracking firm Artificial Analysis, a clear picture emerges: Muse Spark is not just a marginal improvement over the Llama series; it is a fundamental re-entry into the "Top 5" global models. According to the Artificial Analysis Intelligence Index v4.0, Muse Spark achieved a score of 52. For context, Meta’s previous flagship, Llama 4 Maverick, debuted in 2025 with an Index score of just 18. By nearly tripling its performance, Muse Spark now sits within striking distance of the industry’s most elite systems, trailing only Gemini 3.1 Pro Preview (57), GPT-5.4 (57), and Claude Opus 4.6 (53). Meta’s official benchmarks suggest that Muse Spark is particularly dominant in multimodal reasoning, specifically where visual figures and logic intersect. CharXiv Reasoning: In "figure understanding," Muse Spark achieved a score of 86.4, significantly outperforming Claude Opus 4.6 (65.3), Gemini 3.1 Pro (80.2), and GPT-5.4 (82.8). MMMU Pro: Official reports place the model at 80.4, while Artificial Analysis’s independent audit measured it at 80.5%. This makes it the second-most capable vision model on the market, surpassed only by Gemini 3.1 Pro Preview (83.9% official; 82.4% independent). Visual Factuality (SimpleVQA): Muse Spark scored 71.3, placing it ahead of GPT-5.4 (61.1) and Grok 4.2 (57.4), though it narrowly trails Gemini 3.1 Pro (72.4). These scores validate Meta’s focus on "visual chain of thought," enabling the model to not just recognize objects, but to reason through complex spatial problems and dynamic annotations. The "Thinking" gear of Muse Spark was put to the test against specialized benchmarks designed to break non-reasoning models. Humanity’s Last Exam (HLE): In this multidisciplinary evaluation, Meta reports a score of 42.8 (No Tools) and 50.4 (With Tools). Independent audits by Artificial Analysis tracked the model at 39.9%, trailing Gemini 3.1 Pro Preview (44.7%) and GPT-5.4 (41.6%). GPQA Diamond (PhD Level Reasoning): Muse Spark achieved a formidable 89.5, surpassing Grok 4.2 (88.5) but trailing the specialized "max reasoning" outputs of Opus 4.6 (92.7) and Gemini 3.1 Pro (94.3). ARC AGI 2: This remains a notable weak point. Muse Spark scored 42.5, far behind the abstract reasoning puzzles solved by Gemini 3.1 Pro (76.5) and GPT-5.4 (76.1). CritPT (Physics Research): Independent auditing found Muse Spark achieved the 5th highest score at 11%. This marks a substantial lead over Gemini 3 Flash (9%) and Claude 4.6 Sonnet (3%). One of the most striking results from the official data is Muse Spark's performance in the health sector, likely a result of Meta's collaboration with over 1,000 physicians. HealthBench Hard: Muse Spark achieved 42.8, a massive lead over Claude Opus 4.6 (14.8), Gemini 3.1 Pro (20.6), and even GPT-5.4 (40.1). MedXpertQA (Multimodal): It scored 78.4, comfortably ahead of Opus 4.6 (64.8) and Grok 4.2 (65.8), though it still trails Gemini 3.1 Pro’s top-tier score of 81.3. Agentic Systems and Efficiency: The "Thought Compression" Effect While Muse Spark excels at reasoning, its "agentic" performance—executing real-world work tasks—presents a more nuanced picture. SWE-Bench Verified: Muse Spark scored 77.4, trailing Claude Opus 4.6 (80.8) and Gemini 3.1 Pro (80.6). GDPval-AA Elo: Meta’s official score of 1444 differs slightly from Artificial Analysis’s recorded 1427. In both cases, Muse Spark trails GPT-5.4 (1672) and Opus 4.6 (1606), suggesting that while the model "thinks" well, it is still refining its ability to "act" in long-horizon software and office workflows. Token Efficiency: This is where Muse Spark distinguishes itself. To run the Intelligence Index, it used 58 million output tokens. In contrast, Claude Opus 4.6 required 157 million tokens and GPT-5.4 required 120 million. This supports Meta's claim of "thought compression"—delivering frontier-class intelligence while using less than half the "thinking time" of its closest competitors. Benchmark Llama 4 Maverick (2025) Muse Spark (Official) Gemini 3.1 Pro (Official) Intelligence Index Score 18 52 57 MMMU Pro -- 80.4 83.9 CharXiv Reasoning -- 86.4 80.2 HealthBench Hard -- 42.8 20.6 License Open-Weights Proprietary Proprietary With Muse Spark, Meta has successfully transitioned from being the "LAMP stack for AI" to a direct challenger for the title of "Personal Superintelligence". While agentic workflows remain a hurdle, its dominance in vision, health, and token efficiency places Meta back at the center of the frontier race. Personal wellness and Instagram shopping Meta is immediately deploying Muse Spark to power specialized experiences across its app family. Shopping Mode: A new feature that leverages Meta’s vast creator ecosystem. The AI picks up on brands, styling choices, and content across Instagram and Threads to provide personalized recommendations, effectively turning every post into a shoppable interaction. Health Reasoning: In a move toward medical utility, Meta collaborated with over 1,000 physicians to curate training data. Muse Spark can now analyze nutritional content from photos of food or provide "health scores" for pescatarian diets with high cholesterol. Interactive UI: The model can generate web-based minigames or tutorials on the fly. For example, a user can prompt the AI to turn a photo into a playable Sudoku game or a highlights-based tutorial for home appliances. Evaluation awareness While Muse Spark demonstrates strong refusal behaviors regarding biological and chemical weapons, its safety profile includes a startling new discovery. Third-party testing by Apollo Research found that the model possesses a high degree of "evaluation awareness". The model frequently recognized when it was being tested in "alignment traps" and reasoned that it should behave honestly specifically because it was under evaluation. While Meta concluded this was not a "blocking concern" for release, the finding suggests that frontier models are becoming increasingly "conscious" of the testing environment—potentially rendering traditional safety benchmarks less reliable as models learn to "game" the exam. What happens to Llama? In February 2023, Meta released Llama 1 to demonstrate that smaller, compute-optimal models could match larger counterparts like GPT-3 in efficiency. Although access was initially restricted to researchers, the model weights were leaked via 4chan on March 3, 2023, an event that inadvertently democratized high-tier research and catalyzed a global movement for running models on consumer-grade hardware. This shift was solidified in July 2023 with the release of Llama 2, which introduced a commercial license that permitted self-hosting for most organizations. This approach saw rapid adoption, with the Llama family exceeding 100 million downloads and supporting over 1,000 commercial applications by the third quarter of 2023. Through 2024 and 2025, Meta scaled the Llama family to establish it as the essential infrastructure for global enterprise AI, frequently referred to as the LAMP stack for AI. Following the launch of Llama 3 in April 2024 and the landmark Llama 3.1 405B in July, Meta achieved performance parity with the world's leading proprietary systems. The subsequent release of Llama 4 in April 2025 introduced a Mixture-of-Experts architecture, allowing for massive parameter scaling while maintaining fast inference speeds. By early 2026, the Llama ecosystem reached a staggering scale, totaling 1.2 billion downloads and averaging approximately one million downloads per day. This widespread adoption provided businesses with significant economic sovereignty, as self-hosting Llama models offered an 88% cost reduction compared to using proprietary API providers. As of April 2026, Meta’s role as the undisputed leader of the open-weight movement has transitioned into a highly contested multi-polar landscape characterized by the rise of international competitors. While the United States accounts for 35% of global Llama deployments, Chinese models from labs like Alibaba and DeepSeek began accounting for 41% of downloads on platforms like Hugging Face by late 2025. Throughout early 2026, new entrants such as Zhipu AI’s GLM-5 and Alibaba’s Qwen 3.6 Plus have outpaced Llama 4 Maverick on general knowledge and coding benchmarks. In response to this global pressure, Meta's Muse Spark arrives with hefty expectations and an open source legacy that will be tough to live up to. Proprietary only (for now) The launch marks a controversial departure from Meta AI's "open science" roots. While the Llama series was famously accessible to developers, Muse Spark is launching as a proprietary model. Wang addressed the shift on X, stating: "Nine months ago we rebuilt our ai stack from scratch. New infrastructure, new architecture, new data pipelines... This is step one. Bigger models are already in development with plans to open-source future versions." However, the developer community remains skeptical. Some see this as a necessary pivot after the Llama 4 series failed to gain expected developer traction; others view it as Meta "closing the gates" now that it has a competitive reasoning model. Wang himself acknowledged the transition’s difficulty, noting there are "certainly rough edges we will polish over time". For the 3 billion people using Meta’s apps, the change will be felt almost instantly. The AI they interact with is no longer just a library of information, but an agent with a $27 billion brain and a mandate to understand their world as intimately as they do.
Techbuzz
Managed services typically command premium pricing and create stickier customer relationships than raw API access. If businesses build their agent workflows on Anthropic's managed infrastructure, switching costs become significantly higher. The broader implications ripple across the AI industry. If managed agent platforms take off, they could accelerate enterprise AI adoption dramatically by abstracting away the complexity that's been slowing deployment...
Seems like a good model from Meta that is still trailing the current series of releases. The most important thing to note is that it is not open weights. That was the main reason that Meta's models were so important. Without that, it is a lot harder to predict the value of Spark
Seems like a good model from Meta that is still trailing the current series of releases. The most important thing to note is that it is not open weights. That was the main reason that Meta's models were so important. Without that, it is a lot harder to predict the value of Spark
Brazil’s CADE queries WhatsApp clients over API access, AI chatbot distribution
Brazil’s competition authority is investigating whether changes to WhatsApp’s policies regarding AI-powered chatbots and business API access infringe on competition law.
Public procurement critical for the ‘survival’ of European tech, Mistral CEO says
Mistral AI CEO Arthur Mensch urged Europe to use public procurement to support domestic tech companies, warning it is critical for the survival of European technology.
Turkey's AI sector comes under 'comprehensive' inquiry
Turkey's antitrust authority has opened a sector inquiry to study the AI ecosystem, analyzing interactions between large tech companies and innovators.
Unclear legal landscape for AI spawns licensing as US sees 100 copyright cases
A group of YouTube creators sued Apple, OpenAI and Amazon, bringing the number of copyright lawsuits over AI in the US to 100. Many copyright owners are now looking to licensing as a solution.
US says it's in compliance with injunction blocking Anthropic risk designation
The US government submitted a status report to a federal judge detailing its compliance with a preliminary injunction that blocks a supply chain risk designation against Anthropic.
Project Glasswing is inherently Cartel Behaviour
A discussion on the controversial nature of Project Glasswing and allegations of anti-competitive behavior.
Meta debuts Muse Spark, first AI model under Alexandr Wang
Meta has launched Muse Spark, a new AI model that aims to close the performance gap with OpenAI and Anthropic. The model will be available in Meta's apps and will eventually be released under an open-source license.
Anthropic launches Mythos Preview and industry cybersecurity initiative Project Glasswing | NASDAQ:MSFT
Anthropic has formally announced its new AI model, Mythos Preview, on Tuesday, alongside the creation of an industry consortium called Project Glasswing...
Suno and major music labels reportedly clash over AI music sharing
Music labels and AI music generator Suno are in a dispute regarding the sharing and training of AI-generated music.
Meta is reentering the AI race with a new model called Muse Spark
Meta has launched a new proprietary AI model, Muse Spark, marking its latest effort in the competitive AI landscape.
I can’t help rooting for tiny open source AI model maker Arcee
TechCrunch highlights the progress of Arcee, a small company focused on developing open-source AI models.
I asked 5 data leaders about how they use AI to automate - and end integration nightmares | ZDNET
Tech executives explain how they're moving beyond legacy Excel mapping to build AI data pipelines that cut integration workloads by up to 40%.
Only 28% of AI infrastructure projects fully pay off, survey finds
ITSM is the area most likely to offer wins, according to Gartner research.
Citigroup says AI helps speed account openings and systems ...
Citigroup says AI helps speed account openings and systems upgrades | Reuters Exclusive news, data and analytics for financial market professionalsLearn more aboutRefinitiv Citi Bank logo appears in this illustration taken December 1, 2025. REUTERS/Dado Ruvic/Illustration Purchase Licensing Rights, opens new tab - Summary - Companies - Citigroup is using AI to speed internal processes - Bank is sharply reducing time of document review to open an account - Citi is in the midst of a plan to limit the use of tech contractors NEW YORK, April 8 (Reuters) - Citigroup is using artificial intelligence to speed up account openings and the retirement of ol
AI Drug Discovery: Revolution or Expensive Illusion? | Ep. 978
This week we’re digging into one of the biggest narratives in biotech over the past cycle: AI -driven drug discovery, and whether it’s actually delivering on its promise or just compressing timelines without improving outcomes. We break down how the original pitch of faster, cheaper, and more successful drug development is now colliding with the reality of clinical biology, where failure rates remain stubbornly high.
Innovation in the news April 9, 2026 - by Charles McIvor
Differences in AI adoption in Europe and the US: Explanations and implications for productivity growth (Centre for Economic Policy Research)
Differences in AI adoption in Europe and the US - CEPR
Differences in AI adoption in Europe and the US: Explanations and implications for productivity growth | CEPR VoxEU Column Productivity and Innovation # Differences in AI adoption in Europe and the US: Explanations and implications for productivity growth 9 Apr 2026 The economic impact of generative AI will depend on the speed and breadth of adoption by workers and firms. Drawing on a survey of workers in the US and six European countries and a firm survey covering 32 European countries, this column explores the speed of AI adoption across countries and whether an ‘AI gap’ will exacerbate existing productivity differences between the US and Europe. There are large differences in AI adoption across countries, much of which are accounted for by management practices. Higher AI adoption rates are associated with faster productivity growth but not with changes in employment. #### Authors
ROI is about more than profitability when it comes to AI adoption – here’s what enterprises are looking for | IT Pro
A survey from KPMG suggests enterprises are measuring more than just financial returns
The Arithmetic of Productivity Boosts: Why Does a “40% Increase in Productivity” Never Actually Work?
Why do grand productivity promises never actually deliver? Is every product just bad, or is there something else hiding in the numbers?
AI Adoption By the Numbers
A16z analyzes enterprise startup penetration data, showing that generative AI is seeing significant real-world deployment and productivity gains across large organizations.
AI Impact: Is AI Making Demand Harder to Turn Into Revenue? - Newsweek
AI Impact examines how AI is reshaping demand, exposing risk and forcing faster decisions across pricing, governance and strategy.
Labor & Society
The AI job loss story is all about bundles
Evidence on white-collar work displacement is beginning to match the theory
Opinion | A.I. May Worsen Wealth Inequality - The New York Times
A.I. will further enrich the winners and impoverish the losers, with inevitable societal impacts.
Goldman Sachs warns AI job losses may dent earnings and career growth for years - Storyboard18
Goldman Sachs analysts said AI-driven job losses could depress wages and slow careers for years, warning displaced workers face long-term setbacks, though retraining may partly offset the damage.
Tech industry lays off nearly 80,000 employees in the first quarter of 2026 — almost 50% of affected positions cut due to AI | Tom's Hardware
Some experts argue that AI was just used as an excuse for poor business decisions.
Goldman Sachs warns AI job loss: Long-term financial impact on workers, ETCIO
Goldman Sachs warns AI job losses will hurt workers for years. Displaced employees face lower pay and slower career growth. This impact could worsen during economic downturns. Retraining offers a path to better outcomes. AI is already affecting hiring trends.
Nearly 80,000 tech workers have already lost their jobs in 2026 — and AI impact means more could be to come | TechRadar
Q1 2026 was worse for tech layoffs than 2025, 2024
Indian IT Companies Surge Ahead in AI Leadership as Automation Revolutionizes the Industry, ETEnterpriseai
India's top IT firms are reshaping their leadership to prioritize artificial intelligence, implementing new roles and units to accommodate growing automation demands. A report reveals significant changes across key players like TCS, Infosys, and Wipro, signaling a shift in focus towards AI ...
The Literary Job AI Can’t Replace
Ghostwriting is good, actually—when it’s done by humans.
How are software engineering graduates adjusting to AI?
BearingPoint’s Karl Byrne, Holly Daly and Fiona Eguare discuss the effects of AI on software engineering and how it has affected graduates in particular. Read more: How are software engineering graduates adjusting to AI?
'Fearful' Gen Z Employees Intentionally Sabotage AI Adoption Over Job Security Concerns
FOBO, the fear of becoming obsolete, is driving Gen Z workers to actively sabotage AI integration in the modern workplaces.
Project Glasswing: Securing critical software for the AI era
Anthropic introduces Project Glasswing, an initiative aimed at securing critical software infrastructure in the age of AI.
A new Anthropic model found security problems ‘in every major operating system and web browser’
Anthropic's latest AI model has identified significant security vulnerabilities across all major operating systems and web browsers.
Anthropic withholds Mythos model
Anthropic is rolling out a preview of its new Mythos model only to a select group of tech and cybersecurity companies due to concerns about its ability to find and exploit security flaws.
Again, if you care about computer security, read the red team report: https://red.anthropic.com/2026/mythos-preview/
Again, if you care about computer security, read the red team report: https://red.anthropic.com/2026/mythos-preview/
Project Glasswing
Project Glasswing introduces Anthropic’s gated Claude Mythos Preview for defensive cybersecurity, aimed at remediating software vulnerabilities in critical infrastructure.
Generative AI as a weapon of war in Iran | Brookings
Generative AI as a weapon of war in Iran | Brookings #### Generative AI as a weapon of war in Iran - Share - Bluesky Streamline Icon: https://streamlinehq.com Sections Sections - Share - Bluesky Streamline Icon: https://streamlinehq.com ### Subscribe to the Center for Middle East Policy Newsletter Research # Generative AI as a weapon of war in Iran A woman displays an AI-generated image of Iran's new Supreme Leader, Ayatollah Mojtaba Khamenei, in an Islamic Revolutionary Guard Corps (IRGC) military uniform on her cellphone sc
Phrasing of prompts can shift chatbots’ agreement levels, UK testing finds
UK government testing found that reframing prompts from statements to questions reduced agreement levels by 24 percent across OpenAI and Anthropic models.
DC: Security, AI, DevOps.com, Java, Cisco, Varonis, Proofpoint, Cloudflare, Palo Alto Networks, Webinars (Wed. Apr 8th PM)
AI agents are no longer experimental. They’re running production workloads, calling APIs, querying databases, provisioning infrastructure, and making decisions across cloud environments. Ironically these agents often end up with more access than the developers who built them.
Shaky ceasefire unlikely to stop cyberattacks from Iran-linked ...
Shaky ceasefire unlikely to stop cyberattacks from Iran-linked hackers for long - The Washington Post Democracy Dies in Darkness By David Klepper | AP WASHINGTON — Hackers backing Tehran say an uncertain ceasefire between Iran and the United States and Israel won’t end their retaliatory cyberattacks, a warning that American cybersecurity experts say potential targets in the U.S. and Israel should take seriously.
Appeals court rebuffs Anthropic in latest round of its AI battle with the ...
Appeals court rebuffs Anthropic in latest round of its AI battle with the Trump administration - The Washington Post Democracy Dies in Darkness By Associated Press WASHINGTON — A federal appeals court on Wednesday refused to block the Pentagon from blacklisting artificial intelligence laboratory Anthropic in a decision that differed from the conclusions reached in another judge’s ruling on the same issues. The U.S. Court of Appeals in Washington, D.C., rejected Anthropic’s request for an order that would shield the San Francisco company from the fallout stemming from a dispute over how the Pentagon could deploy its Claude chatbot in fully autonomous weapons and potential surveillance of Americans while the panel is still collecting evidence about the case.
China drafts law regulating 'digital humans' and banning addictive virtual services for children
China has proposed new legislation to regulate digital human technology and restrict addictive virtual services for minors.
Sam Altman says AI superintelligence is so big that we need a "New Deal."
Sam Altman argues for a new regulatory framework for AI, though critics suggest his proposals are a cover for regulatory nihilism.
Pentagon Blacklists US AI Firm Anthropic and Court Refuses to Stop It – RedState
The Pentagon has designated one of America's top AI companies a national security risk, a label typically reserved for foreign adversaries, and a federal appeals court on Wednesday refused to block that decision, keeping Anthropic's technology out of military systems. ... The ruling locks in the government's restrictions for now. It bars defense ...
The Politics of AI Regulation: Federal Government v. the States | Polsinelli - JDSupra
Key Takeaways - On December 11, 2025, President Trump signed Executive Order 14365, “Ensuring a National Policy Framework for Artificial Intelligence.” The EO addressed key elements including: 1)...
Anthropic’s Glasswing initiative raises questions for US cyber operations - Nextgov/FCW
Intelligence officials and industry are weighing how Claude Mythos Preview could reshape hacking and cyberdefense. The company has also briefed senior o...
Anthropic loses bid for emergency stay of US supply chain designation
Anthropic lost a bid to have a US appeals court issue an emergency stay blocking its designation as a supply chain risk by the Trump Administration.
Anthropic denied emergency stay of DoD supply chain risk designation
A three-judge panel of the US Court of Appeals for the DC Circuit has denied Anthropic's motion for an emergency stay of the government's designation of the company as a supply chain risk.
OpenAI made economic proposals — here’s what DC thinks of them
An analysis of how Washington D.C. policymakers are reacting to recent economic policy proposals put forward by OpenAI.
Two AI Strategies, One day: Challenging what we think about East ...
Two AI Strategies, One day: Challenging what we think about East vs West | Global Policy Journal By Adam Chalmers and Elise Antoine - 08 April 2026 ### Adam Chalmers and Elise Antoine introduce research that challenges the tidy story of western democracies converging in opposition to Chinese AI exceptionalism. On 20 March 2026, Scotland published its much anticipated and updated national AI strategy. A few hours later, the United States released a National Policy Framework for Artificial Intelligence under the Presidential seal. Two anglophone democracies, the same technology, the same day. We ran both documents through x.Machina(our
AI regulation is stalling as governments clash over control
Artificial intelligence is advancing faster than lawmakers can regulate it, while global AI governance fragments in real time
AI governance will decide cloud strategy in India — not just cost or performance | CIO
As AI moves from experimentation into core business processes, Indian enterprises are discovering that the traditional logic of cloud decision‑making no longer holds. AI systems behave differently from enterprise applications, depend far more deeply on data, and introduce new forms of operational ...
AI data centers a boon for US lifestyle and technology | Opinion - AOL
AI data centers a boon for US lifestyle and technology | Opinion - AOL Wed, April 8, 2026 at 9:06 AM UTC 0 As AI heats up globally, several state legislatures and U.S. Sen. Bernie Sanders are calling for a pause on AI data center development, citing risks of job loss, superintelligence concerns and risks to working people. Their concerns aren’t baseless, but a moratorium is not a solution. Such a proposal would put the U.S. at a disadvantage to China. If the United States is to win the AI race, global adoption is critical. While hardware and model innovation for top performance is significant, it is arguably more important that the development ecosystem is accessible and widespread enough to encourage AI developers to choose U.S.-based platforms. In the same way Microsoft’s Windows Phone arrived to the smartphone
China rolls out first comprehensive AI ethics framework with review rules
China rolled out trial administrative measures to establish the country's first comprehensive framework for the ethical review of artificial intelligence.
Industrial policy for the Intelligence Age
This paper lays out OpenAI’s early policy ideas for keeping people first as AI advances, focusing on prosperity, institutions, and social impact.
EU AI simplification package risks 'loophole' in AI Act scope, civil society warns
A coalition of organizations warned that proposed changes to the EU's AI omnibus package could exclude key systems from high-risk rules and weaken consumer protection.
OpenAI Launches Blueprint to Combat AI-Driven Child Exploitation
OpenAI has launched a Child Safety Blueprint proposing updated laws, enhanced detection, and better reporting systems to combat AI-driven child exploitation.
AI Policy at the Frontier: A Conversation with Nathan Calvin (Encode ...
AI Policy at the Frontier: A Conversation with Nathan Calvin (Encode) and Fin Moorhouse (Forethought) - Harvard Law School | Harvard Law School Back to events list What does it take to govern a technology that might reshape the world within the decade? Answering that requires both big-picture thinking about where AI is heading and close engagement with the policy fights shaping it in the present. This event brings together two speakers who sit on opposite ends of that spectrum — one (Nathan Calvin) is shaping frontier AI legislation in statehouses today, the other (Fin Moorhouse) is thinking through what rapid AI progress could mean for the century ahead. Each will give a short talk, followed by a joint Q&A. In-person event (Harvard ID holders only). Lunch will be served. Part of the AI Governance Speaker Series co-sponsored by AISST and the HLS A
Could AI ads be exposing your business to risks? - Lexology Pro
AI-generated ads are often cheaper and faster to produce, but the legal, regulatory and reputational risks cannot be ignored. Here’s how companies can ensure their AI-enabled marketing is responsible and complaint.
I guess we will find out - the giant uncontrolled experiment of developing superpowerful AI continues regardless.
I guess we will find out - the giant uncontrolled experiment of developing superpowerful AI continues regardless.
South Korea to require disclosure of AI characters in advertisements
The Korea Fair Trade Commission is proposing new rules requiring businesses to clearly label AI-generated virtual characters in advertisements, with public feedback open until April 28.
Experts to advise on AI-automated contract templates sought by EU Commission
The European Commission has opened a call to set up an expert group to assist the development of model contract terms and guidance on AI-driven automated contracts.
Technology & Infrastructure
Peace President's Iran war piles more pain on already battered PC market
Memory costs were already through the roof - now freight's spiking too, and budget systems face extinction America's war with Iran is jacking up the pressure on computing markets already struggling with memory shortages and component cost inflation, meaning buyers should brace themselves for even higher prices this year.…
The chips chokehold that could end the AI investment boom
Taiwan’s control of leading-edge silicon chips lies on a geostrategic faultline
FinancialContent - Broadcom's 3nm Revolution: How Custom Silicon for Meta and ByteDance Fueled a Historic Breakout
However, challenges remain, particularly in the form of supply chain constraints for HBM4 memory and the geopolitical volatility surrounding fab capacity in East Asia. Broadcom’s breakthrough in 3nm custom silicon represents a structural shift in the semiconductor industry. The key takeaway for investors is that the "AI hardware...
SiMa.ai Secures Strategic Investment from Micron to Scale High-Performance, Power-Efficient Physical AI
/PRNewswire/ -- SiMa.ai, a leader in Physical AI, today announced a strategic investment from Micron Technology, Inc. (Nasdaq: MU), strengthening its ability...
OpenAI puts Stargate UK on ice, blames energy costs and red tape
Sam Altman's datacenter dreams hit a wall of watts and wonkery, cooling Britain's AI ambitions OpenAI is pausing its planned Stargate datacenter project in the UK just months after announcing it, citing the regulatory environment and cost of energy as reasons for putting it on hold.…
Anthropic builds Managed Agents to handle agent infrastructure automatically
Anthropic launched Claude Managed Agents, an API suite that removes the need to build agent infrastructure from scratch. It handles tool calling, context updates, and error recovery automatically.
Infrastructure First: Enabling Embodied AI for Science in the Global South
arXiv:2604.06722v1 Announce Type: new Abstract: Embodied AI for Science (EAI4S) brings intelligence into the laboratory by uniting perception, reasoning, and robotic action to autonomously run experiments in the physical world. For the Global South, this shift is not about adopting advanced automation for its own sake, but about overcoming a fundamental capacity constraint: too few hands to run too many experiments. By enabling continuous, reliable experimentation under limits of manpower, power, and connectivity, EAI4S turns automation from a luxury into essential scientific infrastructure. The main obstacle, however, is not algorithmic capability. It is infrastructure. Open-source AI and foundation models have narrowed the knowledge gap, but EAI4S depends on dependable edge compute, energy-efficient hardware, modular robotic systems, localized data pipelines, and open standards. Without these foundations, even the most capable models remain trapped in well-resourced laboratories. This article argues for an infrastructure-first approach to EAI4S and outlines the practical requirements for deploying embodied intelligence at scale, offering a concrete pathway for Global South institutions to translate AI advances into sustained scientific capacity and competitive research output.
Intel Wins Google Promise to Keep Using Xeon in Data Centers
Intel Corp., trying to promote the use of its technology in data centers, said Alphabet Inc.’s Google has committed to using future generations of its Xeon processors and other chips.
AI's next bottleneck: Why even the best chips made in the U.S. take a round trip to Taiwan
Nvidia has reserved the majority of TSMC’s most advanced packaging capacity. The lesser-known chipmaking step may become the next bottleneck for AI.
Reuters Reuters | Breaking International News & Views
Pentagon's ouster of Anthropic opens doors for small AI rivals · ago · Meta, CoreWeave deepen AI cloud partnership with fresh $21 billion deal · ago · Patients scramble to find estrogen patches as shortage worsens after US FDA champions use · ago · Amazon to stock Lilly's new weight-loss pill at US kiosks, offer same-day delivery ·
Microsoft hints at bit bunkers for war zones
President Brad Smith tells an interviewer that Microsoft is reconsidering datacenter design in light of the Iran war.
How AI Data Centers Are Shaping Politics | Lawfare
How AI Data Centers Are Shaping Politics | Lawfare --- Meet The Authors Across the United States, the rapid buildout of hyperscale data centers to support artificial intelligence (AI) infrastructure is no longer just a technological or economic development, but a political flashpoint with intense bipartisan pushback from local communities. The scale of the backlash has escalated a sense of urgency to act from both ends of the ideological spectrum. President Trump’s recent deal with major technology companies, also included in the White House’s National AI Legislative Framework—aimed at protecting American consumers from rising electricity costs tied to the AI data center boom—and the [A
CATL invests in China's Zhongheng Electric as AI demand surges
As artificial intelligence (AI) accelerates the buildout of data centers, the race to supply their enormous energy demands is drawing new alliances between China's technology and industrial giants.
Anthropic's Gigawatt-Scale TPU Deal with Broadcom Creates a Structural Advantage
Anthropic TPU expansion with Google and Broadcom show compute scale, custom silicon partnerships, and AI infrastructure competition dynamics.
Intel will help build Elon Musk’s Terafab AI chip factory
Intel has partnered with Elon Musk to assist in the construction of the Terafab AI chip manufacturing facility.
AI Scale Hinges on Infrastructure, Cybersecurity, and IT/OT Collaboration: Cisco Report Reveals Key Insights
Cisco’s State of Industrial AI Report reveals that infrastructure readiness, including connectivity and cybersecurity, is crucial for scaling AI in industrial settings.
Nvidia Leads $505M Funding for AI Data Center Expansion in Asia-Pacific
Nvidia co-led a $505 million funding round for Firmus Technologies to develop AI hardware facilities across the Asia-Pacific region.
Browser Arena
Open-source benchmarks for cloud browser infrastructure.
Meta unveils first AI model from costly superintelligence team - Reuters
Meta unveils first AI model from costly superintelligence team | Reuters Exclusive news, data and analytics for financial market professionalsLearn more aboutRefinitiv Meta logo is seen in this illustration taken June 18, 2025. REUTERS/Dado Ruvic/Illustration Purchase Licensing Rights, opens new tab - Summary - Companies - Muse Spark is Meta's first AI model in a year - Company is trying to re-join the frontier AI race after its Llama 4 model disappointed - Independent tests show Muse Spark matches rivals in some areas, lags in coding and reasoning April 8 (Reuters) - Meta Platforms [(META.O), opens new tab](https://www.reuters.com/markets/companies/MET
Anthropic rolls out cyber AI model days after source code leak
Claude Mythos opened to Amazon, Microsoft and Apple to detect hidden software vulnerabilities
Meta Unveils New A.I. Model, Its First From the Superintelligence Lab - The New York Times
The model, Muse Spark, performed better than Meta’s previous A.I. models but lags rivals on coding ability.
Anthropic keeps latest AI tool out of public’s hands for fear of enabling widespread hacking
AI company says purpose of its Claude Mythos model is to bolster defenses against hacking in common applications Anthropic on Tuesday said its yet-to-be-released artificial intelligence model called Claude Mythos has proven keenly adept at exposing software weaknesses. Mythos has laid bare thousands of vulnerabilities in commonly used applications for which no patch or fix exists, prompting the San Francisco-based AI startup to form an alliance with cybersecurity specialists to bolster defenses against hacking and withhold wide distribution. Continue reading...
Introducing Muse Spark: Scaling Towards Personal Superintelligence
Meta introduced Muse Spark, a high-performance LLM featuring multimodal, reasoning, and agentic capabilities, including innovative thought compression to reduce token usage.
System Card: Claude Mythos Preview
A technical system card document detailing the capabilities and safety considerations of the Claude Mythos preview model.
Microsoft developer chief Julia Liuson is logging off
Departure may accelerate further AI-centric moves for programming tools Julia Liuson, president of Microsoft's developer division (DevDiv), will resign at the end of June, though she will continue in an advisory role.…
AI, Mythos and Mythmaking in 2026 - by Daniel Jeffries - Future History
AI, Mythos and Mythmaking in 2026 - by Daniel Jeffries # Future History SubscribeSign in # AI, Mythos and Mythmaking in 2026 ### On AI, predictions for the next year and the increasingly insane dialogue and myth making around AI Apr 08, 2026 7 1 Share The Mythos paper dropped yesterday and set my X feed on fire. I decided to take a night to read most of the paper and think about it before writing anything. As always my goal is to bring a little sanity to the discussion. Increasingly, I’m losing hope this is possible in today’s increasingly insane discussions around AI but I have to try. Let’s start with: this is an impressive freaking model. Its benchmarks are a massive leap forward. This isn’t a surprise. The Anthropic team is impressive and they make fantastic models. They’ve revolutionized codin
The next phase of LLM development: Why the future of sovereign AI will be multilingual by design | TechRadar
AI’s next phase prioritizes linguistic diversity
GLM-5.1: Towards Long-Horizon Tasks
An overview of the GLM-5.1 model, which focuses on improving performance for long-horizon AI tasks.
Adoption & Impact
White-collar industries bet on a secret weapon against AI: trust
Executives across finance and cyber defence map where Anthropic’s Claude “plug-ins” will — and won’t — change work
Amid the 'SaaSpocalypse,' CIOs and CTOs take a harder line with their vendors | Fortune
As investors debate the threat that AI poses to enterprise software companies, CIOs are on the font lines of the changes affecting the industry.
~77% of all new "Success" self-help books on Amazon are likely written by AI
A study suggests that a vast majority of new self-help books on Amazon are AI-generated, with some authors publishing at a rate of over one book per day.
Agentic commerce and purchase disputes: Did you mean to buy that?
As agentic shopping becomes more commonplace, how do you dispute a purchase your AI made? Read more: Agentic commerce and purchase disputes: Did you mean to buy that?
Household Use of AI Yields Bigger Productivity Gains Than Seen by Business - UCLA Anderson Review
While their study produces a range ... bigger productivity gains than reported by businesses, where users are more likely to be specialists with less to gain from pinging a bot for advice. The findings underscore concerns about the survival of online information providers that AI users are abandoning to complete tasks. Adopters’ decisions about what to do with this found time — more work or more ...
ActivTrak Launches AI Insights to Give Leaders a Clear View of How AI Changes Work
ActivTrak Launches AI Insights to Give Leaders a Clear View of How AI Changes Work Accessibility Statement Skip Navigation New capabilities connect AI usage to productivity, capacity and business performance — helping organizations understand what's actually working AUSTIN, Texas, April 8, 2026 /PRNewswire/ -- ActivTrak today announced AI Insights, a new set of work intelligence capabilities that gives organizations a clear, data-driven view of how AI is used across the business — and whether it's driving meaningful results. As companies rapidly deploy AI tools, most still rely on fragmented, t
ODMs erase seasonal slump as notebooks and AI servers drive March surge
Electronic ODM makers' surge in March revenue, driven by accelerated notebook orders and booming AI server demand, erased the typical first-quarter slump, with implications for global device supply chains and enterprise AI deployments. As major assemblers outperform expectations and reshape ...
Industrial AI and Security Trends: 2026 Cisco Report
Cisco’s 2026 State of Industrial AI Report shows 61% of organizations have moved AI into live operations, prioritizing automation while facing security hurdles.
AI tools that tried to remove human judgment keep failing
An examination of why AI tools designed to automate human decision-making processes continue to encounter significant failures.
Claude Managed Agents: Get To Production 10x Faster
Anthropic has launched a fully hosted platform for building, deploying, and managing AI agents at scale with integrated memory, orchestration, and analytics.
Some approaches: 1) Multiple reviews. Some papers already show having many AI team members review a problem reduces errors 2) Building in tests and checkpoints 3) Multiple independent answers that are cross-checked 4) Clear processes that escalate to smarter systems for review
Some approaches: 1) Multiple reviews. Some papers already show having many AI team members review a problem reduces errors 2) Building in tests and checkpoints 3) Multiple independent answers that are cross-checked 4) Clear processes that escalate to smarter systems for review
ATANT: An Evaluation Framework for AI Continuity
arXiv:2604.06710v1 Announce Type: new Abstract: We present ATANT (Automated Test for Acceptance of Narrative Truth), an open evaluation framework for measuring continuity in AI systems: the ability to persist, update, disambiguate, and reconstruct meaningful context across time. While the AI industry has produced memory components (RAG pipelines, vector databases, long context windows, profile layers), no published framework formally defines or measures whether these components produce genuine continuity. We define continuity as a system property with 7 required properties, introduce a 10-checkpoint evaluation methodology that operates without an LLM in the evaluation loop, and present a narrative test corpus of 250 stories comprising 1,835 verification questions across 6 life domains. We evaluate a reference implementation across 5 test suite iterations, progressing from 58% (legacy architecture) to 100% in isolated mode (250 stories) and 100% in 50-story cumulative mode, with 96% at 250-story cumulative scale. The cumulative result is the primary measure: when 250 distinct life narratives coexist in the same database, the system must retrieve the correct fact for the correct context without cross-contamination. ATANT is system-agnostic, model-independent, and designed as a sequenced methodology for building and validating continuity systems. The framework specification, example stories, and evaluation protocol are available at https://github.com/Kenotic-Labs/ATANT. The full 250-story corpus will be released incrementally.
Why AI Implementation Fails Without Structured Visibility | Press Releases | norfolkdailynews.com
Why AI Implementation Fails Without Structured Visibility | Press Releases | norfolkdailynews.com You have permission to edit this article. --- --- Why AI Tools Fail Without the Right Business Structure Victoria, Canada - April 8, 2026 / Chief AI Advisors /"/> Why AI Tools Fail Without the Right Business Structure Victoria, Canada - April 8, 2026 / Chief AI Advisors / Businesses across industries are accelerating investments in artificial intelligence, adopting tools designed to automate workflows, improve efficiency, and enhance customer engagement. Yet despite this momentum, many organizations are seeing limited return on these efforts. The issue is not the technology itself. It is the lack of foundational structure required to support it. AI implementation is often approached as a tool decision rather than a systems decision. Businesses deploy automation platforms, integrat
Democratizing Marketing Mix Models (MMM) with Open Source and Gen AI
A practical system design combining open-source Bayesian MMM and GenAI for transparent, vendor independent marketing analytics insights.
India is racing ahead in AI adoption, but falling behind where it matters - BusinessToday
According to a Deloitte report, India Inc is leading major economies in AI adoption, with nearly 40% of companies using it at scale compared to a 28% global average. However, it still lags in deep technical expertise, ranking among the lowest.
Geopolitics
Japan relaxes privacy laws to make itself the ‘easiest country to develop AI’
Opting out of personal data use won't be an option because the Minister says that's a 'very big obstacle' to AI adoption.
AI & Tech Brief: The power of Mythos - The Washington Post
The geopolitical and national security consequences of Anthropic’s new model.
The ATOM Project
The ATOM Project aims to accelerate U.S. open-source AI development to compete with international models, focusing on transparency and open-weight innovation.
South Korea, US launch working group to implement technology-prosperity deal
Seoul and Washington have launched a bilateral technology-prosperity working group to implement a partnership memorandum signed last October, focusing on AI, biotech, and other sectors.
Geopolitical shock to execution risk: Why Gulf investors need human intelligence - ION Analytics
Now, the critical challenge for investors in the Gulf is distinguishing headline‑driven geopolitical noise from genuine operational risk.
Get the full executive brief
Receive curated insights with practical implications for strategy, operations, and governance.