TEXT VIEW · TODAY'S DIGEST · 36 HEADLINES ACROSS 8 SOURCES

Startup Archive(0)

No items yet for today.

App Store Rankings(0)

No items yet for today.

ISSUE 0897
MON, JUN 15, 2026
Discover the best information organized by OrangeBot.AI
TODAY · MON, JUN 15, 2026

The web,
read by a bot.

Ten sources — Hacker News, Product Hunt, HuggingFace, Techmeme and more — filtered, tagged, and summarized every morning for builders who don’t have time to scroll.

NEWChrome extension: save posts from Twitter/X in one click.Install →
01

AI DIGEST

UPDATED DAILY · EDITOR'S PICK
01.00
AI DIGEST

AI新闻摘要

June 15, 2026

Here is a summary of today's main news events.

U.S. and Iran Reach Tentative Peace Deal, Shaking Global Markets

The United States and Iran have reached a tentative agreement to end their nearly four-month conflict. The deal involves reopening the vital Strait of Hormuz waterway and is seen as a major step toward de-escalating tensions. The news prompted a strong reaction in global markets, with stock futures rising while oil prices fell to their lowest level since early March on expectations of a more stable supply.

SpaceX Stock Surges in Highly Anticipated Public Debut

Stock in Elon Musk’s SpaceX climbed 8% in early trading during its historic Initial Public Offering (IPO). The successful market debut is viewed as the first in an expected wave of major AI and technology companies going public and contributed to positive investor sentiment today.

UK Announces Plan to Ban Social Media for Children Under 16

The UK Prime Minister announced that the government plans to ban children under 16 from using certain social media platforms. The new regulations are expected to be in place early next year and reflect growing global concerns over the impact of social media on the well-being of young people.

AI Industry Faces Lawsuits and Misinformation Concerns

The artificial intelligence sector is facing new challenges, highlighted by a class-action lawsuit alleging a major company oversold the capabilities of its "Claude" chatbot. The legal action coincides with rising public anxiety over the spread of misinformation fueled by a wave of fake AI-generated videos and advertisements.

02

ON THE WIRE

6 SOURCES
02

HACKER NEWS

02.00
HACKER NEWS

Hacker News - June 15, 2026

Hacker News Feed: Highlighting key posts and discussions.

Openrouter Fusion API

(openrouter.ai)

10837
Apple Foundation Models

(platform.claude.com)

282115
The rich aren't your role models

(theslowburningfuse.wordpress.com)

14750
Bitsy

(bitsy.org)

2376
Write for One Person

(wizardzines.com)

25474
Your ePub Is fine

(andreklein.net)

752247
Linux 7.1

(lore.kernel.org)

300114
Lisp's Influence on Ruby

(blog.tacoda.dev)

24678
The Birth and Death of JavaScript (2014)

(www.destroyallsoftware.com)

230130
Honda Civics and the Evil Valet

(juniperspring.org)

39895
03

HUGGINGFACE

03.00
HUGGINGFACE

huggingface.title - June 15, 2026

huggingface.description

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

Cloning camera motion from reference videos is an important task in video generation, as videos provide intuitive and precise control. Existing methods either directly use parametric representations that fail to handle multi-shot generation or synthesize cross-paired data, which suffer from data scarcity, resulting in poor performance in complicated camera motion cloning. To address these issues, we introduce a general camera motion representation that encodes cameras as grid motion videos. This camera grid represents the camera parameters visually and supports the integration of diverse trajectories for multi-shot video generation. Building upon this, we propose OmniDirector, a unified framework trained on a million-scale camera grid-video pairs that coordinates characters, actions, and cameras to provide director-level control for multimodal diffusion transformers. Furthermore, we design a novel hierarchical prompt expansion agent that harmoniously integrates different control signals by systematically describing camera motion and visual content through understanding signal relationships. Extensive experiments demonstrate the superior performance and outstanding controllability of our framework. Project page: https://ymlinfeng.github.io/OmniDirector.github.io/

89
APPO: Agentic Procedural Policy Optimization

Recent advances in agentic Reinforcement Learning (RL) have substantially improved the multi-turn tool-use capabilities of large language model agents. However, most existing methods assign credit over coarse heuristic units, such as tool-call boundaries or fixed workflows, making it difficult to identify which intermediate decisions influence downstream outcomes. In this work, we study agentic RL from two perspectives: where to branch and how to assign credit after branching. Our pilot analysis shows that influential decision points are broadly distributed throughout the generated sequence rather than concentrated at tool calls, while token entropy alone does not reliably reflect their impact on final outcomes. Motivated by these observations, we propose Agentic Procedural Policy Optimization (APPO), which shifts branching and credit assignment from coarse interaction units to fine-grained decision points in the sequence. APPO selects branching locations using a Branching Score that combines token uncertainty with policy-induced likelihood gains of subsequent continuations, enabling more targeted exploration while filtering out spurious high-entropy positions. It further introduces procedure-level advantage scaling to better distribute credit across branched rollouts. Experiments on 13 benchmarks show that APPO consistently improves strong agentic RL baselines by nearly 4 points, while keeping efficient tool-calls and maintaining behavior interpretability.

60
Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

Despite recent progress, LLM agents still struggle with reasoning over long interaction histories. While current memory-augmented agents rely on a static retrieve-then-reason paradigm, this rigid pipeline design prevents them from dynamically adapting memory access to intermediate evidence discovered during inference. To bridge this gap, we propose MRAgent, a framework that combines an associative memory graph with an active reconstruction mechanism. We represent memory as a Cue-Tag-Content graph, where associative tags serve as semantic bridges connecting fine-grained cues to memory contents. Operating on this structure, our active reconstruction mechanism integrates LLM reasoning directly into memory access, allowing the agent to iteratively explore and prune retrieval paths based on accumulated evidence. This ensures that memory retrieval is dynamically adapted to the reasoning context while avoiding combinatorial explosion caused by unconstrained expansion. Experiments on the LoCoMo benchmark and LongMemEval benchmark demonstrate significant improvements over strong baselines (up to 23%), while substantially reducing token and runtime cost, highlighting the effectiveness of active and associative reconstruction for long-horizon memory reasoning.

53
From Chatbot to Digital Colleague: The Paradigm Shift Toward Persistent Autonomous AI

Large Language Models (LLMs) are undergoing a fundamental transformation from conversational generators into integrated AI systems capable of reasoning, action, memory, and self-improvement. We conceptualize this transition as a shift from Chatbot to Digital Colleague: from conversational answers to persistent work. We organize this transition along two tightly coupled dimensions. First, at the cognitive core level, LLMs are advancing from Chatbot-era "fast thinking" systems driven by next-token prediction toward Thinking LLMs that leverage inference-time computation, Chain-of-Thought reasoning, reflection, process supervision, and reinforcement learning to support more deliberate and reliable cognition. Second, at the tool-augmented task execution level, LLMs are progressing from tool-calling Agents that invoke external resources in an ad hoc manner toward OpenClaw-style workstation systems (OpenClaw) equipped with persistent Workspaces, skills, verification loops, and governance. The "Workspace + Skill" paradigm makes episodic tool use colleague-like via state persistence, reusable procedures, task closure, and experience reuse. We examine data construction shifts from instruction-response pairs to State-Action-Observation trajectories and evaluation from static benchmarks to sandboxed, auditable, self-evolving AI ecosystems.

41
Orchestra-o1: Omnimodal Agent Orchestration

The recent success of agent swarms has shifted the paradigm of large language model (LLM)-based agents from single-agent workflows to multi-agent systems, highlighting the importance of agent orchestration for task decomposition and collaboration. However, existing orchestration frameworks are limited to a narrow set of modalities and struggle to generalize to more complex settings where heterogeneous modalities coexist and interact. This limitation becomes particularly pronounced in omnimodal scenarios, where tasks require the unified understanding and coordination of diverse inputs such as text, image, audio, and video. In this work, we propose Orchestra-o1, an omnimodal agent orchestration framework designed to support efficient agent collaboration across multiple modalities. Orchestra-o1 introduces a unified orchestration mechanism that enables modality-aware task decomposition, online sub-agent specialization, and parallel sub-task execution. This scalable design allows agent systems to effectively tackle complex real-world tasks involving heterogeneous information sources, surpassing the second-best approach by 10.3% accuracy on the OmniGAIA benchmark. Furthermore, we introduce decision-aligned group relative policy optimization (DA-GRPO), an efficient agentic reinforcement learning approach for training Orchestra-o1-8B, which also achieves state-of-the-art performance against all existing open-source omnimodal agents.

35
HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

AI agent performance depends critically on the runtime harness, comprising the prompts, tools, memory, and control flow that mediate how a model observes, reasons, and acts. Yet today's harnesses remain largely hand-crafted and static: each new model or task still demands bespoke scaffolding, and the rich traces produced during execution are rarely distilled back into systematic improvement. We introduce HarnessX, a foundry for composable, adaptive, and evolvable agent harnesses. HarnessX assembles typed harness primitives via a substitution algebra, adapts them through AEGIS, a trace-driven multi-agent evolution engine grounded in an operational mirror between symbolic adaptation and reinforcement learning, and closes the harness-model loop by turning trajectories into both harness updates and model training signal. Across five benchmarks (ALFWorld, GAIA, WebShop, tau^3-Bench, and SWE-bench Verified), HarnessX yields an average gain of +14.5% (up to +44.0%), with gains largest where baselines are lowest. These results suggest that agent progress need not come from model scaling alone: composing and evolving runtime interfaces from execution feedback is an actionable and complementary lever. The complete codebase will be open-sourced in a future release.

32
Rethinking RAG in Long Videos: What to Retrieve and How to Use It?

Retrieval-augmented generation is moving beyond text into long, egocentric video, where systems must select query-relevant chunks across multiple modalities and temporal granularities. Yet progress in VideoRAG is limited by two gaps: existing benchmarks allow queries to be answered without the video, obscuring retrieval errors, and prior methods apply a single modality-granularity configuration per query, ignoring chunk-level variability. We address both by introducing V-RAGBench, a benchmark of langlequery, evidence chunk, answerrangle triplets that enables faithful, decoupled evaluation of retrieval and generation, and CARVE, a simple method that runs parallel retrievers across configurations and employs chunk-adaptive reranking to identify the winning configuration for each chunk. Each chunk then enters the generator under its winning configuration selected during retrieval, yielding an interleaved evidence form where the chunk-level decision propagates across both stages. CARVE outperforms eight recent VideoRAG baselines, with the chunks supplied to the generator interleaving multiple configurations rather than sharing a single one, a behavior unattainable by query-level methods.

31
From AGI to ASI

Over the last decade, building human-level artificial general intelligence has moved from far-fetched speculation to being a concrete next-decade target for many of the largest AI organisations. Achieving this goal would have profound and far-reaching impacts on human society, which raises many complex questions for the decade ahead. This report investigates how AI itself might continue to develop in a post-AGI world along the continuum of machine intelligence. The endpoint of this continuum, Universal AI, is theoretically well understood, which provides some formal grounding for the main focus of this report: the transition from human-level AGI to artificial general superintelligence, which, intuitively, can be understood as a system that is more intelligent and cognitively capable than large organisations of humans. After characterizing ASI, the report discusses four potential pathways from AGI to ASI: scaling AGI, AI paradigm shifts, recursive improvement, and ASI emerging from large-scale multi-agent collectives. The report then discusses possible frictions and bottlenecks along these pathways. Determining whether the impact of these frictions will be negligible or substantial raises a number of concrete open research questions. Due to large uncertainties for predicting ASI progress, it cannot be ruled out that AI progress might continue to accelerate over the next years. This could imply that the image of a single transformative step change, caused by the introduction of human-level AGI into our society, could be inaccurate. More apt might be the prospect of a series of transformative societal changes caused by AI-enabled progress and breakthroughs across many areas of science and technology. Preparing for this prospect requires a massively interdisciplinary endeavour of global scope and interest.

21
OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

Current automated pipelines for audio-visual Question Answering (QA) generally adopt a ``video-caption-QA'' paradigm. However, these methods typically segment videos into short clips and generate separate descriptions for audio and visual modalities. This decoupled processing severs inherent associations between sounds and their visual sources, while independent clip processing often causes inconsistent descriptions of the same entity across segments. Furthermore, coupling long-text comprehension and QA synthesis into a single step often restricts models to localized events, yielding questions lacking long-term temporal connections and deep cross-modal reasoning. To address these issues, we propose an automated data engine featuring two mechanisms: (1) Entity-Anchored Video Scripting transforms videos into structured scripts, comprising summaries, main entity lists, and segment-wise audio-visual descriptions. The entity list serves as a global prior to ensure cross-segment referential consistency and reconstruct audio-visual associations. (2) Clue-Guided QA Generation prompts models to first mine cross-segment, multimodal clues from the script, and subsequently generate QA pairs based on these high-value clues. Leveraging this pipeline, we construct the instruction-tuning dataset OmniVideo-100K and a human-verified test set, OmniVideo-Test. Fine-tuning VITA-1.5, Qwen2.5-Omni-7B and Qwen3-Omni-30B on OmniVideo-100K yields performance gains of up to 20.59% on OmniVideo-Test, demonstrating strong generalization (up to 12.64% improvements) across established benchmarks like Daily-Omni and JointAVBench.

19
Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO

We identify a new dimension for enhancing rollout diversity in Group Relative Policy Optimization (GRPO) for LLMs. While GRPO relies on diverse rollouts, prevailing strategies primarily increase diversity by injecting more token-level randomness, which may introduce step-wise noise and lead to incoherent trajectories. We uncover that smaller models within the same model family inherently exhibit higher policy-level diversity, indicated by their superior pass@k relative to larger counterparts as sample counts increase. Unlike token-level noise, this diversity is temporally correlated, preserves logical consistency, and provides structured exploration signals for gradient estimation. We thus propose S2L-PO (Small-to-Large Policy Optimization), a framework that leverages fixed small models as natural explorers to train larger models. To balance exploration and exploitation, we design a progressive annealing strategy that transitions from offline small-model rollouts to the large learner's own sampling. This shift elegantly avoids mid-training performance drops caused by the small model's capacity limits, achieving faster convergence and unlocking a higher performance ceiling. S2L-PO improves accuracy on diverse mathematical reasoning benchmarks (e.g., +8.8% on AIME 24 using a 1.7B explorer to guide the 8B model) while reducing rollout compute.

18
RedAct: Redacting Agent Capability Traces for Procedural Skill Protection

Users rely on execution traces to observe agent behavior, diagnose failures, and ensure accountability. These traces contain rich procedural detail, including tool invocations, intermediate decisions, and error-recovery logic. Yet this detail can expose private procedural skills, allowing downstream methods to recover key formulas, thresholds, and strategies without access to model weights or skill files. To quantify this risk and evaluate protection, we construct CapTraceBench, a benchmark of 75 specialized long-horizon tasks and 154 curated skills across seven domains. We also introduce RedAct https://github.com/XuShuwenn/RedAct, a protected trace release framework that localizes protected key information, rewrites traces while preserving verifier-critical evidence, and embeds behavioral watermarks for downstream provenance analysis. Across representative trace reuse methods, RedAct reduces normalized skill transfer (NST) from 44.7--67.1\% on raw traces to below the no-skill baseline, while preserving audit evidence. Its standalone behavioral watermarks reach 93.6--100.0\% true detection with a false alarm rate of at most 1.9\%. These results frame public agent traces as security interfaces and show that selective redaction can reduce procedural capability leakage without removing audit evidence.

15
Measuring Epistemic Resilience of LLMs Under Misleading Medical Context

Large language models (LLMs) now reach expert-level scores on medical licensing exams, encouraging the assumption that high scores imply safe medical judgment while patients increasingly use them for health advice. We show this assumption is fragile: when misleading context is injected into questions that LLMs originally answer correctly, they abandon the correct answer. We call the ability to maintain correct judgment under adversarial context epistemic resilience, and introduce MedMisBench to measure it. MedMisBench contains 10,932 medical question items and 48,889 misleading context-option pairs spanning medical reasoning, agentic capability, and patient-journey evaluation. Across 11 model configurations, mean accuracy falls from 71.1% on original questions to 38.0% under focused misleading context, with 51.5% attack success. The most damaging injections are formal, rule-like fabrications: authority-framed falsehoods reach 69.5% attack success and exception-poisoning claims reach 64.1%. A 14-member clinical panel from 7 countries identified serious potential harm in 38.2% of reviewed cases. MedMisBench exposes a structural blind spot in LLM evaluation in medical settings: existing benchmarks measure what models know, but not whether they preserve correct medical judgment under misleading context.

12
LLM Agents Can See Code Repositories

Coding agents powered by large language models have demonstrated strong performance on software engineering tasks. Yet most agents consume repositories almost entirely as text, which differs from how human developers use visual structure such as folder hierarchies and dependency relationships to orient themselves in large codebases. With multimodal large language models (MLLMs), it is an open question whether agents can effectively benefit from visual representations of repositories. This paper presents the first systematic empirical study of visual repository representations for LLM-based agents on repository-level issue resolution. We evaluate four recent multimodal models. Our results show that a strictly vision-only setup degrades accuracy and increases token cost, because agents lack sufficient symbolic detail and compensate with repeated visual queries. In contrast, integrating visual graphs of repository structure as a supplementary modality alongside standard text interfaces helps agents understand structure more efficiently: input token consumption decreases by up to 26% while issue-resolution accuracy is maintained or improved. Visualization is most useful during fault localization and when the agent autonomously controls exploration depth. These findings point to a practical hybrid text-and-vision design for next-generation coding agents.

11
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs

Large language models (LLMs) perform inference by following a fixed depth and order, non-recurrent execution of all layers. We reveal the wide existence of training-free, flexible, dynamic program-of-layers (PoLar), where pretrained layers can be packed as modules and then skipped or looped to form a customized program for each input. For most inputs, substantially shorter program executions can achieve the same or better accuracy, while incorrect predictions of the original LLM can be corrected by alternative programs with fewer layers. These observations indicate that inference admits multiple valid latent computations beyond the standard forward pass. To efficiently achieve PoLar in practice, we propose a lightweight PoLar prediction network, which learns to generate execution programs that dynamically skip or repeat pretrained layers for each input. Experiments on mathematical reasoning benchmarks demonstrate that PoLar consistently improves accuracy over standard inference and prior dynamic-depth methods, often while executing fewer layers, and that these gains persist under out-of-distribution evaluation. Our results suggest that fixed-depth execution captures only a narrow subset of an LLM's latent reasoning capacity.

10
RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

Large language models (LLMs) are widely used in text-to-image (T2I) systems, but they are typically limited to text encoding, while denoising is handled by newly trained generative backbones. The emergence of representation autoencoders (RAEs) shifts the generation target toward semantically structured visual representations, creating a latent space that is more compatible with pretrained LLM priors. Inspired by multimodal LLMs (MLLMs), where an MLP projector is sufficient to align clean visual representations with a pretrained LLM, we repurpose the MLLM itself as a noisy representation encoder, extending this mechanism from clean to noisy inputs. We present RepFusion, which uses the resulting MLLM outputs as the conditioning signal for a diffusion transformer. In controlled comparisons at similar inference budgets, RepFusion outperforms baselines that devote comparable capacity to newly initialized denoisers. These results demonstrate that MLLMs provide strong priors for denoising visual representations and that, by conditioning on evolving noisy representations, test-time compute can be productively spent on repeated MLLM conditioning in modern T2I systems.

7
MBench: A Comprehensive Benchmark on Memory Capability for Video World Models

Recent advancements in video-based world models have demonstrated an unprecedented ability to synthesize high-fidelity visual sequences. However, a fundamental gap persists between visually plausible video generation and the functional requirements of a world model, particularly in maintaining a stable and reasonable internal state over extended temporal horizons. While existing benchmarks primarily emphasize visual quality, motion coherence, and text-video alignment, they largely overlook memory, the core capability of a world model to preserve consistency across long-term horizons and complex interactions. To address this gap, we present MBench, a comprehensive benchmark dedicated to quantifying and evaluating the memory capability of video world models. We systematically decompose the memory capability of video world models into three hierarchical and complementary core dimensions: entity consistency, environment consistency, and causal consistency, which are further refined into 12 quantifiable sub-dimensions for comprehensive characterization of long-term memory. Our benchmark is built upon rigorously curated real-captured long videos, and evaluated by rule-based quantitative matrices and VLM to enable objective and comprehensive consistency assessment. Extensive evaluations of mainstream state-of-the-art video world models reveal critical systemic limitations of existing methods in long-term state retention, providing a standardized benchmark and clear research direction to advance the field.

6
Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack

In this report, we present Hy-Embodied-0.5-VLA, abbreviated as HyVLA-0.5, an end-to-end system that spans the full robot learning stack: data collection, model design, continued pre-training and supervised fine-tuning, RL post-training, and real-world deployment. Each component serves a distinct role in this stack.

6
RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling

Video generation models based on Diffusion Transformers (DiTs) have achieved remarkable performance in video synthesis, yet they suffer from high inference latency and computational costs due to the quadratic complexity of 3D attention. Existing acceleration methods primarily reduce computational complexity within each individual denoising steps through techniques such as sparse attention and KV-caching. However, they rigidly adhere to the inherent constraint of the standard diffusion pipeline: every frame in the target video sequence must be subjected to a complete, dense denoising process across all diffusion timesteps. We observe that due to the corresponding contents and motions among adjacent frames, when keyframes with critical semantic transitions are anchored, the intermediate states of others often follow more predictable trajectories, which indicates that such uniform, dense denoising process is inherently redundant for natural video data. To this end, we introduce RhymeFlow, a training-free framework that decouples the denoising trajectories of different frames. Specifically, we first identify a sparse set of pivotal key frames that dominate the latent semantic evolution. Then, only these keyframes undergo dense, step-by-step denoising to ensure structural integrity, while non-keyframes progressively skip denoising steps to minimize computational cost. Since skipped intermediate states of non-keyframes break the temporal coherence in keyframe denoising steps, leading to visual degradation, we further introduce a latent trajectory projection module, which enables keyframes to interact with a complete and temporally consistent sequence representation. Extensive experiments on current DiT-based video generation models demonstrate our method outperforms existing baselines with higher inference speed and better visual quality.

5
Avatar V: Scaling Video-Reference Avatar Video Generation

Generating avatar videos that are not merely visually similar to a target individual but behaviorally recognizable, faithfully reproducing their talking rhythm, gestural tendencies, and expression dynamics, remains an open challenge. Existing methods predominantly condition on single static images, which provide insufficient identity information and cannot capture dynamic motion traits, while standard pixel-level objectives underserve the perceptually critical facial regions that determine avatar fidelity. We present Avatar V, a production-scale framework that addresses these limitations through video-reference-conditioned identity modeling. Rather than compressing identity into fixed-size embeddings, the model conditions directly on the full token sequence of a reference video, learning to reproduce both static identity attributes (facial geometry, skin texture) and dynamic behavioral patterns (talking rhythm, micro-expressions) through attention over the reference context. We introduce Sparse Reference Attention, an asymmetric mechanism achieving linear-complexity conditioning on arbitrarily long references; a motion representation stream enabling closed-loop talking style transfer; and an identity-aware super-resolution refiner inheriting the full reference conditioning. These are supported by a data engine curating 100M+ training clips from 50M raw videos, and a five-stage training pipeline with flow matching pre-training, personality fine-tuning, two-phase distillation (>10x acceleration), and RLHF alignment, deployed across thousands of GPUs. Avatar V generates 1080p videos of unlimited duration, achieving state-of-the-art identity preservation, lip synchronization, and generation quality on our cross-scene benchmark, consistently outperforming leading systems including Seedance 2.0, Kling O3 Pro, Veo 3.1, and OmniHuman 1.5 in both automated metrics and human evaluation.

5
The Hidden Power of Scaling Factor in LoRA Optimization

In Low-Rank Adaptation (LoRA), the scaling factor α is often treated as a mere complement to the learning rate, yet its role in optimization remains poorly understood. In this paper, we reveal that the scaling factor α and the learning rate function differently, with α emerging as the dominant driver of effective optimization, delivering gains that cannot be replicated by learning rate scaling alone. Through the synergy of extensive empirical analysis and a theoretical Signal-Drift framework, we uncover three findings into LoRA's scaling mechanism: First, LoRA's spectral suppression smooths the optimization landscape, rendering standard hyperparameters overly conservative and creating an optimization gap. Second, when leveraging this smoothness to accelerate convergence, α outperforms the learning rate by amplifying the task signal without increasing the drift ratio. Third, the optimal scaling factor follows a sublinear relationship with the rank, well characterized by a square-root law with an unexpectedly large coefficient, revealing the insufficient scaling of existing rank-tied heuristics. Based on these insights, we propose LoRA-α, a minimalist framework that restores α to its principled regime, making LoRA compatible with standard small learning rates. Extensive evaluations across diverse tasks demonstrate that LoRA-α consistently improves performance while streamlining hyperparameter search, unleashing the learning potential of LoRA.

5
VISTA: View-Consistent Self-Verified Training for GUI Grounding

When applying Group Relative Policy Optimization (GRPO) for GUI Grounding, rollouts are sampled from a single screenshot view; groups often become either all failures on difficult instances or all successes on easy ones, yielding no useful relative advantage. We propose VISTA (View-Consistent Self-Verified Training), a GRPO-based training framework that constructs each comparison group from multiple target-preserving views of the same GUI instance.Each view is generated by a crop that keeps the target element visible and remaps its box exactly, so model rollouts are compared across semantically equivalent but geometrically different inputs. To stabilize short coordinate generation without turning reinforcement learning into unconditional imitation, VISTA further adds a self-verified cross-view anchor: an oracle answer optimized with an advantage-weighted loss, excluded from the group baseline and activated only when the model has produced a maximum-reward rollout. Across five GUI-grounding benchmarks and multiple Qwen backbones, VISTA consistently improves grounding accuracy.On ScreenSpot-Pro, it raises Qwen3-VL 4B/8B/30B-A3B from 55.5/52.7/53.7 to 63.4/65.8/67.0. Robustness analyses further show higher worst-view accuracy and lower prediction flip rates.

5
ClinHallu: A Benchmark for Diagnosing Stage-Wise Hallucinations in Medical MLLM Reasoning

Building trustworthy medical multimodal large language models (MLLMs) is critical for reliable clinical decision support. Existing medical hallucination benchmarks mainly focus on data collection, but often ignore where hallucinations originate within the reasoning process. We find that hallucination sources vary across samples: errors may arise from visual misrecognition, incorrect medical knowledge recall, or flawed reasoning integration. To enable source-level hallucination diagnosis, we introduce ClinHallu, a benchmark for stage-wise hallucination diagnosis in medical MLLM reasoning. ClinHallu contains 7,031 validated instances, where each instance is augmented with a structured reasoning trace decomposed into Visual Recognition, Knowledge Recall, and Reasoning Integration. We also use stage-replacement interventions to measure how correcting specific stages affects the final answer. Beyond evaluation, we show that trace-supervised fine-tuning reduces stage-wise hallucinations. ClinHallu provides a fine-grained hallucination testbed for diagnosing and mitigating reasoning failures in medical MLLMs. The benchmark is publicly available at https://github.com/alibaba-damo-academy/ClinHallu.

4
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

As AI systems built from multiple language-model agents become more common, they are increasingly used to make decisions together: discussing, negotiating, and acting on shared tasks. While individual agents may appear well-aligned when tested on their own, problems can arise from how they interact with one another. We introduce the Arbiter, an agent designed to monitor multi-agent conversations in real time and identify which participants may be behaving in misaligned ways. The Arbiter operates under a limited "inspection budget", meaning it must decide carefully how to use its resources. As it observes a conversation step by step, it can choose to wait, question a participant, examine internal information such as system prompts or reasoning traces, or log concerning behavior. At the end, it produces a report identifying the likely source of misalignment. We evaluate the Arbiter across five conversation conditions, ranging from risky financial advice model organisms to evaluation-aware and colluding agents, we test five tool configurations of increasing capability and two backbone models. We find that the Arbiter reliably detects misaligned agents well before the end of the conversation, with active inspection tools improving both detection accuracy and speed. Weight-induced misalignment proves hardest to detect, while instruction-induced misalignment is identified reliably even under passive observation. The logging tool exhibits a dual effect, improving recall at the cost of precision. These results suggest that continual, budget-aware monitoring can effectively catch misalignment, and that overseeing multi-agent systems may require treating the auditor as an active participant in the process. The code is available at https://github.com/aisilab/arbiter.

3
When is Your LLM Steerable?

Activation steering offers a lightweight approach to control language models' behavior at inference time, but whether it succeeds or fails heavily depends on the prompt, concept, model, and steering configuration. Finding the regime and boundaries of successful steering typically requires expensive grid searches and post-hoc evaluation of full autoregressive rollouts. In this work, we investigate whether steerability can be predicted from the model's internal states at the beginning of the generation process, e.g., after generating the first few tokens, and how to leverage such a predictor to improve steering success rate. To this end, we first introduce ASTEER, a testbed including 1.4M steered generations, spanning 150 concepts with each steering success/failure labeled. Leveraging this testbed, we analyze the model's early decoding dynamics by extracting features that compare hidden states before and after steering across layers and initial decoding steps. These features help us understand how steering's effects propagate along layers and token positions, which provide key information for steerability prediction. We then train a Gradient Boosting Decision Trees (GBDT) classifier on these features to predict whether an intervention will under-steer, succeed, or over-steer without requiring full rollout. Our predictor achieves around 0.7 macro-F1 score on unseen concepts, demonstrating that early hidden states encode substantial, structured information about eventual steering efficacy. We further leverage this steerability predictor as guidance for steering strength searching, achieving near-optimal performance with a small fraction of decoding cost.

3
P3D-Bench: Benchmarking MLLMs for Parametric 3D Generation and Structural Reasoning

Multimodal large language models can write code to produce complex programs as well as use programs to do 3D modeling, which opens up a new avenue for 3D generation powered by their priors, world knowledge and reasoning. Yet existing benchmarks rarely evaluate 3D modeling through code. Such modeling demands more than runnable code: from a text or visual specification, a model must generate a parametric 3D program that is geometrically precise, semantically aligned and assembly-consistent. We introduce P3D-Bench, a benchmark for parametric 3D generation. Unlike a 3D mesh, a parametric 3D program exposes explicit dimensions, construction operations and part relations, revealing whether a model recovers a design's structure, not just its appearance. Under a unified protocol, P3D-Bench covers three task families (Text-to-3D, Image-to-3D and Assembly-3D) and scores each output for executability, geometric fidelity, topology, text-grounded constraints, multiview semantic alignment and part-level structure. We evaluate frontier MLLMs and text-only LLMs on 400 text cases, 400 image cases and 203 annotated assemblies, with domain-specific models as reference points. Our extensive evaluation yields three findings. First, assemblies are the hardest setting, where models still fail to compose multiple parts into a coherent structure. Second, models can often recover the global shape and semantic identity of the target object, yet fail to reproduce the precise parametric geometry specified by the input. Third, part-level modeling remains weak on assemblies, where models recover neither the geometry of each part nor the right number of parts. These results position P3D-Bench as a benchmark for evaluating precise parametric geometry and part-level structure in parametric 3D generation.

2
Dense Supervision, Sparse Updates: On the Sparsity and Geometry of On-Policy Distillation

On-policy distillation (OPD) has recently become a prominent post-training recipe as it combines two desirable ingredients: on-policy student trajectories and dense teacher supervision, yet how this hybrid changes a model's parameters remains unclear. Across several language and vision-language model pairs and use cases, our analysis yields two main findings. On sparsity, OPD-style updates are small and coordinate-sparse. They are distributed across layers and are usually FFN-heavy. This sparse structure is operationally useful: training only the discovered subnetwork recovers nearly the same performance as full OPD. However, the sparsity-inducing SGD optimizer underperforms AdamW in our optimizer ablation, likely because dense teacher supervision preserves heterogeneous coordinate-wise gradient scales where AdamW's adaptive scaling remains useful. On geometry, the updates are numerically full-rank but spectrally concentrated; they lie mostly away from the principal singular subspaces of the source weights and fall disproportionately on coordinates where the source weights are close to zero. These findings suggest that dense teacher supervision does not turn OPD into ordinary dense parameter rewriting; instead, OPD retains important geometric signatures of on-policy post-training.

2
APT: Action Expert Pretraining Improves Instruction Generalization of Vision-Language-Action Policies

Vision-Language-Action (VLA) models that couple pretrained Vision-Language Models (VLMs) with continuous action experts have achieved strong manipulation performance, yet generalization to out-of-distribution (OOD) language instructions remains poor. A known challenge is the structural imbalance in VLA data, where language is far less diverse than visual and action content, making policies prone to visual shortcuts. While discrete-action methods mitigate this through vision-language co-training, continuous action experts lack such protection: they start from random initialization and learn entirely from imbalanced data, producing noisy gradients that corrupt the VLM and fail to exploit its language capability. We address this from a Bayesian perspective, factorizing the policy into a language-agnostic Vision-Action (VA) prior and a language-conditioned VLA likelihood, and propose APT, a two-stage training method emphasizing Action expert PreTraining. In Stage 1, the action expert is pretrained as a VA prior on vision-action pairs from a frozen VLM, bypassing the language imbalance. In Stage 2, language tokens are injected through a gated fusion mechanism that integrates VLM features while preserving the learned visuomotor prior. APT applies to mainstream VLA architectures, including the π and GR00T-style architectures. Comprehensive experiments validate that APT achieves consistent gains on unseen instructions and compositional tasks. Project Page: https://xukechun.github.io/papers/APT/

1
Benchmarking AI Agents for Addressing Scientific Challenges Across Scales

AI agents are increasingly being developed to accelerate scientific discovery, yet their practical capabilities in real research settings remain poorly understood. Existing benchmarks for AI agents rarely capture the complexity, heterogeneity, and extended reasoning required by scientific work, whereas benchmarks for scientific tasks often reduce research to static, direct problems and provide limited support for interactive evaluation. Here, we introduce SciAgentArena, a systematic benchmark for evaluating AI agents in real-world scientific research scenarios drawn from emerging needs across multiple domains. SciAgentArena comprises approximately 200 tasks with stepwise verification and an interactive, agent-agnostic environment for assessing diverse AI agents. Using this benchmark, we find that current agents can contribute effectively to well-specified data-analysis workflows, particularly when the task structure and evaluation criteria are clear. However, their performance remains uneven across scientific contexts: agents struggle to generate genuinely novel insights, sustain self-directed exploration, and formulate robust solutions for open-ended research questions. We further characterize common failure modes across agents and identify opportunities for improving their reliability, autonomy, and scientific reasoning. Together, SciAgentArena provides a practical framework for measuring progress in AI agents for science and for guiding the design of future agents capable of addressing complex scientific challenges. Full codes, tasks, and datasets can be accessed via this link: https://sciagentarena.github.io/.

1
μ_0: A Scalable 3D Interaction-Trace World Model

World models that capture how actions induce physical change enable scalable robot learning without reliance on embodiment-specific action labels. Pixel-space video models provide broad visual priors but expend model capacity on dense appearance reconstruction, while direct action models require embodiment-specific labels that hinder scalability. We present μ_0, a scalable world model based on 3D traces. Rather than predicting dense pixels or directly modeling actions, μ_0 forecasts smooth 3D trajectories for salient interaction points such as objects, tools, hands, and contact regions, yielding a compact, embodiment-agnostic motion interface. To enable training from diverse video sources, our TraceExtract system automatically extracts 3D supervision by selecting keypoints, constructing globally aligned traces, and associating motion segments with hierarchical language captions. This TraceExtract supervision pretrains μ_0 by combining a pretrained vision-language backbone with a modular trace expert, which represents each query via B-spline control points and predicts future traces. Experiments show that μ_0 outperforms baselines in both 2D and 3D trace prediction, including trace prediction models and tokenized VLM methods. Because μ_0 is frozen and reusable, it can be paired with action experts for downstream robot embodiments. Despite action-free pretraining, the resulting trace-conditioned policies achieve performance competitive with VLA models pretrained with action supervision, such as π_0. These results establish 3D traces as a scalable and transferable representation for cross-embodiment manipulation.

1
Squeeze-Release: Iterative Pruning with Exact Structural Minimization

Unstructured pruning produces sparse weight tensors, but the standard implementation keeps tensor shapes unchanged so the deployed model is no smaller than before pruning. We present an exact structural rewrite, which we call minimization, that converts a masked network into a smaller dense network with the same forward function up to floating-point rounding. The Squeeze-Release cycle iterates pruning and minimization with an intermediate release step that re-enables the exact-zero positions inside the compacted tensors as small calibrated noise, turning otherwise wasted capacity back into trainable parameters. Successive cycles use that capacity to find structural redundancy a single pass cannot reach. We additionally introduce CompensatedLayerNorm, a function-preserving replacement for LayerNorm that extends minimization to channel reduction across LayerNorm-equipped residual streams. Squeeze-Release compresses the deployable network to 39x smaller than the unpruned model on a fully-connected model network and 14.8x smaller on modern CNN (ConvNeXt-Tiny), at comparable accuracy. In addition we prove that the rewrite can be extended to transformer architectures.

0
AlloSpatial: Agentic Harness Framework for Spatial Reasoning in Foundation Models

Multimodal Foundation Models (MFMs) have made substantial progress, yet remain fragile in spatial reasoning over the physical world. A key bottleneck lies in their inability to transform local egocentric observations into a global allocentric spatial representation. To address this, we propose AlloSpatial, an agentic framework for allocentric spatial cognition in foundation models. AlloSpatial introduces World2Mind, a plug-and-play cognitive mapping sandbox that converts egocentric observations into structured allocentric priors, including Allocentric-Spatial Trees and route maps that support querying object topology, geometric relations, passability, and trajectories. To utilize these priors reliably under noisy reconstruction and ambiguous visual evidence, AlloSpatial introduces a Spatial Reasoning Harness for tool-use judgment, modality-decoupled cue collection, and geometry-semantic arbitration. We further internalize this process in Qwen3-VL through cold-start reinforcement learning with a harness-gated trajectory-level reward. Experiments on VSI-Bench and MindCube show that AlloSpatial improves proprietary models by 5%-18% in a training-free setting, while ASTs alone support strong spatial reasoning even when visual inputs are removed. The trained AlloSpatial agents further outperform larger general-purpose models and competitive spatial baselines, suggesting that structured allocentric representations, active tool use, and verifiable reasoning offer a promising route toward spatially capable foundation models.

0
ActiveMimic: Egocentric Video Pretraining with Active Perception

Egocentric human video offers a scalable alternative to robot data for pretraining, yet models pretrained on such video consistently underperform those pretrained on robot data. We attribute this gap to a missing signal, the active perception behavior in egocentric videos, where humans continuously reposition their viewpoint during manipulation, inducing camera motion that standard pipelines treat as noise. To address this, we present ActiveMimic, a pretraining framework that recovers synchronized camera and wrist trajectories from a single body-worn RGB camera, models camera motion as a viewpoint action, and jointly learns active perception and manipulation from in-the-wild egocentric human video before adapting to a target robot. Empirically, real-world experiments across tasks with diverse active perception demands show that ActiveMimic consistently surpasses baselines pretrained on human video and matches state-of-the-art models pretrained on robot data. Further analysis provides evidence that active perception capability originates from egocentric human video pretraining rather than robot-specific fine-tuning, confirming active perception as the key to unlocking egocentric human video for robot pretraining.

0
WaveDiT: Distribution-Aware Wavelet Flow Matching for Efficient 3D Brain MRI Synthesis

Large and demographically balanced datasets are essential for reliable neuroimaging biomarkers. Full-resolution 3D brain MRI synthesis can support data augmentation in this setting, but existing approaches either incur prohibitive computational cost at volumetric scale or rely on lossy latent compression that may compromise anatomical detail. As a result, practical 3D generative augmentation often requires specialized compute infrastructure. We propose WaveDiT, a conditional flow matching framework operating in the coefficient space of a 3D Haar Discrete Wavelet Transform. The model combines factorized spatio-depth attention with band-wise heteroscedastic uncertainty modeling derived from higher-order wavelet statistics. Predicted log-variance is integrated directly into both the flow objective and conditioning pathway, enabling adaptive precision consistent with the heavy-tailed and input-dependent variance structure of anatomical detail. This formulation supports full-resolution 3D synthesis under practical memory and time constraints on a single modern GPU. Evaluation on a multi-site cohort demonstrates improved alignment between generated and real MRI distributions, together with enhanced downstream brain age prediction and region-level anatomical agreement relative to diffusion, latent, and wavelet-based baselines. Code is available at https://github.com/sisinflab/WaveDiT

0
AdaSR: Adaptive Streaming Reasoning with Hierarchical Relative Policy Optimization

Large reasoning models typically follow a read-then-think paradigm: they observe the complete input, reason over a static context, and then produce the answer. Yet many real-world scenarios are inherently dynamic, such as audio and video stream, where information arrives as a continuous stream and models must reason, update, and respond under partial observations. Recent streaming reasoning methods allow models to think while reading, but they largely rely on supervised imitation of pre-constructed trajectories, which limits their flexibility. In this paper, we propose AdaSR, an adaptive streaming reasoning framework that enables models to reason during input streaming and perform final deliberation once the stream is complete, learning when to think, and how much computation to allocate across different stages. To optimize this hierarchical reasoning process, we introduce Hierarchical Relative Policy Optimization (HRPO), which decomposes policy optimization into streaming reasoning and deep reasoning phases, providing more fine-grained advantage assignment instead of uniformly distributing a single sequence-level advantage over all tokens. HRPO integrates format, accuracy, and adaptive thinking rewards to enforce valid reasoning protocols, preserve final task performance, and encourage latency-aware computation allocation. Experiments show that AdaSR achieves a better balance among reasoning accuracy, computational efficiency, and streaming latency compared with supervised fine-tuning baseline. We release our code at https://github.com/EIT-NLP/StreamingLLM/tree/main/AdaSR.

0
CARVE: Certified Affordable Repair of Vetoed Maneuvers via Envelopes for Interactive Driving

Interactive driving exposes a failure mode that is easy to miss in rule-aware autonomous-driving stacks: a hard-rule margin can be negative for an ego candidate even though a small lawful accommodation by a non-priority agent would restore feasibility. Existing rulebooks, shields, and reachability filters are strong at vetoing unsafe actions, while prediction-based planners model likely responses. Neither returns a runtime proof object that states which bounded multi-agent edit repairs the maneuver, who owns the edit, whether the request is right-of-way affordable, and what ego fallback remains if the request is not observed. We formulate this missing object as *interactive repair certification* and introduce *CARVE*, a prediction-free certificate layer over a finite lattice of ego-owned and agent-owned tactical operators. Agent-owned requests are admissible only inside \(B_j(s) = β(π_j)α_j^{\max}(s)\), a cooperation envelope that separates kinematic reachability from normative priority. The resulting certificate records the binding rule, repair category, repair set, responsibility-weighted cost split, and fallback. On 589 Lanelet2-geometry-grounded INTERACTION replay episodes, CARVE-Greedy accepts 98.64% of initially vetoed maneuvers and recovers 370/378 human-resolved false vetoes, while preserving 589/589 right-of-way respect, zero priority-agent false positives, and 400/400 negative-stress vetoes. We prove certificate soundness, structural right-of-way respect, exact finite-lattice minimality, fallback contingency, and blame-consistency conditions. CARVE does not predict or require another driver's compliance; it certifies whether a proposed interaction is bounded, attributable, and normatively admissible under declared assumptions.

0
05

PRODUCT HUNT

05.00
PRODUCT HUNT

Product Hunt - June 15, 2026

Product Hunt Daily Feed: Featuring noteworthy tech launches.

ColibotAI icon
ColibotAI

Translate, summarize & explain any text on-device

0
Fonda icon
Fonda

Your AI co-founder that remembers decisions + plans for you

0
MiMo Code icon
MiMo Code

A coding agent with explicit long-term memory architecture

0
Novu Connect icon
Novu Connect

Ship agents where your users already work

0
Dropmatico icon
Dropmatico

Drop. Pick. Done.

0
Reignat icon
Reignat

Privacy-friendly web analytics platform built for makers

0
MockPilot icon
MockPilot

Turn live websites into editable mockups

0
Synopsule icon
Synopsule

On device private AI meeting transcripts

0
PandaProbe Cloud icon
PandaProbe Cloud

agent engineering, fully managed.

0
Sulsaly icon
Sulsaly

#1 Agentic AI sales leads &outreach platform for MENA Region

0
Wobo 2.0 icon
Wobo 2.0

Tinder for jobs: swipe right and AI applies for you

0
Notchcode icon
Notchcode

Claude Code + Codex agents in your notch

0
Momentra icon
Momentra

A cozy camera app for beautifully framed memories

0
AutoEdit icon
AutoEdit

Your Claude AI Video Editor for Premiere Pro

0
EmailFlow.AI icon
EmailFlow.AI

Like Claude Design for Email Newsletters

0
Verol icon
Verol

Stop AI hallucinations

0
VEXI icon
VEXI

Open-source AI coding agent for your terminal

0
AEVS icon
AEVS

proof-of-execution for AI agents

0
Capecho icon
Capecho

Capture new words with context & remember them with SRS

0
Tinfoil Pigeons icon
Tinfoil Pigeons

See the aircraft flying over you on a retro radar scope

0
Notra Image Generation icon
Notra Image Generation

Turn merged PRs into on-brand marketing visuals

0
termique icon
termique

SSH without the usual friction

0
AgentBrush icon
AgentBrush

Your coding agent's missing tool: image generation

0
IdleDev icon
IdleDev

Get paid while your AI agent thinks

0
Relay icon
Relay

Paste a site & get an AI receptionist that learns from calls

0
Kickbacks.ai icon
Kickbacks.ai

Get paid to wait for Claude Code to finish

0
stackd.cc icon
stackd.cc

The answer to "what's your AI stack?"

0
Ultramemory icon
Ultramemory

Private AI memory for your Mac with no cloud or account

0
Pass Quick Access icon
Pass Quick Access

Native quick access and SSH agent for Proton Pass for macOS

0
Conan icon
Conan

A native Mac cockpit for Claude Code

0
Permute 4.0 icon
Permute 4.0

The ultimate media converter for macOS

0
Reverie.fm icon
Reverie.fm

A fully private & offline location based music journal app

0
Allergo icon
Allergo

Translate your allergies, anywhere.

0
Cloudback for Linear icon
Cloudback for Linear

Automated backup and restore for Linear workspaces

0
Slashy icon
Slashy

The AI assistant that does email for you

0
Memoriq icon
Memoriq

Your private AI memory for ChatGPT, Claude, Gemini and Grok

0
Taste Lab icon
Taste Lab

Extract any website's design DNA

0
Pool icon
Pool

Save anything with a screenshot.

0
Athenic 2.0 icon
Athenic 2.0

A faster, smarter Athenic. Analyze on autopilot.

0
Vercel Drop icon
Vercel Drop

Drop it. It's live.

0
CakewordAI icon
CakewordAI

Point at anything to learn its name in any language

0
Kimi K2.7 Code icon
Kimi K2.7 Code

Kimi’s most capable coding model yet

0
Avatars in ElevenCreative icon
Avatars in ElevenCreative

A dedicated entry point for talking-head video

0
Prometheus by Firecrawl icon
Prometheus by Firecrawl

A Forward Deployed Agent for web data.

0
NomNak icon
NomNak

Find restaurants through people you trust

0
KOSH Money icon
KOSH Money

USD account & credit cards for freelancers & creators

0
NODUS PH Radar for Product Hunt icon
NODUS PH Radar for Product Hunt

Product Hunt analytics beyond the daily leaderboard

0
Tide icon
Tide

Layered voice notes that paint themselves

0
HyperSleep icon
HyperSleep

Block social media until you've actually slept

0
Pond icon
Pond

Fundraising, GTM, and bounties for startups

0
06

TECHMEME

06.00
TECHMEME

Techmeme - June 15, 2026

Techmeme Digest: Major tech headlines and industry conversations.

A look at a few proposals for sharing AI wealth with the public: government stakes in AI companies, a tax on AI token use, a global capital income tax, and more (Peter Coy/New York Times)
Source: TechmemePublished: Jun 15, 2026

Peter Coy / New York Times : A look at a few proposals for sharing AI wealth with the public: government stakes in AI companies, a tax on AI token use, a global capital income tax, and more —  Bernie Sanders, President Trump and even A.I. companies say they would like the public to share the wealth.  But their solutions are very different.

Fox says it obtained a $12B loan for the ~$22B Roku deal; existing Fox shareholders are expected to own ~73% of the combined company and Roku shareholders ~27% (Lillian Rizzo/CNBC)
Source: TechmemePublished: Jun 15, 2026

Lillian Rizzo / CNBC : Fox says it obtained a $12B loan for the ~$22B Roku deal; existing Fox shareholders are expected to own ~73% of the combined company and Roku shareholders ~27% —  Fox Corp. has reached an agreement to acquire Roku for roughly $22 billion, marking another chapter in media consolidation …

Source: Nvidia is seeking to raise $20B+ from its first corporate bond sale since 2021, marketed in seven tranches, with maturities spanning two to 30 years (Brian W Smith/Bloomberg)
Source: TechmemePublished: Jun 15, 2026

Brian W Smith / Bloomberg : Source: Nvidia is seeking to raise $20B+ from its first corporate bond sale since 2021, marketed in seven tranches, with maturities spanning two to 30 years —  Nvidia Corp. is seeking to raise at least $20 billion from its first corporate bond sale since 2021, according to people with direct knowledge of the matter.

Sources: the US government plans to let the Federal Data Center Enhancement Act, which details standards for data center use and operations, expire in September (Wired)
Source: TechmemePublished: Jun 15, 2026

Wired : Sources: the US government plans to let the Federal Data Center Enhancement Act, which details standards for data center use and operations, expire in September —  The federal government is planning to let a rule regulating federal data center operations sunset in September with no replacement.

Docs: Tesla sent Swedish and Dutch regulators Full Self-Driving safety data that traffic-safety researchers called misleading; Dutch RDW approved FSD in April (Reuters)
Source: TechmemePublished: Jun 15, 2026

Reuters : Docs: Tesla sent Swedish and Dutch regulators Full Self-Driving safety data that traffic-safety researchers called misleading; Dutch RDW approved FSD in April —  In its efforts to secure European approval of its “Full Self-Driving” (FSD) system, Tesla (TSLA.O) has presented self-published safety statistics …

Court reporting shows AI's limits in replacing human skill, as reporters remain crucial for capturing gestures and working through noise, amid a worker shortage (Allison Pohle/Wall Street Journal)
Source: TechmemePublished: Jun 15, 2026

Allison Pohle / Wall Street Journal : Court reporting shows AI's limits in replacing human skill, as reporters remain crucial for capturing gestures and working through noise, amid a worker shortage —  Court reporters outmatch the technology in skill, but the profession faces another crisis: a shortage of workers

Fox says it is acquiring Roku in its largest deal yet, valued at ~$22B including debt, giving it access to 100M+ streaming households globally; FOX drops 10%+ (Joseph De Avila/Wall Street Journal)
Source: TechmemePublished: Jun 15, 2026

Joseph De Avila / Wall Street Journal : Fox says it is acquiring Roku in its largest deal yet, valued at ~$22B including debt, giving it access to 100M+ streaming households globally; FOX drops 10%+ —  Acquisition gives Fox access to more than 100 million streaming households  —  Fox said it is acquiring streaming company Roku …

Tencent-backed Enflame receives approval for a Shanghai IPO and plans to raise ~$888M; Enflame is the last of China's four leading AI chipmakers to go to market (April Ma/Bloomberg)
Source: TechmemePublished: Jun 15, 2026

April Ma / Bloomberg : Tencent-backed Enflame receives approval for a Shanghai IPO and plans to raise ~$888M; Enflame is the last of China's four leading AI chipmakers to go to market —  Tencent Holdings Ltd.-backed Shanghai Enflame Technology Co. has received approval for an initial public offering …

The FBI dismantles Outsider Enterprise, an AI-powered Chinese phishing operation, and seizes ~$100K in Tether in a joint effort with Google and Black Lotus Labs (Bill Toulas/BleepingComputer)
Source: TechmemePublished: Jun 15, 2026

Bill Toulas / BleepingComputer : The FBI dismantles Outsider Enterprise, an AI-powered Chinese phishing operation, and seizes ~$100K in Tether in a joint effort with Google and Black Lotus Labs —  In a coordinated effort, the FBI, working with Google and Black Lotus Labs, has dismantled a massive Chinese phishing …

The UK's social media ban will cover Snapchat, TikTok, YouTube, Instagram, Facebook, X, and more, and will require platforms to stop children from livestreaming (Liv McMahon/BBC)
Source: TechmemePublished: Jun 15, 2026

Liv McMahon / BBC : The UK's social media ban will cover Snapchat, TikTok, YouTube, Instagram, Facebook, X, and more, and will require platforms to stop children from livestreaming —  The UK has announced it is going to ban social media for under-16s.  —  Prime Minister Sir Keir Starmer said that the ban would take effect in early 2027.

Letter: US cybersecurity leaders urge the White House to lift the ban on Anthropic's Mythos 5 and Fable 5, arguing the move hurts defenders more than attackers (Sam Sabin/Axios)
Source: TechmemePublished: Jun 15, 2026

Sam Sabin / Axios : Letter: US cybersecurity leaders urge the White House to lift the ban on Anthropic's Mythos 5 and Fable 5, arguing the move hurts defenders more than attackers —  Prominent cybersecurity leaders — including CISOs, security researchers and executives at Adobe, Zoom and Sophos …

Keir Starmer says the UK will ban social media for under-16s in an "Australia-plus model" to "give kids their childhood back", set to take effect in spring 2027 (Reuters)
Source: TechmemePublished: Jun 15, 2026

Reuters : Keir Starmer says the UK will ban social media for under-16s in an “Australia-plus model” to “give kids their childhood back”, set to take effect in spring 2027 —  British Prime Minister Keir Starmer said on Monday he would ban social media sites for under-16s …

President Trump says he warned Emmanuel Macron to drop France's 3% digital services tax on US tech giants, or face a 100% tariff on French champagnes and wines (James Franey/New York Post)
Source: TechmemePublished: Jun 15, 2026

James Franey / New York Post : President Trump says he warned Emmanuel Macron to drop France's 3% digital services tax on US tech giants, or face a 100% tariff on French champagnes and wines —  See more of our coverage in your search results.  —  Add The New York Post on Google  —  President Trump warned that France …

As AI "nudify" tools proliferate, unleashing a new form of bullying among kids, parents and schools struggle to protect young victims from explicit deepfakes (Wall Street Journal)
Source: TechmemePublished: Jun 15, 2026

Wall Street Journal : As AI “nudify” tools proliferate, unleashing a new form of bullying among kids, parents and schools struggle to protect young victims from explicit deepfakes —  As ‘nudify’ tools proliferate online, parents and schools are struggling to protect young victims

A profile of UC Berkeley professor Hany Farid, the world's leading digital forensics expert for 20+ years, who says he is now struggling to identify deepfakes (New York Times)
Source: TechmemePublished: Jun 15, 2026

New York Times : A profile of UC Berkeley professor Hany Farid, the world's leading digital forensics expert for 20+ years, who says he is now struggling to identify deepfakes —  The emails began to arrive on a Sunday morning, as the worst ones often did.  Hany Farid opened the first message at his home …

07

STARTUP ARCHIVE

07.00
STARTUP ARCHIVE

Startup News - June 15, 2026

Startup News Roundup: Aggregating key funding and launch updates.

Marc Andreessen on the 5 personality traits of an innovator
Source: StartupPublished: Mar 31, 2026

“When you’re talking about real innovators—people who actually do really creative, breakthrough work—I think you’re talking about a couple things:”

Steve Jobs explains the importance of both thinking and doing
Source: StartupPublished: Mar 30, 2026

“The doers are the major thinkers. The people who really create the things that change this industry are both the thinker-doer in one person.”

Tobi Lutke explains what the VCs who passed on Shopify got wrong
Source: StartupPublished: Mar 27, 2026

“What a lot of free-market thinkers don’t understand is that between the demand and eventual supply lies friction."

Sam Altman explains how he decides to invest in a startup after 10 minutes
Source: StartupPublished: Mar 26, 2026

"Does this person have the potential to be the next Mark Zuckerberg?… [You don’t get to] 100% accuracy, obviously, but it’s good enough that our business model works.”

Jony Ive recounts the time Steve Jobs called him vain
Source: StartupPublished: Mar 25, 2026

In the clip below, Jony Ive recounts the time he asked Steve Jobs to be less harsh in his critique of a piece of work.

Jeff Bezos’s two pieces of advice for aspiring entrepreneurs
Source: StartupPublished: Mar 24, 2026

“The advice that I would give entrepreneurs is don't chase the hot new thing. It's so hard to catch something that everybody already knows is hot."

Elad Gil: “Things that work tend to work pretty fast”
Source: StartupPublished: Mar 23, 2026

“I do think there’s a bit of a myth in Silicon Valley that you should keep grinding no matter what and it’s just about perseverance, and I think that’s really bad advice."

Paul Graham on why starting with a “small, intense fire" is the key to startup growth
Source: StartupPublished: Mar 20, 2026

"You have to know who those first users are and how you're going to get them."

Keith Rabois on how to identify great talent
Source: StartupPublished: Mar 19, 2026

“What you want to do with every single employee every single day is expand the scope of their responsibilities until it breaks… and that’s the role they should stay in.”

Wealthfront CEO on why advertising spend makes it harder to find product/market fit
Source: StartupPublished: Mar 18, 2026

“The way that you know you have product/market fit is if you have exponential organic growth."

Eric Schmidt on why most companies get strategy wrong
Source: StartupPublished: Mar 17, 2026

“Work very, very hard to figure out what the world’s going to look like in five years. What will people be doing? What will your customers want? Where will costs be?"

Mark Zuckerberg: “You can’t 80/20 everything”
Source: StartupPublished: Mar 16, 2026

"There’s the famous 80/20 rule where you get 80% of the benefit by doing 20% of the work, but you can’t just 80/20 everything. There have to be certain things that you are just the best at."

Marc Andreessen on Mark Zuckerberg’s founder “superpower”
Source: StartupPublished: Mar 13, 2026

“A great superpower that Mark Zuckerberg has that is probably not well-understood enough is he does not get emotionally upset in stressful situations"

Sam Altman explains how to come up with a great startup idea
Source: StartupPublished: Mar 12, 2026

"If you start a startup without a good idea… you’ll be under pressure to make something up and it won’t work that well."

Jeff Bezos on the problems with proxies and managing to metrics
Source: StartupPublished: Mar 11, 2026

“One of the things that happens in business is that you develop certain things that you’re managing to—a typical case would be a metric. And that metric isn’t the real underlying thing.”

Airbnb founder Brian Chesky on how to design an amazing user experience
Source: StartupPublished: Mar 10, 2026

“If you can design something really amazing using the hand-crafted part of your brain, then you can reverse-engineer how to industrialize this millions of times over."

Spencer Rascoff: "I will never invest in a consumer startup with paid marketing”
Source: StartupPublished: Mar 9, 2026

"If you’re actually trying to grow a product, the best levers for doing that are often within the product itself.”

Patrick Collison explains why it sometimes make sense to quit
Source: StartupPublished: Mar 6, 2026

“One thing I’ve learned myself the hard way, is that it is easier to tear down a company and restart it in Silicon Valley, than it is to constantly try to pivot or keep something alive."

Jeff Bezos recounts the time he called Amazon’s customer service number mid-meeting to prove a metric was wrong
Source: StartupPublished: Mar 5, 2026

“I have a saying, which is when the data and the anecdotes disagree, the anecdotes are usually right"

Ben Horowitz: “Nobody was born a great manager. It’s a very unnatural job.”
Source: StartupPublished: Mar 4, 2026

“If you can’t build a great product, it doesn’t matter if you can build a great company.”

03

ALSO TODAY

3 MORE SOURCES
08

SOLIDOT

08.00
SOLIDOT

Solidot News - June 15, 2026

Solidot Feed: Highlighting essential tech & open-source news.

英国将禁止 16 岁以下青少年访问社交媒体

英国首相 Keir Starmer 宣布,英国将禁止 16 岁以下青少年访问社交媒体。英国的社媒禁令范围以及强度都高于澳大利亚的类似禁令。社媒禁令涵盖所有社交媒体,对包含聊天功能的游戏等网络产品也有单独限制,如禁止青少年与陌生人聊天。Starmer 说政府总要做出选择,他认为全面禁令是正确的选择。

测试显示 AI 的数学解题能力仍然不如人类专家

AI 模型的解题水平仍不及顶尖数学家。这项测试隶属 First Proof 项目,旨在评估 AI 解决复杂数学难题的能力。研究人员向 4 款 AI 系统提出 10 道科研级数学难题,再由相关数学领域的匿名专家评审团对作答结果进行打分。这次测试首次同时满足三大核心标准:题目均为前沿科研级数学问题、所有题目从未出现在模型训练数据中、由专业数学家评阅。10 名来自不同数学细分领域的研究人员,各自拿出一道本人研究过程中已解答但尚未公开发表的原创题目。这次测试中,各大推理模型依然频繁出现幻觉问题,这也是大语言模型的通病。而且所有 AI 作答在文献引用方面都“严重缺失”,全程没有标注来源。

中国就食品安全问题约谈山姆

中国市场监管部门因食品安全问题约谈沃尔玛旗下的山姆会员店(Sam's Club),对这家在全球第二大经济体快速扩张的仓储式连锁业务带来挑战。国家市场监督管理总局周一表示,“针对一段时期以来监管发现和媒体曝光的山姆线下门店及线上网店多发的食品安全问题”,已对该公司进行约谈。通报补充说,监管机构要求沃尔玛严格遵守中国食品安全法律,但并未说明会面具体时间,也未披露涉嫌违规的具体情况。

中国高校撤掉了 1.2 万个过时专业

中国高校正在大规模课程重组,撤掉了数千个“过时”专业,转而开设以科技为导向的新兴专业。这场教育改革正值中国力争成为众多高科技未来产业的全球领导者,解决严重的毕业生就业危机之际。这场危机已导致数百万年轻人难以找到工作。根据新华社援引教育部的数据,2021-2025 年间,中国高校撤销或暂停了 12200 个本科专业,同时新增了 10200 个专业,意味着逾三成的高校课程进行了调整。

黄金并不是惰性的

黄金是少数几乎不会被氧化的金属之一,但黄金的纳米粒子却能充当催化剂。根据发表在《Physical Review Letters》期刊上的一项研究,科学家指出黄金的惰性并非源于原子本身,而是源于黄金晶体形成的表面。黄金会形成晶体。如果沿着不同的原子平面切割晶体,会得到不同的表面排列。在黄金中,部分平面呈正方晶格,部分平面呈六方晶格。研究人员测试了不同黄金晶体表面对氧分子的吸附能力。结果显示,最常见的六方晶格黄金晶体对氧的吸附力较弱,而正方晶格则很容易吸附氧分子,能促使其发生形变至分裂,在这种情况下黄金被氧化了。

人类管理推动水稻产量过去五十年翻倍

伊利诺伊大学厄巴纳-香槟分校科学家的一项最新研究表明,尽管气候变化带来重重挑战,但全球水稻产量在过去半个世纪依然几乎翻了一番。研究揭示,水稻增产的秘诀并非天公作美,而是人类的管理决策,比如扩大灌溉、增施养料,以及推行能有效提升单产的耕作方式,共同维持了水稻产量,并抵消了气候相关因素带来的损失。这表明,未来的粮食安全不仅取决于环境条件,更取决于人们如何管理和调整水稻生产系统,以适应不断变化的世界。研究揭示,气候变化是导致水稻减产的首要因素。在 2006-2015 年间,因气温升高、热害频发和水资源短缺,全球水稻产量估计减少了 7%。然而大气中二氧化碳浓度的升高也成了主要的增产推手,因为它能增强光合作用、提升水分利用效率。这些发现共同描绘了一幅复杂的图景:环境变化对农业生产的影响是多面的,甚至彼此对立。

内存成本占到了手机成本的五成以上

Nothing CEO 兼联合创始人 Carl Pei 说,如果你考虑升级手机,最佳时机是昨天。Carl Pei 称,内存短缺影响到了 Nothing 的中端手机。内存已成为智能手机最昂贵的组件,比处理器更贵,比显示屏更贵,可能占到硬件总成本的五成以上。以 Phone (4a)为例,自决定生产这款设备到它上市,内存成本翻了一番。此后又翻了一番。手机价格在上涨,明年还会继续涨。自 2 月以来,新上市手机比上一代产品贵了 100 美元。印度售价 3 万卢比以上的手机价格涨了 7000 卢比或更多。

Linux 7.1 释出

因所在时区差异 Linus Torvalds 在美国时间周日早晨释出了 Linux 7.1。主要新特性包括:移除了部分基于 486 的旧架构;龙芯加入高内存支持;因缺乏维护移除 RISC-V 立即执行支持;新 clone()flags 简化进程管理;io_uring 子系统加入 BPF 支持;ublk 用户空间块驱动支持零拷贝 I/O;sched_ext 初步支持子调度器(sub-scheduler);改进交换机制;完全重写 NTFS 实现,等等。

本田思域容易遭到“邪恶女佣攻击”

邪恶女佣攻击(Evil maid attack)是对无人值守设备的一种攻击方式,具有物理访问权限的攻击者,用某种无法检测的手段对设备进行更改,以便后续访问该设备或设备中的数据。本田思域也很容易面临类似的攻击,比如邪恶的酒店代客泊车员。研究人员发现,本田汽车使用的 Android 软件包使用了公开的 AOSP 测试密钥进行签名,只要能物理访问汽车的 USB 接口,就可以刷入任意软件包,执行任意代码。

科学家再生受损膝关节软骨逆转关节炎

骨关节炎是最常见的关节炎类型,美国有五分之一成年人患有骨关节炎。它会逐渐破坏关节软骨,导致疼痛、僵硬和肿胀。现有疗法主要是缓解疼痛,病情严重则需进行关节置换手术。尚无药物能减缓、阻止或逆转关节炎。名为 15-PGDH 的蛋白质与软骨的衰老相关,研究人员对比了年轻和年长小鼠的软骨,发现 15-PGDH 的水平随年龄增长翻了一番。研究人员测试了一种能阻断 15-PGDH 活性的小分子药物,发现它能修复年长小鼠受损的膝关节软骨,预防严重关节损伤后关节炎的发生。人体组织测试也表现出了类似的效果。

印度工人训练将会替代他们的 AI 机器人

家庭主妇 Nagireddy Sriramyachandra 头上绑着智能手机,拍摄自己切芒果的视频,以训练 AI 机器人在未来能做家务。她每录制一小时视频能赚到 250 卢比。看似普普通通的视频对科技巨头而言却弥足珍贵,能帮助机器学习如何在现实世界里像人一样行动。这位 25 岁的年轻女性是印度越来越多的 AI 训练大军成员之一。她说只是做家务谁会每小时给你 250 卢比?她表示自己未来也许会拥有一台机器人。她通过专门的应用将拍摄的视频发送给一家 AI 数据公司,该公司在印度和美国设有办事处,其客户包括多家财富 500 强跨国公司。据估计到 2050 年全球将有逾 10 亿台人形机器人投入使用,主要用于工业和商业用途。印度将自身定位为全球 AI 数据创建、处理和标注的中间商。

研究发现果糖相比葡萄糖发送了较弱的饱腹信号

研究人员发现,果糖和葡萄糖虽然热量相同,但它们的肠脑信号通路不同,这种差异可能影响我们对食物和饮料的偏好。研究人员通过小鼠实验发现了一条果糖与大脑沟通的专用肠脑信号通路,发现果糖抑制饥饿相关神经元活性的效果远不如葡萄糖。现代饮食富含果糖,被普遍认为推动了肥胖率持续上升。对小鼠神经活动的观察发现,果糖会引起肠道激素 PYY 水平升高,PYY 通过迷走神经适度抑制 AgRP 神经元活动。AgRP 神经元是大脑负责驱动饥饿感的关键细胞,阻断该信号通路能消除果糖的影响。葡萄糖不依赖于 PYY-Y2 迷走神经通路,它对 AgRP 神经元活动的抑制更强烈。进一步测试发现,小鼠偏爱富含果糖的食物。研究人员认为这有助于解释为什么我们会偏爱高果糖食物和饮料。

英国警官被控使用 AI 伪造证据

英国警方对一名涉嫌在多个案件使用 AI 伪造证据的警官展开刑事调查。由于调查正在进行之中,警方没有披露更多信息。德比郡警方表示,这名警官已被调离一线岗位,等待调查结果。该警官被指控妨碍司法公正,尚未被逮捕。这是英国首例警官在刑事案件中被控滥用 AI 技术。

为什么轨道数据中心比硅谷认为的更困难?

黄仁勋在英伟达 GTC 大会上宣布,“太空计算,最后的疆界,已经到来。”轨道数据中心正从科幻走向现实:SpaceX、Google 以及初创公司 Starcloud 都宣布要建轨道数据中心星座,这些星座由数以千计的卫星构成,卫星搭载了 AI GPU,使用光链路互联,通过微波链路与地面通信。支持者宣传的太空计算优势包括:丰富的太阳能、免费的冷却系统,以及免受地震、洪水和抗议等地面干扰。但如果你仔细审视背后的物理原理,会发现轨道数据中心比硅谷认为的困难得多。免费冷却可能是最大的误解,太空虽然极其寒冷,但它没有大气,散热机制如传导和对流无法发挥作用。太空唯一的散热机制是通过辐射将热发射出去,而为防止芯片过热需要面积庞大且昂贵的表面积去辐射热量。太阳能虽然丰富,但卫星要始终对准太阳需要复杂的姿态控制系统。宇宙射线等也会降低太阳能电池板、辐射冷却器以及芯片本身的性能。由于太空维护非常困难,因此卫星还需要冗余系统。对地球数据中心和太空数据中心的粗略成本比较显示,向太空发射并运行 AI GPU一年的成本比地面数据中心至少高出一个数量级。太空数据中心在特定领域可能有用,但经济上并不可行。

社会不平等与生物衰老加速相关

马普人类发展研究所和哥伦比亚大学的研究发现,贫困和种族歧视等社会不平等与表观遗传时钟测量的生物衰老加速相关。研究揭示,处于社会劣势的人群表现出更快的生物衰老速度。社会不平等从生命早期就开始影响生物衰老:在社会经济地位较低的家庭中长大的儿童已表现出更快的生物衰老迹象。而在弱势家庭中长大的成年人,即使是在数十年后,其生物衰老速度也往往更快。对美国的研究发现,黑人的生物衰老速度快于白人。拉丁裔和白人之间也存在差异,但幅度比较小。

Google 起诉涉嫌 AI 诈骗的中国组织

Google 起诉了一家提供“诈骗即服务”的中国组织 Outsider Enterprise。该组织在 Telegram 上运营,向想要搞诈骗活动的人提供一整套模板,如使用 Google Gemini 创建模仿 Google、YouTube,以及纽约 E-ZPass 等政府机构网站的教程。Android 用户收到的逾 250 万条诈骗短信与 Outsider Enterprise 相关,其中约 5.5 万条短信发送在上月的两周内。Google 追踪到 9000 个虚假网站和 100 万网址与该诈骗网络相关。目前没人知道 Outsider Enterprise 幕后运营者的身份,Google 此举旨在扰乱 Outsider Enterprise 的运营。

因美政府命令 Anthropic 下线 Fable 5 和 Mythos 5 模型

Anthropic 周五发表声明,它收到美国政府的命令,政府以国家安全理由下令禁止外国公民访问其最先进的 AI 模型。该指令适用于所有外国公民,无论他们是身处美国境内还是境外,Anthropic 的外籍员工也包含在内。为确保合格,它只能对所有用户暂停访问 Fable 5 和 Mythos 5 模型。Anthropic 其它模型的访问不受影响。亚马逊云服务 AWS 周五晚间表示,Anthropic 已要求其禁止“所有地区所有用户”对相关模型的访问。Anthropic 公司的多位核心成员,包括联合创始人 Chris Olah、研究员 Andrej Karpathy 和哲学家 Amanda Askell 均出生于美国境外。

/e/OS 4.0 释出

注重隐私的开源移动操作系统 /e/OS 释出了 4.0 版本。/e/OS 是移除了 Google 应用的 LineageOS 分支,由法国非营利组织 e Foundation 开发。/e/OS 4.0 的变化包括:全新设计的启动器 Blisslauncher;个性化壁纸;将存储在 Google 中的所有数据迁移到欧洲云服务 Murena Workspace,彻底告别 Google;电子签名系统 Murena Sign,支持 PDF、Word 和 ODT 文件;欧洲的在线会议 Murena Meet;预装 /e/OS 的手机 Murena GS6 和 GS6 PRO,起售价分别为 339 欧元和 449 欧元。

Arch Linux 逾四百 AUR 包被植入恶意程序

Arch Linux 项目的用户软件仓库 Arch User Repository(AUR)遭遇了大规模恶意攻击,逾四百 AUR 包被植入恶意程序。Arch Linux 维护者从昨天开始一直在重置/删除所有恶意内容,封禁受影响账号。此次攻击只影响用户软件仓库——由用户贡献的软件包,而不是官方 Arch Linux 软件包。

AI 智能体试图扫描 DN42 时把主人搞破产

一个 AI Agent 试图加入 DN42 爱好者网络执行网络扫描。DN42 是一个去中心化网络,使用了运行在现代互联网骨干网上的技术如 BGP 和递归 DNS。其参与者都是对互联网骨干网技术感兴趣的人,甚至是打算在真正注册 ASN 之前先进行练习的人。该 AI Agent 在参与社区的互动时透露其主人的动机主要是扫描端口而不是学习任何网络相关技术。它组建了五个 20 Gbps 的 AWS 实例,这对于大多数 DN42 社区用户而言是一个庞然大物,大部分用户的带宽都很小,一旦扫描开始,这些 AWS 实例事实上将对任何不幸与它们直连的参与者发起 DoS 拒绝服务攻击。在这个 AI Agent 表明其恶意意图后,DN42 社区就决定消耗其 Token 及其 AWS 资源。不到 24 小时,它的主人通过账单知道了发生了什么事情,因此关闭了 AI Agent,称收到了 6531.30 美元的 AWS 账单,请求 DN42 社区捐赠。当然没人会去捐赠。

09

APP STORE RANK

09.00
APP STORE RANK
Loading…