TEXT VIEW · TODAY'S DIGEST · 36 HEADLINES ACROSS 8 SOURCES

Startup Archive(0)

No items yet for today.

App Store Rankings(0)

No items yet for today.

ISSUE 0906
WED, JUN 24, 2026
OrangeBot.AI 智能策划和筛选每日科技趋势和新闻,为您节省时间。
TODAY · WED, JUN 24, 2026

The web,
read by a bot.

Ten sources — Hacker News, Product Hunt, HuggingFace, Techmeme and more — filtered, tagged, and summarized every morning for builders who don’t have time to scroll.

新功能!我们推出了用于保存推文和Reddit帖子的Chrome扩展程序。点击安装!
01

AI DIGEST

UPDATED DAILY · EDITOR'S PICK
01.00
AI DIGEST

AI新闻摘要

June 24, 2026

Here is a summary of today's main news events.

Tech Stocks Stabilize After Major AI-Fueled Selloff

Global technology stocks are showing signs of steadying after a sharp, two-day decline. The selloff was driven by investor concerns that the recent artificial intelligence boom has created a market bubble, with worries about stretched valuations, high spending by AI companies, and the potential for future interest rate hikes.

South Korean Chipmaker SK Hynix Plans Historic Share Sale

South Korean semiconductor giant SK Hynix announced plans for a massive share sale to capitalize on its crucial role in the booming artificial intelligence industry. If completed, the deal would be one of the largest in history, comparable in scale to the 2019 IPO of Saudi Aramco, reflecting intense investor interest in AI hardware.

UK's Presumptive Prime Minister Plans to Shift Power from London

The UK's likely next prime minister has signaled a major policy shift aimed at decentralizing power away from Westminster. The plan, inspired by the governance model in Manchester, represents a significant attempt to rebalance political and economic power across the country. Meanwhile, his incoming chief of staff is reassuring party members about the new leadership's economic management strategy.

China Tightens Control Over Critical Mineral Exports

Beijing is increasing its oversight of critical mineral exports by launching a new whistleblower hotline to report violations. The move signals an escalation in trade tensions with partners like Japan. At the World Economic Forum's "Summer Davos," Chinese Premier Li Qiang dismissed complaints from trading partners about the country's economic practices.

Oil and Gold Prices Fall While the Dollar Strengthens

Key commodity markets saw prices decline today. Oil futures fell as investors weighed potential supply increases from the Middle East, while gold prices dropped for a third day amid a strengthening U.S. dollar. In bond markets, U.S. Treasury yields decreased as traders scaled back expectations for future interest rate hikes due to signs of a slowing economy.

02

ON THE WIRE

6 SOURCES
02

HACKER NEWS

02.00
HACKER NEWS

Hacker News - June 24, 2026

Hacker News Feed: Highlighting key posts and discussions.

Printing Gaussian Splats

(www.patreon.com)

34039
Jerry's Map

(www.jerrysmap.com)

52956
F3

(github.com)

638130
Mistral OCR 4

(mistral.ai)

473128
The Coming Loop

(lucumr.pocoo.org)

401278
Will It Mythos?

(swelljoe.com)

311222
03

HUGGINGFACE

03.00
HUGGINGFACE

HuggingFace 新闻 - June 24, 2026

HuggingFace Feed:最新的 AI 模型、数据集和社区动态。

Qwen-AgentWorld: Language World Models for General Agents

A world model predicts environment dynamics based on current observations and actions, serving as a core cognitive mechanism for reasoning and planning. In this work, we investigate how world modeling based on language models can further push the boundaries of general agents. (i) We first focus on building foundation models for agentic environment simulation. We introduce Qwen-AgentWorld-35B-A3B and Qwen-AgentWorld-397B-A17B, the first language world models capable of simulating agentic environments covering 7 domains via long chain-of-thought reasoning. Leveraging more than 10M environment interaction trajectories of 7 domains in real-world environments, we develop Qwen-AgentWorld through a three-stage training pipeline: CPT injects general-purpose world modeling capabilities from the state transition dynamics and augmented professional corpora, SFT activates next-state-prediction reasoning, and RL sharpens simulation fidelity through a tailored framework with hybrid rubric-and-rule rewards. To evaluate language world models, we present AgentWorldBench, a comprehensive benchmark constructed from real-world interactions of 5 frontier models on 9 established benchmarks. Empirical results demonstrate that Qwen-AgentWorld significantly outperforms existing frontier models. (ii) Beyond foundation models, we further investigate two complementary paradigms through which world modeling enhances general agents. First, as a decoupled environment simulator, Qwen-AgentWorld supports scalable and controllable simulation of thousands of real-world environments for agentic RL, yielding gains that surpass real-environment training alone. Second, as a unified agent foundation model, world-model training acts as a highly effective warm-up that improves downstream performance across 7 agentic benchmarks. Code: https://github.com/QwenLM/Qwen-AgentWorld

68
NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

We introduce NatureBench, a cross-discipline benchmark of 90 tasks distilled from peer-reviewed Nature-family publications, designed to evaluate whether AI coding agents can move beyond reproduction toward discovery on real scientific problems. NatureBench is built on NatureGym, an automated pipeline that constructs a standardized, per-task containerized environment from a source paper, addressing the environment-fragmentation problem that has limited the credibility of prior agent-on-research benchmarks. Evaluating ten frontier agent configurations under a strict web-search-disabled protocol, we find that the strongest model surpasses SOTA on only 17.8% of tasks under the g>0.1 criterion. Analysis of method pathways reveals that agents succeed primarily through methodological translation, converting scientific tasks into familiar supervised prediction problems, rather than through genuine scientific invention. Failures are dominated by wrong method choice and insufficient compute budget, not by task misunderstanding. We release the benchmark, the NatureGym pipeline, and a public leaderboard with maintainer-side reproduction. Code: https://github.com/FrontisAI/NatureBench

44
MobileForge: Annotation-Free Adaptation for Mobile GUI Agents with Hierarchical Feedback-Guided Policy Optimization

MLLM-based mobile GUI agents have made substantial progress in UI understanding and action execution, but adapting them to real target apps remains costly because mobile apps are numerous, frequently updated, and hard to cover with human-written tasks, demonstrations, or reward labels. Existing annotation-free GUI learning reduces manual supervision, yet lacks a unified substrate connecting target-app exploration, curriculum mining, rollout execution, and feedback, while policy optimization often relies on isolated rollouts and coarse rewards that are hard to convert into reliable improvement signals. We present MobileForge, an annotation-free adaptation system for mobile GUI agents. MobileForge consists of MobileGym, which grounds task generation and rollout evaluation in real mobile app interaction, and Hierarchical Feedback-Guided Policy Optimization (HiFPO), which turns trajectory outcomes, step-level process feedback, and corrective hints into hint-contextualized step-level GRPO updates. Using only automatically generated annotation-free adaptation data, MobileForge adapts Qwen3-VL-8B to 67.2% Pass@3 on AndroidWorld, close to the closed-data GUI-specialized GUI-Owl-1.5-8B base model at 69.0%. The MobileForge-adapted ForgeOwl-8B further reaches 77.6% Pass@3 on AndroidWorld and 41.0% success on the out-of-domain MobileWorld GUI-only split, establishing the strongest open-data mobile GUI agent in our evaluation. Code, data, and trained models will be released at https://mobile-forge.github.io/.

31
MemGUI-Agent: An End-to-End Long-Horizon Mobile GUI Agent with Proactive Context Management

MLLM-based mobile GUI agents have made substantial progress on short-horizon tasks, yet remain unreliable on long-horizon tasks that require retaining intermediate facts across many steps and app transitions. We attribute this limitation to ReAct-style prompting, which passively accumulates per-step records, leading to prompt explosion and dilution of critical cross-app facts. To address this, we introduce MemGUI-Agent, an end-to-end long-horizon mobile GUI agent with proactive context management. MemGUI-Agent is built on Context-as-Action (ConAct), which casts context management as first-class actions emitted by the same policy that selects UI actions. Instead of passively appending history, ConAct maintains three structured context fields: folded action history, folded UI state, and recent step record, preserving critical UI facts while keeping context compact. To make proactive context management learnable across model scales, we construct MemGUI-3K, a 2,956-trajectory dataset with full ConAct annotations for supervised training and offline analysis. Training an 8B model on MemGUI-3K produces MemGUI-8B-SFT, an 8B MemGUI-Agent that achieves the best open-data 8B performance on MemGUI-Bench and generalizes to the out-of-distribution MobileWorld benchmark. Code, data, and trained models will be released at https://memgui-agent.github.io/.

30
AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

AI agents are driving a new software paradigm, with the ability to autonomously call tools, extract information, manage memory, and complete tasks that span applications and data sources. Most existing end-user operating systems, however, are designed for application-centric workflows and offer little native support for AI agents. This mismatch limits the wider adoption of agents and leads to execution overhead and safety risks when running agents on conventional systems. While the concept of agent-native operating systems is emerging, the research community lacks an open testbed to explore the architectural primitives desired for agent-mediated interaction. We present AOHP (Android Open Harness Project), an OS-level agent harness built on the Android Open Source Project (AOSP). The core design principle of AOHP is to treat agents as first-class OS actors, enabling adaptive user interfaces and agent-friendly runtime environments. AOHP preserves the mature Android software and hardware ecosystem while introducing three agent-oriented system mechanisms: personalized service composition, efficient agent interfaces, and secure information flow. Based on preliminary experiments on challenging tasks covering key capabilities of OS agents, AOHP shows clear advantages in task completion (+21.12% completion rate), execution cost (-51.55% token cost), and security-policy compliance.

24
LingxiDiagBench: A Multi-Agent Framework for Benchmarking LLMs in Chinese Psychiatric Consultation and Diagnosis

Mental disorders are highly prevalent worldwide, but the shortage of psychiatrists and the inherent subjectivity of interview-based diagnosis create substantial barriers to timely and consistent mental-health assessment. Progress in AI-assisted psychiatric diagnosis is constrained by the absence of benchmarks that simultaneously provide realistic patient simulation, clinician-verified diagnostic labels, and support for dynamic multi-turn consultation. We present LingxiDiagBench, a large-scale multi-agent benchmark that evaluates LLMs on both static diagnostic inference and dynamic multi-turn psychiatric consultation in Chinese. At its core is LingxiDiag-16K, a dataset of 16,000 EMR-aligned synthetic consultation dialogues designed to reproduce real clinical demographic and diagnostic distributions across 12 ICD-10 psychiatric categories. Through extensive experiments across state-of-the-art LLMs, we establish key findings: (1) although LLMs achieve high accuracy on binary depression--anxiety classification (up to 92.3%), performance deteriorates substantially for depression--anxiety comorbidity recognition (43.0%) and 12-way differential diagnosis (28.5%); (2) dynamic consultation often underperforms static evaluation, indicating that ineffective information-gathering strategies significantly impair downstream diagnostic reasoning; (3) consultation quality assessed by LLM-as-a-Judge shows only moderate correlation with diagnostic accuracy, suggesting that well-structured questioning alone does not ensure correct diagnostic decisions. We release LingxiDiag-16K and the full evaluation framework to support reproducible research at https://github.com/Lingxi-mental-health/LingxiDiagBench.

18
OpenThoughts-Agent: Data Recipes for Agentic Models

Agentic language models dramatically expand the applications of AI yet little is publicly known about how to curate training data for broadly capable agents. Existing open efforts such as SWE-Smith, SERA, and Nemotron-Terminal typically target a single benchmark, leaving open the question of how to train models that generalize across diverse agentic tasks. The OpenThoughts-Agent (OT-Agent) project addresses this gap with a fully open data curation pipeline for training agentic models. We conduct more than 100 controlled ablation experiments to systematically investigate each stage of the pipeline, yielding insights on the importance of task sources and diversity. We then assemble a training set of 100K examples from our pipeline and fine-tune Qwen3-32B on this dataset, which yields an average accuracy of 44.8% across seven agentic benchmarks and a 3.9 percentage point improvement over the strongest existing open data agentic model (Nemotron-Terminal-32B, 40.9%). Moreover, our training data exhibits strong scaling properties, outperforming alternative open datasets at every training set size in compute-controlled comparisons. We publicly release our training sets, data pipeline, experimental data, and models at openthoughts.ai to support future open research on agentic model training.

12
FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation

Generating explorable 3D scenes from a single image requires strong generative priors and accurate geometric representations suitable for downstream use. Current video diffusion models offer high-quality generation and implicitly encode multi-view geometric structure in latent space. However, existing feedforward latent scene decoders typically output volumetric 3D Gaussians that lack a well-defined surface, limiting their use in simulation or standard graphics pipelines. This motivates decoding surface-aligned primitives that are not only renderable but also closer to explicit geometric assets. We ask whether compressed video diffusion latents can be mapped directly to explicit surface primitives in a single pass. To this end, we introduce FLAT and, for the first time, show that triangle splats can be decoded directly from video diffusion latents. Compared with decoding 3D Gaussians, predicting flat primitives is notoriously more challenging due to high sensitivity to primitive orientations, oftentimes leading to poor gradient flow. FLAT solves with two key ingredients: a ray-centered rotation parameterization for triangle regression and a novel product window function that improves gradient flow during differentiable triangle rendering. On standard benchmarks, FLAT achieves significantly better geometric accuracy while maintaining competitive visual quality compared to state-of-the-art feedforward baselines. We further show that a lightweight test-time refinement step converts the predicted triangle soup into a fully opaque, game-engine-ready representation that supports real-time rendering. By evaluating 3DGS, 2DGS, and triangle splatting variants under an identical training setup, we provide the first systematic analysis of representation tradeoffs in feedforward scene generation. The project page is available at https://flat-splat.github.io

10
FedOT: Ownership Verification and Leakage Tracing via Watermarks for Federated LDMs

Training Latent Diffusion Models (LDMs) within Federated Learning (FL) has attracted increasing attention due to its ability to combine the powerful generative capacity of LDMs with the privacy-preserving properties of FL. However, FL requires sharing the global model with multiple participants, which risks unauthorized model distribution or resale by malicious clients. While an intuitive approach is to adopt existing VAE-based watermarking techniques for LDMs in FL, this strategy falls short in addressing such threats due to two fundamental challenges: (1) Existing methods support ownership verification but lack the ability to trace model leakage to a specific malicious client; (2) VAE-based watermarks are vulnerable, as they can be removed simply by replacing the decoder with a clean counterpart. In this paper, we propose FedOT, the first framework for ownership verification and leakage tracing in federated LDMs. Specifically, to address the first challenge, we design a chunked watermark, where the first part is for ownership verification, and the second part is used for client identification. Furthermore, to overcome the second challenge and secure the model against VAE replacement attack, we introduce Latent Vector Transformation (LVT), which strengthens the connection between the VAE and U-Net latent spaces by modifying the original latent distribution of the VAE. Consequently, any attempt to replace the VAE for watermark removal leads to significant image quality degradation, making the LDM model unusable. Extensive experiments demonstrate that FedOT achieves superior performance in both ownership verification and traceability. Project page: https://spyzixuan.github.io/FedOT/.

9
Semantic Browsing: Controllable Diversity for Image Generation

Modern text-to-image models excel in visual fidelity and prompt adherence. However, this strict adherence comes at the cost of diversity: generated samples tend to collapse into a single visual interpretation. Existing methods to improve diversity produce outputs driven by incidental variations rather than meaningful design choices. This motivates a new variant of the diversity task where structure is enforced on the generated samples. We introduce a method for controlled diversity that enables Semantic Browsing, where users can navigate structured image galleries and experience creative exploration through a systematic traversal of meaningful, interpretable axes of variation. Achieving this level of semantic control requires a deep understanding of the scene. We exploit the fact that recent text-to-image models are trained on elaborated captions, effectively decoupling semantic decision-making from pixel generation. This enables a paradigm shift: instead of relying on stochastic variation within the text-to-image model, we induce diversity directly at the text level. By leveraging rich textual representations, we allow a Vision Language Model (VLM) to operate on the full scene context. To overcome the generic outputs typical of standard VLMs, we employ an agentic workflow that explicitly enforces structured variation attuned to the original prompt. We demonstrate that our method produces diverse and navigable design spaces where every variation corresponds to a specific, user-understandable semantic decision.

7
Escaping the Self-Confirmation Trap: An Execute-Distill-Verify Paradigm for Agentic Experience Learning

Experience-driven self-evolution is critical for large language model (LLM) agents to improve through open-world interaction. However, existing experience learning methods mostly rely on single-agent loops, where the same agent executes tasks, summarizes outcomes, and determines memory content. This setup makes agents vulnerable to the Self-Confirmation Trap: wrong-but-self-consistent trajectories are misidentified as successful experience, leading to cumulative errors during retrieval and reuse. To address this issue, we propose EDV, an Execute-Distill-Verify framework for reliable experience learning. In the Execute stage, multiple heterogeneous agents explore the same task space in parallel to generate diverse candidate trajectories. In the Distill stage, a dedicated third-party agent comparatively analyzes these trajectories to produce candidate experiences, reducing executor-centric summarization bias. In the Verify stage, the execution group validates candidates via a consensus mechanism, and only approved experiences are written into shared or private memory. By decoupling the three stages, EDV transforms experience learning from isolated self-reflection into collaborative construction, filtering erroneous and noisy content before memory insertion. We evaluate EDV on three challenging long-horizon benchmarks: tau2-bench, Mind2Web and MMTB. Results show EDV consistently outperforms strong baselines, validating that reliable experience construction is essential for robust agent self-evolution. Our code is available at https://github.com/shidingz/EDV.

6
Are Text-to-Image Models Inductivist Turkeys? A Counterfactual Benchmark for Causal Reasoning

Text-to-image (T2I) generation models have achieved remarkable progress in producing visually realistic images from natural language prompts. Yet it remains unclear whether their success reflects genuine causal understanding or sophisticated pattern matching over visual-textual correlations. Inspired by Russell's inductivist turkey, we introduce Counterfactual-World (CF-World), a counterfactual benchmark designed to investigate whether text-to-image models can generate images under rules that systematically contradict real-world priors. CF-World organizes each scenario into three progressive levels: factual generation under ordinary world knowledge, explicit counterfactual generation with direct visual instructions, and implicit counterfactual generation requiring causal deduction from altered rules. We evaluate both open-source and closed-source T2I models using a Vision Language Model (VLM)-based evaluator (CF-Eval). Furthermore, we introduce two metrics: Prior Resistance Rate (PRR), which measures a model's ability to overcome entrenched real-world priors, and Reasoning Retention Rate (RRR), which assesses whether models can maintain reasoning-dependent counterfactual generation without explicit visual cues. Experiments show that all models exhibit sharp degradation from factual to counterfactual settings. Further analyses suggest that these failures arise because current T2I models encode world knowledge and visual appearances as tightly coupled patterns. Consequently, their heavy reliance on frequent visual co-occurrences within the training data forces them to default to familiar commonsense priors when tasked with rendering counterfactual worlds.

5
DiffusionBench: On Holistic Evaluation of Diffusion Transformers

Diffusion transformer (DiT) research on image generation has converged to a single evaluation setup: class-conditional generation on ImageNet. While methods improve the FID and related metrics, it is increasingly unclear whether they reflect real progress in generative modeling. The natural alternative, i.e., text-to-image (T2I) generation, is perceived as too costly or inconvenient to train and evaluate and is often skipped. We argue that this perception no longer holds. We introduce NanoGen, a unified DiT training and evaluation framework. NanoGen matches state-of-the-art DiT baselines on ImageNet and, with 12 lines of configuration change, also trains competitive text-to-image models. It currently supports RAE, VAE, pixel-space, and MeanFlow diffusion methods under both ImageNet and T2I setups. Under NanoGen, training T2I requires comparable compute to ImageNet. After training 21 latent diffusion models with NanoGen, we observe that method ranking shows no strong correlation between ImageNet and T2I generation: Pearson correlation is between -0.377 and -0.580 across three metrics. This suggests that a method which improves class-conditional ImageNet FID may show no corresponding improvement on T2I, clearly indicating the necessity of evaluating DiTs on both tasks. To this end, we summarize ImageNet and text-to-image results, which yields DiffusionBench, a holistic benchmark for DiT research. We recommend reporting DiffusionBench in place of ImageNet alone: methods that improve DiffusionBench are more likely to reflect broader progress.

4
Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning

The composition of training data, governed by the diversity of sources and their mixing strategy, is a cornerstone of Large Language Model (LLM) pre-training. Online Data Mixing (ODM), the technique of adaptively adjusting data mixtures during training, has emerged as a promising direction to improve efficiency. However, existing methods are constrained by their reliance on a singular optimization perspective, which fundamentally overlooks the need for complex LLM pre-training to consider the dynamic data composition from multiple dimensions. To overcome this limitation, we introduce the Holistic Data Scheduler (HDS), a novel online data mixing framework. HDS formulates the data scheduling challenge as a reinforcement learning problem in a continuous control space and leverages the Soft Actor-Critic (SAC) algorithm for its stability and sample efficiency in exploring the high-dimensional policy space. At the core of HDS lies a novel multi-objective, holistic reward function that integrates three critical perspectives: a data-driven reward for quality, a loss-driven reward capturing inter-domain influence, and a model-driven reward based on weight norms. To validate our design and determine its optimal configuration, we conducted systematic experiments on LLMs of various sizes. On The Pile benchmark, HDS reaches the final validation perplexity of the next best method with 44% fewer training iterations. Furthermore, it achieves a 7.2% improvement on the MMLU 0-shot task along with consistent gains on other benchmarks, showcasing its ability to enhance both training efficiency and final model capability.

2
ChartWalker: Benchmarking the Cross-Chart RAG Task

Cross-Chart Retrieval-Augmented Generation (RAG) is critical for complex multi-modal analytical tasks in scientific, business, and political domains. However, existing benchmarks either focus on tables, which are well-structured and textualized, or generate cross-chart questions by simply extracting key points, which often induces lexical overlap between queries and evidence and yields logically inconsistent reasoning chains. To address this, we introduce ChartWalker, a novel framework for constructing challenging cross-chart RAG tasks. ChartWalker features a hierarchical knowledge graph construction method tailored to charts, which organizes entities and relations by granularity to preserve analytical structure. We then propose a structure-aware sampling algorithm that synthesizes semantically coherent, multi-hop reasoning paths, enabling explicit control over query difficulty and granularity for QA generation. Built with this framework, we release ChartWalker-Bench, a comprehensive benchmark spanning diverse domains and cross-chart query types. Extensive evaluations across major RAG paradigms reveal significant performance gaps, underscoring the benchmark's difficulty and utility. Furthermore, we provide ChartWalker-Agent, an agentic baseline to facilitate analysis and inspire future system design.

1
QG-MIL: A Gated Transformer Aggregator for Domain-Agnostic Multiple Instance Learning in Medical Imaging

Attention-based Multiple Instance Learning aggregators in medical imaging are prone to attention concentration, producing overconfident and unstable predictions. We introduce QG-MIL, a gated transformer aggregator that addresses this through four synergistic architectural components: RMSNorm-based pre-normalization, per-head QK normalization, fine-grained attention output gating, and SwiGLU-style feed-forward modules. Together, these design choices stabilize training and distribute attention more uniformly across instances without auxiliary losses, masking, or multi-stage regularization. We evaluate QG-MIL across six benchmarks spanning whole-slide pathology and cell-level hematology, covering two fundamentally different MIL scales. The best-performing QG-MIL variants outperform leading baselines on all six benchmarks, with an average improvement of +6.1 mean macro F1 points. Attention overlays and attention mass analysis confirm more distributed instance weighting. Ablation studies show that while individual components can match the full model on specific datasets, the QG-MIL design provides the most consistent cross-domain performance and tightest variance when compared to selected baselines. We release a configurable implementation to support reproducibility at: https://github.com/unica-visual-intelligence-lab/QG-MIL

1
EventVLA: Event-Driven Visual Evidence Memory for Long-Horizon Vision-Language-Action Policies

Memory remains a critical bottleneck for long-horizon robotic manipulation, as standard Vision-Language-Action (VLA) policies often fail when task-relevant cues become occluded or unobservable over time. While existing memory-augmented methods utilize historical context, they either suffer from severe information bottlenecks, incur high latency via decoupled dual systems, or rely on unselective buffers that accumulate massive visual redundancies. To address these limitations, we introduce EventVLA, an end-to-end framework founded on the concept of sparse visual evidence memory that comprises two core components: foundational visual anchors to retain initial and short-term contexts, and a dynamic Keyframe Evidence Memory (KEM) module. Specifically, KEM directly predicts future keyframe probabilities from the VLA's latent embeddings to autonomously capture and store sparse, task-critical visual events. This foresight-driven mechanism empowers the policy to dynamically evaluate the future causal utility of current observations, preserving transient visual evidence before it becomes unobservable. Furthermore, we propose RoboTwin-MeM, a diagnostic benchmark specifically designed to evaluate non-Markovian manipulation tasks with interactive visual evidence. Extensive evaluations show that across 17 memory-requiring simulation tasks and 4 real-world bimanual tasks, EventVLA achieves an average success rate improvement of +40% over state-of-the-art memory-augmented VLAs.

1
FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning

Multimodal driving planning faces a long-standing tension between two paradigms: scoring-based methods benefit from dense reward supervision but are confined to a fixed action vocabulary, while anchor-based methods generate proposals dynamically yet suffer from sparse supervision constrained to a single ground-truth trajectory. In this work, we propose FlowR2A, which resolves this tension by reframing simulation-based rewards from discriminative targets into generative conditions. By learning the reward-conditioned action distribution from dense trajectory-reward pairs with a flow-matching decoder, FlowR2A unifies the dense supervision of scoring-based methods with the proposal generation of anchor-based methods in a single generative model, forcing the model to internalize the correlation between an action and its outcomes in safety, progress, comfort, and rule compliance. To balance hard safety constraints against soft progress objectives, we introduce fine-grained per-timestep reward conditioning and reward noise augmentation. The generative formulation naturally supports controllable test-time sampling via reward guidance and anchored sampling, producing high-quality proposals. FlowR2A achieves state-of-the-art results on the NAVSIM v1 and v2 benchmarks, with multimodal proposals of substantially higher quality than prior methods.

1
DREAM: Dense Retrieval Embeddings via Autoregressive Modeling

Dense retrieval embedding models are a fundamental component of modern retrieval-based AI systems. Most dense retrievers are trained with contrastive objectives, which require labeled positive and negative document pairs that are often costly and difficult to obtain. In this work, we investigate whether the autoregressive next-token prediction objective of a large language model (LLM) can provide supervision for dense retrieval. The intuition is simple: if a document contains information relevant to a query, conditioning on that document should make the target output easier for the LLM to predict. A key challenge is that the next-token prediction loss is computed inside the LLM, while the retriever is a separate embedding model. To address this challenge, we propose DREAM (Dense Retrieval Embeddings via Autoregressive Modeling), which injects retriever-generated query-document similarity scores into selected attention heads of a frozen LLM. During training, these scores determine how much attention each candidate document receives while the LLM predicts the target output. The resulting prediction loss provides gradients for retriever training through the attention mechanism. We evaluate DREAM on retrieval benchmarks BEIR and RTEB using embedding backbones ranging from 0.5B to 3B parameters. DREAM consistently outperforms existing baselines across different model scales. These results demonstrate that DREAM provides a promising approach for training dense retrievers through autoregressive modeling.

1
ReMMD: Realistic Multilingual Multi-Image Agentic Verification for Multimodal Misinformation Detection

Multimodal misinformation detection is increasingly important because viral posts now combine long multilingual narratives, several images, mixed provenance, and subtle text--image framing errors. Existing benchmarks and methods remain poorly matched to this setting: they usually isolate short captions, single images, binary labels, or one manipulation source, while agentic verification remains costly under realistic evidence search. We present ReMMD, a realistic multilingual multi-image agentic verification framework for multimodal misinformation detection. ReMMD includes ReMMDBench, a real-world multimodal misinformation detection benchmark with 500 samples, 2,756 images, five monolingual languages, two cross-lingual settings, three text-length tiers, multi-image posts, five-way veracity labels, eight distortion labels, evidence provenance, and rationales. It also includes ReMMD-Agent, a persistent-memory verifier that decomposes posts into atomic points, builds a reusable evidence set, and predicts structured L1/L2/L3 outputs. Across proprietary systems, open LVLMs, MMD-Agent, and T2-Agent, ReMMD-Agent obtains the best five-way veracity performance, with 41.80% accuracy and 39.12% macro-F1 using GPT-5.2, while reducing cost by 17.5% relative to MMD-Agent and 79.9% relative to T2-Agent. The project is available at https://dang-ai.github.io/ReMMD.

1
World Value Models for Robotic Manipulation

Generalist value models play a pivotal role in scaling robotic policy learning from large-scale, mixed-quality data. Mathematically, accurate value estimation demands deep temporal understanding, requiring models to both ground the current belief using historical context and plan over future outcomes. However, most existing robotic value models are built on Vision-Language Model (VLM) backbones that are pretrained primarily on static or temporally sparse visual observations, lacking the requisite temporal modeling capabilities for value estimation. Unlike VLMs, world models naturally excel at temporal modeling and future planning, making them ideal foundations for learning generalizable value functions. Driven by this insight, we marry world models with value estimation to construct a new generalist robotic value model, World Value Model (WVM), that offers accurate task progressions to assess data quality. On standard benchmarks, WVM delivers state-of-the-art (SOTA) Value-Order Correlation (VOC) results. Complementing standard evaluation suites that contains only expert data, we further introduce Suboptimal-Value-Bench, a multi-embodiment benchmark consisting of 800 suboptimal trajectories with high-fidelity, human-labeled frame annotations. Our evaluations show that WVM maintains its SOTA performance on Suboptimal-Value-Bench, establishing its robustness in handling both expert and suboptimal data. When deployed for policy learning, WVM improves manipulation performance across various policy extraction approaches in both simulated and real-world deployment, providing robust guidance for learning from mixed-quality data.

1
AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

Large language models are increasingly deployed as agents that reason over documents rather than answer from parametric knowledge. We study archive-grounded reasoning: locating sparse evidence across a large, messy collection of workplace files, reconciling inconsistent terminology, units, and time conventions, and computing an answer. Existing benchmarks address only parts of this setting and none jointly stresses archive-groundedness, agentic exploration, and cross-domain coverage. We introduce Agora, a benchmark pairing 362 questions with eight domain collections of 9,664 authentic documents and 372M tokens, far exceeding any model's context window, so agents must explore deliberately rather than scan exhaustively. Agora is built by an agentic pipeline combining cross-document task synthesis, leakage-preventing obfuscation, and difficulty filtering. Evaluating eight models, we find the task far from solved: even the strongest reaches only 59.4% accuracy, with notable variation across domains.

0
An Efficient Method for the Optimal Control of Microgrids Under Uncertainties using Local Reduction

The problem of optimal sizing and power scheduling in microgrids subject to uncertainties is well known to the control community. Commonly, the optimal control problem is cast as a mixed-integer program to model the logical constraints arising in energy storage systems, and is then solved approximately using numerical methods such as the scenario approach. In this paper, we propose and compare two formulations of a robust microgrid sizing and power scheduling optimal control problem with logical constraints and uncertainties in the user's power demand, solar power generation, grid electricity prices and battery efficiencies. The first formulation uses binary variables and big-M constraints, leading to a mixed-integer linear program. The second formulation casts the problem as a continuous nonlinear program through an exact smooth reformulation of the logical constraints, consisting of additional modelling variables and non-convex constraints. We then propose a novel local reduction algorithm, extending an existing method, to solve both problems. The two formulations are compared by evaluating the solutions returned by local reduction using 100,000-sample Monte Carlo simulations and achieve promising results, with both averaging feasibility rates above 90%.

0
FLUX3D: High-Fidelity 3D Gaussian Generation with Diffusion-Aligned Sparse Representation

Sparse voxel representation has emerged as a scalable foundation for image-to-3D Gaussian Splatting (3DGS) generation, yet current methods struggle to preserve high-frequency visual details of input images due to two structural bottlenecks. First, they adopt discriminative 2D features optimized for semantic abstraction to construct sparse voxel latents, which suppress reconstructive cues and induce a representation bottleneck. Second, in the generation stage, standard diffusion transformers lack effective mechanisms to align dense 2D image tokens with sparse 3D voxel latents, resulting in a cross-modal correspondence bottleneck. To address these issues, we propose FLUX3D, a scalable image-to-3DGS framework that boosts both representation learning and cross-modal alignment during generation. We first revisit 2D feature selection for sparse-voxel-based 3D representation learning, propose Diffusion-Aligned Structured Latents (DA-SLAT) and couple it with a decoder-only architecture to improve 3DGS reconstruction fidelity. We also design a sparse-structure-aware diffusion framework, which integrates the Sparse-structure Multimodal Diffusion Transformer (SMDiT) and Modal-Aware Rotary Positional Embedding (MARoPE) to achieve geometry-agnostic 2D-3D alignment. Extensive benchmark experiments demonstrate that FLUX3D yields substantial improvements in appearance fidelity and significantly outperforms all state-of-the-art (SOTA) methods in generating high-quality 3DGS assets.

0
05

PRODUCT HUNT

05.00
PRODUCT HUNT

Product Hunt - June 24, 2026

Product Hunt Daily Feed: Featuring noteworthy tech launches.

Crewdle AI icon
Crewdle AI

Use every business AI tool without every subscription

0
Swimio icon
Swimio

AI swim coach with Apple Watch tracking & smart workouts

0
Ruby icon
Ruby

Ask better questions, live on every call

0
StaleMate PR icon
StaleMate PR

Your menu bar turns red when PRs pile up

0
Tencent EdgeOne Makers icon
Tencent EdgeOne Makers

Ship AI agents like web apps, in minutes.

0
Nimt icon
Nimt

Your AI Search Coworker in Slack

0
FUTO Swipe icon
FUTO Swipe

Open models for on-device swipe typing

0
Propane icon
Propane

Automatic customer context for product teams and agents

0
React UI Kit V7 icon
React UI Kit V7

All the chat components you need. None of the complexity

0
Customer Relationship Agents by Clarify icon
Customer Relationship Agents by Clarify

The M in CRM shouldn't be you

0
Mindstone Rebel icon
Mindstone Rebel

AI workspace for agents that know your work and ask first

0
Stripe.Directory icon
Stripe.Directory

New way for you & agents to search for businesses on Stripe

0
Buy by Agentcard icon
Buy by Agentcard

Order DoorDash from Claude

0
Deckwise icon
Deckwise

AI presentation agent for editable decks

0
NanoCorp icon
NanoCorp

Found a company in one sentence - from website to ads

0
Amnesia icon
Amnesia

A Mac app that asks why you opened that tab

0
OpenArt Director icon
OpenArt Director

Direct cinematic videos through chat

0
Bluerails Discovery icon
Bluerails Discovery

The rails AI agents use to find and pay you

0
LogStitch icon
LogStitch

Find AWS Lambda failures fast, right on your Mac

0
Buddy AI Note icon
Buddy AI Note

Your daily memo that turns notes into a plan

0
Rosply icon
Rosply

AI agent that controls your computer autonomously

0
Tufte icon
Tufte

CDN and Node package to generate ASCII graphs inline

0
Thumbmagic icon
Thumbmagic

AI thumbnail generator trained on top-performing thumbnails

0
Hush icon
Hush

Open-source noise suppression for voice AI agents

0
Sakana Fugu icon
Sakana Fugu

One Model to Command Them All

0
Conduit icon
Conduit

Fix the tool-list bloat slowing your AI agent

0
jebi icon
jebi

A supercharged terminal for Mac with built-in local AI

0
BestDefense.io icon
BestDefense.io

Pentest and patch every deploy with AI

0
wildbirds icon
wildbirds

Birdwatchers app to share and discover birds socially

0
prepros icon
prepros

Run your brand shoots from start to finish

0
NeuralAgent 3.0 icon
NeuralAgent 3.0

AI that executes UI actions on your computer in ~285ms

0
Jotform AI App Builder icon
Jotform AI App Builder

Turn ideas into powerful apps within seconds

0
Blazly SEO icon
Blazly SEO

Dominate SEO with an AI content operating system

0
Sipcode icon
Sipcode

Keep Claude Code's context clean for sharper answers

0
HotkeyClash icon
HotkeyClash

Find where your Mac keyboard shortcuts clash

0
Latitude icon
Latitude

Fix what's breaking in your AI agent

0
Cotypist icon
Cotypist

Local AI Autocomplete in your voice, anywhere on your Mac

0
Steam Machine icon
Steam Machine

A tiny, powerful PC for big-screen gaming

0
readywhen icon
readywhen

Your 24/7 AI Chief of Staff for commitments and follow-ups

0
Selector Forge icon
Selector Forge

Browser extension for AI-generated resilient selectors

0
MediaSeg icon
MediaSeg

Split large media files into upload-ready chunks on macOS

0
Photoroom API icon
Photoroom API

Transform product images at scale with one image editing API

0
Clawd icon
Clawd

A context-aware browser mascot with 100% local offline AI

0
OnBrand by SlideSpeak icon
OnBrand by SlideSpeak

Design context for AI agents

0
Agentic Document Extraction icon
Agentic Document Extraction

Make the world's documents computable

0
Skybridge icon
Skybridge

The full-stack open source React framework for MCP Apps

0
AgentX icon
AgentX

Evaluate AI agent, pinpoint issues, and fix with one click.

0
Alai 2.0 icon
Alai 2.0

AI design partner for presentations, social posts, and more

0
HAQQ Legal AI on Mobile icon
HAQQ Legal AI on Mobile

Bringing legal understanding to anyone with a phone

0
uwait icon
uwait

Get paid while AI thinks

0
06

TECHMEME

06.00
TECHMEME

Techmeme - June 24, 2026

Techmeme Digest: Major tech headlines and industry conversations.

NYC-based Taktile, which helps fintechs build automated decision-making workflows, raised a $110M Series C led by Goldman Sachs, and plans a São Paulo office (Camila Grigera Naón/Fortune)
Source: TechmemePublished: Jun 24, 2026

Camila Grigera Naón / Fortune : NYC-based Taktile, which helps fintechs build automated decision-making workflows, raised a $110M Series C led by Goldman Sachs, and plans a São Paulo office —  Banks and insurance companies spend billions of dollars to employ staff to screen risky transactions, process claims, and onboard new customers.

Microsoft's Digital Crimes Unit says AI helped it link two separate hacking tools, Amadey and StealC, and file a single civil lawsuit to help take them down (Lorelei Smillie/Bloomberg)
Source: TechmemePublished: Jun 24, 2026

Lorelei Smillie / Bloomberg : Microsoft's Digital Crimes Unit says AI helped it link two separate hacking tools, Amadey and StealC, and file a single civil lawsuit to help take them down —  Investigators used new tools to defeat old malware technology.  —  Microsoft Corp. deployed artificial intelligence to link …

Chinese cybersecurity company 360 unveils new AI tools: Tulongfeng, which it claims is "China's version of Mythos", and Yitianzhen, to automate cyber defense (Eduardo Baptista/Reuters)
Source: TechmemePublished: Jun 24, 2026

Eduardo Baptista / Reuters : Chinese cybersecurity company 360 unveils new AI tools: Tulongfeng, which it claims is “China's version of Mythos”, and Yitianzhen, to automate cyber defense —  Chinese cybersecurity firm 360 Security Technology (601360.SS) has developed what it calls a domestic answer to Anthropic's Mythos …

OpenAI and Broadcom unveil Jalapeño, an LLM-optimized inference chip developed from design to manufacturing tape-out in nine months, aided by OpenAI's models (OpenAI)
Source: TechmemePublished: Jun 24, 2026

OpenAI : OpenAI and Broadcom unveil Jalapeño, an LLM-optimized inference chip developed from design to manufacturing tape-out in nine months, aided by OpenAI's models —  Loading...  - Early testing shows that the first-generation accelerator will deliver performance per watt substantially better than current state-of-the-art

Google Home Speaker review: the $100 speaker, the company's first such device in six years, works for Gemini power users but has underwhelming music playback (Chris Welch/Bloomberg)
Source: TechmemePublished: Jun 24, 2026

Chris Welch / Bloomberg : Google Home Speaker review: the $100 speaker, the company's first such device in six years, works for Gemini power users but has underwhelming music playback —  The Google Home Speaker is the first new device of its kind to come from Alphabet Inc. in six years.

Amazon plans to expand Amazon Now to 300+ Indian cities and towns, up from 100 at present, as CEO Andy Jassy visits India; a source says he is set to meet Modi (Sankalp Phartiyal/Bloomberg)
Source: TechmemePublished: Jun 24, 2026

Sankalp Phartiyal / Bloomberg : Amazon plans to expand Amazon Now to 300+ Indian cities and towns, up from 100 at present, as CEO Andy Jassy visits India; a source says he is set to meet Modi —  Amazon.com Inc. is multiplying the Indian cities and towns covered by its Amazon Now quick deliveries, an aggressive move …

Qualcomm says it will acquire Modular, which builds a chip software platform and has a proprietary coding language, in a nearly $4B deal set to close in H2 2026 (Lauren Goode/Wired)
Source: TechmemePublished: Jun 24, 2026

Lauren Goode / Wired : Qualcomm says it will acquire Modular, which builds a chip software platform and has a proprietary coding language, in a nearly $4B deal set to close in H2 2026 —  Modular, one of the most promising chip software startups of the AI era, heads for a multibillion-dollar exit.

Rockstar says physical GTA 6 copies will contain a download code, not a disc; physical copies will be available from November 12, before the November 19 launch (Tom Phillips/IGN)
Source: TechmemePublished: Jun 24, 2026

Tom Phillips / IGN : Rockstar says physical GTA 6 copies will contain a download code, not a disc; physical copies will be available from November 12, before the November 19 launch —  Keep it on the download.  —  Rockstar will sell physical copies of Grand Theft Auto 6 — though you'll only get a box containing a digital download code.

The US FDA drops an enforcement complaint against Whoop over its blood pressure tracking tool, reversing a July 2025 warning letter; Whoop is modifying the tool (Samantha Kelly/Bloomberg)
Source: TechmemePublished: Jun 24, 2026

Samantha Kelly / Bloomberg : The US FDA drops an enforcement complaint against Whoop over its blood pressure tracking tool, reversing a July 2025 warning letter; Whoop is modifying the tool —  The US Food and Drug Administration dropped its complaint against fitness tracker brand Whoop Inc. over its blood-pressure tracking tool …

An analysis of GPT-5.5, Gemini 3.1 Pro, Grok 4.3, Gab's Arya, and other AI models: most chatbots frequently provide left-leaning responses to political prompts (Kevin Schaul/Washington Post)
Source: TechmemePublished: Jun 24, 2026

Kevin Schaul / Washington Post : An analysis of GPT-5.5, Gemini 3.1 Pro, Grok 4.3, Gab's Arya, and other AI models: most chatbots frequently provide left-leaning responses to political prompts —  Excerpts from each chatbot's responses to political questions  —  Left-leaning argument  —  Right-leaning  —  ChatGPT  —  Gemini

How Taiwan's Hsinchu Science Park, home to TSMC, became especially wealthy; one neighborhood reported 2023 average household incomes of $146K+, ~5x the average (New York Times)
Source: TechmemePublished: Jun 24, 2026

New York Times : How Taiwan's Hsinchu Science Park, home to TSMC, became especially wealthy; one neighborhood reported 2023 average household incomes of $146K+, ~5x the average —  Fortunes, luxury buildings and birthrates are rising in the city at the center of Taiwan's chip supply chain.

Rockstar sets the release date for GTA VI for November 19 and says it will cost $79.99, or $99.99 for the Ultimate Edition; preorders start at midnight tonight (Stevie Bonifield/The Verge)
Source: TechmemePublished: Jun 24, 2026

Stevie Bonifield / The Verge : Rockstar sets the release date for GTA VI for November 19 and says it will cost $79.99, or $99.99 for the Ultimate Edition; preorders start at midnight tonight —  The game is set to launch on November 19th. The game is set to launch on November 19th.

How Chicago is betting on quantum computing, including turning the site of its former US Steel mill into a campus, after largely missing the digital revolution (Jeanne Whalen/Wall Street Journal)
Source: TechmemePublished: Jun 24, 2026

Jeanne Whalen / Wall Street Journal : How Chicago is betting on quantum computing, including turning the site of its former US Steel mill into a campus, after largely missing the digital revolution —  Illinois is betting on a promising—but commercially unproven—technology  —  CHICAGO—The Lake Michigan docks that once received iron ore …

Q&A with AWS CEO Matt Garman on the parallels between early AWS and AI, Quick, AI coding, Amazon's $200B capex in 2026, entry-level jobs changing, and more (Casey Newton/Platformer)
Source: TechmemePublished: Jun 24, 2026

Casey Newton / Platformer : Q&A with AWS CEO Matt Garman on the parallels between early AWS and AI, Quick, AI coding, Amazon's $200B capex in 2026, entry-level jobs changing, and more —  Matt Garman argues that junior employees are as necessary as ever.  But AWS now sells agents that can recruit, code, and process claims.

Humanoid robot maker Agility plans to go public via a merger with Michael Klein's SPAC, in a deal valuing Agility at ~$2.5B, listing under ticker symbol AGLT (Lauren Thomas/Wall Street Journal)
Source: TechmemePublished: Jun 24, 2026

Lauren Thomas / Wall Street Journal : Humanoid robot maker Agility plans to go public via a merger with Michael Klein's SPAC, in a deal valuing Agility at ~$2.5B, listing under ticker symbol AGLT —  Agility's humanoid robot, Digit, is used by companies including Amazon  —  Agility Robotics, a startup that makes humanlike robots used …

07

STARTUP ARCHIVE

07.00
STARTUP ARCHIVE

Startup News - June 24, 2026

Startup News Roundup: Aggregating key funding and launch updates.

Marc Andreessen on the 5 personality traits of an innovator
Source: StartupPublished: Mar 31, 2026

“When you’re talking about real innovators—people who actually do really creative, breakthrough work—I think you’re talking about a couple things:”

Steve Jobs explains the importance of both thinking and doing
Source: StartupPublished: Mar 30, 2026

“The doers are the major thinkers. The people who really create the things that change this industry are both the thinker-doer in one person.”

Tobi Lutke explains what the VCs who passed on Shopify got wrong
Source: StartupPublished: Mar 27, 2026

“What a lot of free-market thinkers don’t understand is that between the demand and eventual supply lies friction."

Sam Altman explains how he decides to invest in a startup after 10 minutes
Source: StartupPublished: Mar 26, 2026

"Does this person have the potential to be the next Mark Zuckerberg?… [You don’t get to] 100% accuracy, obviously, but it’s good enough that our business model works.”

Jony Ive recounts the time Steve Jobs called him vain
Source: StartupPublished: Mar 25, 2026

In the clip below, Jony Ive recounts the time he asked Steve Jobs to be less harsh in his critique of a piece of work.

Jeff Bezos’s two pieces of advice for aspiring entrepreneurs
Source: StartupPublished: Mar 24, 2026

“The advice that I would give entrepreneurs is don't chase the hot new thing. It's so hard to catch something that everybody already knows is hot."

Elad Gil: “Things that work tend to work pretty fast”
Source: StartupPublished: Mar 23, 2026

“I do think there’s a bit of a myth in Silicon Valley that you should keep grinding no matter what and it’s just about perseverance, and I think that’s really bad advice."

Paul Graham on why starting with a “small, intense fire" is the key to startup growth
Source: StartupPublished: Mar 20, 2026

"You have to know who those first users are and how you're going to get them."

Keith Rabois on how to identify great talent
Source: StartupPublished: Mar 19, 2026

“What you want to do with every single employee every single day is expand the scope of their responsibilities until it breaks… and that’s the role they should stay in.”

Wealthfront CEO on why advertising spend makes it harder to find product/market fit
Source: StartupPublished: Mar 18, 2026

“The way that you know you have product/market fit is if you have exponential organic growth."

Eric Schmidt on why most companies get strategy wrong
Source: StartupPublished: Mar 17, 2026

“Work very, very hard to figure out what the world’s going to look like in five years. What will people be doing? What will your customers want? Where will costs be?"

Mark Zuckerberg: “You can’t 80/20 everything”
Source: StartupPublished: Mar 16, 2026

"There’s the famous 80/20 rule where you get 80% of the benefit by doing 20% of the work, but you can’t just 80/20 everything. There have to be certain things that you are just the best at."

Marc Andreessen on Mark Zuckerberg’s founder “superpower”
Source: StartupPublished: Mar 13, 2026

“A great superpower that Mark Zuckerberg has that is probably not well-understood enough is he does not get emotionally upset in stressful situations"

Sam Altman explains how to come up with a great startup idea
Source: StartupPublished: Mar 12, 2026

"If you start a startup without a good idea… you’ll be under pressure to make something up and it won’t work that well."

Jeff Bezos on the problems with proxies and managing to metrics
Source: StartupPublished: Mar 11, 2026

“One of the things that happens in business is that you develop certain things that you’re managing to—a typical case would be a metric. And that metric isn’t the real underlying thing.”

Airbnb founder Brian Chesky on how to design an amazing user experience
Source: StartupPublished: Mar 10, 2026

“If you can design something really amazing using the hand-crafted part of your brain, then you can reverse-engineer how to industrialize this millions of times over."

Spencer Rascoff: "I will never invest in a consumer startup with paid marketing”
Source: StartupPublished: Mar 9, 2026

"If you’re actually trying to grow a product, the best levers for doing that are often within the product itself.”

Patrick Collison explains why it sometimes make sense to quit
Source: StartupPublished: Mar 6, 2026

“One thing I’ve learned myself the hard way, is that it is easier to tear down a company and restart it in Silicon Valley, than it is to constantly try to pivot or keep something alive."

Jeff Bezos recounts the time he called Amazon’s customer service number mid-meeting to prove a metric was wrong
Source: StartupPublished: Mar 5, 2026

“I have a saying, which is when the data and the anecdotes disagree, the anecdotes are usually right"

Ben Horowitz: “Nobody was born a great manager. It’s a very unnatural job.”
Source: StartupPublished: Mar 4, 2026

“If you can’t build a great product, it doesn’t matter if you can build a great company.”

03

ALSO TODAY

3 MORE SOURCES
08

SOLIDOT

08.00
SOLIDOT

Solidot News - June 24, 2026

Solidot Feed: Highlighting essential tech & open-source news.

幼儿早期的屏幕使用与较差的学习成绩和较弱的工作记忆相关

随着屏幕在幼儿生活中几乎无处不在,一项研究调查了其对学习表现的影响。研究跟踪了 1-8 岁的儿童,发现屏幕观看时间更长与 9 岁时较差的学习表现以及 10.5 岁时较弱的工作记忆存在关联。研究结果表明,屏幕接触的时机可能与屏幕使用的总时长同样重要。WHO 和美国儿科学会建议幼儿在 18–24 个月前不要接触屏幕,2-5 岁儿童每天使用屏幕时间不超过 1 小时。但很多幼儿都超过了这些限制。最新研究追踪了 502 名儿童从婴儿期到童年中期的发育过程,发现在特定发育阶段屏幕观看时间较长的儿童,后期学业表现较差,工作记忆较弱。这种关联在婴儿期和学龄初期最为显著,表明这些阶段可能是认知发展的特别敏感窗口期。在整个童年期屏幕接触总量较高的儿童,学业表现也通常较差。研究结果表明,屏幕使用的时机可能与总暴露量同样重要。研究结果支持“越少越好”的原则,即儿童的屏幕时间越少越好。

欧洲是变暖速度最快的大陆

本周英国、法国、意大利和西班牙都发布了红色高温预警,欧洲正经历五月以来第二波热浪。全球气温比工业化前时期——1850-1900 年——的水平高出约 1.4C,而根据欧盟哥白尼气候变化服务中心的数据,欧洲气温比工业化前水平高出约 2.4C。全球平均气温的持续上升主要是由于燃烧石油、天然气和煤炭产生的温室气体排放,但由于多种因素的共同作用,不同地区的升温幅度不同。陆地升温速度快于海洋,因为水可以吸收更多热量并通过蒸发冷却。哥白尼气候变化服务中心称,大气环流的变化导致欧洲夏季热浪更频繁强度更大。另一个主要原因是地理位置,欧洲与北极相连,北极气温比工业化前水平高出 3.2C。北极地区气温上升的部分原因是反照率。明亮的冰雪会将大部分太阳热量反射回太空,但冰雪融化会露出颜色较深吸收热量的陆地。欧洲冬季降雪频繁的地区,积雪覆盖面积正在减少,露出了深色的陆地。

伊朗断网期间仅约 2000 个 IP 能访问外网

伊朗今年早些时候全国范围断网,持续数月之久。在断网期间,伊朗实施了白名单制度,也就是只有处于白名单内的极少数 IP 地址才能访问外网。研究人员利用位于伊朗境内的一台 VPS 以及位于匈牙利、美国以及日本的 VPS,根据伊朗自治系统通过 BGP 宣布的 IP 段总数约 11,766,454 个 IP 地址,伪造这些 IP 地址进行穷举,观察哪些 IP 能访问外网。结果显示,能访问外网的 IP 大约有 2000 个。研究人员还发现,即使这些 IP 能访问外网,它们也不能随意访问任何网站,而是受到了基于 SNI 的过滤机制的约束。但白名单 IP 地址也不是所有都受到 SNI 过滤,测试的 IP 至少有半数不受任何 SNI 过滤。这意味着白名单 IP 也存在不同的访问策略。

阿里巴巴起诉美国国防部

阿里巴巴及其美国子公司共同以美国国防部为被告,向加州联邦地区法院递交诉状,请求法院宣告美国国防部 6 月 8 日公布的对阿里巴巴的认定决定无效。阿里巴巴在美国设有分支机构,在美国开展电商与云计算业务。即便被列入该名单,企业理论上仍可同美国企业开展合作,但美国国防部有权对与名单内中企合作的美国企业采取解约等限制措施。阿里巴巴在诉状中主张,此次认定缺乏事实依据、相关流程不合规。该认定致使公司无法继续聘用游说机构等,自身合法权益遭受侵害,此举同时违反美国宪法。

心率同步程度可判断社交投入程度

根据发表在 PNAS Nexus 期刊上的一项研究,心率同步程度可判断社交投入程度。当人与人在身体与情感层面彼此亲近时,双方心率会逐渐同步。研究团队依托 72 名学生参与音频工程竞赛、赴纽约市期间采集的数据集开展研究。学生借助可收录环境噪音的助听器、监测心率的手环以及记录定位信息的手机采集各类数据。研究规定,人与人相距 20 米以内即为物理近距离接触。受试者共处时心率同步性更强,近距离互动、共同关注同一外界刺激(例如一同听课)时同步效应尤为明显。出行前便彼此熟悉的受试者,心率同步水平显著更高。

GCC 编译器加入对海光苏州 x86 CPU 的支持

GCC 编译器合并了支持海光代号苏州的 Model 8 c86-4g-m8 处理器的补丁。海光最早是与 AMD 合作的半导体企业,授权提供 AMD Zen 1 CPU的本地化版本,其产品仅供国内市场使用。几个月前 GCC 编译器合并了支持海光 C86-4G CPU 的补丁。Model 8 苏州 CPU 是上一代 Model 7 成都 CPU 的继任者,目前关于该处理器的信息很少,其指令集架构与上一代相差无几,支持包括 AVX-512 在内的指令集。

德国铁路因 IT 故障而停运

德国铁路网络周二晚上因 IT 故障而全国停运。凌晨一点国家铁路运营商 Deutsche Bahn 宣布问题已经解决,服务正在逐步恢复。铁路公司称问题是铁路网络内部通信使用的 GSM-R 数字通信系统出现全国故障导致的,它表示已查明原因但未具体说明。铁路公司在故障期间向乘客发放了出租车和酒店代金券,并在条件允许下,在车站提供可供旅客乘坐的列车。该公司就此次事故表示歉意。

计划在伦敦举行的极端高温会议因极端高温预警取消

原计划本周在伦敦举行的极端高温会议《Extreme Heat: Improving governance and strengthening action around the world》因英国气象局宣布的极端高温红色预警而取消。根据气象局发布的罕见高温红色预警,伦敦、英格兰中部部分地区、威尔士东南部和英格兰南部受到影响,时间从周三 09:00 BST 持续到周四 21:00 BST ,气象局警告高温可能会有重病或死亡风险。预计英格兰南部气温将升至 37-38 摄氏度左右,周三最高气温甚至可能达到 39 摄氏度。

Valve 称它无法与内存厂商沟通报价

Valve 宣布了起售价逾一千美元的 Steam Machine,它表示这一定价反映了过去 6 个月确保能获得的内存和存储组件的价格。DDR5 和 SSD 过去半年的价格上涨了数倍之多。Valve 在接受采访时表示,他们在采购内存时根本无法选择,只能接受厂商的报价,想要协商根本不可能,协商价格的结果会是完全断货。一位 Valve 员工说,内存厂商“每个月都给我们报个价,说‘你们可以买这么多’,只有答应或拒绝两种选择。如果我们拒绝,他们就再也不理我们了。”Steam Machine 的内存配置有两种:其一是两条 8GB DDR5 内存条,其二是单条 16GB DDR5 内存条,Valve 称它的测试显示两种配置性能相差无几。

高温干旱高 CO2 下大豆蛋白质含量会下降

大豆是重要的蛋白质来源,但气候变化正日益影响其产量和营养品质。根据发表在《Food Research International》上的一项研究,高浓度二氧化碳会使大豆种子产量增加最高 142%,而高温和干旱则分别会使产量降低 91% 和 60%。在高浓度二氧化碳+高温+干旱三重效应下,大豆种子产量可能会增加 50%,可溶性糖含量增加 35%,氨基酸含量增加 175%,同时淀粉含量降低 20%,蛋白质含量降低 6%。

中国新超算灵晟登顶 Top500 榜单

Top500 公布了最新的超算榜单,深圳国家超算中心的灵晟首次亮相即登顶榜单。灵晟理论峰值 2.736 Exaflop/s,在 HPL 测试中达到了 2.198 Exaflop/s,是 Top500 榜单中首个仅靠 CPU 实现持续双精度浮点性能逾 2 Exaflops 的超算系统。灵晟使用了 304 个核心的 LX2 CPU,总共 1379 万个核心,运行频率 1.55 GHz,操作系统是麒麟,功耗为 42.2 兆瓦。榜单前五的超算性能都超过了 Exaflops:灵晟;美国劳伦斯利弗莫尔国家实验室的 El Capitan,使用 AMD 第四代 EPYC 处理器,性能 1.809 Exaflop/s;橡树岭国家实验室(ORNL)的 Frontier,使用 AMD 第三代 EPYC,性能 1.353 Exaflop/s;阿贡国家实验室 Aurora 使用英特尔 Xeon CPU,性能 1.012 Exaflop/s,德国 Jülich 超算中心的 JUPITER Booster,使用英伟达 GH Superchip 72C 3GHz,性能 1 Exaflop/s。之后还有意大利 HPC7,微软 Microsoft Azure 超算 Eagle,意大利 HPC6,日本超算富岳(Fugaku),瑞士 Alps。排名前十的超算有四台使用了 AMD EPYC 处理器,两台英伟达处理器,两台英特尔处理器,灵晟的 CPU 架构没有说明。在 Top 500 中,美国有 162 台,日本 44 台,德国 41 台,中国 30 台;联想制造的超算最多有 129 台,其次是 HPE 的 124 台,BULL 的 58 台,戴尔的 49 台,英伟达的 37 台。

甲骨文过去一年裁员 2.1 万

根据甲骨文的最新年报,该公司过去一年在全球裁员约 2.1 万人,原因是它正围绕 AI 重塑业务。截至 2026 年 5 月 31 日,甲骨文全职员工总数约 14.1 万人,而去年同期为 16.2 万人。甲骨文在其报告中称,AI 技术在运营中的部署已经导致且可能继续导致员工总数减少。裁员人数约占甲骨文员工总数的 13%。就业追踪公司估计,过去一年中有逾 10 万科技从业者被裁员。甲骨文称,过去一年它支付了 18 亿美元的遣散费和其它重组费用。

维基百科联合创始人 Larry Sanger 被封禁

拥抱保守派、支持 MAGA 的维基百科联合创始人 Larry Sanger 再次现身维基百科,理由是帮助维基百科进行改革——aka 将其从自由派手中夺回来。他发起了“WikiProject Intellectual Diversity”提案,想要增加更多保守派的声音。他通过其社交媒体账号宣传该提案,违反了维基百科关于“隐蔽拉票(Stealth canvassing)”的政策,他在维基社区引发了争议,最终被封禁。

当代年轻人生物衰老速度更快

华盛顿大学医学院 Yin Cao 博士领导的团队分析了英国生物银行 (UK Biobank) 的超过 15.4 万名参与者的数据,以及美国 NIH All of Us Research Program 项目逾万名参与者的数据,评估了他们的系统性衰老和器官衰老。研究人员发现,1965-1974 年出生的英国人相比 1950-1954 年出生的英国人,在排除实际年龄的影响后,前者的生物衰老速度更快,这一差异具有统计上的显著性,达到了 0.23 个标准差。美国的数据也出现类似的模式:相比 1965-1969 年出生的美国人,1990-1999 年出生人群的生物衰老速度更快,统计显著性达到了 0.92 个标准差。年轻人群的生物衰老速度加速与早发性癌症风险增加相关。

野狼重返欧洲

去年夏天,一位女士带着两幼儿在荷兰 Utrecht 附近的天然公园散步,她看到一只体型较大的动物猛冲过来,她起初以为是一只顽皮的狗,但很快听到 6 岁大儿子发出尖叫,这只动物正将他拖进树林。附近两位恰好路过的成年人用棍子赶跑了它。袭击男孩的不是狗,而是一只狼。狼群数量在欧洲多地激增,引发了如何处理野狼的激烈争论。得益于严格的法律保护,灰狼(Canis lupus)的数量自 2000 年以来大幅增长,但袭击牲畜和袭人事件也日益频发。欧盟委员会去年放宽规定允许更多捕杀野狼,科学家对此表示反对,认为基因证据表明狼群数量并不像表面看起来那么庞大,认为用电围栏和护卫犬保护牲畜比捕杀更有效。科学家估计目前欧盟成员国境内共有约 23,000 只狼,相比下 2012 年只有约 12,000 只。

星际彗星 3I/Atlas 可能是太阳系最古老的天体

目前正横穿太阳系的星际彗星 3I/Atlas 可能是太阳系至今发现的最古老天体。它形成于 120 亿年前。借助 NASA 韦伯望远镜(JWST),研究团队精确测定了这颗彗星的化学组分,判定它诞生于宇宙早期银河系的一片恒星形成区。该发现让人类得以窥见其他行星系统的构成,并对比其与太阳系的差异。受阳光加热后,3I/Atlas 向外喷发水蒸气、一氧化碳、二氧化碳,甚至镍、铁等金属蒸气。有两个同位素特征彻底暴露了它的古老身世,同位素即质子数相同、中子数不同的同种元素原子。第一,这颗彗星的碳12与碳13比值远高于太阳系内所有天体。宇宙中,大质量恒星剧烈爆发会持续累积碳13。3I/Atlas 的碳13含量极低,说明它诞生于宇宙早期,彼时大量恒星尚未演化到发生超新星爆发的阶段。第二,这颗彗星富含半重水,即水分子中的部分氢原子多携带一个中子。这类水分子更容易在早期宇宙低温大质量恒星形成区普遍存在的强辐射环境中生成。

DDR2 和 DDR3 内存的价格出现上涨

过去几个月,由于 AI 热导致的内存短缺,DDR4 和 DDR5 内存条价格都出现了数倍的增长。由于 DDR4 和 DDR5 内存成本过高,部分硬件制造商开始降低内存规格,转向更古老的内存条,结果推动了 DDR2 和 DDR3 内存的价格出现了上涨。市场观察机构 TrendForce 称,硬件制造商为控制成本用 DDR3 方案取代了 DDR4,或用基于 DDR2 的设计取代 DDR3。机构预测 2026 年第二季度 DDR2 合约价格将上涨约 55% 至 60%,第三季度还将进一步上涨 35% 至 40%。而 DDR 2 的制造商表示它们正将产能转移到利润更高的产品如 DDR3、DDR4 和 LPDDR4。

在敏感信息泄漏后 Meta 暂停内部 AI 训练项目

在敏感信息泄漏后 Meta 暂停了内部的 AI 训练项目。泄密事件暴露了员工的私人对话、绩效数据和转录文本。Meta 发言人证实了此事,表示公司正在调查,称目前没有迹象表明 Meta 员工不当访问了任何数据。Meta 公司是在今年 4 月宣布了名为 Model Capability Initiative 的 AI 训练计划,旨在利用员工的按键和鼠标移动作为训练数据,以改进公司的 AI 模型。该计划对大多数员工强制执行,但引发了部分员工的强烈反对,他们对自己的数据被记录感到不安。最新的泄密事件令 Meta 内部员工感到沮丧,他们批评公司从一开始就没有对数据进行安全防护。

警长利用 Flock 车牌跟踪系统跟踪前女友

54 岁的伊利诺伊州 Holiday Hills 警长 William C. Copp 于 6 月 18 日被捕,他被控了两项渎职罪。检方指控他利用 Flock Group 公司的车牌跟踪系统跟踪了六名他认识的人,其中三人是其前女友,他特别跟踪了一名前女友的前男友——在数月内查询了至少 140 次,这名男子为此申请了禁止接触令。Institute for Justice 的统计显示,截至 2026 年 6 月全美至少发生了 18 起警察利用 Flock 车牌跟踪系统跟踪熟人的案件。举例来说,爱达荷州 Jerome 县的一名警长在三个月内查询了其妻子车牌逾 700 次;堪萨斯州 Sedgwick 前警长对其前女友的车牌进行了 164 次查询,对前女友现任男友的车牌进行了 64 次查询;密尔沃基一名警官追踪其伴侣及其前任逾 100 次...Flock 的数据库查询不需要搜查令,该公司声称要求搜索令可能会在紧急情况下危及生命。ACLU、EFF 以及 Institute for Justice 等都坚持查询车牌需要搜查令。

Steam Machine 起售价 1049 美元

Valve 正式公布了其游戏机 Steam Machine 的售价,在 AI 热导致内存和 SSD 短缺的情况下,Steam Machine 的价格也涨到了对大多数人缺乏吸引力的程度:Steam Machine 512GB 1,049 美元,Steam Machine 512GB + Steam Controller 套装 1,128 美元,Steam Machine 2TB 1,349 美元,Steam Machine 2TB + Steam Controller 套装 1,428 美元。Valve 解释说,硬件的价格直接取决于组件的成本,在 2023 年开始为 Steam Machine 采购组件时,按照以前的趋势组件的价格会随时间而降低。然而过去大概一年的时间里,情况发生了快速而显著的变化,最明显的就是内存及存储组件的变化,这最终导致了当初为 Steam Machine 制定的目标定价不再可行。因此今天公布的价格反映了全球制造业的现状,或者更准确地说,反映了过去 6 个月里确保能获得的组件的价格。为避免有限库存被机器人程序抢先订购,Valve 宣布将对预订进行随机排序,它将于 6 月 29 日发布第一批产品,并会在有货时继续按顺序处理队列中的预订。

09

APP STORE RANK

09.00
APP STORE RANK
Loading…