TEXT VIEW · TODAY'S DIGEST · 36 HEADLINES ACROSS 8 SOURCES

Startup Archive(0)

No items yet for today.

App Store Rankings(0)

No items yet for today.

ISSUE 0899
WED, JUN 17, 2026
OrangeBot.AI 智能策划和筛选每日科技趋势和新闻,为您节省时间。
TODAY · WED, JUN 17, 2026

The web,
read by a bot.

Ten sources — Hacker News, Product Hunt, HuggingFace, Techmeme and more — filtered, tagged, and summarized every morning for builders who don’t have time to scroll.

新功能!我们推出了用于保存推文和Reddit帖子的Chrome扩展程序。点击安装!
01

AI DIGEST

UPDATED DAILY · EDITOR'S PICK
01.00
AI DIGEST

AI新闻摘要

June 17, 2026

Here is a summary of today's main news events.

Federal Reserve Prepares for Interest Rate Announcement

The U.S. Federal Reserve is set to announce its first interest rate decision under new Chairman Kevin Warsh today. While rates are widely expected to remain unchanged, markets are anticipating future hikes this year, causing Treasury yields to rise and U.S. stocks to gain in anticipation.

SpaceX Acquires AI Firm Cursor in a $60 Billion Deal

Fresh off a historic IPO that briefly saw its market value surpass Amazon's, SpaceX has announced it is buying the AI coding agent Cursor for $60 billion. The move signals the newly public company's major push into artificial intelligence to expand its enterprise customer base.

U.S.-Iran Peace Deal Details Emerge, Impacting Markets

Details are being released about an interim peace agreement between the U.S. and Iran, which is scheduled to be signed on Friday. The news is causing ripples in global markets, stabilizing oil prices and contributing to a decline in U.S. Treasury yields, though the deal is reportedly facing political backlash in Washington.

Italian Bank UniCredit Advances on Takeover of German Rival Commerzbank

Italy's UniCredit is reportedly nearing a deal to acquire Germany's Commerzbank. The successful takeover would represent a landmark merger in the European banking sector, creating a new pan-European financial giant.

Brokerage Firm Robinhood Announces Job Cuts

The financial brokerage Robinhood announced today that it is cutting its workforce, eliminating approximately 290 positions as part of a corporate restructuring.

02

ON THE WIRE

6 SOURCES
02

HACKER NEWS

02.00
HACKER NEWS

Hacker News - June 17, 2026

Hacker News Feed: Highlighting key posts and discussions.

U.S. Science Is in Chaos

(www.scientificamerican.com)

198229
Calvin and Hobbes and the price of integrity

(therepublicofletters.substack.com)

490210
Stop Using JWTs

(gist.github.com)

444258
Is Meta destroying its engineering organization?

(newsletter.pragmaticengineer.com)

608555
But yak shaving is fun (2019)

(parksb.github.io)

28185
SpaceX Is Buying Cursor

(www.bbc.com)

2893
Mechanical Watch (2022)

(ciechanow.ski)

711122
03

HUGGINGFACE

03.00
HUGGINGFACE

HuggingFace 新闻 - June 17, 2026

HuggingFace Feed:最新的 AI 模型、数据集和社区动态。

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

Looped Transformers scale latent computation by repeatedly applying shared blocks, but sequential looping increases latency and KV-cache memory with the loop count. Parallel loop Transformers (PLT) alleviate this cost through cross-loop position offsets (CLP) and shared-KV gated sliding-window attention, making loop count a practical design choice. We therefore study PLT loop-count selection through a gain--cost view: an extra loop may refine representations, but CLP also introduces a positional mismatch at each loop boundary. We instantiate this study by training LoopCoder-v2, a family of 7B PLT coders with different loop counts, from scratch on 18T tokens, followed by matched instruction tuning and evaluation. Empirically, the two-loop variant delivers broad gains over the non-looped baseline across code generation, code reasoning, agentic software engineering, and tool-use benchmarks, improving SWE-bench Verified from 43.0 to 64.4 points and Multi-SWE from 14.0 to 31.0 points. In contrast, variants with three or more loops regress, revealing a strongly non-monotonic loop-count effect. Our diagnostics show that loop 2 provides the main productive refinement, while later loops yield diminishing, oscillatory updates and reduced representational diversity. Because the CLP-induced mismatch remains roughly fixed as refinement gains shrink, the offset cost increasingly dominates. This gain--cost trade-off explains PLT's saturation at two loops and provides diagnostics for loop-count selection.

88
ACE-Ego-0: Unifying Egocentric Human and Robotic Data for VLA Pretraining

Vision-Language-Action (VLA) models benefit from large-scale and diverse embodied data, yet scaling robot trajectory collection is costly and labor-intensive. Recent advances show that large-scale egocentric human videos provide complementary real-world supervision in pretraining. However, joint training on human and robot data remains challenging due to divergences in action spaces, embodiment structures, temporal dynamics, and supervision quality. We introduce ACE-EGO-0, a unified VLA pretraining framework jointly leveraging heterogeneous data sources. To extract large-scale pretraining supervision from egocentric human videos, we build a scalable egocentric video-to-action pipeline that converts raw human videos into robot-format pseudo-action trajectories. To make these labels comparable with robot demonstrations, ACE-EGO-0 uses a unified action representation based on camera-space actions, morphology conditioning, and time-aligned action chunking. To robustly leverage noisy pseudo-action supervision from egocentric human videos, we formulate a reliability-aware training objective with a human auxiliary loss that concentrates supervision on reliable signals. We instantiate ACE-EGO-0 on 4.53K hours of robot and simulation data, together with 1.48K hours of pseudo-action-labeled egocentric human data. Experiments show that incorporating large-scale human supervision under reliability-aware weighting consistently improves both unified joint pretraining and supervised fine-tuning. ACE-EGO-0 achieves state-of-the-art performance on RoboCasa GR1 TableTop and RoboTwin 2.0, while demonstrating strong transfer to real-world bimanual manipulation.

38
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Knowledge distillation transfers a teacher's competence to a small student but is brittle in the small-student regime: forcing the student to imitate logits from a much larger teacher concentrates it on the teacher's sharpest modes, hurting generalization on benchmark families beyond the training corpus. Reinforcement learning (RL) avoids logit imitation by training on the student's own rollouts. However, on questions where every rollout fails-yielding zero advantage and being silently discarded-injecting a stronger teacher's response into the policy gradient breaks the on-policy assumption and induces drift. We introduce Zone of Proximal Policy Optimization (ZPPO), inspired by Vygotsky's zone of proximal development, which keeps the teacher inside the prompt rather than the policy gradient. On hard questions, ZPPO constructs two reformulated prompts: a Binary Candidate-included Question (BCQ) pairs one correct teacher response with one incorrect student response as anonymized candidates the student must discriminate, and a Negative Candidate-included Question (NCQ) aggregates the student's wrong rollouts into a single prompt to surface their shared failure modes. A prompt replay buffer recirculates each hard question until it either graduates-the student's mean rollout accuracy on it reaches half- or is FIFO-evicted under finite capacity, amplifying BCQ and NCQ inside the student's current zone of proximal development. On the Qwen3.5 family at four student scales (0.8B-9B) with a 27B teacher, post-trained as vision-language models and evaluated on a 31-benchmark suite (16 VLM, 10 LLM, 5 Video), ZPPO outperforms off/on-policy distillation and GRPO, with the largest gains at the smallest scale.

38
GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Game generation is an emerging application of coding agents, requiring models to transform natural-language specifications into playable interactive systems. Unlike traditional coding tasks, game generation takes place within a game engine, where scripts, scenes, assets, rendering, and runtime interactions must jointly produce coherent gameplay. We formalize end-to-end game generation as the problem of producing a complete game artifact that realizes a specification through observable player-game interaction in a target environment. We argue that evaluating this setting requires three desiderata: Engine Grounding, Artifact Completeness, and Interactive Verification. We propose an interaction-grounded evaluation framework that assesses executable gameplay through replayed demonstrations and rubric-guided multimodal judging. We instantiate this framework as GameCraft-Bench, a benchmark comprising 140 Godot tasks across 15 game families. Evaluations of frontier coding agents show that end-to-end game generation remains highly challenging: the strongest agent achieves only 41.46%, and most agents score below 40%. Further analysis reveals that while agents often implement recognizable mechanics, they struggle to deliver complete games with sufficient content, functional visual feedback, and coherent presentation. See https://tongxuluo.github.io/gamecraft-bench-website for demos, code, and data.

35
LectūraAgents: A Multi-Agent Framework for Adaptive Personalized AI-Assisted Learning and Embodied Teaching

Effective personalized AI-assisted learning demands systems that can not only generate accurate learner-specific educational materials, but also dynamically adapt their instruction to diverse learners. However, existing educational agents have primarily focused on lecture content automation and simulations, which often fall short of modelling multimodal and embodied instructional methods tailored for the individual learner. To this end, we propose LectūraAgents - a multi-agent framework that enables personalized learning through end-to-end adaptive embodied teaching. At its core, LectūraAgents mirrors a professor-student relationship, in which a ProfessorAgent leads a collaborative team of specialized subordinate agents through research, planning, review, and embodied delivery of lecture contents that adapt to a learner's needs. The framework offers three main contributions: (1) a hierarchical multi-agent architecture for end-to-end personalized learning; (2) an adaptive embodied teaching mechanism, wherein the ProfessorAgent executes visible and pedagogically motivated teaching actions (e.g., handwrite, highlight, underline, etc.) over contents in a teaching environment; and (3) a Teaching Action-Speech Alignment (TASA) algorithm that employs salience-based heuristics and temporal semantic segmentation to generate coherent teaching action sequences aligned with learner profiles. We evaluate LectūraAgents on diverse courses at high school, undergraduate, and graduate levels using sample-specific rubric-based analysis; with generated lecture materials and teaching actions assessed and validated by expert educators. Experimental results show consistent gains in lecture content quality, embodied teaching quality, assessment, and personalization over existing approaches, positioning LectūraAgents as a pedagogically well-grounded framework for personalized learning at scale.

30
TRIAGE: Dialectical Reasoning for Explainable Risk Prediction on Irregularly Sampled Medical Time Series with LLMs

Clinical early warning systems built on electronic health records, in which clinical observations are recorded as irregularly sampled medical time series (ISMTS), must deliver both calibrated risk scores for patient triage and interpretable rationales that clinicians can verify. Large Language Models (LLMs) have been explored for this task, yet they collapse graded clinical risk into overconfident binary predictions. This risk polarization undermines both calibration and cross-patient comparability. To address this, we propose TRIAGE, a framework that trains an LLM to generate dialectical reasoning over competing clinical outcomes by eliciting outcome-specific rationales. This dialectical formulation mitigates risk polarization, enabling a single LLM to yield continuous risk scores grounded in explicit clinical reasoning. Evaluated on three ISMTS benchmarks, TRIAGE achieves an average AUPRC improvement of 3.3% and reduces calibration error by 81% compared to the competitive baselines. An LLM-as-a-judge assessment further shows that our rationales surpass post-hoc explanations from the baseline by 20% in clinical reasoning quality. The source code is available at https://github.com/HyeongWon-Jang/TRIAGE .

25
Learning from the Self-future: On-policy Self-distillation for dLLMs

On-policy self-distillation (OPSD) has proven effective for post-training large language models (LLMs), yet its application to diffusion LLMs (dLLMs) remains unexplored. Existing OPSD methods are inherently autoregressive-centric. They inject privileged information via left-to-right prefix conditioning with token-level divergence supervision, a design that fundamentally conflicts with the arbitraryorder generation of dLLMs. We introduce d-OPSD, the first OPSD framework tailored for dLLMs. Our approach makes two core contributions. First, we reframe self-teacher construction by using self-generated answers as suffix conditioning, enabling the student model to learn from "self future-experience" rather than privileged prefixes. Second, we shift supervision from token-level to step-level, aligning training with the iterative denoising process of dLLMs. Experiments across four reasoning benchmarks show that d-OPSD consistently outperforms RLVR and SFT baselines with superior sample efficiency, requiring only around 10% of the optimization steps by RLVR and opening a promising pathway for dLLM posttraining. The code is available at https://github.com/xingzhejun/d-OPSD.

21
OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation

Memory has become a standard substrate for self-evolving agents, yet retaining experience is not the same as learning how to evolve through it. Existing memory agents can store trajectories, retrieve reflections, or accumulate skills, but often lack the holistic competence to select useful experience, act on it, write reusable knowledge, and maintain a growing repository. We introduce OPD-Evolver, a slow-fast co-evolution framework that cultivates such an agent evolver through on-policy self-distillation. In the fast loop, OPD-Evolver interacts with a four-level memory hierarchy to read, use, write, and maintain experience for rapid test-time evolution. In the slow loop, outcome-calibrated memory attribution and privileged hindsight distill these four abilities into the deployable policy. Across multi-domain benchmarks, OPD-Evolver surpasses memory systems such as ReasoningBank by up to 11.5%, and training-based methods such as Skill0 by ~5.8%. Further analysis shows that OPD-Evolver internalizes high-value experience and memory management, enabling OPD-Evolver-9B to challenge giant counterparts such as Qwen3.5-397B-A17B and Step-3.5-Flash, pointing beyond memory-augmented agents toward genuinely qualified agent evolvers.

20
Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

Pixel-space diffusion models are trained on full-bandwidth noisy images, yet the useful signal available to the denoiser is strongly frequency dependent. Under rectified-flow diffusion and natural-image power-law spectra, the per-band data-to-noise contour k^{*}(t) = (1-t)^{-2/α} separates a signal-bearing low-frequency region from a noise-dominated high-frequency region at each time t. We show that this implicit coarse-to-fine structure is not merely descriptive: it induces a capacity-allocation problem. A standard pixel-space denoiser must discover the moving bandwidth boundary internally and can spend computation on frequency-time regions where the optimal prediction collapses to deterministic baselines rather than data-distribution modeling. To make this boundary explicit, we introduce Spectral Forcing, a parameter-free, time-conditional 2D-DCT low-pass operator applied to the noisy input before the patch embedder. Its cutoff expands monotonically with the diffusion time and becomes the identity at the data endpoint. Through controlled synthetic experiments, we identify the regime in which the operator is beneficial: coarse patch tokenization and data whose high-frequency content is predominantly noise rather than essential signal. On ImageNet-256 with JiT-700M/32, Spectral Forcing consistently improves both FID and Inception Score across different training epochs, demonstrating robust gains throughout training; at finer tokenization, the spectral forcing is still competitive. We further insert the unchanged operator into SenseNova-U1, a unified text-to-image model, where it improves DPG-Bench and GenEval, showing that the input-side spectral prior transfers beyond class-conditional generation. These results suggest a route to capacity-efficient pixel-space diffusion by showing the signal and hiding the noise.

15
Rethinking the Role of Efficient Attention in Hybrid Architectures

Modern language models increasingly adopt hybrid architectures that combine full attention with efficient attention modules, such as sliding-window attention (SWA) and recurrent sequence mixers. However, how these efficient modules shape model capabilities remains poorly understood. To address this gap, we conduct a systematic analysis across hybrid architectures from three perspectives: scaling behavior, mechanism analysis, and architecture design. First, from a scaling perspective, we find that efficient-attention design primarily affects how fast long-context capability emerges, while different hybrids eventually converge to comparable long-context performance under sufficient training. Second, mechanistically, we show that long-range retrieval is mainly carried by full attention, whereas efficient attention shapes its optimization trajectory. This explains a counter-intuitive phenomenon we call Large-Window Laziness: larger SWA windows can delay the formation of retrieval heads in full-attention layers. Third, guided by this mechanism, we show that applying NoPE to only the full-attention layers of a small-window SWA hybrid substantially improves long-context performance with negligible impact on short-context performance.

8
Text-Vision Co-Instructed Image Editing

Existing image editing methods can be generally categorized into textual instruction-based and visual prompt-based ones. Textual instructions are semantically expressive, but are limited by the coarse granularity of spatial control of the editing results. In contrast, visual prompts such as drag and point can provide precise spatial guidance, but are limited by the inherent ambiguity in semantic intent. To unify the strength of textual and visual prompts, we present Text-Vision Co-Instructed Image Editing, which jointly models textual instructions as semantic intent and sparse visual instructions as spatial guidance, aiming to achieve precise and intent-faithful image manipulation. To this end, we first construct a textual-visual instruction paired dataset with more than 23K samples derived from dynamic videos, enabling aligned supervision for cross-modal instruction. We then propose TV-Edit, a Textual-Visual instruction unified Editing framework to contextualize drag or point-based visual instructions with image-text semantics and lift them into semantic-aware control representations for pretrained editing backbones. By integrating semantic intent and spatial constraints, TV-Edit leads to more precise spatial control, less instruction ambiguity, and stronger structural consistency than text-only or drag-based alternatives. Finally, we establish TV-Edit-Bench, a deliberately designed benchmark to evaluate semantic faithfulness, spatial alignment, and visual consistency with ground-truth references and controlled textual-visual variations for reliable assessment. Our experiments across multiple editing backbones demonstrate that TV-Edit consistently yields more precise and intent-faithful edits, significantly outperforming state-of-the-art instruction-based and drag-based baselines.

7
ChLogic: Evaluating Robustness of Logical Reasoning in Chinese Expressions

Large language models perform increasingly well on standardized logical reasoning benchmarks, but whether this ability remains robust beyond English is unclear. We introduce ChLogic, an English--Chinese aligned benchmark that tests whether models preserve logical reasoning performance when the same latent logical structure is expressed in English and diverse Chinese surface realizations. Built from formal logical templates, the benchmark contains three data sets: (i) the General aligned set, derived from 60 General Propositions across nine template families; (ii) the Difficult aligned set, derived from 40 Difficult Problems; and (iii) the Chinese-only set, covering 15 language-specific phenomenon types. Each aligned item pairs one English reference expression with five Chinese realizations. Experiments on Qwen3, Ministral, and GLM models reveal a persistent English--Chinese performance gap. Back-translation from standard Chinese into English often improves performance on the General aligned set, but produces mixed effects on the Difficult aligned set, where Qwen3-32B and GLM-5.1 perform worse after translation. These results indicate that Chinese surface realization, translation artifacts, and model-specific behavior jointly affect multilingual logical reasoning. Overall, ChLogic provides a useful stress test for the robustness of multilingual reasoning.

6
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization

Reinforcement learning with verifiable rewards (RLVR) improves language-model reasoning, but GRPO-style optimization remains prone to collapse. We analyse this instability through token-level gradient dynamics, deriving a taxonomy that predicts how updates affect next-token probabilities and entropy. The taxonomy shows that stability depends jointly on the advantage sign and token distribution under the current policy. Motivated by this finding, we propose Winner Advantage Policy Optimization (WAPO), a simple online clipped policy-gradient objective that updates only on positive-advantage completions. Across mathematical reasoning and multi-hop QA benchmarks, WAPO improves training stability and matches or outperforms baselines across multiple model families. Full code can be found at https://github.com/layer6ai-labs/wapo.

6
Looped World Models

Current world models face a fundamental tension: faithful long-horizon simulation demands deep computation, but deeper models are expensive to deploy and prone to compounding errors. We resolve this by introducing Looped World Models (LoopWM), which are the first looped architectures for world modelling. Our method iteratively refines latent environment states through a parameter-shared transformer block. This yield up to 100x parameter efficiency over conventional approaches with adaptive computation that automatically scales depth to match the complexity of each prediction step. Orthogonal to scaling model size and training data, LoopWM establishes iterative latent depth as a new scaling axis for world simulation, which might significantly push the community forward.

5
Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification

Unified Multimodal Modeling aims to integrate visual understanding and generation within a single system. However, existing approaches typically rely on two disparate visual tokenizers, which splits the representation space and hinders truly unified modeling. We propose UniAR, a unified autoregressive framework where a single discrete visual tokenizer serves as the key bridge between understanding and generation, enabling a shared context in which the model can directly interpret its own generated visual tokens without additional re-encoding. UniAR adapts a pretrained vision encoder with multi-level feature fusion and a lookup-free bitwise quantization scheme, preserving both high-level semantics and low-level details while scaling the effective visual vocabulary at minimal cost. Building on this, the unified autoregressive model adopts parallel-bitwise-prediction to jointly predict spatially grouped, multi-level visual codes, substantially reducing visual sequence length and accelerating generation. Finally, a diffusion-based visual decoder operates on discrete visual tokens to decode high-fidelity images. Through large-scale pre-training, followed by supervised fine-tuning and reinforcement learning, UniAR achieves state-of-the-art performance on image generation and image editing while remaining competitive on multimodal understanding benchmarks. The project page is available at https://sharelab-sii.github.io/uniar-web.

5
ProCUA-SFT Technical Report

Training computer-use agents (CUAs) -- models that interact with graphical desktops through screenshots and keyboard/mouse actions -- requires large-scale, diverse trajectory data collected in full desktop environments. The largest public resource, AgentNet (22.5K human trajectories), leads to negative transfer when used for supervised fine-tuning (SFT): continuing training UI-TARS 7B on AgentNet causes OSWorld success rate to fall from 26.3% to 8-10%. We present ProCUA-SFT, a dataset of 3.1M step-level SFT samples distilled from 93K synthetic trajectories across 2,484 application combinations. The dataset is produced by a fully automated pipeline that (i) synthesizes grounded tasks on live desktops seeded with real-world content -- 912 spreadsheets from SpreadsheetBench, approximately 10K permissively-licensed presentations from Zenodo10K, and multi-application OSWorld configs -- and (ii) verifies each task's feasibility through binary precondition checking before rollout. A single VLM (Kimi-K2.5) serves as goal generator, precondition judge, and trajectory executor, eliminating planner-actor capability gaps. Each trajectory is expanded into step-prefix samples that exactly reproduce the context layout seen at inference time. Fine-tuning UI-TARS 7B on ProCUA-SFT for one epoch yields 45.0% on OSWorld -- an 18.7 percentage-point improvement over the base model and over 35% above AgentNet-trained counterparts. A subset of ProCUA was incorporated into the training data for the Nemotron 3 Nano Omni model, contributing to its computer-use capabilities.

4
ActWorld: From Explorable to Interactive World Model via Action-Aware Memory

Interactive world models aim to simulate environment dynamics under real-time user actions. However, their action vocabulary is largely confined to navigation: most actions correspond to motion (e.g., walk, turn, look around), while interaction with objects in the scene (e.g., pick up plates, open doors, or trigger physical responses) is either absent, restricted to game domains, or relegated to prompt-to-full-video scenarios. The resulting worlds are visually explorable but not truly actionable. In this work, we present ActWorld, an interactive world model that extends prior navigation-centric generators to support mid-rollout object interaction within a chunk-autoregressive framework. We argue that the navigation-interaction gap stems from two bottlenecks. First, a data bottleneck: the lack of human-object interaction data with accurate, dense labels. Second, a memory bottleneck: recency-biased history compression in existing world models discards the event-transition frames that causally determine subsequent object states, leading to an action-forgetting pathology. On the data side, we construct a 100K interaction video dataset, each annotated with per-chunk captions via chain-of-thought reasoning. On the model side, we introduce a hierarchical action-aware memory design that routes history compression by interaction importance, complemented by a persistent memory bank that maintains event-update and object-identity tokens across long rollouts. Experiments show that ActWorld supports both flexible navigation and rich object interaction within a single model, substantially improving interaction fidelity over navigation-only baselines without sacrificing viewpoint control. Project page is available at https://interactwm.github.io/ActWorld.

4
Dr-DCI: Scaling Direct Corpus Interaction via Dynamic Workspace Expansion

Agentic search over large corpora relies on retriever-mediated interfaces (e.g., BM25 or ColBERT) for scalable candidate discovery. While effective at ranking relevant documents, these interfaces expose evidence only as ranked results or bounded document views, limiting agents' ability to reorganize material and verify constraints across documents. Direct Corpus Interaction (DCI) addresses this limitation by exposing shell-executable corpus operations for flexible search, filtering, comparison, and verification. However, full-corpus terminal commands become slow and unstable as the corpus grows, degrading performance and efficiency. We introduce DR-DCI, a retriever-steered DCI framework that treats retrieval as an agent-callable action for expanding a local workspace. Rather than operating directly over the full corpus, the agent dynamically pulls relevant documents into an evolving workspace and conducts DCI operations within it. This design combines retriever-level recall with DCI-style precision: retrieval keeps exploration scalable, while DCI preserves the local operations needed for effective evidence resolution. Experiments show that DR-DCI is both effective and efficient across scales. On Browsecomp-Plus, DR-DCI reaches 71.2\% accuracy, improving over raw DCI and ablated variants by up to 8.3 points while reducing tool usage, wall time, and estimated cost. With workspace-preserving context reset, accuracy further improves to 73.3\%. In corpus-scaling experiments, DR-DCI remains effective from 100K to 10M documents, whereas raw DCI becomes unstable and BM25 performs substantially worse. DR-DCI also scales to a 20M-scale file-per-document Wiki-18 QA setting, achieving an average score of 63.0 across six benchmarks and outperforming retrieval-based and trained search-agent baselines. Ablation analysis further shows that ranked previews and inter-document DCI are key to performance.

3
Visual-Seeker: Towards Visual-Native Multimodal Agentic Search via Active Visual Reasoning

Multimodal large language models (MLLMs) have demonstrated impressive capabilities in many visual tasks, but they often struggle with factual grounding when confronted with complex, open-world scenarios. While recent multimodal deep search agents attempt to address this issue by utilizing external tools, the visual-native search paradigm remains underexplored. Existing methods primarily rely on simple images with explicit semantics and text-only evidence trajectories, limiting the agent's ability to perform multi-hop, cross-modal reasoning and search. To address these limitations, we propose Visual-Seeker, a visual-native multimodal deep search agent via active visual reasoning. Rather than treating vision as a static input, our agent actively attends to fine-grained visual details, dynamically harvests visual evidence throughout the search process. To unlock its visual-native potential, we design an active visual reasoning data pipeline and synthesize 5K high-quality multimodal trajectories for model training. Extensive experiments demonstrate the state-of-the-art performance across five challenging multimodal search benchmarks, even surpassing several proprietary models, validating robust visual-native reasoning and search in real-world web environments. The code and data can be accessed at: https://github.com/ZhengboZhang/Visual-Seeker.

3
EgoCS-400K: An Egocentric Gameplay Dataset for World Models

The shift from video generation to interactive world modeling places new demands on data: beyond captioned videos, world models require temporally aligned video-action-language trajectories grounded in the actions, camera motion, states, and events that drive future scene changes. However, such data is difficult to obtain at scale. Web video datasets offer broad visual coverage but lack executable actions and reliable states; robotic datasets provide action and state supervision but are costly and limited in scene diversity; and existing simulators often lack large-scale human-driven interaction trajectories. In this paper, we introduce EgoCS-400K, a large-scale replay-grounded egocentric Counter-Strike dataset for world models, built from public professional CS and CS2 match demos that preserve human gameplay trajectories and enable parsing, replaying, rendering, and temporal alignment. We extract player states, view directions, movements, keyboard/button inputs, view-angle changes, weapon usage, game events, and round-level context, and render clean first-person videos from the same trajectories. EgoCS-400K contains over 400,000 first-person videos and 10,000 hours of gameplay from more than 1,000 matches and 40,000 rounds, covering 13 maps and 10 player viewpoints per round. It supports a range of interactive visual modeling tasks, including action-conditioned future prediction, state- and event-aware scene rollout, replay-grounded captioning, and agent egocentric action understanding. By connecting visual observations with human actions, camera motion, game states, and events at scale, EgoCS-400K serves as a practical bridge between passive web videos, controllable game simulation, and costly real-world embodied data.

3
Variable-Width Transformers

Scaling model size, specifically depth and width, has driven significant progress in transformer-based language models. However, most architectures maintain a constant width across all layers, allocating a fixed parameter and computation budget evenly despite different layers potentially playing distinct computational roles. In this work, we empirically investigate nonuniform capacity allocation across network depth by proposing a times-shaped > <former architecture. This design maintains wider early and late layers while narrowing the middle layers, utilizing a parameter-free residual resizing mechanism. Across decoder-only language models ranging from 200M to 2B parameters (dense) and 3B parameters (MoE), our > <former consistently outperforms parameter-matched uniform baselines on language modeling loss. By reducing the average layer width, this architecture also requires fewer overall FLOPs (22% reduction under fitted loss-matched scaling curves) and smaller KV cache memory and I/O cost (15% reduction). In analysis, we show that this bottleneck structure results in qualitatively different representations in residual streams. Overall, our results demonstrate that nonuniform width allocation can result in more resource-optimal scaling of language models.

3
RepSelect: Robust LLM Unlearning via Representation Selectivity

Making large language models (LLMs) deeply forget specific knowledge and values without sacrificing general capabilities remains a central challenge in unlearning. However, current methods are easily reversed by fine-tuning or few-shot prompting, suggesting their forgetting is only shallow. We identify the root cause. Existing methods target representations shared with both the retain set and the subspace recovered by a fine-tuning attacker, making unlearning both disruptive to general capabilities and easy to reverse. We propose RepSelect (Representation Selectivity), isolates forget-set-specific representations by collapsing top principal components of weight gradients before each update, leaving general capabilities intact while limiting what fine-tuning can recover. We evaluate across two forget categories, biohazardous knowledge and abusive tendencies, and four model families spanning dense and Mixture-of-Experts architectures (Llama 3, Qwen 3.5, Gemma 4 E4B, DeepSeek V2 Lite). Compared to five popular baselines (GradDiff, NPO, SimNPO, RMU, UNDIAL), RepSelect achieves a 4-50x larger reduction in post-relearning answer accuracy than the strongest baseline, and is near-perfectly robust to few-shot prompting attacks. Targeting selective representations is thus an important step towards deep and robust LLM forgetting.

2
Aligning Quantum Operators with Large Language Models

Can Large Language Models (LLMs) understand and reason about quantum operators? Despite their remarkable capabilities in mathematics and symbolic reasoning, LLMs remain inherently blind to quantum representations such as unitary matrices. In this work, we take a step toward bridging this gap by introducing an approach that maps unitary operators into the latent space of an LLM, enabling unified modeling over quantum and linguistic inputs. We instantiate this idea on Clifford+T circuit synthesis over a Pauli rotation gate set, where our model achieves results competitive with state-of-the-art methods and scales consistently with training data, with no signs of saturation. Our approach further enables language-conditioned synthesis, allowing gate constraints unseen during training to be specified directly in natural language. This work suggests a path toward quantum--aware foundation models that can natively interpret and reason about quantum operations, which could have broader implications reaching across quantum compilation and algorithm discovery.

2
RefGC-SR^2: Reference-guided Generated Content Super-Resolution and Refinement

Reference-guided generation (e.g., object compositing, customization) has progressed rapidly, yet current pipelines share a fundamental limitation: the object-centric high-resolution reference image (HRRI) provided by users is downsampled to a fixed low-resolution (LR) before being fed into the model, so the fine-grained details are discarded before the output is even produced. In addition, the generation step then introduces its own artifacts (e.g., identity distortion) on top of this loss. Existing reference-guided generated content refinement (RefGCR) methods can correct some of these artifacts but still operate in the LR domain; reference-guided super-resolution (RefSR) methods recover resolution but assume natural-image degradations and ignore the artifact distribution of generative pipelines. To address both gaps in a single formulation, we introduce a new task: reference-guided generated content super-resolution-refinement (RefGC-SR^2), where the original HRRI is reused at the post-processing stage to recover lost details, refine generative artifacts, and upscale the output simultaneously. We construct the first real-world triplet data generation pipeline for this RefGC-SR^2 task, training a diptych-conditioned generator to synthesize paired low-quality anchors that public pretrained models cannot provide. We further present a frequency-aware diffusion transformer model for RefGC-SR^2 that selectively injects fine details from the HRRI while removing generative artifacts. Extensive experiments demonstrate that our RefGC-SR^2 model successfully (i) refines the object identity faithfully with respect to the reference, and (ii) recovers high-resolution details, so that the final result is significantly higher quality and practically more usable compared to existing RefGCR and RefSR baselines.

1
MotionVLA: Vision-Language-Action Model for Humanoid Motion

Generating realistic humanoid motion from scene images and text involves both low-frequency pose semantics and high-frequency physical dynamics. However, many existing methods tokenize motion with a single shared codebook, forcing heterogeneous motion signals into the same quantization space. Our frequency-domain analysis of human motion data reveals a clear mismatch between single-codebook quantization and motion statistics: five DCT coefficients capture 93% of joint-position energy but only 37% of joint-velocity energy, which can bias quantization toward pose statistics and under-represent high-frequency velocity components. A second challenge lies in adapting a standard autoregressive model to effectively model high-frequency physical signals in motion sequences. Therefore, we propose DSFT, a dual-stream frequency tokenizer that separates motion into Base and physical streams and compresses them independently with DCT truncation and BPE. Furthermore, we present MotionVLA, a Qwen3.5-based model that arranges Base and physical tokens in a unified sequence, where Phys tokens are predicted after Base tokens. Experiments on HumanML3D and MBench show that, despite using a lightweight 2B backbone, MotionVLA reduces the Diversity gap to real data by over 50% on HumanML3D and improves Motion-Condition Consistency by 3.8% on MBench, supporting frequency-aware dual-stream decoupling as an effective formulation for autoregressive motion generation. Code: https://github.com/AIGeeksGroup/MotionVLA. Website: https://aigeeksgroup.github.io/MotionVLA.

1
Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

Deep research agents are increasingly evaluated on their ability to search for evidence, reason over retrieved sources, and produce grounded answers. Existing browsing benchmarks, however, largely assume that the user's query and the supporting evidence are written in the same language, leaving open whether agentic search systems can operate when relevant evidence appears in another language. We introduce XBCP (Cross-lingual BrowseComp-Plus), a controlled benchmark that preserves the English question-and-answer space of BrowseComp-Plus but varies the languages of the supporting documents. XBCP instantiates two complementary settings: in the cross-lingual setting, each query is paired with evidence in a single assigned language. In the multilingual setting, the full evidence corpus is distributed equally and randomly across 12 languages spanning high-resource and low-resource regimes. We evaluate four deep research agents using sparse and dense multilingual retrievers, measuring answer accuracy, evidence recall, search behavior, calibration, citation fidelity, and oracle retrieval. Results reveal substantial degradation when evidence is translated. Even strong, dense retrievers lose evidence recall, and agents become less calibrated and cite evidence less reliably. Notably, accuracy remains lower even when all gold evidence is supplied directly. These findings suggest that cross-lingual deep research exposes both retrieval failures and an independent, agent-side difficulty in integrating language-mismatched evidence.

1
The Price of Anarchy in Disaggregated Inference

Disaggregated inference architectures physically separate prefill and decode phases onto distinct GPU pools, creating competing "agents" that share a fixed hardware budget. We provide, to our knowledge, the first formal game-theoretic analysis of this architecture, using NVIDIA Dynamo as a concrete case study. We model disaggregated serving as three coupled games: a two-player resource game between prefill and decode pools, a selfish caching game over the hierarchical KV cache, and a congestion game with positive externalities for request routing. We empirically validate the latter two; the P/D resource game is treated analytically (Section 9.2). We characterize how GPU saturation induces regime transitions that shift the game's payoff structure: below saturation, selfish behavior has bounded Price of Anarchy (PoA); at saturation, superlinear latency and cache externalities drive our empirical estimator PoA-hat (defined in Section 6.4) upward. Based on this analysis, we design an adaptive controller that detects saturation transitions in real time and adjusts routing parameters accordingly, shifting from cache-affinity exploitation to load-balanced congestion avoidance. We instantiate our framework on a 3-node NVIDIA B200 cluster running Dynamo with two models, Nemotron-4-340B (TP=8, full-node workers with cross-InfiniBand KV transfers) and Llama-3.1-70B (TP=4), and find the same three-regime PoA-hat structure with the same first post-knee grid point (C=128) on both models. Adaptive routing shifts each model to a better operating point. Our strongest result is on the 70B 1P/5D topology, where PoA-hat drops 3.1x (66.4 to 21.5) in the saturated phase at a 13% throughput cost. On the 70B 1P/2D, PoA-hat drops 2.2x and TTFT P99 drops 7.6x (see Section 8.5).

0
05

PRODUCT HUNT

05.00
PRODUCT HUNT

Product Hunt - June 17, 2026

Product Hunt Daily Feed: Featuring noteworthy tech launches.

Typerino icon
Typerino

Screenwriting and playwriting for movies, television, stage.

0
Wolfram Language 15 icon
Wolfram Language 15

Computational language built for humans and AI agents

0
Snapchat SPECS icon
Snapchat SPECS

Powerful computer built into lightweight see-through glasses

0
Quartz icon
Quartz

AI email client built for focus. Runs locally on your Mac

0
ClipDone icon
ClipDone

Automatic short-form video editing

0
Daemons by Charlie Labs icon
Daemons by Charlie Labs

Keep PRs, issues, CI, and docs moving with AI agents

0
ClawEase icon
ClawEase

An AI business operator that books appointments for SMBs.

0
Wilson icon
Wilson

AI coworker in Slack that builds reports, work tools, + more

0
Cilantro icon
Cilantro

Watches your accounts and tells you what changed

0
Tyto by ai-coustics icon
Tyto by ai-coustics

Audio insight that predicts voice AI performance

0
Infinite icon
Infinite

OS runtime unifying GA4, PostHog, + Stripe into a local db

0
Deep Work Plan icon
Deep Work Plan

Models matter. Context matters more. Give your agent a plan.

0
Framer 3.0 icon
Framer 3.0

With Agents, Branching, Community, and an all-new design

0
Mirlo icon
Mirlo

Social media for real connections. No likes, no algorithm.

0
PaneFlow icon
PaneFlow

Let AI agents build real animated slideshows

0
memi icon
memi

The AI agent harness for product design teams

0
Dualora icon
Dualora

Record in both 16:9 and 9:16 at the same time

0
AudienceCue icon
AudienceCue

Turn YouTube comments into cited insights and a report

0
Swytchcode CLI icon
Swytchcode CLI

Give agents reliable access to 2,000+ APIs w/ durable state

0
Android 17 icon
Android 17

Android becomes an intelligence system

0
Tapfree for Chrome icon
Tapfree for Chrome

Voice dictation that adapts to what’s on your screen

0
Dopami icon
Dopami

Household chores without the mental load for ADHD

0
Docfarm icon
Docfarm

Host, share, and track everything your AI builds

0
Locus Founder icon
Locus Founder

Text an AI agent and it builds + runs your business

0
Henji icon
Henji

AI replies that are trained to sound like you

0
Vitrine icon
Vitrine

Turn any photo into a beautiful wallpaper

0
idmly icon
idmly

Turn any HTML/CSS design into an editable InDesign file

0
StickerWords icon
StickerWords

Learn new words from the world around you

0
Redactify icon
Redactify

Automated profanity censoring for video & audio

0
Parano.ai icon
Parano.ai

Never miss a competitor's move

0
Polygram Coding Agent icon
Polygram Coding Agent

AI-native coding assistant that helps developers in any IDE

0
Spanly icon
Spanly

See what AI agents do inside your MCP server

0
Restaurant Menu Visualizer icon
Restaurant Menu Visualizer

Share the menu with Ava and see what each dish may look like

0
NotchSpace icon
NotchSpace

Turn your Mac notch into an intelligent active workspace

0
TapSign icon
TapSign

Send, sign & manage documents easily

0
SuperGoal icon
SuperGoal

World cup in your menu bar

0
MCP 2000 icon
MCP 2000

AI Drum Machine MPC in your browser

0
SolonGate icon
SolonGate

Zero-trust security gateway for AI agents

0
Wario Synth icon
Wario Synth

Transform songs into retro Gameboy-style game console music

0
BrainFlow icon
BrainFlow

Turn your rambling thoughts into coherent notes

0
Tendlet icon
Tendlet

Know, plan, and share care for pets and plants

0
Ledgerly icon
Ledgerly

Available on the App Store

0
FableWatch icon
FableWatch

Know the second Fable 5 is back

0
Notum icon
Notum

AI-powered research and document intelligence for law firms.

0
agentbrowse icon
agentbrowse

Give your AI coding agent the web as a command line

0
Subotiz icon
Subotiz

Smarter Monetization. Subscribe. Sell. Scale.

0
Avocado icon
Avocado

AI-native content operations for any Next.js website

0
Mood icon
Mood

A quiet record of how you feel

0
Human in the Love icon
Human in the Love

The world's first MCP dating app inside Claude.

0
Tadka icon
Tadka

Ship 10x more ad creative, without hiring a design team.

0
06

TECHMEME

06.00
TECHMEME

Techmeme - June 17, 2026

Techmeme Digest: Major tech headlines and industry conversations.

Google's $99 Google Home Speaker, which is built for Gemini for Home and works as a Matter controller, will ship on June 29, nine months after it was announced (Jennifer Pattison Tuohy/The Verge)
Source: TechmemePublished: Jun 17, 2026

Jennifer Pattison Tuohy / The Verge : Google's $99 Google Home Speaker, which is built for Gemini for Home and works as a Matter controller, will ship on June 29, nine months after it was announced —  The Google Home Speaker is designed for Gemini for Home and its more conversational smart home assistant.

UK says reports that Prime Minister Keir Starmer sought a carve-out to US export controls imposed on Mythos 5 are "categorically untrue" (Politico)
Source: TechmemePublished: Jun 17, 2026

Politico : UK says reports that Prime Minister Keir Starmer sought a carve-out to US export controls imposed on Mythos 5 are “categorically untrue” —  EVIAN-LES-BAINS, France — Prime Minister Keir Starmer has not sought a carve-out to U.S. export controls imposed on Anthropic's most advanced AI models …

Andera, which uses AI to automate corporate audit and compliance testing, raised a $37M Series A led by Lightspeed (Ryan Lawler/Axios)
Source: TechmemePublished: Jun 17, 2026

Ryan Lawler / Axios : Andera, which uses AI to automate corporate audit and compliance testing, raised a $37M Series A led by Lightspeed —  Andera, which uses AI to automate corporate audit and compliance testing, raised $37 million in Series A funding, CEO Aryo Patel tells Axios exclusively.

EigenQ, which makes cybersecurity systems to protect from future attacks by quantum computers, plans to go public via a SPAC merger at a $3B valuation (Reuters)
Source: TechmemePublished: Jun 17, 2026

Reuters : EigenQ, which makes cybersecurity systems to protect from future attacks by quantum computers, plans to go public via a SPAC merger at a $3B valuation —  EigenQ will go public through a merger with blank-check company Silicon Valley Acquisition (SVAQ.O) in a deal valuing the quantum tech firm …

Conduct, which uses AI to let companies maintain, change, and modernize their legacy IT systems, raised a $60M Series A, bringing its total funding to $72M (Chris Metinko/Axios)
Source: TechmemePublished: Jun 17, 2026

Chris Metinko / Axios : Conduct, which uses AI to let companies maintain, change, and modernize their legacy IT systems, raised a $60M Series A, bringing its total funding to $72M —  Enterprise IT startup Conduct raised a $60 million Series A co-led by Index Ventures and ICONIQ, CEO Jan Philipp Haas tells Axios Pro exclusively.

Estonia says it will assign personal ID numbers to AI agents to give them "limited, controllable, and auditable authorizations" as they take actions for humans (Ott Tammik/Bloomberg)
Source: TechmemePublished: Jun 17, 2026

Ott Tammik / Bloomberg : Estonia says it will assign personal ID numbers to AI agents to give them “limited, controllable, and auditable authorizations” as they take actions for humans —  Estonia plans to assign personal identification numbers to AI assistants that would give the bots legal rights …

Twenty, which uses AI to help US military hackers penetrate adversary computer networks, raised a $100M Series B led by Accel at a $1B valuation (Colin Demarest/Axios)
Source: TechmemePublished: Jun 17, 2026

Colin Demarest / Axios : Twenty, which uses AI to help US military hackers penetrate adversary computer networks, raised a $100M Series B led by Accel at a $1B valuation —  Twenty, a cyber warfare startup, raised $100 million and is now valued at $1 billion. … - Even rarer is Twenty's unabashed pursuit and advertisement of offensive cyber tools.

Nasdaq Private Market sues Hiive, claiming Hiive stole trade secrets and infringed on its IP, poaching two employees in part to access confidential information (Yazhou Sun/Bloomberg)
Source: TechmemePublished: Jun 17, 2026

Yazhou Sun / Bloomberg : Nasdaq Private Market sues Hiive, claiming Hiive stole trade secrets and infringed on its IP, poaching two employees in part to access confidential information —  Trading platform Nasdaq Private Market alleged that its Vancouver-based competitor Hiive has stolen trade secrets and infringed on its intellectual property in a lawsuit.

OpenAI's CSO Jason Kwon messaged staff that the company "strongly" told the US government that building AI "requires the best talent from around the world" (The Information)
Source: TechmemePublished: Jun 17, 2026

The Information : OpenAI's CSO Jason Kwon messaged staff that the company “strongly” told the US government that building AI “requires the best talent from around the world” —  The U.S. government's latest battle with Anthropic has revived long-simmering concerns throughout the AI industry …

The US government awards $500M under the CHIPS Act to SandboxAQ to use AI to develop new chemicals and materials for domestic semiconductor manufacturing (Stephen Nellis/Reuters)
Source: TechmemePublished: Jun 17, 2026

Stephen Nellis / Reuters : The US government awards $500M under the CHIPS Act to SandboxAQ to use AI to develop new chemicals and materials for domestic semiconductor manufacturing —  The U.S. government awarded $500 million on Wednesday to startup SandboxAQ to develop new chemicals and materials …

Uber plans to expands its premium robotaxi service from SF to Houston by mid-2027, in partnership with EV maker Lucid and autonomous vehicle startup Nuro (Kirsten Korosec/TechCrunch)
Source: TechmemePublished: Jun 17, 2026

Kirsten Korosec / TechCrunch : Uber plans to expands its premium robotaxi service from SF to Houston by mid-2027, in partnership with EV maker Lucid and autonomous vehicle startup Nuro —  Uber plans to launch a premium robotaxi service in Houston by mid-2027, making it the second U.S. market under its partnership …

Nauk Nauk, which runs an AI video app that turns photos of toys into short animated videos, raised $20M and is coming out of beta; it claims to have 1M+ users (Ina Fried/Axios)
Source: TechmemePublished: Jun 17, 2026

Ina Fried / Axios : Nauk Nauk, which runs an AI video app that turns photos of toys into short animated videos, raised $20M and is coming out of beta; it claims to have 1M+ users —  Nauk Nauk has raised $20 million to expand its AI video app that turns photos of toys into short animated videos, co-founder Daniel Liu tells Axios.

Sources: Trump officials discussed how to structure government equity stakes in AI companies, with Commerce Secretary Lutnick preferring a sovereign wealth fund (Eleanor Mueller/Semafor)
Source: TechmemePublished: Jun 17, 2026

Eleanor Mueller / Semafor : Sources: Trump officials discussed how to structure government equity stakes in AI companies, with Commerce Secretary Lutnick preferring a sovereign wealth fund —  Senior Trump administration officials had weighed how to structure potential government equity stakes in major AI companies …

Sources describe Huawei's efforts to rebuild its chip business seven years after the US cut Huawei off from advanced chips, as it bets on logic-stacking tech (Financial Times)
Source: TechmemePublished: Jun 17, 2026

Financial Times : Sources describe Huawei's efforts to rebuild its chip business seven years after the US cut Huawei off from advanced chips, as it bets on logic-stacking tech —  Seven years after being written off the Chinese tech giant is making technical advances that appear to sidestep Washington curbs.

World models startup Odyssey raised $310M from Amazon and others at a $1.45B valuation and will use AWS as its preferred cloud partner, deploying Trainium chips (Financial Times)
Source: TechmemePublished: Jun 17, 2026

Financial Times : World models startup Odyssey raised $310M from Amazon and others at a $1.45B valuation and will use AWS as its preferred cloud partner, deploying Trainium chips —  Company joins investment arms of Nvidia and AMD in $310mn funding round for Odyssey ML  —  Amazon is partnering with a start …

07

STARTUP ARCHIVE

07.00
STARTUP ARCHIVE

Startup News - June 17, 2026

Startup News Roundup: Aggregating key funding and launch updates.

Marc Andreessen on the 5 personality traits of an innovator
Source: StartupPublished: Mar 31, 2026

“When you’re talking about real innovators—people who actually do really creative, breakthrough work—I think you’re talking about a couple things:”

Steve Jobs explains the importance of both thinking and doing
Source: StartupPublished: Mar 30, 2026

“The doers are the major thinkers. The people who really create the things that change this industry are both the thinker-doer in one person.”

Tobi Lutke explains what the VCs who passed on Shopify got wrong
Source: StartupPublished: Mar 27, 2026

“What a lot of free-market thinkers don’t understand is that between the demand and eventual supply lies friction."

Sam Altman explains how he decides to invest in a startup after 10 minutes
Source: StartupPublished: Mar 26, 2026

"Does this person have the potential to be the next Mark Zuckerberg?… [You don’t get to] 100% accuracy, obviously, but it’s good enough that our business model works.”

Jony Ive recounts the time Steve Jobs called him vain
Source: StartupPublished: Mar 25, 2026

In the clip below, Jony Ive recounts the time he asked Steve Jobs to be less harsh in his critique of a piece of work.

Jeff Bezos’s two pieces of advice for aspiring entrepreneurs
Source: StartupPublished: Mar 24, 2026

“The advice that I would give entrepreneurs is don't chase the hot new thing. It's so hard to catch something that everybody already knows is hot."

Elad Gil: “Things that work tend to work pretty fast”
Source: StartupPublished: Mar 23, 2026

“I do think there’s a bit of a myth in Silicon Valley that you should keep grinding no matter what and it’s just about perseverance, and I think that’s really bad advice."

Paul Graham on why starting with a “small, intense fire" is the key to startup growth
Source: StartupPublished: Mar 20, 2026

"You have to know who those first users are and how you're going to get them."

Keith Rabois on how to identify great talent
Source: StartupPublished: Mar 19, 2026

“What you want to do with every single employee every single day is expand the scope of their responsibilities until it breaks… and that’s the role they should stay in.”

Wealthfront CEO on why advertising spend makes it harder to find product/market fit
Source: StartupPublished: Mar 18, 2026

“The way that you know you have product/market fit is if you have exponential organic growth."

Eric Schmidt on why most companies get strategy wrong
Source: StartupPublished: Mar 17, 2026

“Work very, very hard to figure out what the world’s going to look like in five years. What will people be doing? What will your customers want? Where will costs be?"

Mark Zuckerberg: “You can’t 80/20 everything”
Source: StartupPublished: Mar 16, 2026

"There’s the famous 80/20 rule where you get 80% of the benefit by doing 20% of the work, but you can’t just 80/20 everything. There have to be certain things that you are just the best at."

Marc Andreessen on Mark Zuckerberg’s founder “superpower”
Source: StartupPublished: Mar 13, 2026

“A great superpower that Mark Zuckerberg has that is probably not well-understood enough is he does not get emotionally upset in stressful situations"

Sam Altman explains how to come up with a great startup idea
Source: StartupPublished: Mar 12, 2026

"If you start a startup without a good idea… you’ll be under pressure to make something up and it won’t work that well."

Jeff Bezos on the problems with proxies and managing to metrics
Source: StartupPublished: Mar 11, 2026

“One of the things that happens in business is that you develop certain things that you’re managing to—a typical case would be a metric. And that metric isn’t the real underlying thing.”

Airbnb founder Brian Chesky on how to design an amazing user experience
Source: StartupPublished: Mar 10, 2026

“If you can design something really amazing using the hand-crafted part of your brain, then you can reverse-engineer how to industrialize this millions of times over."

Spencer Rascoff: "I will never invest in a consumer startup with paid marketing”
Source: StartupPublished: Mar 9, 2026

"If you’re actually trying to grow a product, the best levers for doing that are often within the product itself.”

Patrick Collison explains why it sometimes make sense to quit
Source: StartupPublished: Mar 6, 2026

“One thing I’ve learned myself the hard way, is that it is easier to tear down a company and restart it in Silicon Valley, than it is to constantly try to pivot or keep something alive."

Jeff Bezos recounts the time he called Amazon’s customer service number mid-meeting to prove a metric was wrong
Source: StartupPublished: Mar 5, 2026

“I have a saying, which is when the data and the anecdotes disagree, the anecdotes are usually right"

Ben Horowitz: “Nobody was born a great manager. It’s a very unnatural job.”
Source: StartupPublished: Mar 4, 2026

“If you can’t build a great product, it doesn’t matter if you can build a great company.”

03

ALSO TODAY

3 MORE SOURCES
08

SOLIDOT

08.00
SOLIDOT

Solidot News - June 17, 2026

Solidot Feed: Highlighting essential tech & open-source news.

恶意墙纸瞄准中俄 Steam 用户窃取其账号

俄罗斯安全公司卡巴斯基对中俄 Steam 用户发出警告,恶意墙纸正在 Steam 创意工坊快速扩散,其目的是劫持他们的账号。攻击者利用了热门墙纸应用 Wallpaper Engine 创意工坊分享功能的漏洞,恶意程序隐藏在分享的壁纸包中。运行被感染的壁纸会导致 Steam 账号被盗,或者系统被植入后门或加密货币挖矿程序。安全研究人员在创意工坊发现了数十款恶意壁纸,每一款都被下载了数千次,甚至数万次。黑客主要针对中国 Steam 用户,墙纸的艺术风格和标题都专门针对中国玩家量身定制,中国玩家的下载量最多,占到了总下载量的  89.4%,其次是俄罗斯的 5.5%,新加坡 (1.4%)、香港 (0.9%)、德国 (0.9%)、越南 (0.9%)、印度 (0.5%) 和加拿大 (0.5%)。Steam 目前已经移除了包含恶意程序的墙纸。

Firefox 用 Zlib 的 Rust 语言版本替代了 C 语言版本

Firefox 浏览器从 v151 开始,Gzip 压缩/解压缩就依赖于 zlib-rs 库,用 Rust 语言开发的版本替代了 C 语言版本改进了性能,提供了更好的内存安全性,以及带来了英特尔第 13 代/第 14 代酷睿 CPU 不稳定导致的崩溃问题。致力于用 Rust 语言重写关键库的非盈利组织 Trifecta Tech Foundation 在 2024 年夏天就与 Mozilla 讨论在浏览器中集成 zlib-rs,但从测试到落地花了两年时间,一个重要原因就是 zlib-rs 触发了臭名昭著的英特尔 CPU bug。测试中 zlib-rs 中的一些代码导致英特尔 Raptor Lake CPU 频繁崩溃,开发者最终发现问题与 Huffman 编码写入内存的一个特定指令相关,识别问题之后解决起来就容易了,开发者通过加入一段“不安全代码”修复了该问题。

泄漏财务数据显示 2025 年 OpenAI 净亏损约 80 亿美元

泄漏财务数据显示 2025 年 OpenAI 净亏损约 80 亿美元。数据显示,OpenAI 的营收从 2024 年的 37 亿美元增至 2025 年的 130.7 亿美元。研发支出从 2024 年的 78.1 亿美元飙升至 2025 年的 191.8 亿美元,其中仅支付给微软的研发费用就高达 105.9 亿美元。产品生产和分销支出从 2024 年的 26.5 亿美元增至 2025 年的 75 亿美元。销售和市场营销支出从 2024 年的 11.1 亿美元增至 2025 年的 57.3 亿美元。OpenAI 的运营亏损从 2024 年的 87.8 亿美元增至 2025 年的 209.2 亿美元,净亏损从 2024 年略高于 50 亿美元飙升至 2025 年的近 390 亿美元。但其中包含了一笔大约 300 亿美元的从非盈利结构转为盈利性结构的估值相关会计支出,如果不计入这笔费用,OpenAI 在 2025 年净亏损约为 80 亿美元。OpenAI 披露 ChatGPT 周活跃用户逾 9 亿,但付费用户只有 5000 万。

GLP-1 减肥药有助于提高男性睾酮水平和精子质量

根据内分泌学会年会上发表的报告,多项研究显示 GLP-1 减肥药有助于提高男性睾酮水平和精子质量。一项研究对 1600多 名开具减肥药处方的男性患者的电子健康记录进行了分析,发现在接受 GLP-1 药物或双重激素受体激动剂治疗后,参与者的睾酮水平增加了约 30%。另一项回顾性研究同样分析了 215 名接受减肥药物治疗男性的记录,发现治疗后他们的平均睾酮水平比治疗前高出约 20%。睾酮是精子产生和维持生育能力不可或缺的激素,而肥胖会降低睾酮水平已是医学界的共识。脂肪细胞中含有高水平的酶,能将睾酮转化为主要的女性性激素雌二醇。此外肥胖引起的代谢变化和体内炎症水平升高也会直接影响睾酮的产生。当 GLP-1 药物帮助患者有效减重时,这些负面因素也随之减弱,从而促使生殖激素网络恢复正常。

地下真菌网络长度超过 10 万万亿公里

根据发表在《科学》期刊上的一项研究,地下真菌网络长度达到 11 万万亿公里(或 110 京公里,1 京等于 1 千万亿),是地日距离的 7.5 亿倍。丛枝菌根真菌(Arbuscular mycorrhizal fungi)是由被称为菌丝的管状细胞构成的网络。它们通过与逾七成的植物建立共生关系维系着地球上的生命。这种网络已存在约 4.75 亿年,它们通过向植物提供养分和水分换取植物产生的碳,它们还通过将碳吸收到土壤中帮助调节气候。Society for the Protection of Underground Networks(Spun)组织的研究团队利用机器学习模型,结合世界各地逾 16000 个土壤样本的数据,绘制出第一张丛枝菌根真菌网络的全球地图。研究人员称,仅仅一茶匙土壤就可能存在长达 10 米的菌根网络。研究还发现,农耕会破坏真菌网络,农田菌根网络密度平均比野生生态系统低 47.3%。草原地区拥有最密集的菌丝系统,但这些地区缺乏保护,正日益退化。

Mozilla 公布 Firefox 路线图

Mozilla 在宣布 Firefox 152 的同时,公布了将在未来推出的一系列新功能,其中包括:更新 UI 的 Project Nova;自定义快捷键;改进 PDF 编辑功能——支持在浏览器上直接拆分、合并和重组 PDF 文档;Multi-Account Containers 从扩展变成原生功能;移动版本将内置免费 VPN(可能只限于少数国家);通过语音向浏览器提问获得 AI 生成答案的 Quick Answers;隐私 AI 浏览 Smart Window;省电模式(Power Saving Mode)识别手机上消耗资源最多的标签页,自动降低其资源占用,从而延长电池续航时间,等等。

ChatGPT 市场份额首次跌破 50%

根据 Sensor Tower 的《State of AI Report for 2026》报告,在 ChatGPT 发布三年半之后,其市场份额首次跌破 50%,而用户正在 Google Gemini、Anthropic Claude 等不同 AI 助手之间切换。ChatGPT 是最快达到 10 亿月活用户的应用,它的月活用户目前超过 11 亿,之后是 Gemini 的 6.62 亿 和 Claude 的 2.45 亿。ChatGPT 在今年 1 月市场份额还超过 50%,但到了 5 月底降至 46.4%,Gemini 占 27.7% 和 Claude 占 10.3%,Grok、Perplexity、DeepSeek 和 Meta AI 都低于 5%。 Sensor Tower 估计,2026 年上半年,AI 应用下载量预计将接近 23 亿次,用户支出将超过 42 亿美元。相比之下 2025 年上半年的 AI 支出为 18.3 亿美元——这表明 AI 行业正将重心从增长转向盈利。但下载量和支出增长率均已放缓,表明即使绝对数量在继续攀升,市场可能正走向成熟。中国和印度的 AI 应用下载量出现了下滑,2026 年第一季度亚洲下载量下降了 3.3%。

微软考虑使用 DeepSeek 的开源模型降低成本

最大化词元使用(tokenmaxxing)对微软的 AI 工具 Copilot 产生了不利影响,软件巨人正在考虑使用 DeepSeek 的开源模型以降低成本。微软考虑使用的是 DeepSeek-V4 自托管版本的修改版,它将作为一种低成本选项用于驱动微软的 Copilot Cowork。Copilot Cowork 目前运行在 Anthropic 和 OpenAI 的模型上,两家公司不断涨价,Copilot 也从无限量使用切换到了基于使用量的定价模式,此举招致了用户的强烈不满。更便宜的型号有助于降低成本让用户满意,但可能会让特朗普政府不满意。

Peter Thiel 的秘密社交网络曝光

黑客行动主义者曝光了 Peter Thiel 于 2006 年创办的秘密组织 Dialog 的成员信息和内部记录。Dialog 每年都会组织非公开的活动,邀请美国官员、外国政府官员和硅谷高管参加。2026 年活动定于 8 月 12 日至 16 日在爱尔兰都柏林的 Powerscourt Hotel 举行,注册的与会者共 222 人,并标明了他们是活跃会员还是特邀嘉宾(guest),会议讨论的主题包括“金钱能买到幸福吗?”“恢复核能”“应对第三次世界大战”“性生活”“建立邪教(Build-a-Cult)”“建立一个党(Build-a-Party)”等等。2025 年上任的北约欧洲盟军指挥官 Alexus Grynkewich 将军自 2021 年起就参与了 Dialog 活动。名单中的名人还有 Palantir 联合创始人 Joe Lonsdale、美财长 Scott Bessent、参议员 Ted Cruz、陆军部长 Dan Driscoll、参与监管 Pantir 的众议员 Jim Hime,以及 Google 和 DeepMind 高管、经济学诺奖得主 Roger Myerson 等。Dialog 还扮演了某种媒人的角色,询问与会者是否在寻找爱情,还提供了约会应用,该应用的口号是“为杰出人士建立有意义的联系”。Dialog 在 2014 年的活动邀请了 Jeffrey Epstein,但并不知道他有没有出席。

垂直绿化给城市降温

气候变化和城市化加剧了热岛效应,城市地区的温度显著高于农村地区,而更高的温度又推动了制冷需求和加剧了电网压力,形成某种恶性循环。日本大阪府大学 Jihui Yuan 副教授领导的团队调查了垂直绿化等城市降温策略。他们的研究显示,朝南绿墙可将室内热条件改善最多 1.7°C;低反照率外表面能改善室外热舒适度最多 1.5°C;高反照率外表面则有助于降低室内温度。

GLP-1 减肥药在降低体重的同时也降低了骨折率

GLP-1 减肥药如 Ozempic、Wegovy、Rybelsus 能快速降低体重,此前有担忧认为快速的体重下降可能导致骨质疏松,增加骨折风险。然而最新研究发现,相比其它起效较慢的减肥药,GLP-1 减肥药能将骨折风险降低 15%。研究人员承认需要更多研究去证实相关性。研究人员分析了逾 59,000 名患者,其中 26,324 名服用了 GLP-1 减肥药,对照组的 33,555 人服用的是非 GLP-1 减肥药。结果显示,实验组发生 794 例骨折,对照组则发生 1045 例。

亚马逊数据中心 2025 年使用了 25 亿加仑的水

根据亚马逊公布的数据,它的数据中心在 2025 年使用了 25 亿加仑的水。电商巨人声称它的用水量远低于主要竞争对手。亚马逊称,其数据中心用水量为每千瓦时 0.12 升(L/kWh),称微软在 2025 年的用水量为每千瓦时 0.27 升,Meta 在 2024 年的用水量为每千瓦时 0.19 升,Google 最糟糕达到每千瓦时 1.15 升。亚马逊表示,其设施约 90% 的时间都采用“自然空气冷却”,即引入室外空气使其流经服务器吸收热量,无需用水——但在最炎热的天气里会使用水蒸发降温。

Commodore 宣布反社交网络的翻盖手机

曾经的家用 PC 巨人 Commodore 又回来了,它宣布了一款翻盖手机 Callback 8020,运行基于 Linux 的 Sailfish OS 操作系统,不支持任何社交媒体、浏览器或工作应用如电子邮件,但支持地图、播客、拼车、以及流行的消息应用如 WhatsApp、Signal、Telegram 和微信(WeChat)——因为对很多人而言没有这些应用手机什么也不是。这款手机是 Commodore 公司推出的,当然也有 Commodore 模拟器。选择翻盖手机是因为它是作为一种多用途工具,你打开翻盖就是为了用它。这款手机不便宜,6 月 30 日开放预购,售价 499 美元,主要配置是 4GB 内存,64 GB SSD,索尼 4800 万像素相机,显示屏分辨率 480 x 640,电池可移除,容量为 1550mAh。

禁止使用科技产品提升了学生的阅读能力

在数字化时代,一名教师的低科技实验显示学生的阅读能力有了显著提升。明尼阿波利斯 Washburn 高中的 AP 文学和英语教师 Maureen Mulvaney 在学生抄袭、注意力不集中以及阅读能力下降等问题之后开始了低科技实验,在家长的支持下,她禁止学生使用手机和笔记本电脑,要求所有作业都必须用纸笔完成。尽管学生一开始有抵触,但效果立竿见影:实验前的 2025 年 9 月只有 46% 的学生对阅读能力有信心,到了今年 2 月该比例飙升至 95%。大多数学生能写至少两页部分学生甚至能写五六页英文文章。79% 的学生表示在纸上写作和组织思路比在屏幕上更容易。

开源模型能否战胜 OpenAI?

中美两国的 AI 公司采取了不同的发布策略:中国侧重于开源权重模型,美国公司如 OpenAI 和 Anthropic 则采用闭源策略。Hugging Face 前亚太生态系统高管 Tiezhen Wang 表示,OpenAI 和 Anthropic 指责中国 AI 公司蒸馏其模型,他认为蒸馏是中性的,美国 AI 公司是通过抓取互联网上的信息训练模型,它们并非知识的创造者,却试图阻止其他人重复利用知识,有点讽刺。所有 AI 生成的内容都应该没有版权,否则拥有算力的人能滥用权力,生成各种组合内容然后对所有内容都申请版权。他发现中国公司和美国公司在最大化使用 token 上有明显差异,因为中国有很多开源权重模型,其使用成本没有美国大,因此中国互联网公司都鼓励员工最大化使用 token,鼓励员工成为 AI 原生开发者,甚至禁止他们手动完成撰写文档之类的日常工作。

curl 暂停一个月接受漏洞报告

curl 项目维护者 Daniel Stenberg 宣布,curl 将于 7 月 1 日至 8 月 3 日期间暂停接收漏洞报告,除非提交者拥有付费支持合同。他称之为“curl 的极乐夏日”。Stenberg 称过去四个月一直承受着巨大的压力,他们需要休息一下。GitHub 上的项目 issue 和 pull-request 保持开放,curl 8.22.0 的发布日期推迟两周至 2026 年 9 月 2 日发布。

朱雀二号火箭解体产生上百碎片

蓝箭航天于 6 月 9 日在东风发射场发射了朱雀二号改进型遥六运载火箭(ZQ-2E Y6),将搭载的千帆 DTC 01 星和中国移动 02 星送入预定轨道。但火箭上面级随后在太空发生解体事件,碎片散落在近地轨道,其中部分与国际空间站和 SpaceX Starlink 宽带网络的轨道重叠。LeoLabs 的高级技术研究员 Darren McKnight 表示此次事件可能产生了 100-150 块碎片。其中一块碎片是火箭的第二级,长约 8 米直径约 3.35 米。火箭上面级的主体在距离地球 335-424 公里的轨道上运行,轨道倾角 54.5 度。好消息是轨道高度足够低,空气阻力将使大部分火箭碎片在几个月内重返大气层烧毁。如果轨道高度超过 650 公里,那么碎片将需要数十年甚至更长时间才能重返大气层。

Firefox 152 释出

Mozilla 释出了 Firefox 152。主要变化包括:默认编译了 JPEG-XL 支持代码,但默认仍然没有启用,用户需要去 Firefox Labs 调整设置启用,JPEG-XL 是新的免专利图像格式,相关编解码器使用了 Rust 语言开发;重新设计了设置界面、在 Windows 不同硬件配置下支持 HDR 视频、支持 CSS 的 field-sizing 属性,以及一系列面向开发者的新功能,等等。

女性头部摄药量与经期相关

小鼠和人类实验显示,头部通过经鼻给物法摄入的药量存在性别差异,其中雌性摄药量与经期密切相关。研究人员通过小鼠实验发现,在动情前期与动情期,即雌激素奔涌至最高的阶段,雌性头部区域摄取的药物显著多于雄性;随着周期进入动情后期,雌激素跌向其最低值,两性之间基本不存在差异。研究人员在人类身上观察到了相似的现象:女性趋于更高的峰浓度,女性最高峰值逾男性最高峰值的两倍。男性则保留药物更久。

微软求助于 AWS 以满足 GitHub 上 AI 驱动的负载增长需求

微软旗下的代码托管平台 GitHub 最近一段时间宕机事件频发,它正将服务迁移到微软的云计算平台 Azure,但仍然无法满足不断增长的需求,因此它正求助于最大的竞争对手亚马逊 AWS,以确保平台能正常运行。亚马逊表示不会对个别客户发表评论。对微软而言,GitHub 宕机的风险已经超过了向 AWS 付费所带来的负面影响。GitHub 首席运营官 Kyle Daigle 此前表示,该平台用户的提交(Commits)数将从 2025 年的 10 亿飙升到 2026 年 140 亿次。GitHub 在 2025 年 10 月计划将平台容量提高 10 倍,但到了 2026 年 2 月它发现需要提高 30 倍,原因是 AI 编程导致平台工作负荷大幅提升。

09

APP STORE RANK

09.00
APP STORE RANK
Loading…