TEXT VIEW · TODAY'S DIGEST · 36 HEADLINES ACROSS 8 SOURCES

Startup Archive(0)

No items yet for today.

App Store Rankings(0)

No items yet for today.

ISSUE 0878
WED, MAY 27, 2026
Discover the best information organized by OrangeBot.AI
TODAY · WED, MAY 27, 2026

The web,
read by a bot.

Ten sources — Hacker News, Product Hunt, HuggingFace, Techmeme and more — filtered, tagged, and summarized every morning for builders who don’t have time to scroll.

NEWChrome extension: save posts from Twitter/X in one click.Install →
01

AI DIGEST

UPDATED DAILY · EDITOR'S PICK
01.00
AI DIGEST

AI新闻摘要

May 27, 2026

Here is a summary of today's main news events.

Global Markets Rally on Hopes of U.S.-Iran Deal

Stock markets, including the S&P 500 and Nasdaq, reached new records while oil prices fell significantly. The moves were driven by investor optimism that a potential agreement between the U.S. and Iran could ease geopolitical tensions and secure global oil supplies.

Semiconductor Stocks Fuel Major Market Gains

A powerful rally in chip and memory stocks, led by companies like Micron and South Korea's SK Hynix, propelled the S&P 500 and Nasdaq to record highs. The surge highlights strong investor confidence in the technology sector, particularly companies essential for the growth of artificial intelligence.

UK Faces New Economic and Social Pressures

British households are set to face higher energy bills after the regulator, Ofgem, increased its price cap. Simultaneously, the British Medical Association has announced another doctors' strike over pay, continuing a long-running dispute that has repeatedly disrupted healthcare services.

Indian Court Sentences Edtech Founder Byju Raveendran

An Indian judge has ordered a six-month jail term for Byju Raveendran, the founder of the embattled educational technology company Byju's. This marks a dramatic turn for the once high-flying startup, which has faced numerous legal and financial challenges.

Lululemon Reaches Truce with Founder Chip Wilson

The athleisure company agreed to add two board members nominated by its founder, Chip Wilson. This deal ends a period of public agitation from Wilson in exchange for an 18-month standstill agreement, resolving a significant corporate governance dispute.

02

ON THE WIRE

6 SOURCES
02

HACKER NEWS

02.00
HACKER NEWS

Hacker News - May 27, 2026

Hacker News Feed: Highlighting key posts and discussions.

I'm Tired of Talking to AI

(orchidfiles.com)

756426
The Melancholy of Slaying Monsters

(thereader.mitpress.mit.edu)

17273
The Forgotten Art of the LAN Party (2023)

(www.superjumpmagazine.com)

15278
Cloudflare Flagship

(developers.cloudflare.com)

275148
GitHub Actions was down

(www.githubstatus.com)

6501
03

HUGGINGFACE

03.00
HUGGINGFACE

huggingface.title - May 27, 2026

huggingface.description

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Vision-language models (VLMs) commonly formulate visual grounding and detection as a coordinate-token generation problem, serializing each 2D box into multiple 1D tokens that are learned and decoded largely independently. This token-by-token decoding mismatches the coupled structure of box geometry and creates a practical inference bottleneck due to strictly sequential generation. We introduce LocateAnything, a unified generative grounding and detection framework based on Parallel Box Decoding (PBD). By decoding geometric elements such as bounding boxes and points as atomic units in a single step, LocateAnything preserves intra-box geometric coherence and unlocks substantial parallelism. We show that PBD improves both decoding throughput and localization accuracy. We further develop a scalable data engine and curate LocateAnything-Data, a large-scale dataset with more than 138 million training samples, substantially increasing data diversity for high-precision localization. Extensive evaluations show that LocateAnything advances the speed-accuracy frontier, achieving significantly higher decoding throughput while improving high-IoU localization quality across diverse benchmarks. The results highlight the complementary benefits of Parallel Box Decoding and large-scale training data in enabling efficient and precise unified visual grounding and detection.

82
SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

While spatial foundation models have demonstrated impressive performance on standard datasets, a critical question remains: are they truly all-round players capable of generalizing robustly across diverse downstream tasks, arbitrary viewpoints, shifting scene domains, varying input densities, and specific hardware constraints? Answering this overarching question requires a holistic assessment, yet current models are mainly evaluated on specific domains for which they were specifically designed or trained. Such evaluations are intrinsically limited by narrow paradigm coverage, limited scene domains, and arbitrary frame sampling, making it fundamentally difficult to assess their true generalization capabilities. To address this gap, we present SpatialBench, a cross-paradigm, domain-diverse benchmark for spatial foundation models with deterministic sampling. SpatialBench features unprecedented scale and rigorous deterministic design, comprising 19 datasets and 546 scenes across 5 diverse spatial domains. It comprehensively evaluates 41 models across 6 paradigms on 5 task suites under 4 different input density settings. Our extensive evaluation reveals that current models are not yet all-round players, and uncovers crucial insights for future advancement. Specifically, we demonstrate that full-context attention maximizes accuracy while bounded-memory strategies unlock long-sequence scalability. Moreover, our empirical evaluations in challenging embodied and egocentric tasks demonstrate that strict domain alignment and high data quality are far more critical to performance than simple dataset scaling. Furthermore, to address the largest data gap identified in our analysis, we go beyond evaluation by introducing a large-scale dataset, DA-Next-5M, and a strong baseline model, DA-Next, pushing the boundaries of spatial representation learning.

51
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

We present MobileGym, a browser-hosted, lightweight, fully controllable environment for everyday mobile use, targeting interaction fidelity without replicating proprietary backends. It enables two capabilities previously out of reach for everyday apps: verifiable outcome signals through deterministic state-based judging over structured JSON state, and scalable online RL through low-cost parallel rollouts. The full environment state is captured, configured, forked, and compared as structured JSON, and a single server can host hundreds of parallel instances, with about 400 MB memory per instance and about 3 s cold start. A layered state model and a declarative task-definition framework keep state programmability and task creation practical at scale, and a single programmatic judging mechanism delivers both deterministic evaluation verdicts and dense RL rewards. The accompanying MobileGym-Bench provides 416 parameterized task templates, including 256 test and 160 train templates, over 28 apps, with deterministic judges and a structured AnswerSheet protocol that avoids free-text matching failures. In a Sim-to-Real case study, GRPO on Qwen3-VL-4B-Instruct gains +12.8 percentage points on the 256-task test set, and on a 59-task real-device signal subset, real-device execution retains 95.1% of the simulation-side training gain. Project page: https://mobilegym.github.io.

45
EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

The rapid evolution of generative video foundation models has propelled the field toward professional-grade cinematic synthesis. To achieve such demanding quality, the community transitions towards Reinforcement Learning (RL) and agentic workflows. However, reliable evaluation has emerged as a critical bottleneck. Existing benchmarks predominantly evaluate ''whether it is right'' (basic prompt-following) while fundamentally neglecting ''whether it is good'' (cinematic quality, acting, and aesthetics). Furthermore, current automated metrics lack the domain-specific rigor required to provide trustworthy signals, creating a severe credibility gap between human aesthetic perception and machine scoring. To bridge this gap, we introduce EvalVerse, a comprehensive, pipeline-aware, and expert-calibrated evaluation framework. We treat video generation assessment not merely as an engineering task, but as a core scientific problem: the systematic digitization of subjective cinematic expertise. First, we organize domain knowledge into an evaluation taxonomy aligned with the professional filmmaking workflow (pre-production, production, and post-production). Second, we distill human expert judgments into a curated dataset with large-scale human annotations. Third, we inject this knowledge into Vision-Language Models (VLMs) through an expert-calibrated fine-tuning strategy, enabling the VLM to perform explicit Chain-of-Thought reasoning. Compared to previous works, EvalVerse not only retains compatibility with foundational ''rightness'' metrics, but also significantly expands the criteria to ''goodness'' and broaden the task coverage to complex multi-shot sequencing and audio-visual integration. Consequently, by providing granular diagnostic signals, EvalVerse transcends a static leaderboard and establishes a fundamental infrastructure for future work, such as reward models and evaluator agent.

35
Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

Multi-view 3D reconstruction has achieved remarkable progress with the advent of feed-forward 3D reconstruction models. However, these models are typically trained and evaluated under ideal, degradation-free imaging conditions, whereas real-world observations often contain degradations that differ significantly from such settings. Improving robustness for multi-view 3D reconstruction under degraded conditions therefore remains an important challenge. We present Geometry-Aware Representation Denoising (GARD), a novel framework that performs diffusion-based multi-view restoration directly in the feature space of a feed-forward 3D reconstruction model. This design exploits the geometry-aware feature representations of the 3D reconstructor to effectively recover accurate scene geometry. Furthermore, by employing an additional RGB image decoder, the refined representations can also be used to restore high-quality RGB images, thereby enabling the simultaneous recovery of 3D scene geometry and high-quality imagery. Comprehensive experiments on the Depth Anything 3 (DA3) benchmark demonstrate the effectiveness of the proposed GARD framework.

32
D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing

Despite the emergence of diffusion large language models (D-LLMs) as an alternative to autoregressive large language models (AR-LLMs), safety monitoring for D-LLMs remains largely unexplored. Unlike AR-LLMs, D-LLMs generate text through a multi-step denoising process, exposing intermediate hidden representations that may contain safety-relevant information unavailable in standard single-step monitoring setups. Motivated by the suitability of lightweight probes for always-on monitoring, we analyze which trajectory-level signals best indicate when such probes are likely to struggle. We find that the most informative signal is safety hesitation: intermediate hidden states repeatedly falling within a small margin of the probe's decision boundary. The number of such hesitation steps in D-LLM's trajectory predicts probe failure effectively, providing a proxy of sample difficulty. Building on this analysis, we propose D^2-Monitor, a bi-level safety monitor for D-LLMs. D^2-Monitor adopts a lightweight probe as an always-on monitor to jointly estimate hesitation and perform base classification. When the hesitation level exceeds a threshold, a more expressive but computationally heavier probe is activated. This dynamic routing mechanism allocates monitoring resources efficiently at test time. Evaluated on 3 datasets (WildguardMix, ToxicChat, OpenAI-Moderation) across 4 D-LLMs, D^2-Monitor achieves state-of-the-art performance with a compact parameter footprint (leq 0.85M parameters), and exhibits the best trade-off between effectiveness and efficiency relative to 8 baselines.

31
LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Audio-visual generation is rapidly advancing from short clips to minute-long content, while existing evaluation protocols remain largely confined to short-form settings. Existing benchmarks primarily focus on 5--10 second text-conditioned generation and rarely support unified evaluation across text, image, and video conditioning modalities. Moreover, they provide limited insight into how identity consistency, narrative coherence, and audio-visual alignment degrade over extended temporal horizons. To bridge this gap, we introduce LongAV-Compass, a systematic benchmark for minute-long audio-visual generation. LongAV-Compass contains 284 curated test cases spanning text-to-audio-video (T2AV), image-to-audio-video (I2AV), and video-to-audio-video (V2AV), organized by application scenario and generation complexity. The benchmark combines taxonomy-guided benchmark construction with a unified evaluation framework that integrates MLLM-assisted assessment with complementary perceptual and multimodal metrics, including DINO-v2, ArcFace, CLIP, and ImageBind. The framework evaluates more than 20 fine-grained dimensions covering within-segment quality, cross-segment consistency, global narrative coherence, semantic alignment, and audio-visual synchronization. Through experiments on 11 representative models together with human-alignment validation, LongAV-Compass provides a diagnostic testbed for analyzing the limitations of current systems in sustaining coherent, semantically aligned, and temporally consistent minute-scale audio-visual generation across diverse input modalities.

31
Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

Test-Time Scaling (TTS) enhances the reasoning capabilities of large language models by allocating additional inference compute to explore the solution space. However, existing parallel TTS methods typically keep branches isolated during search: intermediate discoveries remain branch-private and cannot guide other branches in time. This information isolation causes substantial redundant exploration, as branches repeatedly rediscover information already found elsewhere and require more search steps to collect complete decision information needed to reach correct answers. To bridge this gap, we propose Collaborative Parallel Thinking (CPT), a training-free inference framework that enables search-time information sharing across parallel branches. CPT extracts compact intermediate information from ongoing branches, maintains a deduplicated query-level information pool, and broadcasts pool entries through the input context, allowing each branch in subsequent search steps to reuse discoveries made by other branches rather than rediscover the same information. Empirically, experiments on HMMT and AIME benchmarks show that CPT establishes a stronger accuracy--latency Pareto frontier than strong baselines across rollout budgets and model scales, highlighting search-time collaboration as an effective direction for efficient parallel TTS.

19
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

We introduce the MiniMax-M2 series, a family of Mixture-of-Experts language models built around the principle that mini activations can unleash maximum real-world intelligence. The flagship M2 contains 229.9B total parameters with only 9.8B activated per token. Designed end-to-end for agentic deployment, the M2 series rests on three components: (i) agent-driven data pipelines producing large-scale, verifiable trajectories across agentic coding and agentic cowork, each grounded in an executable workspace and an artifact-aligned reward; (ii) Forge, a scalable agent-native RL system that adapts to long-horizon agent trajectories, paired with windowed-FIFO scheduling, prefix-tree merging, inference optimization, and a clean training-inference-agent decoupling that supports both white-box and black-box agents; (iii) the latest M2.7 checkpoint takes an early step toward self-evolution -- autonomously debugging training runs and modifying its own scaffold. Across M2 through M2.7, this combination translates a mini-activation footprint into frontier-tier performance on agentic coding, deep search, office-task, and reasoning benchmarks.

19
Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

We study series-level cinematic remaking, a long-horizon video-to-video generation problem that localizes full episodes or films via stylization or actor replacement while strictly preserving narrative structure, motion choreography, and character identity across hundreds of shots. Existing video generation and editing pipelines often break down in this regime due to compounding identity drift, background mutation, and semantic erosion under large camera motions and viewpoint changes. We propose Soap2Soap, a multi-agent framework that enforces long-term language-visual consistency through a Dual-Bridge Consistency mechanism: a scene-aware JSON screenplay serving as a persistent semantic backbone, and dynamically allocated visual reference anchors at both scene and shot levels. To suppress drift before video synthesis, we introduce batch keyframe consistency, jointly generating multiple keyframes in a shared latent context via a grid-based formulation. A closed-loop verification agent further audits identity, stability, and alignment to trigger selective regeneration. Experiments on SoapBench demonstrate strong improvements over commercial video generation APIs in long-term consistency and narrative fidelity.

17
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

We introduce LLaVA-OneVision-2 (LLaVA-OV-2), the most capable vision-language model in the LLaVA-OneVision series to date, achieving superior performance across a broad range of multimodal benchmarks. The model builds on a native OneVision-Encoder and incorporates Windowed Attention for efficient local computation while maintaining native resolution. Its key advance is codec-stream tokenization: it treats compressed video as a continuous bit-cost stream, where bit-cost dynamics determine adaptive temporal groups, and motion-residual cues select salient spatial evidence into compact visual canvases. This allocation concentrates a limited token budget on event-bearing content, enabling more stable long-video token compression than fixed groups of pictures. A shared 3D RoPE further places codec canvases, sampled frames, and images in a unified spatiotemporal coordinate system. Furthermore, we build the LLaVA-OV-2 data and training stack around large-scale open supervision: approximately 8M re-captioned video samples for pretraining, a 4M-sample spatial corpus for fine-tuning. We also introduce JumpScore, a temporal-localization benchmark targeting fine-grained grounding in high-frequency, densely repeated motion, a regime underrepresented by existing video evaluations. A standout capability of LLaVA-OV-2 is its unified perception across video understanding, temporal grounding, spatial grounding, and manipulation-trace reasoning. On JumpScore, LLaVA-OneVision-2-8B reaches 74.9 JumpScore mAP, surpassing Qwen3-VL-8B (30.1) by +44.8 points; under matched visual-token budgets on the same benchmark, codec-stream inputs improve temporal grounding over frame sampling by +9.7 points. Across standard benchmarks, LLaVA-OneVision-2-8B further outperforms Qwen3-VL-8B by +4.3 average points on video tasks, +5.3 on spatial tasks, and +15.6 average J&F on tracking tasks.

14
VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

Large language models (LLMs) have evolved into interactive agents that collaborate with users in real-world tasks. Effective collaboration in such settings increasingly depends on understanding the user beyond what is explicitly stated, as user intent is often reflected in fragmented daily interactions and requires both personalized modeling and proactive interaction. However, existing agent benchmarks primarily evaluate reasoning and tool use, largely overlooking the challenges of inferring and leveraging user preferences in realistic scenarios. To address this gap, we introduce VitaBench 2.0, a benchmark for evaluating personalized and proactive agent behavior in long-term user interactions. In VitaBench 2.0, tasks are organized as temporally ordered sequences for individual users, where preferences are embedded in fragmented and heterogeneous interactions. Successful completion of tasks requires the agent to continuously extract, utilize, and update user preferences from these interactions. We further evaluate proactiveness through tasks that require agents to recognize missing information and actively acquire it from users or environments before making decisions. To support systematic analysis, we provide an extensible memory interface that enables controlled comparison across different memory architectures. We benchmark a diverse set of frontier proprietary and open-source LLMs. Results show that real-world personalization remains highly challenging even for state-of-the-art models, revealing a substantial gap between current capabilities and practical requirements. Extensive analysis further reveals the failure modes and capability bottlenecks of current agents in real-world personalized decision-making, providing insights for future model improvements.

9
Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning

Visual reasoning through reinforcement learning with verifiable rewards (RLVR) has achieved remarkable progress. However, when dealing with multi-source inputs, existing approaches tend to treat them as a mere accumulation of information, lacking explicit mechanisms to distinguish whether integrating additional sources yields information gain or introduces interference. Therefore, they struggle to effectively model dynamic interaction when integrating multiple sources, particularly when they differ significantly in physical properties and semantics, e.g., infrared and depth, leading to inferior performance to mono-source reasoning when a certain source holds the dominant signal. To address this issue, we propose MARS, a novel mono-anchored multi-source reasoning framework that models each visual modality as an independent information source. Specifically, by treating mono-source rewards as dynamic anchors, our method explicitly incorporates the information gain introduced by multi-source fusion into advantage normalization and adaptively emphasizes mutual promotion between sources while suppressing potential noise or conflicts during RLVR. From theoretical analysis, our method effectively quantifies information gain introduced by multi-source integration in gradient estimation, enabling consistent modality regulation. Empirical results also show impressive 3.2% and 4.9% performance gains on GRPO and DAPO across diverse datasets, confirming effectiveness of our method.

9
Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals

Activation oracles aim to make the activations of other models legible to humans and yield promising results compared to white-box interpretability techniques. However, uncertainty quantification (UQ) for the natural-language outputs of such activation oracles is so far understudied. Here, we investigate 6 different methods for estimating the confidence of activation oracles and evaluate how well-calibrated their confidence scores are. Our experiments on 6,000 samples per oracle (varying verbalizer and context prompts) reveal that bootstrap mode frequency is the best-calibrated method among those tested (ECE 5.7% vs. 25.5% for the answer-word log-probability on Qwen3-8B; 10.3% vs. 13.1% on Qwen3.6-27B), and that the log-prob baseline can serve as a fast triage signal at a fraction of the cost. Code and the patched trainer are available at https://github.com/federicotorrielli/probabilistic_activation_oracles.

8
Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement

Agentic reinforcement learning (RL) has proven effective for training LLM-based agents with external tool-use capabilities. However, we identify that agentic RL training induces increasing redundant tool calls and blurs the model's intrinsic knowledge boundary, where the model fails to distinguish when tools are needed versus when parametric knowledge suffices. Existing solutions based on reward shaping create coarse-grained optimization targets that tend to incentivize indiscriminate tool-call suppression, leading to reward hacking. In this paper, we propose AKBE (Agentic Knowledge Boundary Enhancement), an on-policy method that dynamically probes the model's intrinsic knowledge boundary through dual-path (with-tool and no-tool) rollouts during training. We define the knowledge boundary as the per-instance determination of whether tools are required and the minimum tool calls necessary. By comparing correctness across paths, AKBE categorizes trajectories and constructs targeted supervisory signals that guide efficient tool-use patterns for each question. These signals are integrated seamlessly into the agentic RL training loop. Experiments on seven QA benchmarks demonstrate that AKBE improves task accuracy by +1.85 on average and reduces tool calls by 18% over standard agentic RL, yielding 25% higher tool productivity without any accuracy-efficiency trade-off. Further analysis suggests its plug-and-play compatibility across different RL algorithms and the mechanism of each signal category. Our code is available at https://github.com/CuSO4-Chen/AKBE.

8
Negligible in Size, Significant in Effect: On Scale Vectors in Large Language Models

Normalization layers in modern large language models (LLMs) consist of a deterministic normalization operation and a learnable scale vector. While the normalization operation has been extensively studied, the scale vector remains poorly understood despite its ubiquitous use. In this work, we present a systematic study of scale vectors in LLMs from the perspectives of expressivity, optimization, and architectural structure. First, we show empirically that although scale vectors constitute only a negligible fraction of model parameters, removing them substantially degrades LLM pre-training. Our theory further shows that, in Pre-Norm architectures, scale vectors do not increase expressivity; instead, they improve optimization through a self-amplifying preconditioning effect on subsequent linear mappings. Second, we investigate the role of weight decay for scale vectors. By distinguishing Input-Norm and Output-Norm layers, we theoretically show that weight decay is beneficial for the former but harmful for the latter, due to their distinct roles in optimization and expressivity. Third, motivated by this understanding, we propose three lightweight and complementary improvements to scale vectors: branch-specific heterogeneity, improved placement around linear mappings, and magnitude-direction reparameterization. Both theory and experiments show that each improvement yields consistent gains. Finally, we combine these improvements into a unified scale-vector strategy and evaluate it through extensive LLM pre-training experiments on dense and mixture-of-experts models ranging from 0.12B to 2B parameters, across multiple optimizers and learning rate schedules, under industrial-scale token budgets. The unified strategy consistently achieves lower terminal loss than well-tuned baselines and exhibits more favorable scaling behavior, while adding negligible parameter and computational overhead.

8
Rethinking VLM Representation for VLA Initialization

Vision-Language-Action (VLA) models widely adopt pretrained Vision-Language Models (VLMs) as policy backbones, yet it remains unclear what kind of pretrained VLM representation is useful as a VLA initialization. In this paper, we study VLA initialization as a controlled representation-design problem along three axes: capability-level embodied VQA supervision, parameter-update strategy, and robot-data pretraining. Our experiments show that the original pretrained VLM representation is a key source of action performance. However, embodied VQA adaptation does not yield uniform gains: its benefit depends on downstream bottlenecks, and gains from different capability domains are not simply additive. For update strategy, LoRA provides a more reliable initialization than Full Finetune, indicating that overly reshaping the pretrained representation can weaken VLA initialization. Robot-data pretraining further improves VLA initialization, with the strongest variant obtained by staged LoRA-based training. Together, these findings suggest that effective VLM-to-VLA adaptation should inject action-relevant embodied and robot-trajectory signals while preserving the pretrained VLM representation that remains useful for action learning.

7
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation

Large language model (LLM) agents rely on reusable skills to solve complex tasks. However, existing skill creation approaches treat skills as isolated and static artifacts, limiting their reusability, reliability, and long-term improvement. We propose MUSE-Autoskill Agent (Memory-Utilizing Skill Evolution), a skill-centric agent framework that lets agents continuously improve their task-solving capability by creating, reusing, and refining skills under a unified lifecycle (creation, memory, management, evaluation, and refinement). Our framework enables agents to create skills on demand, store and reuse them across tasks, organize and select them efficiently, and evaluate them through unit tests and runtime feedback for continuous refinement. We further introduce skill-level memory that accumulates experience for each skill across tasks, enabling more effective reuse and adaptation over time. Experiments on SkillsBench provide initial evidence that lifecycle-managed skills can improve task success, efficiency, reuse, and cross-agent transfer, highlighting the importance of treating skills as long-lived, experience-aware, and testable assets.

6
Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows

Large Language Models (LLMs) are increasingly deployed as autonomous agents that reason, use tools, and act over multiple steps. Yet most hallucination benchmarks still evaluate only the final output, missing failures that originate in intermediate Thought-Action-Observation steps. We present Trajel, a dataset and evaluation framework for auditing trajectory-level hallucinations in multi-agent industrial workflows. Trajel introduces a five-type hallucination taxonomy (factual, referential, logical, procedural, and scope-based) over expert-annotated agent traces from AssetOpsBench. We benchmark supervised detection models at the subtask, trajectory, and long-context levels. Our results show that the most common failure modes are missed by existing benchmarks, that nearly half of hallucinated trajectories involve multiple types at once, and that automated detectors with high binary accuracy still misclassify the subtlest types. Trajectory-aware detection significantly outperforms standard post-hoc verification, making taxonomy-grounded evaluation necessary for safer agentic deployment.

5
Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

We introduce Gemini Embedding 2, a native multimodal embedding model that allows embedding video, audio, image, and text modalities in a unified representation space. We leverage the multimodal capabilities of Gemini to produce embeddings for arbitrary combinations of interleaved inputs across all these modalities that generalize well across a wide variety of tasks. Applying large-scale contrastive learning in a multi-task multi-stage training setup, we achieve state-of-the-art performance on key embedding benchmarks including unimodal, cross-modal, and multimodal retrieval spanning a diverse set of tasks. We show that our embedding model demonstrates strong performance (with a score of 62.9 R@1 on MSCOCO, 68.8 NDCG@10 on Vatex, 69.9 on MTEB multilingual and 84.0 on MTEB Code) across a variety of tasks surpassing the performance of specialized models. These unified capabilities make Gemini Embedding 2 a promising candidate for downstream use cases such as RAG, recommendation and search. Furthermore, its robust zero-shot performance across distinct fields - from astronomy and bioscience to fine arts and the culinary arts - establishes it as a highly reliable, out-of-the-box representation even for specialized domains.

4
MobileMoE: Scaling On-Device Mixture of Experts

Mixture-of-Experts (MoE) has become the de facto architecture for hundred-billion-parameter language models, yet its advantages at sub-billion scales for on-device deployment remain largely unexplored. To close this gap, we present MobileMoE, a family of on-device MoE language models with sub-billion active parameters (0.3-0.9B active and 1.3-5.3B total) that establish a new Pareto frontier for on-device LLMs. We first formulate an on-device MoE scaling law that jointly optimizes MoE architecture under mobile memory and compute constraints, identifying an on-device sweet spot - moderate sparsity with fine-grained and shared experts - that is simultaneously memory and compute-optimal. Building on the derived architectures, we train MobileMoE with a four-stage recipe covering pre-training, mid-training, instruction fine-tuning, and quantization-aware training, all on open-source datasets. Across 14 benchmarks, MobileMoE matches or exceeds leading on-device dense LLMs with 2-4times fewer inference FLOPs, and matches or surpasses the state-of-the-art MoE OLMoE-1B-7B with up to 60% fewer parameters. To bridge the last mile to mobile deployment, we provide the first efficient MoE inference on commodity smartphones with comprehensive on-device profiling. At comparable INT4 weight memory, MobileMoE-S delivers 1.8-3.8times faster prefill and 2.2-3.4times faster decode than the dense baseline MobileLLM-Pro.

4
Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments

Recent advances in large language models (LLMs) have facilitated the widespread deployment of LLMs as interactive agents capable of reasoning, planning, and tool use. Despite strong performance on existing benchmarks, such agents often exhibit notable degradation when deployed in real-world settings, where environments are inherently stochastic and imperfect. We argue that this discrepancy arises from a fundamental mismatch between idealized training settings and real-world interaction dynamics, where current paradigms rely on carefully curated task instructions and stable, well-controlled environments. To address this gap, we propose NoisyAgent, an agentic training framework that explicitly incorporates environmental imperfections into the agent learning process. We identify two major sources of interaction noise in real-world scenarios: user noise, which captures ambiguity and variability in user interaction, and tool noise, which reflects failures and anomalies in tool execution. We introduce such perturbations into the training pipeline by modifying user interaction patterns and simulating tool execution results within the training environment. To stabilize training while encouraging agents to handle increasingly challenging imperfections, noise is applied to only a subset of rollouts and progressively increased in difficulty as the model adapts to the current noise level. Extensive experiments demonstrate that our approach consistently improves agent robustness under noisy and dynamic environments. Our analysis reveals that training under noise conditions also yields performance gains on idealized benchmarks, suggesting that controlled exposure to environmental noise promotes more generalizable reasoning and decision-making behaviors. Our findings highlight the importance of modeling interaction imperfections for bridging the gap between agent training and real-world deployment.

3
Squeezing Capacity from Multimodal Large Language Models for Subject-driven Generation

Subject-driven image generation aims to synthesize new images that preserve the identity of the given subject while following textual instructions. Existing approaches often encode text and reference images separately. This limits cross-modal reasoning abilities and causes copy-paste artifacts. Recent frameworks that connect multimodal models and diffusion models improve instruction following, but largely overlook identity preservation. To address these limitations, we condition diffusion models on Multimodal Large Language Models (MLLMs) that jointly encode text and reference images, and augment it with VAE-based identity conditioning. A novel Dual Layer Aggregation (DLA) module is designed to aggregate multi-level MLLM features for optimal conditioning, and a multi-stage denoising strategy is applied to progressively balance the semantic information from MLLM and fine-detail identity from VAE during inference. Extensive experiments demonstrate that our approach harmonizes multimodal understanding with identity preservation, mitigates copy-paste issues, and achieves superior performance regarding human preference on subject-driven image generation. Our project website is available at https://zsh2000.github.io/squeeze-mllm-subject-gen/.

3
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models

Large language models inevitably retain sensitive information, defined as inputs that may induce harmful generations, due to training on massive web corpora, raising concerns for privacy and safety. Existing machine unlearning methods primarily rely on retraining or aggressive fine-tuning, which are either computationally expensive or prone to degrading related knowledge and overall model utility. In this work, we reformulate machine unlearning as a precise knowledge re-mapping problem via model editing. We propose ZeroUnlearn, a few-shot unlearning framework. It overwrites sensitive inputs by mapping them to a neutral target state and removing their original representations. ZeroUnlearn enforces representational orthogonality through a multiplicative parameter update with a closed-form solution, enabling efficient and targeted unlearning. We further extend ZeroUnlearn to a gradient-based variant for multi-sample unlearning. Experiments demonstrate that our approach outperforms existing baselines while preserving general model utility. Our code is available at the github: https://github.com/XMUDeepLIT/ZeroUnlearn.

2
Understanding Data Temporality Impact on Large Language Models Pre-training

Large language models (LLMs) are typically trained on shuffled corpora, yielding models whose knowledge is frozen at train time and whose temporal grounding remains poorly understood. In this work, we study the impact of pre-training dynamics on the acquisition of time-sensitive factual knowledge, focusing specifically on data ordering. Our main contributions are twofold. First, we introduce a comprehensive benchmark of over 7,000 temporally grounded questions and an evaluation protocol that enables analysis of whether models correctly associate facts with their corresponding time periods. Second, we pretrain 6B-parameter models on temporally ordered Common Crawl snapshots and compare them against standard shuffled pre-training. Our results show that sequentially trained models match shuffled baselines on general language understanding and common knowledge while consistently exhibiting more up-to-date and temporally precise knowledge. Temporally ordered pre-training yields improved factual freshness, while shuffled pre-training peaks on older data, possibly due to increased factual repetition. These findings, along with the release of our code at https://github.com/kyutai-labs/kairos , checkpoints, and datasets at https://huggingface.co/collections/kyutai/kairos provide a foundation for future research on continual learning for LLMs.

2
SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent

Long-horizon agentic reasoning requires large language models to act over long interaction histories containing thoughts, tool calls, observations, and partial conclusions. The challenge is not merely that these histories grow long, but that information needed for the current decision may be scattered across distant steps and only become relevant later. Existing approaches address this difficulty by truncating the interaction history, compressing it into shorter surrogates, or retrieving selected parts of it for reuse, but they do not explicitly model how access to past interaction should adapt to the agent's evolving state. We instead cast long-horizon reasoning as a problem of state-adaptive memory. To this end, we propose State-Adaptive Memory~(SAM), a standalone framework that consolidates ongoing interaction into compact memory cues while preserving raw trajectory pages for intent-driven recall. These cues are not treated as replacements for history; rather, they serve as lightweight handles that allow the agent to reconstruct temporally distant information according to its current needs, without retraining the underlying backbone. We further optimize the memory module through expert-guided supervision and reinforcement learning, aligning it with trajectory-level utility. Across BrowseComp, BrowseComp-ZH, WideSearch, and HLE, SAM consistently outperforms strong baselines over diverse agent backbones. Our results suggest that explicit memory modeling provides a simple and effective foundation for long-horizon agentic reasoning.

2
MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale

Layered image generation and editing is a fundamental capability that enables layer-wise reuse, editing, and composition of generated visual content, analogous to word-level editing in natural language. Despite its importance, this remains an underexplored area at scale. To address this gap, we present MRT, a 20B-parameter masked region diffusion model tailored for multi-layer transparent image generation and editing, trained on over 10M multilingual design samples spanning diverse aspect ratios and textual prompts. To fully leverage this scale, we make two key technical contributions. First, we unify three complementary tasks including text-to-layers, image-to-layers, and layers-to-layers within a shared masked region diffusion framework, where selective token masking enables flexible layer-wise generation and editing. Second, to enable overflow layer generation, we introduce an overflow-aware canvas layer that handles boundary inconsistencies and supports semi-transparent background synthesis, enabling complete editable layers extending beyond visible canvas boundaries. Additionally, we apply diffusion distillation to achieve 8-step, real-time multi-layer generation with minimal quality degradation. Extensive experiments demonstrate that our framework substantially outperforms prior state-of-the-art approaches, including various commercial systems, across all three tasks, establishing a new benchmark for multi-layer transparent image generation. Notably, our model significantly outperforms the concurrent Qwen-Image-Layered model in image-to-layers quality according to user-study results, while achieving 10-100\times faster inference and reducing activation GPU memory consumption by 50-90\% during image-to-layer inference.

2
RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models

Diffusion Transformers (DiT) achieve strong performance in image generation but incur substantial inference costs. While prior work has reduced this cost via quantization and distillation, semi-structured sparsity, which can nearly halve FLOPs, remains underexplored. A key reason is that most existing approaches focus on weight sparsification, and pruning 50% of the weights can remove critical model capacity and degrade generation quality. Our study, however, shows that DiT activations are intrinsically sparse and significantly more robust to N:M semi-structured sparsification than weights. Motivated by this observation, we advocate a paradigm shift from weight sparsification to activation sparsification. We propose RT-Lynx, which applies N:M sparsification to activations and incorporates error-compensation techniques to mitigate accuracy loss. We further implement highly optimized CUDA kernels tailored to this setting, achieving up to a 1.55x speedup on average in linear layers. Extensive experiments across multiple diffusion models demonstrate that our method preserves the generation quality of the original models while substantially accelerating inference.

2
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

Prior work establishes that controlled contrastiveness between self-generated responses from large language models, set via reward scores, improves downstream preference tuning in English. We extend this method to multiple languages and evaluate two models across a total of 14 high and low-resource languages on a diverse set of tasks. Our central finding is that cross-lingual contrastive preference tuning on self-generations (CroCo) transfers without language-specific preference annotation. A reward model trained on English preferences (atop a multilingual base) produces useful within-language rankings across most languages, and pairing in either a monolingual or multilingual setting improves over each model on the majority of setups while preventing the catastrophic forgetting of supervised fine-tuning. We observe that the gains require on-policy data. Off-policy responses reduce the benefit and online preference optimization fails to improve over the offline variant. Specifically, on structured tasks, our method matches or exceeds the base in 6/7 languages for EuroLLM-9B and 4/7 settings for Aya-3B. On open-ended generation, both tuned models win against their respective base across 11 evaluated languages. Overall, we show promising directions for multilingual preference tuning.

1
Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents

Agentic systems are becoming more capable: agents define strategies, take actions, and interact with different environments. This autonomy poses serious challenges for overseeing and assessing agent behavior. Most current tools are limited, focusing on observability with basic evaluation capabilities or imposing static, hand-crafted error taxonomies that cannot adapt to new domains. To address this gap, we present Agentic CLEAR, an automatic, dynamic, and easy-to-use evaluation framework. It produces textual insights into the agent behavior on three levels of granularity: system, trace, and node. Agentic CLEAR operates above the observability layer, enabling seamless integration and featuring an intuitive UI that makes agent evaluation highly accessible. In our experiments on four benchmarks, seven agentic settings, and tens of thousands of LLM calls, we show that Agentic CLEAR produces high-quality, data-driven, insightful feedback. Our analysis shows strong alignment with human-annotated errors and the ability to predict task success rate.

1
Learning High-Frequency Continuous Action Chunks in Latent Space

Modern robotic policies increasingly rely on action chunking to execute complex tasks in the physical world. While action chunking improves temporal consistency at moderate action frequencies, it becomes insufficient when the action frequency is further increased (e.g., to 60~Hz). At such high frequencies, policies often fail to generate actions that are both temporally smooth and spatially consistent. We address this challenge by shifting high-frequency action learning from the action space to a latent space with variational autoencoder (VAE). This formulation significantly improves both temporal and spatial consistency of high-frequency control. To enable smooth real-time execution, we further introduce Reuse-then-Refine, a chunk-level refine strategy that improves continuity between adjacent action chunks under asynchronous inference. As a result, robots controlled by our policy can execute complex contact-rich tasks continuously, with less pauses and jerky motions. Experiments on three real-world contact-rich robotic tasks show that our approach consistently completes tasks with smooth motions. Our code and data are available at https://github.com/tars-robotics/RTR.

1
DarkForest: Less Talk, Higher Accuracy for Multi-Agent LLMs

Multi-agent LLM systems improve reasoning by combining outputs from multiple agents, but interaction-heavy methods can introduce error propagation and high communication overhead. When agents exchange raw responses or reasoning traces, incorrect intermediate reasoning may be adopted and amplified, leading to confident but wrong consensus; multi-round communication also increases token consumption, latency, and inference cost. In this paper, we propose a controlled-communication coordination framework named DarkForest. DarkForest first keeps agents independent, so each agent produces an answer without seeing the others' outputs. It then parses the raw responses into structured candidate records, groups semantically equivalent candidates into clusters, and estimates a calibrated belief distribution over these clusters using agent reliability, confidence, parse quality, support-pattern reliability, and independence corrections. A coordinator receives only policy-permitted evidence from this belief state with controlled communication. Experiments on six reasoning benchmarks show that DarkForest achieves leading overall quality, improves the strongest baseline by up to 30.7\% on benchmark metrics, and reduces token consumption by up to 6.5times compared with communication-heavy baselines.

1
STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media

Large language models for vertical domains are bottlenecked by the scarcity of complex, domain-specific task-oriented dialogues. Existing data acquisition pipelines face a persistent trilemma: expert annotation is expensive, real-world service conversations are constrained by privacy and commercial restrictions, and static corpora quickly become temporally stale. We propose Stream, a data-centric framework that leverages publicly available streaming media (live streams and short videos) to synthesize high-value service dialogues at scale. Stream mines authentic interaction signals from noisy streams and synthesizes conversations by integrating role-grounded persona construction with Conversational Blueprint construction; it further adopts retrieval-augmented generation (RAG) to support knowledge-aware responses. Based on Stream, we release StreamDial, a large-scale multi-domain dataset covering Automotive, Restaurant, and Hotel. StreamDial contains 87,498 dialogue sessions and 1,497,320 turns in total, with an average of 17.11 turns per session and a comparable scale across domains. Each session is organized as a structured quadruplet langle P_u, P_a, B, H rangle that pairs dialogue history with explicit user/agent personas and a Conversational Blueprint, capturing realistic service behaviors such as requirement mining, constraint conflicts, negotiation, and recovery. Evaluations with automatic judges and downstream tasks show that StreamDial improves intrinsic dialogue quality over strong baselines, and models trained with StreamDial improve Dialogue State Tracking across backbones; we further report a completed human-evaluation set and encouraging multilingual transfer on Qwen3-8B under a controlled training budget. The data is released in https://github.com/hitxueliang/DialogDataSetBySTREAM.

0
05

PRODUCT HUNT

05.00
PRODUCT HUNT

Product Hunt - May 27, 2026

Product Hunt Daily Feed: Featuring noteworthy tech launches.

Pawse.ai icon
Pawse.ai

An acoustic regulation system for dogs

0
Layers icon
Layers

Create beautiful animated code snippet videos for free

0
Bluedot 2.1 icon
Bluedot 2.1

Record on Apple Watch. Sync with Claude

0
Mojito icon
Mojito

Type to search for any emoji or symbol in seconds

0
Oasis Browser for Mac icon
Oasis Browser for Mac

A privacy-first AI browser you can train anonymously

0
baz.studio icon
baz.studio

Skills library & video editor for AI Agents

0
Phasr icon
Phasr

Run 100+ workflows simultaneously without losing context

0
QuickSheet v1.2 icon
QuickSheet v1.2

Instantly create and edit spreadsheets from your menu bar

0
Netfox icon
Netfox

A native local macOS network monitor

0
Archi-Flow icon
Archi-Flow

Visualize cloud architecture with live traffic simulations

0
Krater icon
Krater

All the AI tools you use, one subscription

0
Calling Skills for AI Agents icon
Calling Skills for AI Agents

Add voice and video calling via your coding agent

0
Extend icon
Extend

Parse any PDF layout with SOTA accuracy for AI pipelines

0
AgenticCalling AI icon
AgenticCalling AI

Give your AI the power to make phone calls

0
Local Panel icon
Local Panel

Local SSH server manager with no subscriptions or installs

0
CircadiaOS icon
CircadiaOS

Sleep optimization, minus the $3,000 mattress pod

0
Chunk sidecars icon
Chunk sidecars

Validate agent-generated code before it ever reaches CI

0
Curlo icon
Curlo

Local AI search to find SFX and music by describing it

0
GenGo icon
GenGo

Transform selected text anywhere on macOS

0
BaseBuddy icon
BaseBuddy

Turn your Supabase database into a WordPress-like editor

0
BobCA icon
BobCA

A sovereign agent that learns to code with your preferences

0
Octolane icon
Octolane

Self-driving AI CRM that you can talk to

0
Powabase icon
Powabase

Build AI apps with Postgres, RAG, and agents

0
Coworker AI icon
Coworker AI

More AI for less spend with context-aware model routing

0
zero.xyz icon
zero.xyz

Give your AI agent access to 8k tools, APIs and services

0
Harbor icon
Harbor

CLI + companion App to spin up complete local LLM stacks

0
Aviquill icon
Aviquill

A calm canvas for visual thinkers with messy minds

0
Jott icon
Jott

Capture quick written or voice notes from your Mac's notch

0
BankStatementLab icon
BankStatementLab

Turn any bank statement PDF into Excel, CSV or JSON with AI

0
Studio Practice icon
Studio Practice

Preview any URL on every Mac screen at once

0
marpy.io icon
marpy.io

AI coding platform built specifically for the Python stack

0
ReplylessAI Sequences icon
ReplylessAI Sequences

Outbound email sequences without the sales-tool bloat

0
AVTR-1 Real-Time Open Weights Model icon
AVTR-1 Real-Time Open Weights Model

Generating uncanny AI avatars is now open source

0
Kept icon
Kept

Your AI chats, saved as Markdown locally with no cloud

0
Hayley: Your Thinking Companion icon
Hayley: Your Thinking Companion

An insight, a pattern, one question worth sitting with.

0
Bond icon
Bond

Outbound campaigns powered by real buying signals

0
Trace icon
Trace

No-frills offline meeting transcripts with context

0
Parrot Speech-to-text API icon
Parrot Speech-to-text API

Fast, accurate STT for production-grade voice agents

0
Willow Scribe icon
Willow Scribe

Tell Scribe what to say. It writes the rest.

0
QuakPit icon
QuakPit

Meeting reminders that actually make you smile.

0
Ferrari Luce icon
Ferrari Luce

The first electric Ferrari designed by LoveFrom

0
MiniCPM5-1B icon
MiniCPM5-1B

A new SOTA for compact open models on the edge

0
blokdots 3.0 icon
blokdots 3.0

Prototype hardware visually, export real C++ for engineering

0
SelectPrism icon
SelectPrism

Agents that screen and interview so you can hire faster

0
LikePulse icon
LikePulse

See exactly where YouTube audiences react — instantly

0
Ormedo icon
Ormedo

Let AI agents handle your entire outbound pipeline

0
AI Shadowing icon
AI Shadowing

Turn any YouTube video into a language shadowing lesson

0
DNSimple CLI icon
DNSimple CLI

Manage Your DNS from the Command Line with DNSimple CLI

0
crunr icon
crunr

Launch and run any compute job on AWS with 1 command

0
LangPanda icon
LangPanda

Learn languages from watching your favorite shows

0
06

TECHMEME

06.00
TECHMEME

Techmeme - May 27, 2026

Techmeme Digest: Major tech headlines and industry conversations.

DC-based Airis Labs, which uses AI to convert visual data to law enforcement intelligence, emerges from stealth with a $31M Series B, for $60M in total funding (Chris Metinko/Axios)
Source: TechmemePublished: May 27, 2026

Chris Metinko / Axios : DC-based Airis Labs, which uses AI to convert visual data to law enforcement intelligence, emerges from stealth with a $31M Series B, for $60M in total funding —  Airis Labs, an AI video platform for defense, has come out of stealth with a $31 million Series B led by PSG Equity, co-founder and CEO Noam Friedman tells Axios Pro.

Polymarket is making it harder to use VPNs to access its service, blocking some IPs and suspicious accounts, and is asking some customers to identify themselves (Michael Roddan/The Information)
Source: TechmemePublished: May 27, 2026

Michael Roddan / The Information : Polymarket is making it harder to use VPNs to access its service, blocking some IPs and suspicious accounts, and is asking some customers to identify themselves —  In Russia, making bets on Polymarket isn't permitted by the popular online prediction market's rules.

YouTube makes its AI content labels more prominent on desktop and mobile, and will apply them automatically if it detects "significant photorealistic AI use" (Todd Spangler/Variety)
Source: TechmemePublished: May 27, 2026

Todd Spangler / Variety : YouTube makes its AI content labels more prominent on desktop and mobile, and will apply them automatically if it detects “significant photorealistic AI use” —  Is that YouTube video clip you're watching real or was it made with AI?  —  YouTube wants to make it easier for viewers …

Robinhood launches a feature to let users link AI agents, such as Claude or Cursor, to separate, dedicated investment accounts for trading stocks autonomously (Hannah Erin Lang/Wall Street Journal)
Source: TechmemePublished: May 27, 2026

Hannah Erin Lang / Wall Street Journal : Robinhood launches a feature to let users link AI agents, such as Claude or Cursor, to separate, dedicated investment accounts for trading stocks autonomously —  New feature links artificial-intelligence tools to investment and credit-card accounts  —  AI agents were already dispensing advice …

Biohub, a Mark Zuckerberg- and Priscilla Chan-funded institute, releases "a world model of protein biology" to researchers for prediction, design, and discovery (Ina Fried/Axios)
Source: TechmemePublished: May 27, 2026

Ina Fried / Axios : Biohub, a Mark Zuckerberg- and Priscilla Chan-funded institute, releases “a world model of protein biology” to researchers for prediction, design, and discovery —  Biohub, the Mark Zuckerberg and Priscilla Chan-funded institute, on Wednesday released what it says amounts to “a world model of protein biology.”

The Shanghai Stock Exchange says memory maker CXMT cleared a listing review for the exchange's Nasdaq-like STAR Board, in what could be China's top IPO in 2026 (Wall Street Journal)
Source: TechmemePublished: May 27, 2026

Wall Street Journal : The Shanghai Stock Exchange says memory maker CXMT cleared a listing review for the exchange's Nasdaq-like STAR Board, in what could be China's top IPO in 2026 —  The listing could become China's largest IPO this year  —  China's securities regulator has cleared an approximately $4 billion share offering …

Source: TSMC CEO C.C. Wei told staff that they will see a 30%+ bump in their profit-sharing payouts in 2026 on average, after some staff voiced concerns online (Bloomberg)
Source: TechmemePublished: May 27, 2026

Bloomberg : Source: TSMC CEO C.C. Wei told staff that they will see a 30%+ bump in their profit-sharing payouts in 2026 on average, after some staff voiced concerns online —  Taiwan Semiconductor Manufacturing Co. chief C.C. Wei told staff they'll see more than a 30% bump in their profit-sharing payouts …

Temu owner PDD reports Q1 revenue up 11% YoY to ~$15.7B, below ~$16.2B est., net profit down 15% to ~$1.85B, below ~$3.4B est., amid fierce competition in China (Wall Street Journal)
Source: TechmemePublished: May 27, 2026

Wall Street Journal : Temu owner PDD reports Q1 revenue up 11% YoY to ~$15.7B, below ~$16.2B est., net profit down 15% to ~$1.85B, below ~$3.4B est., amid fierce competition in China —  The company is rolling out more support initiatives to prevent merchants from defecting to other platforms

Sources: ByteDance is discussing up to $70B of 2026 capex as it builds out data centers and other AI infrastructure, underwritten by its ~$50B profit in 2025 (Bloomberg)
Source: TechmemePublished: May 27, 2026

Bloomberg : Sources: ByteDance is discussing up to $70B of 2026 capex as it builds out data centers and other AI infrastructure, underwritten by its ~$50B profit in 2025 —  ByteDance Ltd., the developer of TikTok and a leading force in artificial intelligence, is planning to sharply increase …

Demis Hassabis says he still broadly expects AGI around 2030, though he now sees 2029 as a possibility, and 2026's "agentic era" is a "bit like a practice run" (Ina Fried/Axios)
Source: TechmemePublished: May 27, 2026

Ina Fried / Axios : Demis Hassabis says he still broadly expects AGI around 2030, though he now sees 2029 as a possibility, and 2026's “agentic era” is a “bit like a practice run” —  Google DeepMind CEO Demis Hassabis said at Google's developer conference last week that humanity is standing in the …

Milan-based social travel startup WeRoad raised a $58M Series C led by Airbnb, taking its total funding to ~$100M, to expand into the US, starting in Austin (Lauren Forristal/TechCrunch)
Source: TechmemePublished: May 27, 2026

Lauren Forristal / TechCrunch : Milan-based social travel startup WeRoad raised a $58M Series C led by Airbnb, taking its total funding to ~$100M, to expand into the US, starting in Austin —  WeRoad, the Milan-based group travel startup, has raised a $58 million Series C round led by Airbnb as it prepares for its first major expansion outside Europe.

Sources: Germany and Spain are leading opposition to the EU's plan to ban Chinese suppliers like Huawei from telecom networks as part of new cybersecurity rules (Bloomberg)
Source: TechmemePublished: May 27, 2026

Bloomberg : Sources: Germany and Spain are leading opposition to the EU's plan to ban Chinese suppliers like Huawei from telecom networks as part of new cybersecurity rules —  Germany and Spain are leading opposition to European Commission plans to ban Chinese technology suppliers from telecom networks …

Jensen Huang says Nvidia spends $100B-$150B per year on its Taiwan supply chain, up from $10B-$15B in 2022-2023, and will boost its 1,000 staff there to 4,000 (Nikkei Asia)
Source: TechmemePublished: May 27, 2026

Nikkei Asia : Jensen Huang says Nvidia spends $100B-$150B per year on its Taiwan supply chain, up from $10B-$15B in 2022-2023, and will boost its 1,000 staff there to 4,000 —  TAIPEI — Nvidia is now spending up to $150 billion a year on its supply chain partners in Taiwan and plans to increase its headcount in the …

Datacurve releases the DeepSWE coding benchmark, a 113-task test across 91 open-source repositories: GPT-5.5 leads at 70%, GPT-5.4 got 56%, and Opus 4.7 got 54% (Michael Nuñez/VentureBeat)
Source: TechmemePublished: May 27, 2026

Michael Nuñez / VentureBeat : Datacurve releases the DeepSWE coding benchmark, a 113-task test across 91 open-source repositories: GPT-5.5 leads at 70%, GPT-5.4 got 56%, and Opus 4.7 got 54% —  For months, the leading AI coding benchmarks have told enterprise buyers a comforting but misleading story: the top models are all roughly the same.

Xreal launches a new sub-brand, X by Xreal, and $299 a01 display glasses with micro OLED displays, a 50° FOV, and a 62g weight, set for a July release in the US (Scott Stein/CNET)
Source: TechmemePublished: May 27, 2026

Scott Stein / CNET : Xreal launches a new sub-brand, X by Xreal, and $299 a01 display glasses with micro OLED displays, a 50° FOV, and a 62g weight, set for a July release in the US —  X by Xreal is arriving in July for $299, and they look to compete with TCL's budget glasses.

07

STARTUP ARCHIVE

07.00
STARTUP ARCHIVE

Startup News - May 27, 2026

Startup News Roundup: Aggregating key funding and launch updates.

Marc Andreessen on the 5 personality traits of an innovator
Source: StartupPublished: Mar 31, 2026

“When you’re talking about real innovators—people who actually do really creative, breakthrough work—I think you’re talking about a couple things:”

Steve Jobs explains the importance of both thinking and doing
Source: StartupPublished: Mar 30, 2026

“The doers are the major thinkers. The people who really create the things that change this industry are both the thinker-doer in one person.”

Tobi Lutke explains what the VCs who passed on Shopify got wrong
Source: StartupPublished: Mar 27, 2026

“What a lot of free-market thinkers don’t understand is that between the demand and eventual supply lies friction."

Sam Altman explains how he decides to invest in a startup after 10 minutes
Source: StartupPublished: Mar 26, 2026

"Does this person have the potential to be the next Mark Zuckerberg?… [You don’t get to] 100% accuracy, obviously, but it’s good enough that our business model works.”

Jony Ive recounts the time Steve Jobs called him vain
Source: StartupPublished: Mar 25, 2026

In the clip below, Jony Ive recounts the time he asked Steve Jobs to be less harsh in his critique of a piece of work.

Jeff Bezos’s two pieces of advice for aspiring entrepreneurs
Source: StartupPublished: Mar 24, 2026

“The advice that I would give entrepreneurs is don't chase the hot new thing. It's so hard to catch something that everybody already knows is hot."

Elad Gil: “Things that work tend to work pretty fast”
Source: StartupPublished: Mar 23, 2026

“I do think there’s a bit of a myth in Silicon Valley that you should keep grinding no matter what and it’s just about perseverance, and I think that’s really bad advice."

Paul Graham on why starting with a “small, intense fire" is the key to startup growth
Source: StartupPublished: Mar 20, 2026

"You have to know who those first users are and how you're going to get them."

Keith Rabois on how to identify great talent
Source: StartupPublished: Mar 19, 2026

“What you want to do with every single employee every single day is expand the scope of their responsibilities until it breaks… and that’s the role they should stay in.”

Wealthfront CEO on why advertising spend makes it harder to find product/market fit
Source: StartupPublished: Mar 18, 2026

“The way that you know you have product/market fit is if you have exponential organic growth."

Eric Schmidt on why most companies get strategy wrong
Source: StartupPublished: Mar 17, 2026

“Work very, very hard to figure out what the world’s going to look like in five years. What will people be doing? What will your customers want? Where will costs be?"

Mark Zuckerberg: “You can’t 80/20 everything”
Source: StartupPublished: Mar 16, 2026

"There’s the famous 80/20 rule where you get 80% of the benefit by doing 20% of the work, but you can’t just 80/20 everything. There have to be certain things that you are just the best at."

Marc Andreessen on Mark Zuckerberg’s founder “superpower”
Source: StartupPublished: Mar 13, 2026

“A great superpower that Mark Zuckerberg has that is probably not well-understood enough is he does not get emotionally upset in stressful situations"

Sam Altman explains how to come up with a great startup idea
Source: StartupPublished: Mar 12, 2026

"If you start a startup without a good idea… you’ll be under pressure to make something up and it won’t work that well."

Jeff Bezos on the problems with proxies and managing to metrics
Source: StartupPublished: Mar 11, 2026

“One of the things that happens in business is that you develop certain things that you’re managing to—a typical case would be a metric. And that metric isn’t the real underlying thing.”

Airbnb founder Brian Chesky on how to design an amazing user experience
Source: StartupPublished: Mar 10, 2026

“If you can design something really amazing using the hand-crafted part of your brain, then you can reverse-engineer how to industrialize this millions of times over."

Spencer Rascoff: "I will never invest in a consumer startup with paid marketing”
Source: StartupPublished: Mar 9, 2026

"If you’re actually trying to grow a product, the best levers for doing that are often within the product itself.”

Patrick Collison explains why it sometimes make sense to quit
Source: StartupPublished: Mar 6, 2026

“One thing I’ve learned myself the hard way, is that it is easier to tear down a company and restart it in Silicon Valley, than it is to constantly try to pivot or keep something alive."

Jeff Bezos recounts the time he called Amazon’s customer service number mid-meeting to prove a metric was wrong
Source: StartupPublished: Mar 5, 2026

“I have a saying, which is when the data and the anecdotes disagree, the anecdotes are usually right"

Ben Horowitz: “Nobody was born a great manager. It’s a very unnatural job.”
Source: StartupPublished: Mar 4, 2026

“If you can’t build a great product, it doesn’t matter if you can build a great company.”

03

ALSO TODAY

3 MORE SOURCES
08

SOLIDOT

08.00
SOLIDOT

Solidot News - May 27, 2026

Solidot Feed: Highlighting essential tech & open-source news.

Google 转型 AI 搜索之后 DuckDuckGo 安装量上涨最高三成

Google 上周宣布将大幅更改搜索功能,把搜索框改为 AI 聊天机器人的对话框,此举立即在用户中间引发了强烈反对。一部分批评者认为这将杀死开放 Web,一部分人担心 AI overviews 会展示错误的答案,且剥夺了不想要 AI 的用户的控制权。部分用户因此转向了替代搜索 DuckDuckGo。DuckDuckGo 称,其美国应用在 5 月 20 日-25 日期间的安装量周环比平均增长 18.1%,安装量增势持续了六天,5 月 25 日达到最高的 30.5%。而在 iOS 平台上,安装量周环比平均增长 33%,最高 69.9%。不展示 AI 结果的 noai.duckduckgo.com 访问量周环比平均增长 22.7%,5 月 24 日最高 27.7%。DuckDuckGo 高管 Kamyl Bazbaz 称用户想要选择权。

Dropbox 创始人卸任 CEO 一职

Dropbox 创始人 Drew Houston 周二通知员工他将卸任 CEO 一职改任执行董事长,联席 CEO Ashraf Alkarmi 将成为唯一的 CEO。Houston 是在 24 岁创办了 Dropbox,担任 CEO 长达 19 年,帮助开创了云存储市场,与巨头 Google 和苹果展开直接竞争。但他领导下的 Dropbox 未能走向巅峰,其市值比上市时的峰值跌去了一半。Dropbox 在最新的季度财报中表示其付费用户逾 1800 万,其云存储服务仍然深受媒体专业人士、平面设计师、建筑师以及其他日常工作中需要共享文件和照片的人士的欢迎。Dropbox 2017 年年收入突破 10 亿美元,四年后突破 20 亿美元,但过去两年收入基本持平,2025 年略有下降。

奇怪的语言错误或有助于识别论文工厂的论文

Medical Evidence Project 项目的 James Heathers 在世界科研诚信大会上报告称,一种简单的寻找语言错误的方法,有助于识别出由“论文工厂”炮制出来的虚假研究论文。Heathers 是在去年萌生的这一想法。当时有人给他发来十几篇看起来极为相似的医学论文,希望他能够找出其中的问题所在。Heathers 花了两天时间阅读这些论文,并注意到一些奇怪但常见的拼写错误、语法错误和用词。例如“Kolmogorovor 信息复杂度”拼写错了数学家 Andrey Kolmogorov 的姓氏;还有多篇论文出现不规范表述,如“5毫升含凝胶生物化学试管”,Heathers 形容这种表达“像是外星人写的”。这类语言错误可能只是非英语母语作者的失误,本身不足以判定论文造假。但Heathers 在 Google 学术平台检索上述特殊表述后,又发现了约 200 篇论文与最初那十几篇论文具有相同的特征——不仅主题一致,研究设计、图表样式等细节特征也高度重合。他认为,从统计学角度看,这种情况几乎不可能发生,除非它们都来自同一源头。Heathers 推测,这些论文都是同一篇论文的不同版本,由论文工厂批量伪造、翻新后,出售给那些急于增加论文发表数量的科学家。

荷兰阻止美国公司收购其重要数字供应商

针对美国 IT 巨头 Kyndryl 拟收购荷兰云服务商 Solvinity 的交易,荷兰政府最终决定阻止收购。Solvinity 托管了荷兰的在线身份平台 DigiD,因此交易引发了 DigiD 数据被美国控制和索取的担忧。荷兰数字经济国务秘书 Willemijn Aerdts 周二致函荷兰议会,负责审查投资的机构认为此次收购“可能对公共利益构成风险”,建议政府阻止此次收购。政府随后采纳了建议。Kyndryl 在一份声明中对荷兰政府的决定表示极度失望。

教宗的首份通谕被怀疑部分是在 AI 帮助下撰写的

教宗良十四世发布了其首道通谕《伟大的人类(Magnifica Humanitas)》,谈论了在 AI 时代守护人类。但这篇 通谕被质疑部分是在 AI 帮助下撰写的。AI 检测工具 Pangra 的分析显示,部分段落有 40% 到 100% 的概率是由 AI 撰写的,大部分段落则没有使用 AI。以前发布的通谕没有发现使用 AI 的痕迹。根据文本和间接证据判断,所使用的 AI 很可能是 Anthropic 的 Claude。而这份通谕的一位顾问是 Anthropic 联合创始人 Christopher Olah。

维基媒体基金会解雇工会组织者引发社区抗议

维基媒体基金会在五月中旬解雇了 MediaWiki 资深首席开发者 Brooke Vibber,5 月 21 日解散了 Community Tech 团队,五名工程师和一名经理全部离职。他们多数人都是工会组织者。Brooke Vibber 于 2003 年初担任 MediaWiki 项目的首席开发者,维基百科就运行在 MediaWiki 之上,她是维基媒体基金会聘用的第一位全职员工,也是首位 CTO,她被认为是少数深入理解系统技术底层的资深开发者。而 Community Tech 团队旨在通过 Community Wishlist 实现社区志愿者们想要的功能。维基媒体基金会此举立即引发了志愿者的抗议,社区志愿者准备采取罢工等集体行动。这是首次志愿者与基金会员工联合发起声援行动。名叫 Femke 的管理员认为一个致力于造福社会的组织,不应该在没有工会的情况下运作。维基媒体基金会拥有 2.966 亿美元的储备金,足以支付 17.1 个月的运营支出。而工会 Wiki Workers United 只要求:领导层对员工和社区保持透明和负责;决策前倾听员工对年度规划的建议;告别朝令夕改的招人、辞退与晋升乱象,等等,相当的温和。

伊朗逐步恢复全球联网

在切断网络近三个月之后,伊朗逐步恢复全球联网。伊朗第一副总统 Mohammad Reza Aref 周二通过其 X 账号宣布了这一消息。网络监视组织 Netblocks 和 Kentik 都报告伊朗网络从 13:00 GMT 开始逐步恢复,但大部分网络尚未恢复。这次断网始于 2 月 28 日,是全球历史上持续时间最长的断网事件之一。Netblocks 的研究主管 Isik Mater 称,有迹象表明伊朗对互联网的过滤比之前更严格,WhatsApp 等消息应用被额外过滤。

美国 14 州实施堕胎禁令后妊娠相关死亡增加 9.2%

2021 年美国德州通过法案禁止孕妇在妊娠约 6 周后堕胎。2022 年美国最高法院在 Dobbs v. Jackson Women’s Health Organization 一案中裁决宪法未赋予公民堕胎权,因此推翻了 1973 年的 Roe v. Wade 案。截至 2026 年初美国有 13 个州全面禁止堕胎,7 个州禁止孕妇妊娠 22 周后堕胎。严格堕胎禁令被认为会增加妊娠相关死亡率。发表在《American Journal of Public Health》期刊上的一项研究调查了严格堕胎禁令对孕妇健康的影响。结果显示,在 14 个严格禁止堕胎和禁止妊娠 6 周后堕胎的州,妊娠相关死亡比预期高 9.2%。

在内存天价时代 Meta 更新了 CacheLib 项目

Meta 在 2021 年开源了缓存引擎 CacheLib,该项目旨在利用非易失性存储器作为缓存去扩展服务,以抵消不断上涨的 DRAM 成本。该项目在 2024 年 6 月之后就停止了更新,但在 2026 年 5 月 25 日 Meta 再次释出了更新——而今天由于 AI 热 DRAM 价格相比 2021 年几乎是天价。

座头鲸迁徙距离超过 1.5 万公里

科学家首次记录了一次非凡的鲸类迁徙壮举,证实两头座头鲸在澳大利亚东部和巴西的繁殖地之间,穿越了超过 1.4 万公里的海洋。研究人员通过对比数万张座头鲸尾鳍的图像来辨认这些鲸。每头鲸的尾鳍都有独特的斑纹,这使得研究人员能长期追踪并识别个体。2007 年,一头座头鲸在澳大利亚昆士兰州的赫维湾首次被拍到。2013年,它再次出现在同一海域,随后于 2019 年现身巴西圣保罗附近。这些繁殖地之间的最短直线距离约为1.42万公里。第二头座头鲸更令人惊叹。研究人员于2003年首次在巴西阿布洛霍斯礁群——该国主要的座头鲸繁殖地,拍摄到了它的身影。当时它正与由9头成年鲸组成的活跃群体一起游弋。22年后的2025年9月,同一头鲸被发现在澳大利亚赫维湾独自游弋。两次目击地之间的距离达 1.51 万公里,这创下了单头座头鲸已知最远迁徙距离的新纪录。研究基于19283张高质量的鲸照片,这些照片拍摄于1984年至2025年间,采集自澳大利亚东部和拉丁美洲。这些图像既来自专业研究人员,也来自通过全球鲸追踪平台“Happywhale”参与的公民科学家。

英国皇家医学院学会认为社媒和香烟一样不利于青少年健康

英国皇家医学院学会在递交给政府的咨询意见书中表示,社交媒体的使用与吸烟一样对年轻人的健康构成威胁。医生在接诊年轻患者时,应例行询问他们的屏幕时间和社交媒体使用情况。英国政府正在考虑的一项措施是禁止 16 岁以下儿童使用社交媒体,类似澳大利亚的做法。其它可能采取的限制包括宵禁,或禁用自动播放和无限滚动等功能。儿童精神科医生 Emily Sehmer 认为过度使用社媒的危害远甚于吸烟,因为儿童只需几秒钟就会接触到有害内容。

Uber COO 称愈来愈难以证明最大化词元花的钱是合理的

Uber 高管表示 AI 上支出并没有带来相应的回报。Uber COO Andrew Macdonald 上周六接受采访时表示愈来愈难以证明最大化 AI 词元花的钱是合理的。而在上个月的一次采访中 Uber CTO Praveen Neppalli Naga 告诉 The Information,该公司已经用完了 2026 年的 Claude Code 预算。Macdonald 称,通过与工程主管的交流,他认识到更高的 AI 词元使用量并没有转变为消费者功能的相应增加。他说 AI 带来的权衡成本愈来愈难以证明支出是合理的。

JAXA 等成功测试五马赫冲压发动机

JAXA、早稻田大学、东京大学和庆应义塾大学的工程师团队成功完成了为五马赫高超音速飞机设计的冲压式发动机的地面燃烧试验。冲压发动机利用了发动机的前向运动来压缩空气,不使用带有可旋转叶片的压气机,它无法在空速为零的时候产生推力,需要先加速到超音速。在测试中,一架实验飞机被安装在 JAXA 角田宇宙中心的风洞中,模拟约 25 公里高空的环境条件。在五马赫的飞行速度下,机头和前缘周围的空气温度会超过 1000 摄氏度,为应对高温,工程师设计了一套先进的热防护系统,使飞机内部温度保持在接近正常工作温度的范围内,保证机载航空电子设备和控制电子设备的正常运行。JAXA 接下来计划将实验飞行器搭载在探空火箭上尝试实际飞行,它的目标是到 2040 年代实现商业高超音速客运服务。

BepiColombo 计划于 11 月 21 日进入水星轨道

欧洲 ESA 和日本 JAXA 合作的水星探索项目 BepiColombo 以意大利数学家 Giuseppe Colombo 的名字命名,探测器于 2018 年 10 月发射,原计划在六次飞掠水星之后于 2025 年 12 月进入水星轨道。但第四次飞掠水星前推进器出现故障,地面任务规划人员不得不修订时间表。JAXA 通过其社交媒体账号宣布了最新的日期:BepiColombo 计划于 11 月 21 日进入水星轨道。BepiColombo 包含三个组件:ESA 的水星转移模块和水星行星轨道器,以及 JAXA 的水星磁层轨道器。JAXA 的轨道器分离时间定在 12 月 10 日。BepiColombo 是人类第三次水星探测任务,前两次是 1973 年的 Mariner 10 和 2004 年的 Messenger。水星是太阳系最小密度最高的行星,由于温度非常高,ESA 的轨道器安装了上百公斤的隔热材料。

加州年龄验证法律将豁免大部分 Linux 发行版

加州年龄验证法律的修正案将豁免大部分 Linux 发行版和自由开源软件。年龄验证法律要求操作系统提供商在设置询问用户年龄。该法案的修改版 AB-1856 缩小了适用的操作系统提供商和应用程序的范围:(2) “操作系统提供商”不包括在许可条款允许接收方复制、重新分发和修改该软件的情况下,分发操作系统或应用程序的个人或实体。(2) “应用程序”不包括其本身并未作为独立可执行应用程序、通过受监管的应用程序商店向消费者提供的软件组件。Valve 的 SteamOS 平台仍然受到影响,因为它的 Steam 客户端是受监管的应用商店。

2025 年中亚经历了创纪录的冰川损失

中亚的冰川是生活在下游地区的数百万人的重要水源。一项新研究发现 2025 年中亚经历了创纪录的冰川损失。冰川加速消融可能会在短期内增加融水,但最终由于冰量的减少融水也会减少。研究人员利用对天山和帕米尔高原 16 座冰川的实地观测数据,结合模型,估算出中亚冰川在一年内损失了约 30 立方千米的冰,相当于该地区冰川总体积的近 2%。这一结果是异常温暖的春夏季气温以及降雪频率的大幅下降造成的。16 座冰川有 9 座经历了有史以来最严重的冰川质量损失,帕米尔高原西部和天山山脉西部的冰川消融最为严重,部分冰川在一年内损失了 2%-4% 的总冰量。64% 的冰川经历了自 1991 年以来最严重的冰川质量损失。研究人员警告由于全球暖化,这种情况可能成为常态。

摩托罗拉手机劫持亚马逊应用植入联盟营销推广码

用户通过社交媒体报告,摩托罗拉手机预装的一个应用 Smart Feed 在更新之后开始劫持亚马逊应用植入联盟营销推广码获取佣金。非常奇怪的是,推广码 sramz-kff-008-20 指向的是一名时尚博主“@kirasfashionfinds”,也就是佣金给的不是 Smart Feed 而是这位博主。暂时不清楚究竟发生了什么。受影响的用户可通过禁用 Smart Feed 关闭推广码,方法是:Settings > Apps > 搜索定位到 “Smart Feed” > Disable。

教宗呼吁不可用 AI 作恶

教宗良十四世颁布了其首道通谕,呼吁世人不可用人工智能来作恶,切莫把人工智能当成「掌控、排斥或死亡的工具」。 教会长期支持核裁军,称这是「为人类大家庭和平与尊严的服务」。同样地,人工智能今天也不可用于作恶,这就「如同核能那样,必须用来为所有的人和公共福祉效劳」。「关于科技的决定绝对是与良心和责任密不可分的」。「和平不只是没有战争,更是正义伸张。然而,当科技削弱我们的批判意识时,和平本身就会陷入危险。无论如何,光是解除武装仍有所不足,我们还必须进行建设。」

欧洲执法部门黑进 VPN 服务识别勒索组织用户

欧洲刑警组织披露,他们黑进了被网络犯罪分子使用的 VPN 服务“First VPN”,访问了用户数据库,识别了数千用户身份。First VPN 的网站已经显示被执法部门扣押的信息,它过去曾在俄语网络犯罪论坛上打广告,宣称能隐藏用户的 IP 地址,加密所有通信,不记录任何日志。它还声称将拒绝与司法机关合作,其服务不受任何司法管辖,且不会存储任何用户数据。First VPN 的活动始于 2014 年,在 27 个国家/地区提供了 32 个出口节点服务器。至少有 25 个勒索软件组织利用了其基础设施进行网络侦察和入侵。警方搜查了该服务管理员在乌克兰的住所,拆除了 33 台服务器。

HBM 成本占到了 AI 芯片组件成本的三分之二

对英伟达、AMD、Google 和亚马逊四家公司的 AI 芯片的分析显示,HBM 内存芯片成本占到了 AI 芯片组件成本的三分之二(63%),逻辑芯片占 13%,先进封装占 15%,辅助组件占 9% 。四家公司在 HBM 上的支出从 2024 年的约 120 亿美元增至 2025 年的 320 亿美元,增速远超其它芯片组件。随着内存芯片供应持续紧张且价格上涨,HBM 在 2026 年的市场份额可能会进一步扩大。超大规模数据中心运营商在其资本支出预期中已经预见到这一点:微软 2026 财年 1900 亿美元的资本支出预期中,约有 250 亿美元来自组件价格上涨;Meta 将其 2026 年资本支出预期上调了 100 亿美元,理由同样是组件价格上涨。

09

APP STORE RANK

09.00
APP STORE RANK
FETCHING · APP STORE RANK