Curated by Shen Huang · 90 stories · ~14 min read
DIGEST · 2026-05-21

OrangeBot.AI Digest — 2026-05-21

90 headlines across 8 sources, aggregated for this day.

Hacker News(15)

  1. BBEdit 16 (www.barebones.com)
  2. Seattle Shield, an intelligence-sharing network operated by the Seattle police (prismreports.org)
  3. Was my $48K GPU server worth it? (rosmine.ai)
  4. Project Hail Mary – Stellar Navigation Chart (valhovey.github.io)
  5. Waymo pauses Atlanta service as its robotaxis keep driving into floods (techcrunch.com)
  6. Indexing a year of video locally on a 2021 MacBook with Gemma4-31B (50GB swap) (blog.simbastack.com)
  7. Google's Antigravity bait and switch (www.0xsid.com)
  8. A Bipartisan Amendment Would End Police License Plate Tracking Nationwide (www.wired.com)
  9. US employers spend more than $1.5B a year to fight labor unions, report finds (www.theguardian.com)
  10. AI is just unauthorised plagiarism at a bigger scale (axelk.ee)
  11. Shunning AI is the human choice (www.thehandbasket.co)
  12. Lost Images from the 1945 Trinity Nuclear Test Restored (spectrum.ieee.org)
  13. Flipper One – we need your help (blog.flipper.net)
  14. Python 3.15: features that didn't make the headlines (blog.changs.co.uk)
  15. Throwing AI-generated walls of text into conversations (noslopgrenade.com)

GitHub Trending(15)

  1. anthropics / claude-plugins-official
  2. colbymchenry / codegraph
  3. multica-ai / andrej-karpathy-skills
  4. dotnet / skills
  5. obra / superpowers
  6. HKUDS / CLI-Anything
  7. rmyndharis / OpenWA
  8. ChromeDevTools / chrome-devtools-mcp
  9. rohitg00 / ai-engineering-from-scratch
  10. teng-lin / notebooklm-py
  11. can1357 / oh-my-pi
  12. antoinezambelli / forge
  13. multica-ai / multica
  14. Imbad0202 / academic-research-skills
  15. trimstray / the-book-of-secret-knowledge

Product Hunt(15)

  1. Tycoon AI

    Run one-person companies entirely with AI agents

  2. AutoSubtitles 2.0

    AI subtitles & animated captions with faster editing

  3. Basedash Skills

    Reusable AI instructions for every Basedash surface.

  4. AlliHat

    Claude AI in your Safari sidebar

  5. Mixpanel Headless

    Programmatic access to product analytics for agents and devs

  6. Novi Notes 1.1

    A local AI memory layer for your Mac

  7. WeWeb 3.0

    Vibe-code apps with the safety net of a no-code editor

  8. CatchAll by NewsCatcher

    Build any dataset from the web. Filtered to your criteria.

  9. Visual Usability Checker

    Validate your design decisions instantly with AI insights

  10. Framed

    Turn screenshots, videos, and code into polished visuals

  11. TongueType for macOS

    Local dictation for macOS without the subscription

  12. Ente Locker

    Shared vault for your most important documents

  13. Tacet

    The brain monitor for cognitive health scores

  14. Mintlify Workflows

    Self-updating knowledge bases

  15. Slideshot

    Product demo videos, recorded by your AI agent

Hugging Face(15)

  1. Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

    Despite rapid advances in automatic speech recognition (ASR) and large audio-language models, robust recognition in real-world environments remains limited by an "acoustic robustness bottleneck": models often lose acoustic grounding and produce omissions or hallucinations under severe, compositional distortions. We propose Mega-ASR, a unified ASR-in-the-wild framework that combines scalable compound-data construction with progressive acoustic-to-semantic optimization. We introduce Voices-in-the-Wild-2M, covering 7 classic acoustic phenomena and 54 physically plausible compound scenarios, and train Mega-ASR with Acoustic-to-Semantic Progressive Supervised Fine-Tuning and Dual-Granularity WER-Gated Policy Optimization. Extensive experiments demonstrate that Mega-ASR achieves significant advantages over prior state-of-the-art systems on adverse-condition ASR benchmarks (45.69% vs. 54.01% on VOiCES R4-B-F, and 21.49% vs. 29.34% on NOIZEUS Sta-0). On complex compositional acoustic scenarios, Mega-ASR further delivers over 30% relative WER reduction against strong open- and closed-source baselines, establishing a scalable paradigm for robust ASR in-the-wild.

  2. Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

    Recent advances in multimodal large language models have driven growing interest in graphical user interface (GUI) agents, yet their generalization remains constrained by the scarcity of large-scale training data spanning diverse real-world applications. Existing datasets rely heavily on costly manual annotations and are typically confined to narrow domains. To address this challenge, we propose Video2GUI, a fully automated framework that extracts grounded GUI interaction trajectories directly from unlabeled Internet videos. Video2GUI employs a coarse-to-fine filtering strategy to identify high-quality GUI tutorial videos and convert them into structured agent trajectories. Applying this pipeline to 500 million video metadata entries, we construct WildGUI, a large-scale dataset containing 12 million interaction trajectories spanning over 1,500 applications and websites. Pre-training Qwen2.5-VL and Mimo-VL on WildGUI yields consistent improvements of 5-20% across multiple GUI grounding and action benchmarks, matching or surpassing state-of-the-art performance. We will release both the WildGUI dataset and the Video2GUI pipeline to support future research of GUI agents.

  3. Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos

    Without incurring significant computational overhead, train-free long video generation aims to enable foundation video generation models to produce longer videos. Frame-level autoregressive frameworks, e.g., FIFO-diffusion, offer the advantage of generating infinitely long videos with constant memory consumption. However, the mismatch between training and inference, coupled with the challenge of maintaining long-term consistency, limits the effective utilization of foundation models. To mitigate these concerns, we propose MIGA, a novel infinite-frame long video generation method. Firstly, we propose an effective two-stage alignment mechanism that mitigates the training-inference gap by reducing the excessive noise span fed to the model. We then introduce an innovative dual consistency enhancement mechanism, where the self-reflection approach corrects early high-noise frames and the long-range frame guidance approach leverages later low-noise frames with broad coverage to steer generation, jointly improving temporal consistency. Extensive experiments on VBench and NarrLV demonstrate the state-of-the-art performance of MIGA. Our project page is available at https://xiaokunfeng.github.io/miga_homepage/.

  4. You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

    Reinforcement learning with verifiable rewards (RLVR) has become a dominant paradigm for improving reasoning in large language models (LLMs), yet the underlying geometry of the resulting parameter trajectories remains underexplored. In this work, we demonstrate that RLVR weight trajectories are extremely low-rank and highly predictable. Specifically, we find that the majority of downstream performance gains are captured by a rank-1 approximation of the parameter deltas, where the magnitude of this projection evolves near-linearly with training steps. Motivated by this, we propose a simple and compute-efficient method RELEX (REinforcement Learning EXtrapolation), which estimates the rank-1 subspace from a short observation window and extrapolates future checkpoints via linear regression, with no learned model required. Across three models (i.e., Qwen2.5-Math-1.5B, Qwen3-4B-Base, and Qwen3-8B-Base), RELEX produces checkpoints that match or exceed RLVR performance on both in-domain and out-of-domain benchmarks, requiring as few as 15% steps of full RLVR training. Remarkably, RELEX is able to extrapolate far beyond the observation window at no training cost, predicting checkpoints up to 10-20times beyond the observed prefix with continued improvement (e.g., observe only the first 50 steps and extrapolate to 1000 steps). Our ablation analysis confirms the minimalist sufficiency of RELEX: neither increasing the subspace rank nor employing non-linear modeling yields further gains in extrapolation. Finally, we show that RELEX's success stems from a "denoising" effect: by projecting updates onto the rank-1 subspace, the model discards stochastic optimization noise that would otherwise degrade performance during extrapolation. Our code is available at https://github.com/weizhepei/RELEX.

  5. IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools

    Multimodal large language models (MLLMs) have shown remarkable capability in bridging visual perception and textual reasoning, enabling zero-shot understanding across diverse industrial scenarios. However, their performance in open-vocabulary industrial anomaly detection (IAD) is often limited by domain-misaligned reasoning and hallucinated structural inferences. To address these challenges, we propose IndusAgent, a tool-augmented agentic framework for open-vocabulary IAD. Specifically, we first construct Indus-CoT, a structured dataset that integrates global visual observations, high-resolution local patches, and expert normalcy priors, providing supervision for fine-tuning the model on rigorous industrial inspection trajectories. Building on this, IndusAgent dynamically orchestrates a set of external tools, including dynamic region cropping, high-frequency feature enhancement, and prior retrieval, thus enabling the agent to actively resolve visual ambiguities and disentangle subtle anomalies. Furthermore, we introduce a gated reinforcement learning objective that jointly optimizes anomaly classification, localization accuracy, anomaly type reasoning, and efficient tool usage, ensuring that tool invocation occurs only when beneficial. Extensive evaluations on five industrial anomaly benchmarks, including MVTec-AD, VisA, MPDD, DTD, and SDD, demonstrate that IndusAgent achieves state-of-the-art zero-shot performance among all existing methods, validating our robustness and generalization capacity.

  6. OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

    The rapid advancement toward long-context reasoning and multi-modal intelligence has made the memory footprint of the Key-Value (KV) cache a dominant memory bottleneck for efficient deployment. While the established per-channel quantization effectively accommodates intrinsic channel-wise outliers in Key tensors, its efficacy diminishes under extreme compression. In this work, we revisit the inherent limitations of the per-channel quantization paradigm from both empirical and theoretical perspectives. Our analysis identifies Token Norm Imbalance (TNI) as the primary bottleneck to quantization fidelity. We demonstrate that TNI systematically amplifies errors when shared quantization parameters are required to span token groups exhibiting substantial norm disparities. Instead of relying on intricate quantization pipelines (e.g., TurboQuant), we propose OScaR (Omni-Scaled Canalized Rotation), an accurate and lightweight KV cache compression framework for X-LLMs (i.e., text-only, multi-modal, and omni-modal LLMs). Advancing the per-channel paradigm, OScaR employs Canalized Rotation followed by Omni-Token Scaling to mitigate TNI-induced sequence-dimensional variance both effectively and efficiently, further supported by our optimized system design and CUDA kernels. Extensive evaluations across X-LLMs show that OScaR consistently outperforms existing methods and achieves near-lossless performance under INT2 quantization, establishing it as a robust, low-complexity, and universal framework that defines a new Pareto front. Compared with the BF16 FlashDecoding-v2 baseline, our OScaR implementation achieves a notable up to 3.0x speedup in decoding, reduces memory footprint by 5.3x, and increases throughput by 4.1x. The code for OScaR is publicly available at https://github.com/ZunhaiSu/OScaR-KV-Quant.

  7. A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook

    The foundational capabilities established by Large Language Models (LLMs) have paved the way for Multimodal Large Language Models (MLLMs), within which Large Audio Language Models (LALMs) are essential for realizing universal auditory intelligence. Despite their remarkable performance, the escalation of LALMs' capabilities has significantly outpaced the development of systemic frameworks to ensure their trustworthiness. This survey provides a comprehensive investigation into the endogenous mechanisms of LALMs, detailing the architectural innovations and alignment algorithms that facilitate emergent reasoning. Specifically, we analyze how the transition to unified end-to-end frameworks and the integration of continuous acoustic signals inherently expand the attack surface. To rigorously evaluate the risks within these paradigms, we establish a comprehensive taxonomy of trustworthiness, categorizing critical vulnerabilities such as cross-modal jailbreaking, latent acoustic backdoors, and biometric privacy leakage. We review the state-of-the-art through six analytical pillars: hallucination, robustness, safety, privacy, fairness, and authentication. The profound imbalance between a mature offensive landscape and underdeveloped defenses further validates the critical trustworthiness gaps and multidimensional risks facing audio-centric intelligence. Finally, we propose a strategic roadmap advocating for "Defense-in-Depth" architectures, causal auditory world modeling, and intrinsic representation engineering to bridge the gap between empirical performance and intrinsically trustworthy audio intelligence. Our project has been uploaded to GitHub https://github.com/Kwwwww74/Awesome-Trustworthy-AudioLLMs.

  8. It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

    Contextual Integrity (CI) defines privacy not merely as keeping information hidden, but as governing information flows according to the norms of a given context. As large language models are increasingly deployed as personal agents handling sensitive workflows, adhering to CI becomes critical. However, even frontier models remain unreliable in making disclosure decisions, and existing mitigation strategies often degrade underlying task performance. To overcome this privacy-utility trade-off, we propose SELFCI, a complementary self-distillation framework that decouples information suppression from task resolution. SELFCI jointly optimizes two independent reverse KL divergences over distinct teacher distributions derived from feedback: one encourages preserving task-relevant information for utility, while the other enforces minimal and appropriate disclosure. This complementary formulation induces a Product-of-Experts (PoE) target, aligning the policy with the intersection of capability and privacy requirements. Empirical evaluations demonstrate that SELFCI, without relying on costly external supervision, consistently outperforms competitive baselines such as online reinforcement learning algorithms (e.g., GRPO). These trends further extend to out-of-domain settings involving agentic workflows and accumulated private context, suggesting that SELFCI provides a practical path toward CI alignment.

  9. Toto 2.0: Time Series Forecasting Enters the Scaling Era

    We show that time series foundation models scale: a single training recipe produces reliable forecast-quality improvements from 4M to 2.5B parameters. We release Toto 2.0, a family of five open-weights forecasting models trained under this recipe. The Toto 2.0 family sets a new state of the art on three forecasting benchmarks: BOOM, our observability benchmark; GIFT-Eval, the standard general-purpose benchmark; and the recent contamination-resistant TIME benchmark. This report describes our experimental results and details the design decisions behind Toto 2.0: its architecture and training recipe, training data, and the u-muP hyperparameter transfer pipeline. All five base checkpoints are released under Apache 2.0.

  10. Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs

    LLM agents have recently emerged as a powerful paradigm for solving complex tasks through planning, tool use, memory retrieval, and multi-step interaction. However, these agentic workflows often introduce substantial input-side overhead, making the compute-intensive prefilling stage a key bottleneck in long-context, multi-turn inference. In this work, we propose Mix-Quant, a simple and effective phase-aware quantization framework for fast agentic inference. We first investigate FP4 quantization in agentic LLM workflows and observe that quantizing the entire inference process can incur significant performance degradation. In contrast, the prefilling stage exhibits substantial quantization redundancy and can therefore be quantized with minimal accuracy loss, despite being the dominant source of computation. Based on this insight, we apply high-throughput NVFP4 quantization to the prefilling phase while preserving BF16 precision for decoding. By decoupling prefilling acceleration from decoding quality, Mix-Quant combines phase-aware algorithmic quantization with hardware-efficient NVFP4 execution to alleviate the inference bottleneck in LLM agents. Extensive experiments across long-context and agentic benchmarks demonstrate that Mix-Quant largely preserves task performance while delivering significant efficiency improvements, achieving up to a 3x speedup during prefilling.

  11. CutVerse: A Compositional GUI Agents Benchmark for Media Post-Production Editing

    While GUI agents have made significant progress in web navigation and basic operating system tasks, their capabilities in professional creative workflows remain largely underexplored. To bridge this gap, we introduce Cutverse, a benchmark designed to systematically evaluate autonomous GUI agents in realistic media post-production environments. We curate expert demonstrations across 7 professional applications (e.g., Premiere Pro, Photoshop), covering 186 complex, long-horizon tasks grounded in authentic editing workflows, involving dense multimodal interfaces and tightly coupled interaction sequences. To support scalable evaluation, we develop a lightweight parser that transforms raw screen recordings and low-level interaction logs into structured, compositional GUI action trajectories with precise grounding. Extensive evaluations reveal that existing agents achieve only 36.0\% task success on realistic media editing tasks, underscoring the challenges posed by complex, long-horizon media post-production workflows in our benchmark.While current models demonstrate promising spatial grounding, multimodal alignment, and coordinated action execution, they remain limited in long-horizon reliability and domain-specific planning.

  12. Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

    Currently, enhancing Unified Multimodal Models (UMMs) with image understanding, generation, and editing capabilities mainly relies on mixed multi-task training. Due to inherent task conflicts, such strategy requires complex multi-stage pipelines, massive data mixing, and balancing tricks, merely resulting in a performance trade-off rather than true mutual reinforcement. To break this paradigm, we propose Uni-Edit, an intelligent image editing task that serves as the first general task for UMM tuning. Unlike complex mixed pipelines, Uni-Edit improves performance across all three abilities at once using only one task, one training stage, and one dataset. Specifically, we first identify image editing as an inherently ideal general task, as it naturally demands both visual understanding and generation. However, existing editing data relies on simplistic instructions that severely underutilize a model's understanding capacity. To address this, we introduce the first automated and scalable data synthesis pipeline for intelligent editing, transforming diverse VQA data into complex and effective editing instructions with embedded questions and nested logic. This yields Uni-Edit-148k, pairing diverse reasoning-intensive instructions with high-quality edited images. Extensive experiments on BAGEL and Janus-Pro demonstrate that tuning solely on Uni-Edit achieves comprehensive enhancements across all three capabilities without any auxiliary operations.

  13. Generative Recursive Reasoning

    How should future neural reasoning systems implement extended computation? Recursive Reasoning Models (RRMs) offer a promising alternative to autoregressive sequence extension by performing iterative latent-state refinement with shared transition functions. Yet existing RRMs are largely deterministic, following a single latent trajectory and converging to a single prediction. We introduce Generative Recursive reAsoning Models (GRAM), a framework that turns recursive latent reasoning into probabilistic multi-trajectory computation. GRAM models reasoning as a stochastic latent trajectory, enabling multiple hypotheses, alternative solution strategies, and inference-time scaling through both recursive depth and parallel trajectory sampling. This yields a latent-variable generative model supporting conditional reasoning via p_θ(y mid x) and, with fixed or absent inputs, unconditional generation via p_θ(x). Trained with amortized variational inference, GRAM improves over deterministic recurrent and recursive baselines on structured reasoning and multi-solution constraint satisfaction tasks, while demonstrating an unconditional generation capability. https://ahn-ml.github.io/gram-website

  14. LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

    Evaluating large language models (LLMs) on natural-language logical reasoning is essential because rule-governed tasks require conclusions to follow strictly from stated premises. Many existing logical-reasoning benchmarks are generated by templating natural-language items from sampled formulas, provide only coarse or unaudited formal annotations, and are now quickly saturated by frontier reasoning models. We present LLMEval-Logic, a Chinese logical reasoning benchmark built from realistic situational scenarios. Its pipeline forward-authors and expert-audits natural-language items together with their reference formalizations, verifies annotated answers with Z3, constructs expert rubrics for natural-to-formal grading, and hardens selected items through a closed-loop adversarial workflow. The benchmark is released in two paired subsets: a 246-item Base subset shipped with 1,400 expert-developed rubric atoms, and a 190-item Hard subset with 938 multi-step sub-questions over closed model spaces. Evaluating 14 frontier LLMs on LLMEval-Logic reveals substantial gaps in current models: the best model reaches only 37.5% Hard Item Accuracy, and even with reference symbols the highest joint Z3+Rubric formalization score among evaluated models reaches only 60.16%. Our benchmark is publicly available at https://github.com/llmeval/LLMEval-Logic.

  15. HRM-Text: Efficient Pretraining Beyond Scaling

    The current pretraining paradigm for large language models relies on massive compute and internet-scale raw text, creating a significant barrier to foundational research. In contrast, biological systems demonstrate highly sample-efficient learning through multi-timescale processing, such as the functional organization of the frontoparietal loop. Taking this as inspiration, we introduce HRM-Text, which replaces standard Transformers with a Hierarchical Recurrent Model (HRM) that decouples computation into slow-evolving strategic and fast-evolving execution layers. To stabilize this deep recurrence for language modeling, we introduce MagicNorm and warmup deep credit assignment. Furthermore, instead of standard raw-text pretraining, we train exclusively on instruction-response pairs using a task-completion objective and PrefixLM masking. Serving as an empirical existence proof of efficient pretraining, a 1B-parameter HRM-Text model trained from scratch on only 40 billion unique tokens and $1,500 budget achieves 60.7% on MMLU, 81.9% on ARC-C, 82.2% on DROP, 84.5% on GSM8K, and 56.2% on MATH. Despite utilizing roughly 100-900x fewer training tokens and 96-432x less estimated compute than standard baselines, HRM-Text performs competitively with 2-7B parameter open models. These results demonstrate that co-designing architectures and objectives can radically reduce the compute-to-performance ratio, making pretraining from scratch accessible to the broader research community.

Techmeme(15)

  1. Source: Cursor reached $3B in annualized revenue in late April and now has 3,000+ customers paying at least $100K each for its software on an annualized basis (Rachel Metz/Bloomberg)

    Rachel Metz / Bloomberg : Source: Cursor reached $3B in annualized revenue in late April and now has 3,000+ customers paying at least $100K each for its software on an annualized basis —  Cursor's annualized revenue hit $3 billion in late April, according to a person familiar with the matter, indicating growing demand …

  2. Spotify closes up 13% after announcing new features and 2030 guidance, forecasting a compound annual growth rate in the mid-teens and gross margins of 35%-40% (Samantha Subin/CNBC)

    Samantha Subin / CNBC : Spotify closes up 13% after announcing new features and 2030 guidance, forecasting a compound annual growth rate in the mid-teens and gross margins of 35%-40% —  Spotify shares jumped 15% on Thursday after the music streaming platform laid out guidance for 2030 and reached an artificial intelligence deal …

  3. Zoom reports Q1 revenue up 5.5% YoY to $1.24B, vs. $1.22B est., AI Companion paid users up 184%, forecasts FY 2027 revenue above est.; ZM jumps 8%+ after hours (Brody Ford/Bloomberg)

    Brody Ford / Bloomberg : Zoom reports Q1 revenue up 5.5% YoY to $1.24B, vs. $1.22B est., AI Companion paid users up 184%, forecasts FY 2027 revenue above est.; ZM jumps 8%+ after hours —  Zoom Communications Inc. projected stronger-than-anticipated sales growth, suggesting that customers are adopting its expanded suite of office products.

  4. Take-Two reports Q4 bookings flat YoY at $1.58B, forecasts FY 2027 bookings below est., reiterates GTA VI's November 19 launch date; TTWO jumps 6%+ after hours (Zaheer Kachwala/Reuters)

    Zaheer Kachwala / Reuters : Take-Two reports Q4 bookings flat YoY at $1.58B, forecasts FY 2027 bookings below est., reiterates GTA VI's November 19 launch date; TTWO jumps 6%+ after hours —  Take-Two Interactive (TTWO.O) forecast annual bookings below Wall Street expectations on Thursday, but reiterated …

  5. Workday reports Q1 revenue up 13% YoY to $2.54B vs. $2.52B est., and lifts its full-year forecast, saying its AI strategy is working; WDAY jumps 9%+ after hours (Jordan Novet/CNBC)

    Jordan Novet / CNBC : Workday reports Q1 revenue up 13% YoY to $2.54B vs. $2.52B est., and lifts its full-year forecast, saying its AI strategy is working; WDAY jumps 9%+ after hours —  Workday shares surged as much as 14% in extended trading on Thursday after the finance and human resources software maker reported results …

  6. Texas AG Ken Paxton sues Meta, accusing WhatsApp of marketing its services as secure but failing to "deliver on those promises" by accessing encrypted messages (Ryan Autullo/Bloomberg Law)

    Ryan Autullo / Bloomberg Law : Texas AG Ken Paxton sues Meta, accusing WhatsApp of marketing its services as secure but failing to “deliver on those promises” by accessing encrypted messages —  WhatsApp is able to access user's encrypted messages, Texas said in a lawsuit that accuses Meta founder Mark Zuckerberg …

  7. Meta joins TikTok, Snap, and YouTube in settling with a Kentucky school district to avoid a trial over claims the platforms were designed to addict kids (Erin Mulvaney/Wall Street Journal)

    Erin Mulvaney / Wall Street Journal : Meta joins TikTok, Snap, and YouTube in settling with a Kentucky school district to avoid a trial over claims the platforms were designed to addict kids —  Meta, TikTok, Snap and YouTube reached deal to avoid first of more than 1,200 consolidated lawsuits by school districts

  8. Sources: Starbucks shut down an AI program for automating inventory counts, nine months after deploying it, after it frequently miscounted and mislabeled items (Waylon Cunningham/Reuters)

    Waylon Cunningham / Reuters : Sources: Starbucks shut down an AI program for automating inventory counts, nine months after deploying it, after it frequently miscounted and mislabeled items —  Starbucks (SBUX.O) terminated an AI program workers used for automating certain inventory counts this week …

  9. Modal Labs, which offers a serverless cloud platform to build AI apps and run AI inference, raised a $355M Series C at a $4.65B valuation, up from $1.1B in 2025 (Deepa Seetharaman/Reuters)

    Deepa Seetharaman / Reuters : Modal Labs, which offers a serverless cloud platform to build AI apps and run AI inference, raised a $355M Series C at a $4.65B valuation, up from $1.1B in 2025 —  AI startup Modal raised $355 million in a new round of financing, valuing the company at $4.65 billion, CEO Erik Bernhardsson told Reuters.

  10. Gavin Newsom signs an EO mandating state agencies work with the AI industry and others to study subsidies for companies that don't replace workers with AI (Cecilia Kang/New York Times)

    Cecilia Kang / New York Times : Gavin Newsom signs an EO mandating state agencies work with the AI industry and others to study subsidies for companies that don't replace workers with AI —  Gov. Gavin Newsom issued an executive order to explore an overhaul of labor policies to deal with potential mass job displacement from artificial intelligence.

  11. Sources: OpenAI generated ~$5.7B in revenue in Q1, ~$1B more than Anthropic; its adjusted operating income margin was -122%, and ChatGPT user growth stalled (Sri Muppidi/The Information)

    Sri Muppidi / The Information : Sources: OpenAI generated ~$5.7B in revenue in Q1, ~$1B more than Anthropic; its adjusted operating income margin was -122%, and ChatGPT user growth stalled —  OpenAI generated about $5.7 billion in revenue in the first quarter, nearly $1 billion more than archrival Anthropic generated in the same period …

  12. Sources: the EU will propose temporarily lifting sanctions, imposed in April, on a major Chinese semiconductor supplier after automakers warned of shortages (Alberto Nardelli/Bloomberg)

    Alberto Nardelli / Bloomberg : Sources: the EU will propose temporarily lifting sanctions, imposed in April, on a major Chinese semiconductor supplier after automakers warned of shortages —  The European Union will propose temporarily lifting sanctions on a major Chinese semiconductor supplier after automakers warned …

  13. August Robotics, which makes autonomous robots for construction and industrial applications, raised $30M led by Big Pi Ventures (Kyt Dotson/SiliconANGLE)

    Kyt Dotson / SiliconANGLE : August Robotics, which makes autonomous robots for construction and industrial applications, raised $30M led by Big Pi Ventures —  August Robotics Ltd., a robotics automation company for construction and industrial applications, today announced it raised $30 million in new funding led by Big Pi Ventures.

  14. Apple plans to broadcast an MLS game on Saturday shot entirely on 15 iPhone 17 Pros, the first major live sports event to be captured using only smartphones (Todd Spangler/Variety)

    Todd Spangler / Variety : Apple plans to broadcast an MLS game on Saturday shot entirely on 15 iPhone 17 Pros, the first major live sports event to be captured using only smartphones —  Apple has frequently touted the use of professional-grade iPhones by filmmakers to make shorts, commercials and even feature-length movies.

  15. Source: smart ring maker Oura filed confidentially for a US IPO, set for later in 2026; SF- and Finland-based Oura had an $11B valuation in September 2025 (Mark Gurman/Bloomberg)

    Mark Gurman / Bloomberg : Source: smart ring maker Oura filed confidentially for a US IPO, set for later in 2026; SF- and Finland-based Oura had an $11B valuation in September 2025 —  Smart Ring Maker Oura Files Confidentially for IPO  —  Video Player is loading.  —  Unmute  —  Current Time 0:00 Loaded: 0.00% Playback Rate

Solidot(15)

  1. Google 宣布在 AI 模式下加入更多广告

    Google 本周二宣布搜索框将变成 AI 聊天机器人的对话框,那么它久经时间考虑的商业模式——搜索广告——自然也会跟着进入 AI 模式。Google 周三宣布将在 AI 模式中引入更多“富有帮助的广告(helpful ads)”。搜索巨人表示在测试两类新广告,提供相关产品的细节和有用的指导。作为广告的一部分,它们都会包含一个独立的 AI 解释器。广告也都会标明“赞助”字样。两类新广告其一称之为“对话式发现广告”——广告即答案;其二称之为“高亮答案”(Highlighted Answers)——将高度相关的广告作为推荐列表的一部分提供给用户。

  2. NASA 预计中国将在 2027 年执行载人绕月飞行任务

    NASA 局长 Jared Isaacman 表示他预计中国将在 2027 年执行载人绕月飞行任务,他正以此为由要求修改阿尔忒弥斯计划,加快美国重返月球的步伐。Isaacman 称,下次全世界观看宇航员绕月飞行时——很可能是 2027 年的某个时候——他们将是中国宇航员,美国将不再是唯一能将人类送入月球环境的国家。中国尚未公布月球载人飞行的时间表。迄今所有载人绕月飞行、轨道飞行或登月任务均由 NASA 执行:包括 1968-1972 年间的九次阿波罗计划以及今年四月的阿尔忒弥斯 2 号任务。

  3. Vivaldi 8.0 释出

    基于 Chromium 的浏览器 Vivaldi 释出了 8.0 版本。Vivaldi 由 Opera 联合创始人谭咏文(Jon von Tetzchner)创办。Vivaldi 8.0 的新特性包括:被称为 Unified 的新外观,所有元素都统一在一个视觉平面上;提供了六种预设布局,其中之一是垂直标签,用户可选择垂直左侧、垂直右侧两种垂直标签布局,其它还有经典、简洁、自动隐藏以及底部四种布局。

  4. SpaceX 最大的收入来源是与 Anthropic 达成的数据中心交易

    SpaceX 周三晚上向美国证券交易委员会(SEC)递交了招股说明书,首次披露了其财务状况。根据招股说明书,在合并了马斯克(Elon Musk)旗下的 xAI 和 X/Twitter 之后,SpaceX 最大的收入来源就是今年五月与 Anthropic 达成的为期三年的数据中心交易,租用 Colossus 1 园区的算力,每月支付 12.5 亿美元。但这笔交易并非是保障性,任何一方都可以提前 90 天通知终止交易。其它数据包括:2025 年营收 187 亿美元,营业亏损 26 亿美元,净亏损 49 亿美元。其中卫星宽带 Starlink / Connectivity 业务营收 114 亿美元营业利润 44 亿美元,太空发射业务营收 41 亿美元运营亏损 6.57 亿美元,AI 以及社媒业务营收 32 亿美元营业亏损 64 亿美元。招股书数百次提及 AI。马斯克持有 12.3% 的 A 类股和 93.6% 的 B 类股,B 类股投票权十倍于 A 类股,马斯克总共控制着公司 85.1% 的投票权。如果他出售任何 B 类股,它们将自动转换为 A 类股。

  5. Google 的 AI 搜索容易被人为操纵

    Google 的 AI 搜索非常容易被人为操纵。因为以前的搜索结果是第一页给你 10 个链接然后让用户判断,现在的 AI 搜索是给你一个答案,而答案的来源可能只有一个。BBC 科技记者通过个人网站上一篇热狗文章演示了这一操纵。专家表示此类操纵正大规模系统性地发生。操纵 AI 搜索向用户提供偏见或不准确信息可能会带来严重后果。这并非一个无关紧要的问题。在全球范围内,逾 10 亿人日常使用 AI 聊天机器人,每月有 25 亿人浏览 Google 的 AI overviews。如果你能操控此类工具就能获得巨大的权力。Google 等公司也注意到了该问题。, Google 上周更新了其政策,将试图操纵 AI 回复的行为视为违反公司规定。Google 威胁对涉嫌操纵行为的公司或网站从搜索结果中移除或降低排名。

  6. RTX 5090DV2 显卡列入封禁清单

    上周五,中国海关将去年 8 月英伟达为通过美国出口管制规定而推出的 RTX 5090DV2 显卡列入封禁清单。该清单最初包括 H200 和 H20。H20 是英伟达此前在中国市场销售的另一款中国特供芯片。在京东和淘宝等主要电商平台,RTX 5090DV2 仍在销售,价格在 1.8 万-2.2 万元之间,意味着现有库存仍然能正常销售,但随着进口的消失,其数量将会越来越少。

  7. Google 意外公开了未修复 Chromium 漏洞的利用代码

    Google 周三公开了一个未修复 Chromium 漏洞的利用代码。该漏洞影响所有使用基于 Chromium 浏览器的用户。独立安全研究员 Lyra Rebane 在 2022 年底向 Google 报告了漏洞,但 29 个月后它仍然没有修复。本周三上午 Google 向 Chromium 的 bug 跟踪系统披露了漏洞,Rebane 一开始以为漏洞已经修复了,结果发现根本没有。Google 虽然之后删除了帖子,但其内容已被其它网站存档。该漏洞滥用了 Chromium 的 Browser Fetch API 打开一个持续活动的 Service Worker,恶意网站可通过 JavaScript 触发该 Service Worker 创建连接,监视用户的部分活动,它还可作为代理访问网站和发起 DDoS 攻击。安全研究人员认为这是一个严重的漏洞,它实际上相当于一个受限的后门,将浏览器变成僵尸网络的一部分。

  8. 三星电子劳资谈判达成初步协议,罢工终止

    三星电子工会在 20 日 23 时总罢工启动仅剩最后 1 个小时之际,与三星电子公司戏剧性地达成了协议,罢工终止。根据双方达成的就绩效奖金方案初步协议,负责半导体业务的设备解决方案(DS)部门员工今年有望获得最高约 6 亿韩元(约合人民币 272.3 万元)的绩效奖金。劳资商定维持既有的年终绩效奖金(OPI)制度的同时,为 DS 部门新设半导体特别绩效奖金。公司将拿出业绩的 10.5% 作为特别绩效奖金资金来源,不设上限。资金来源中的 40% 将分配给 DS 部门,其余 60% 分配给子部门,向行政部门统一发放的绩效奖金为 DS 子部门存储芯片事业部的 70% 水平。人均绩效奖金规模有望达 6 亿韩元。

  9. 安娜档案馆被判向图书出版商赔偿 1950 万美元

    Penguin Random House、Elsevier 和 HarperCollins 等 13 家大型图书出版商今年 3 月联合起诉安娜的档案(Anna’s Archive),指控该影子图书馆助长图书盗版。出版商此举旨在获得法庭禁令,对安娜的档案的域名注册商施压。安娜的档案已经深陷了多起诉讼,去年底流媒体巨头 Spotify 和唱片公司起诉安娜的档案导致其失去了 .org 主域名。本周美国地区法官 Jed S. Rakoff 签署了一项缺席判决书,完全满足了出版商的要求,安娜档案馆被判向出版商赔偿 1950 万美元。法官还发布了一项范围广泛的永久禁令,要求二十多家全球域名注册商、托管商和服务提供商立即关闭安娜的档案的其余域名。鉴于网站运营者身份匿名,赔偿金基本不可能兑现,因此它面临的影响主要是禁令,如美国公司 Cloudflare 和 OwnRegistrar 将需要遵守禁令。

  10. Firefox 将移除 asm.js 相关代码

    Mozilla 宣布 Firefox 未来将移除 asm.js 相关代码,因为它早有了后继者 WebAssembly,同时维护两者耗费时间且增加攻击面。asm.js 是 Mozilla 对 NaCl 和 PNaCl 的回应:通过选择一个严格静态的 JavaScript 子集获得类似 NaCl/PNaCl 的性能,同时代码又能直接运行在 Web 内容中。asm.js 于 2013 年随 Firefox 22 发布,获得了巨大的成功,证明只使用 Web 技术就能在 Web 上以接近原生的速度运行代码,它为 WebAssembly 的诞生铺平了道路,WebAssembly 在 2019 年成为 W3C 标准。Mozilla 从 Firefox 148 开始 JS 引擎 SpiderMonkey 默认禁用 asm.js 优化,未来版本将完全移除相关代码,使用 asm.js 的网站不会受到影响,开发者建议想要继续使用 asm.js 发布内容的网站重编译到 WebAssembly,它的执行速度更快,二进制文件更小。

  11. Google 云服务 GCP 不小心将其大客户 Railway 的账号封禁

    2024 年 Google 云服务 GCP 的错误配置导致澳大利亚退休基金管理公司 UniSuper 的数据被完全删除,幸运的是 UniSuper 在另一家公司有备份。这起事故导致 UniSuper 下线了一周多时间。2026 年 5 月 19 日 GCP 发生了一起类似的严重事故,它的自动系统将其大客户、PaaS 平台 Railway.com 的生产账号给封了,导致 Railway 的服务下线,根据 Railway 官方博客的事故报告,宕机持续了大约 8 个小时。账号封禁发生在 19 日 22:10 UTC,导致 Railway 失去了 GCP 相关的基础设施,这些基础设施支持了控制面板、API 以及部分网络基础设施。Railway 立即联系了 GCP 的客户经理,22:29 UTC 账号恢复,但计算实例、磁盘以及网络都需要逐个慢慢恢复,直到第二天 07:58 UTC 事故才完全解决。Railway 宣布将降低对 GCP 的依赖,计划将 GCP 从热路径中移除,保留作为备份/故障转移服务。

  12. 为何日本的花粉过敏如此严重

    日本的花粉过敏症是一个全国性健康问题,估计 43% 的日本人出现中度至重度症状。相比下英国是 26%,美国为 12%-18%。每年春天日本全国各地的城市街道上人人都戴上口罩,原因就是花粉引发的过敏性鼻炎。为什么日本的花粉过敏问题如此严重?原因与健康不佳、污染甚至自然环境都关系不大,而是与二战后日本政客的决策有关。战争期间,石油和天然气短缺迫使日本转向其最丰富的自然资源——森林——作为家庭和工业的燃料来源。天然森林遭到大面积砍伐,东京、大阪和神户等城市周围山林被砍伐殆尽。二战之后,由于光秃秃的山容易引发山体滑坡和洪涝灾害,政府决定开展大规模植树造林。政府选择了两种快速生长的树种:日本杉(sugi)和日本扁柏(hinoki)。今天这些杉树和柏树的种植面积占到了国土面积的五分之一。问题是杉树和柏树在生长 30 年成熟之后会产生大量轻质花粉。而几乎所有人工林的年龄都超过 30 岁了。为了缓解过敏症日本政府如今计划砍掉五分之一的杉树林,替换上新树种。

  13. Fedora 移除深度桌面环境包

    在 openSUSE 之后,Fedora 发行版移除了深度桌面环境包(Deepin Desktop)。2025 年初 SUSE 安全团队在一次例行审查中发现深度桌面环境有名叫 deepin-feature-enable 的软件包,该软件包是在 2021 年 4 月加入的,并没有咨询或通知 SUSE,它包含了一个“许可协议对话框(license agreement dialog)”,基本上说讲因为 openSUSE 的安全规定,它禁用了 deepin-api 和 deepin-daemon 需要的所有 dbus 和 polkit 功能,这可能导致 Deepin Desktop 不能正常工作,部分功能无效。如果用户不在意这些安全问题,可选择点击确认,之后会自动安装缺少的 dbus 和 polkit。安全团队的调查发现,deepin-daemon 中的核心组件从未递交进行安全审查,它们被悄悄的引入到了 openSUSE 中。鉴于 Deepin 社区过去几年多次违规,openSUSE 决定移除 Deepin Desktop。Fedora 项目随后也对深度桌面环境包展开安全审查,期间开发者发现难以联系部分深度软件包的维护者,因为安全担忧和软件包缺乏维护,它最终决定移除深度桌面环境。

  14. OpenAI 和英伟达等在模型中加入了对 SynthID 水印的支持

    Google 在三年前推出了用于标记 AI 图像的数字水印技术 SynthID,它称 SynthID 至今被用于标记了 1000 亿张图像和视频。Google 去年在 Gemini 应用中添加了 SynthID 检测功能。用户上传可疑内容,询问聊天机器人是否是 AI 生成的。Google 称至今还没有人成功破解 SynthID,宣布与多家 AI 公司合作加入对该水印技术的支持。英伟达的 Cosmos、OpenAI 的 GPT 2 图像、Kakao 和 ElevenLabs 都将在其 AI 生成内容中加入对 SynthID 的支持。

  15. 全球疫苗接种率下滑

    全球疫苗接种率下滑。在医疗体系陷入混乱的新冠疫情过去后,疫苗接种率今未能恢复至以前的水平。2024 年麻疹疫情已蔓延至 59 个国家。麻疹病毒传染性极强,如果同一空间中有感染者,没有相关免疫的人群几乎 100% 会被感染。该病的并发症有肺炎、中耳炎等,甚至可能导致脑炎,变成重症。预防麻疹必须要靠疫苗。想要维持群体免疫、防止疫情扩散,疫苗接种率需达到 95% 以上。新冠疫情期间,由于出行限制,民众普遍推迟了其他疫苗的接种。医疗机构方面,接种人员和治疗人员也侧重于应对新冠疫情。加上其他传染病的流行得到抑制,认为无需接种疫苗的人越来越多,导致全球疫苗接种率持续走低。除麻疹以外,其他传染病也呈现出类似趋势。2024 年白喉、百日咳、破伤风三联疫苗的接种率全球所有地区都低于 2010 年以后的峰值水平。

NEWSLETTER · FREE · WEEKLY

OrangeBot Weekly

5 Claude Code skills worth using each week — with my verdict on what’s actually good. No hype.