OrangeBot.AI Digest — 2026-05-18
90 headlines across 8 sources, aggregated for this day.
Hacker News(15)
- Haiku OS runs on M1 Macs now (discuss.haiku-os.org)
- Garry Tan, the CEO of YC, accused me of unethical reporting (radleybalko.substack.com)
- Elon Musk has lost his lawsuit against Sam Altman and OpenAI (techcrunch.com)
- Iran starts Bitcoin-backed ship insurance for Hormuz strait (www.bloomberg.com)
- Anthropic acquires Stainless (www.anthropic.com)
- Qwen 3.7 Preview (twitter.com)
- We stopped AI bot spam in our GitHub repo using Git's –author flag (archestra.ai)
- Project Glasswing: what Mythos showed us (blog.cloudflare.com)
- The Quiet Renovation at Bitwarden (blog.ppb1701.com)
- Actually, democracy dies in H.R. (www.nytimes.com)
- Show HN: Files.md – Open-source alternative to Obsidian (github.com)
- 'We mould trees to grow into the shape of chairs' (www.bbc.co.uk)
- AI eats the world (Spring 26) [pdf] (static1.squarespace.com)
- Linux security mailing list 'almost unmanageable' (www.theregister.com)
- Eric Schmidt speech about AI booed during graduation (www.nbcnews.com)
GitHub Trending(15)
- tinyhumansai / openhuman
- Imbad0202 / academic-research-skills
- HKUDS / CLI-Anything
- K-Dense-AI / scientific-agent-skills
- supertone-inc / supertonic
- ggml-org / llama.cpp
- ruvnet / RuView
- CloakHQ / CloakBrowser
- tech-leads-club / agent-skills
- BigBodyCobain / Shadowbroker
- humanlayer / 12-factor-agents
- NVlabs / Sana
- microsoft / ai-agents-for-beginners
- ZhuLinsen / daily_stock_analysis
- plausible / analytics
Product Hunt(15)
- Shadow
AI computer screen and voice control with custom automation
- ReactVision Studio
Build AR/VR Apps in React Native + ship directly to devices
- LobeHub
Your Chief Agent Operator for multi-agent work
- SocLeads 3.0
Scrape emails from socials and maps by location
- Moody
Your Mac wallpaper that listens to your music & weather
- AnyFrame
Sandboxes for your AI agents
- Krea 2
An image model built for style control and moodboards
- LandingHero AI
24/7 Salesperson on Your Website
- Polarity
The Self-Improvement Stack For agents
- M1 by Montage
Agentic UI that scales on demand
- Triggered Agents by Adaptive
AI agents that run automatically on business events
- Draft
Capture AI chats into your knowledge base
- Voiser AI
Human-like AI voiceovers in 140+ languages
- Agentspan
Open-source runtime for durable AI agents
- SizzleAir
Thermal assistant for fanless MacBook Airs
Hugging Face(15)
- CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence
Multimodal Large Language Models (MLLMs) have significantly advanced document understanding, yet current Doc-VQA evaluations score only the final answer and leave the supporting evidence unchecked. This answer-only approach masks a critical failure mode: a model can land on the correct answer while grounding it in the wrong passage -- a critical risk in high-stakes domains like law, finance, and medicine, where every conclusion must be traceable to a specific source region. To address this, we introduce CiteVQA, a benchmark that requires models to return element-level bounding-box citations alongside each answer, evaluating both jointly. CiteVQA comprises 1,897 questions across 711 PDFs spanning seven domains and two languages, averaging 40.6 pages per document. To ensure fidelity and scalability, the ground-truth citations are generated by an automated pipeline-which identifies crucial evidence via masking ablation-and are subsequently validated through expert review. At the core of our evaluation is Strict Attributed Accuracy (SAA), which credits a prediction only when the answer and the cited region are both correct. Auditing 20 MLLMs reveals a pervasive Attribution Hallucination: models frequently produce the right answer while citing the wrong region. The strongest system (Gemini-3.1-Pro-Preview) achieves an SAA of only 76.0, and the strongest open-source MLLM reaches just 22.5. Ultimately, towards trustworthy document intelligence, CiteVQA exposes a reliability gap that answer-only evaluations overlook, providing the instrumentation needed to close it. Our repository is available at https://github.com/opendatalab/CiteVQA.
- PhysBrain 1.0 Technical Report
Vision-language-action models have advanced rapidly, but robot trajectories alone provide limited coverage for learning broad physical understanding. PhysBrain 1.0 studies a complementary route: converting large-scale human egocentric video into structured physical commonsense supervision before robot adaptation. Our data engine extracts scene elements, spatial dynamics, action execution, and depth-aware relations, then turns them into question-answer supervision for training PhysBrain VLMs. The resulting physical priors are further transferred to VLA policies through a capability-preserving and language-sensitive adaptation design. Across multimodal QA benchmarks and embodied control benchmarks, including ERQA, PhysBench, SimplerEnv-WidowX, LIBERO, and RoboCasa, PhysBrain 1.0 achieves SOTA results and shows especially strong out-of-domain performance on SimplerEnv. These results suggest that scaling physical commonsense from human interaction video can provide an effective bridge from multimodal understanding to robot action.
- MMSkills: Towards Multimodal Skills for General Visual Agents
Reusable skills have become a core substrate for improving agent capabilities, yet most existing skill packages encode reusable behavior primarily as textual prompts, executable code, or learned routines. For visual agents, however, procedural knowledge is inherently multimodal: reuse depends not only on what operation to perform, but also on recognizing the relevant state, interpreting visual evidence of progress or failure, and deciding what to do next. We formalize this requirement as multimodal procedural knowledge and address three practical challenges: (I) what a multimodal skill package should contain; (II) where such packages can be derived from public interaction experience; and (III) how agents can consult multimodal evidence at inference time without excessive image context or over-anchoring to reference screenshots. We introduce MMSkills, a framework for representing, generating, and using reusable multimodal procedures for runtime visual decision making. Each MMSkill is a compact, state-conditioned package that couples a textual procedure with runtime state cards and multi-view keyframes. To construct these packages, we develop an agentic trajectory-to-skill Generator that transforms public non-evaluation trajectories into reusable multimodal skills through workflow grouping, procedure induction, visual grounding, and meta-skill-guided auditing. To use them, we introduce a branch-loaded multimodal skill agent: selected state cards and keyframes are inspected in a temporary branch, aligned with the live environment, and distilled into structured guidance for the main agent. Experiments across GUI and game-based visual-agent benchmarks show that MMSkills consistently improve both frontier and smaller multimodal agents, suggesting that external multimodal procedural knowledge complements model-internal priors.
- FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization
Human-centric video customization, particularly at the garment level, has shown significant commercial value. However, existing approaches cannot support low-latency and interactive garment control, which is crucial for applications such as e-commerce and content creation. This paper studies how to achieve interactive multi-garment video customization while preserving motion coherence using only single-garment video data. We present FashionChameleon, a real-time and interactive framework for human-garment customization in autoregressive video generation, where users can interactively switch garment during generation. FashionChameleon consists of three key techniques: (i) Instead of training on multi-garment video data, we train a Teacher Model with In-Context Learning on a single reference-garment pair. By retaining the image-to-video training paradigm while enforcing a mismatch between the reference and garment image, the model is encouraged to implicitly preserve coherence during single-garment switching. (ii) To achieve consistency and efficiency during generation, we introduce Streaming Distillation with In-Context Learning, which fine-tunes the model with in-context teacher forcing and improves extrapolation consistency via gradient-reweighted distribution matching distillation. (iii) To extend the model for interactive multi-garment video customization, we propose Training-Free KV Cache Rescheduling, which includes garment KV refresh, historical KV withdraw, and reference KV disentangle to achieve garment switching while preserving motion coherence. Our FashionChameleon uniquely supports interactive customization and consistent long-video extrapolation, while achieving real-time generation at 23.8 FPS on a single GPU, 30-180times faster than existing baselines.
- Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation
On-policy distillation (OPD) has emerged as an efficient post-training paradigm for large language models. However, existing studies largely attribute this advantage to denser and more stable supervision, while the parameter-level mechanisms underlying OPD's efficiency remain poorly understood. In this work, we argue that OPD's efficiency stems from a form of ``foresight'': it establishes a stable update trajectory toward the final model early in training. This foresight manifests in two aspects. First, at the Module-Allocation Level, OPD identifies regions with low marginal utility and concentrates updates on modules that are more critical to reasoning. Second, at the Update-Direction Level, OPD exhibits stronger low-rank concentration, with its dominant subspaces aligning closely with the final update subspace early in training. Building on these findings, we propose EffOPD, a plug-and-play acceleration method that speeds up OPD by adaptively selecting an extrapolation step size and moving along the current update direction. EffOPD requires no additional trainable modules or complex hyperparameter tuning, and achieves an average training acceleration of 3times while maintaining comparable final performance. Overall, our findings provide a parameter-dynamics perspective for understanding the efficiency of OPD and offer practical insights for designing more efficient post-training methods for large language models.
- DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo
Achieving human-level manipulation requires dexterous robotic hands capable of complex object interactions. Advancing such capabilities further demands standardized benchmarks for systematic evaluation. However, existing dexterous benchmarks lack tasks that reflect the unique manipulation capabilities of dexterous hands over parallel grippers, as well as comprehensive evaluation pipelines. In this paper, we present DexJoCo, a benchmark and toolkit for task-oriented dexterous manipulation, comprising 11 functionally grounded tasks that evaluate tool-use, bimanual coordination, long-horizon execution, and reasoning. We develop a low-cost data collection system and collect 1.1K trajectories across these tasks, with support for domain randomization to assess robustness. We benchmark modern models under diverse settings, including visual and dynamics randomization, multi-task training, and action-head adaptation. Through extensive empirical analysis, we identify several important insights and common limitations of current policies in dexterous manipulation, highlighting key challenges for future research in dexterous hand robot learning. Project page available at: https://dexjoco.github.io
- Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding
Distilling large reasoning models is essential for making Long-CoT reasoning practical, as full-scale inference remains computationally prohibitive. Existing curation-based approaches select complete reasoning traces post-hoc, overlooking collaboration among heterogeneous teachers and lacking dynamic exploration, which leads to redundant sampling and missed complementary reasoning. We introduce CoRD, a collaborative multi-teacher decoding framework that performs step-wise reasoning synthesis guided by predictive perplexity-based scoring and beam search. This enables heterogeneous LRMs to jointly construct coherent reasoning trajectories while efficiently preserving diverse, high-potential hypotheses. Experiments show that CoRD produces higher-quality reasoning data and achieves near teacher-level student performance with fewer, structured supervision signals, without substantial efficiency overhead. CoRD further generalizes well to out-of-domain and open-ended settings. The dataset and model are available at https://github.com/DISL-Lab/CoRD{https://github.com/DISL-Lab/CoRD}.
- InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation
Text and faces are among the most perceptually salient and practically important patterns in visual generation, yet they remain challenging for autoregressive generators built on discrete tokenization. A central bottleneck is the tokenizer: aggressive downsampling and quantization often discard the fine-grained structures needed to preserve readable glyphs and distinctive facial features. We attribute this gap to standard discrete-tokenizer objectives being weakly aligned with text legibility and facial fidelity, as these objectives typically optimize generic reconstruction while compressing diverse content uniformly. To address this, we propose InsightTok, a simple yet effective discrete visual tokenization framework that enhances text and face fidelity through localized, content-aware perceptual losses. With a compact 16k codebook and a 16x downsampling rate, InsightTok significantly outperforms prior tokenizers in text and face reconstruction without compromising general reconstruction quality. These gains consistently transfer to autoregressive image generation in InsightAR, producing images with clearer text and more faithful facial details. Overall, our results highlight the potential of specialized supervision in tokenizer training for advancing discrete image generation.
- Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Group Relative Policy Optimization has emerged as essential for aligning video diffusion models with human preferences, but faces a critical computational bottleneck: training a 14B parametered model typically demands hundreds of GPU days per experiment. Existing efficiency methods reduce costs through sliding window subsampling training timesteps, but fundamentally compromise optimization, exhibiting severe instability and failing to reach full trajectory performance. We present Flash-GRPO, a single-step training framework that outperforms full trajectory training in alignment quality under low computational budgets while substantially improving training efficiency. Flash-GRPO addresses two critical challenges: iso-temporal grouping eliminates timestep-confounded variance by enforcing prompt-wise temporal consistency, decoupling policy performance from timestep difficulty; temporal gradient rectification neutralizes the time-dependent scaling factor that causes vastly inconsistent gradient magnitudes across timesteps. Experiments on 1.3B to 14B parameter models validate Flash-GRPO's effectiveness, demonstrating substantial training acceleration with consistent stability and state-of-the-art alignment quality.
- Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR
Reinforcement learning with verifiable rewards (RLVR) has emerged as a scalable paradigm for improving the reasoning capabilities of large language models. However, its effectiveness is fundamentally limited by exploration: the policy can only improve on trajectories it has already sampled. While increasing the number of rollouts alleviates this issue, such brute-force scaling is computationally expensive, and existing approaches that modify the optimization objective provide limited control over what is explored. In this work, we propose NudgeRL, a framework for structured and diversity-driven exploration in RLVR. Our approach introduces Strategy Nudging, which conditions each rollout on lightweight, strategy-level contexts to induce diverse reasoning trajectories without relying on expensive oracle supervision. To effectively learn from such structured exploration, we further propose a unified objective, which decomposes the reward signal into inter- and intra-context components and incorporates a distillation objective to transfer discovered behaviors back to the base policy. Empirically, NudgeRL outperforms standard GRPO with up to 8 times larger rollout budgets, while outperforming oracle-guided RL baseline on average across five challenging math benchmarks. These results demonstrate that structured, context-driven exploration can serve as an efficient and scalable alternative to both brute-force rollout scaling and feasibility-oriented methods based on privileged information. Our code is available at https://github.com/tally0818/NudgeRL.
- ReactiveGWM: Steering NPC in Reactive Game World Models
Current game world models simulate environments from a subjective, player-centric perspective. However, by treating the Non-Player Character (NPC) merely as background pixels, these models cannot capture interactions between the player and NPC. In that sense, they act as passive video renderers rather than real simulation engines, lacking the physical understanding needed to model action-induced NPC reactivities. We introduce ReactiveGWM, a reactive game world model that synthesizes dynamic interactions between the player and NPC. Instead of entangling all interaction dynamics, ReactiveGWM explicitly decouples player controls from NPC behaviors. Player actions are injected into the diffusion backbone via a lightweight additive bias, while high-level NPC responses (e.g., Offense, Control, Defense) are grounded through cross-attention modules. Crucially, these modules learn a game-agnostic representation of interactive logic. This enables zero-shot strategy transfer: our learned modules can be plugged directly into off-the-shelf, unannotated world models of different games. This instantly unlocks steerable NPC interactions without any domain-specific retraining. Evaluated on two Street Fighter games, ReactiveGWM maintains fine-grain player controllability while achieving robust, prompt-aligned NPC strategy adherence, paving the way for scalable, strategy-rich interaction with the NPC.
- Hölder Policy Optimisation
Group Relative Policy Optimisation (GRPO) enhances large language models by estimating advantages across a group of sampled trajectories. However, mapping these trajectory-level advantages to policy updates requires aggregating token-level probabilities within each sequence. Relying on a fixed aggregation mechanism for this step fundamentally limits the algorithm's adaptability. Empirically, we observe a critical trade-off: certain fixed aggregations frequently suffer from training collapse, while others fail to yield satisfactory performance. To resolve this, we propose HölderPO, a generalised policy optimisation framework unifying token-level probability aggregation via the Hölder mean. By explicitly modulating the parameter p, our framework provides continuous control over the trade-off between gradient concentration and variance bounds. Theoretically, we prove that a larger p concentrates the gradient to amplify sparse learning signals, whereas a smaller p strictly bounds gradient variance. Because no static configuration can universally resolve this concentration-stability trade-off, we instantiate the framework with a dynamic annealing algorithm that progressively schedules p across the training lifecycle. Extensive evaluations demonstrate superior stability and convergence over existing baselines. Specifically, our approach achieves a state-of-the-art average accuracy of 54.9% across multiple mathematical benchmarks, yielding a substantial 7.2% relative gain over standard GRPO and secures an exceptional 93.8% success rate on ALFWorld.
- Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution
Large language models (LLMs) still struggle with the rigorous reasoning demands of hard competitive programming. While recent multi-agent frameworks attempt to bridge this reliability gap, they remain fundamentally stateless: they rely on static retrieval and discard the valuable problem-solving and debugging experience gained from previous tasks. To address this, we present Solvita, an agentic evolution framework that enables continuous learning without requiring weight updates to the underlying LLM. Solvita reorganizes problem-solving into a closed-loop system of strategy selection, program synthesis, certified supervision, and targeted hacking, executed by four specialized agents: Planner, Solver, Oracle, and Hacker. Crucially, each agent is paired with a trainable, graph-structured knowledge network. As the system operates, outcome signals, such as pass/fail verdicts, test certification quality, and adversarial vulnerabilities discovered by the Hacker, are recast as reinforcement learning updates to these network weights. This allows the agents to dynamically route future queries based on past successes and failures, effectively accumulating transferable reasoning experience over time. Evaluated across CodeContests, APPS, AetherCode, and live Codeforces rounds, Solvita establishes a new state-of-the-art among code-generation agents, outperforming existing multi-agent pipelines and nearly doubling the accuracy of single-pass baselines.
- From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing
Modern image editing models produce realistic results but struggle with abstract, multi step instructions (e.g., ``make this advertisement more vegetarian-friendly''). Prior agent based methods decompose such tasks but rely on handcrafted pipelines or teacher imitation, limiting flexibility and decoupling learning from actual editing outcomes. We propose an experiential framework for long-horizon image editing, where a planner generates structured atomic decompositions and an orchestrator selects tools and regions to execute each step. A vision language judge provides outcome-based rewards for instruction adherence and visual quality. The orchestrator is trained to maximize these rewards, and successful trajectories are used to refine the planner. By tightly coupling planning with reward driven execution, our approach yields more coherent and reliable edits than single-step or rule-based multistep baselines.
- CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage
Modern 3D visual learning relies on observations sampled from metric 3D assets, yet existing scans, meshes, point clouds, simulations, and reconstructions do not directly provide a sparse, comparable, and geometry-consistent panoramic training interface. Dense trajectories duplicate nearby views, source-specific rendering policies yield heterogeneous annotations, and sparse heuristics may miss important regions or introduce depth-inconsistent observations. We study how to convert 3D assets into sparse panoramic RGB-D-pose data that preserves complete scene coverage with low redundancy and auditable provenance. We propose COVER (Coverage-Oriented Viewpoint curation with ERP Range-depth warping), a training-free ERP viewpoint curator that projects geometry observed from selected views into candidate ERP probes, scores incremental coverage, and penalizes depth conflicts. Under bounded proxy error, its greedy coverage proxy preserves the standard coverage-style approximation behavior up to an additive error term. Using COVER, we build CM-EVS (Coverage-curated Metric ERP View Set), a panoramic RGB-D-pose dataset with 36,373 curated ERP frames from 1,275 indoor scenes across Blender indoor, HM3D, and ScanNet++, complemented by outdoor panoramas from TartanGround and OB3D re-encoded into the same schema. Each frame provides full-sphere RGB, metric range depth, calibrated pose; COVER-produced indoor frames include per-step provenance logs. With a median of only 25 frames per indoor scene, CM-EVS covers all 13 unified room types while maintaining compact scene-level coverage. Experiments show that COVER improves the coverage-conflict trade-off, making CM-EVS a sparse, compact, and auditable RGB-D-pose resource for geometry-consistent panoramic 3D learning.
Techmeme(15)
- Internal memo: Meta is reassigning 7,000 workers to four new units focused on building AI tools, two days before it is set to lay off 10% of its workforce (Eli Tan/New York Times)
Eli Tan / New York Times : Internal memo: Meta is reassigning 7,000 workers to four new units focused on building AI tools, two days before it is set to lay off 10% of its workforce — The company announced the changes two days before it plans to lay off 10 percent of its work force, or about 8,000 employees.
- Sources: AI chip designer Tenstorrent has drawn takeover interest from Intel and Qualcomm; Tenstorrent could be valued at more than $5B in a potential deal (Bloomberg)
Bloomberg : Sources: AI chip designer Tenstorrent has drawn takeover interest from Intel and Qualcomm; Tenstorrent could be valued at more than $5B in a potential deal — Artificial intelligence chip startup Tenstorrent Inc. is drawing early takeover interest from prospective buyers at a moment …
- Anthropic last week began letting Mythos users share cybersecurity threats with others who may face similar vulnerabilities, modifying its previous stance (Amrith Ramkumar/Wall Street Journal)
Amrith Ramkumar / Wall Street Journal : Anthropic last week began letting Mythos users share cybersecurity threats with others who may face similar vulnerabilities, modifying its previous stance — How to restrict access while still allowing users to share threat information is a major challenge facing AI companies
- Musk v. Altman: Elon Musk says the judge and jury "never actually ruled on the merits of the case, just on a calendar technicality" and he will file an appeal (Elon Musk/@elonmusk)
Elon Musk / @elonmusk : Musk v. Altman: Elon Musk says the judge and jury “never actually ruled on the merits of the case, just on a calendar technicality” and he will file an appeal — Regarding the OpenAI case, the judge & jury never actually ruled on the merits of the case, just on a calendar technicality. There is no question to anyone following the case in detail that Altman & Brockman did in fact enrich themselves by stealing a charity. The only question
- Akamai is seeking to raise $2.6B in a convertible bond offering, and plans to use $350M of the offering to buy back its common stock from buyers of the bonds (David Morris/Bloomberg)
David Morris / Bloomberg : Akamai is seeking to raise $2.6B in a convertible bond offering, and plans to use $350M of the offering to buy back its common stock from buyers of the bonds — Akamai Technologies Inc. is seeking to raise $2.6 billion in a convertible bond offering to fund spending on cloud computing infrastructure.
- Cloudflare tests Mythos against 50+ repositories, highlights its ability to chain bugs into a single exploit, and details a vulnerability discovery harness (Grant Bourzikas/Cloudflare)
Grant Bourzikas / Cloudflare : Cloudflare tests Mythos against 50+ repositories, highlights its ability to chain bugs into a single exploit, and details a vulnerability discovery harness — For the last few months, we've been testing a range of security-focused LLMs on our own infrastructure.
- Anthropic acquires NYC-based Stainless, which generates SDKs from APIs, and plans to wind down its hosted products, after reportedly discussing a $300M+ deal (Kirsten Korosec/TechCrunch)
Kirsten Korosec / TechCrunch : Anthropic acquires NYC-based Stainless, which generates SDKs from APIs, and plans to wind down its hosted products, after reportedly discussing a $300M+ deal — Anthropic announced Monday it has acquired Stainless, a startup founded by former Stripe engineer Alex Rattray whose software …
- X quietly limits users who didn't pay for verification to "50 original posts and 200 replies per day", down from 2,400 posts per day (Jackson Chen/Engadget)
Jackson Chen / Engadget : X quietly limits users who didn't pay for verification to “50 original posts and 200 replies per day”, down from 2,400 posts per day — That's down from the previous 2,400 posts per day limit. — X has introduced some more incentive to get users to pay for “verification” …
- Uber increases its stake in Delivery Hero to 19.5%, up from 7% in April, and says it "has no intent to acquire 30% or more" of Delivery Hero's voting rights (Natalie Lung/Bloomberg)
Natalie Lung / Bloomberg : Uber increases its stake in Delivery Hero to 19.5%, up from 7% in April, and says it “has no intent to acquire 30% or more” of Delivery Hero's voting rights — Uber Technologies Inc. has increased its stake in German food delivery company Delivery Hero SE, underscoring its ambitions …
- Seagate shares drop 6.87%, leading a group-wide sell-off after comments from its CEO raised concerns it won't be able to meet demand fueled by the AI buildout (CJ Haddad/CNBC)
CJ Haddad / CNBC : Seagate shares drop 6.87%, leading a group-wide sell-off after comments from its CEO raised concerns it won't be able to meet demand fueled by the AI buildout — Shares of memory chip maker Seagate closed down more than 6% Monday, leading a group-wide sell-off, after comments …
- Sigma, which sells a cloud-native analytics platform that sits on top of data warehouses, raised an $80M Series E led by Princeville Capital at a $3B valuation (Cristian Dina/The Next Web)
Cristian Dina / The Next Web : Sigma, which sells a cloud-native analytics platform that sits on top of data warehouses, raised an $80M Series E led by Princeville Capital at a $3B valuation — The round was led by Princeville Capital, with new strategic investors Databricks Ventures, ServiceNow Ventures, and Workday Ventures also participating.
- Musk v. Altman: the jury unanimously rejects Elon Musk's claims against Sam Altman, finding that the claims were filed outside of the statute of limitations (CNBC)
CNBC : Musk v. Altman: the jury unanimously rejects Elon Musk's claims against Sam Altman, finding that the claims were filed outside of the statute of limitations — After less than two hours of deliberations, a jury on Monday rejected Elon Musk's claims against OpenAI CEO Sam Altman …
- Cursor releases Composer 2.5, saying it's better at sustained work on long-running tasks and follows complex instructions more reliably; it's built on Kimi K2.5 (Cursor)
Cursor : Cursor releases Composer 2.5, saying it's better at sustained work on long-running tasks and follows complex instructions more reliably; it's built on Kimi K2.5 — Composer 2.5 is now available in Cursor. — It's a substantial improvement in intelligence and behavior over Composer 2.
- NextEra's $67B deal to buy Dominion, the largest utility merger in US history, signals a new era of utility consolidation to accommodate AI-driven power demand (Emily Forgash/Bloomberg)
Emily Forgash / Bloomberg : NextEra's $67B deal to buy Dominion, the largest utility merger in US history, signals a new era of utility consolidation to accommodate AI-driven power demand — NextEra Energy Inc.'s $67 billion deal for rival Dominion Energy Inc., the largest utility acquisition in US history …
- Sony is hiking the starting price of one-month and three-month PlayStation Plus subscriptions in "select regions", blaming "ongoing market conditions" (Jay Peters/The Verge)
Jay Peters / The Verge : Sony is hiking the starting price of one-month and three-month PlayStation Plus subscriptions in “select regions”, blaming “ongoing market conditions” — One-month and three-month PlayStation Plus subscriptions are getting more expensive.
Solidot(15)
- 你生活的地点与你衰老的速度相关
根据发表在《Cell》期刊上的一项研究,研究人员通过分析欧洲、东亚和南亚的 322 名健康人去构建迄今最详尽的遗传祖先和环境如何塑造人类生物学特征的图谱。通过招募居住在不同大洲、具有相同遗传背景的人群,科学家得以以前所未有的清晰度,将 DNA 的影响与环境的影响区分开来。研究人员发现,无论搬到哪里,种族背景会对免疫系统、新陈代谢和肠道菌群产生深远影响。南亚人表现出更高的病原体暴露水平。欧洲人的肠道微生物多样性更丰富,且与心脏病风险相关的化合物含量更高。跨州迁移会改变主要的代谢途径,改变肠道微生物的平衡。研究的一大发现是你生活的地点与你衰老的速度相关。居住在亚洲外的东亚人比东亚人生物年龄更大。欧洲人则相反,居住在欧洲外的欧洲人生物年龄更小。
- 伊朗要求通过霍尔木兹海峡的海底光缆付费
伊朗军方发言人 Ebrahim Zolfaghari 在 X 上宣布对通过霍尔木兹海峡的海底光缆收费。暂时不清楚伊朗只是发出一种威胁,还是会将威胁付诸实施。伊朗的计划将要求 Google、微软、Meta和亚马逊等公司遵守其法律,同时海底光缆公司被要求支付通行许可费,而维修和维护权则完全授予伊朗公司。海底光缆传输着欧洲、亚洲和波斯湾之间的网络和金融流量,破坏光缆将会引发数字灾难,威胁到从银行系统、军事通信和 AI 云基础设施到远程办公、在线游戏和流媒体服务。
- 微软将修改 Edge 加载密码的方式
安全研究员本月初披露,Edge 内置的密码管理器会在浏览器启动时候解密所有密码然后加载到内存里。他联络了微软,结果收到的回应是“源于设计(by design)”,认为不是安全隐患。微软当时强调这是应用的预期功能。然而仅仅过了几天,微软就改变了主意,宣布未来版本的 Edge 不会再在启动时加载密码。相关补丁已经释出到 Edge Canary 版本,将包含在 Edge build 148 或更新版本中。
- 《Terraria》 15 年售出 7000 万份拷贝
独立沙盒游戏《Terraria》的开发商宣布游戏上市 15 年共售出了七千万份拷贝,其中 PC 平台销量最高 3960 万份,主机 1070 万份,移动 1970 万份,Mod 工具 tModLoader 下载量 1230 万次。过去一年《Terraria》PC 版本日均玩家 46.1 万最高 140 万,PC 玩家平均游戏时长 101 小时 18 分钟。开发商表示会继续更新《Terraria》。在史上最畅销的游戏中,《Terraria》排在第 7 位。销量最高的是《我的世界(Minecraft)》(如果不考虑《俄罗斯方块》),售出超过 3.5 亿份拷贝,其次是《Grand Theft Auto V》的 2.25 亿份,《Wii Sports》的 8290 万份、《Red Dead Redemption 2》的 8200 万份,《马里奥赛车8》的 7954 万份, 《绝地求生》的 7500 万份等。
- 三星电子工会威胁总罢工
三星电子工会已经宣布将于 21 日启动为期 18 天的总罢工,双方就绩效奖金的上限存在分歧。目前韩国政府对此事表达了高度关注,总理金民锡周日表示若罢工对国民经济造成巨大损失,政府将为保护国民经济而采取包括行使紧急调整权在内的所有可行手段。周一三星电子劳资双方展开了最新一轮谈判。韩国法院同一天就三星电子资方针对工会提出的禁止其进行违法集体斗争行为的申请作出裁定,支持资方大部分诉求,要求工会即便罢工也不得耽误生产。该决定或给劳资谈判以及工会的罢工计划带来不小影响。专家估计每罢工一天造成的损失最高达到 20 亿美元,18 天总罢工将接近 170 亿美元。而 JPMorgan 估计损失最高将达到 280 亿美元。
- NASA 维护旅行者号代码的工程师日益稀少
NASA 在 48 年前先后发射了两艘旅行者号探测器,当年曾为旅行者号写代码的工程师如今早已白发苍苍,甚至已经去世。旅行者号机载计算机运行的是汇编语言,是专为通用电气开发的处理器编写的。探测器上有三个计算机系统:计算机指令子系统(CCS)、姿态调节控制子系统(ACS)以及飞行数据子系统(FDS)。其底层飞行工作依赖于专门的汇编语言,地面系统和早期任务工具使用了 Fortran 语言。探测器上的计算机内存非常小,总容量仅为 64-70 KB。几十年来,地面控制团队成员不断减少,也逐渐老去。更糟的是很多原始文档遗失或支离破碎。项目文件大多是纸质的,每次项目搬迁办公室,会有更多的文件丢失。NASA JPL Interplanetary Network Directorate 项目主任 Suzy Dodd 在 2024 年称建造探测器的人都已经不在人世。Larry Zottarelli 是最后一位仍在工作的原始团队工程师,他于 2016 年 80 岁时退休。目前旅行者号维护团队大多数人都年过八旬,团队还依赖于一份退休工程师名单,以便在紧急情况下呼叫。该名单每年都在缩小。
- 美国青少年睡眠时间比以往任何时候都少
根据发表在《Pediatrics》期刊上的一项研究,美国所有年龄段的青少年睡眠时间都呈持续下降趋势。只有 22%的青少年表示每晚睡至少 7 个小时。研究人员称导致睡眠减少的因素包括家庭作业、课外活动、社交压力和工作等。黑人和拉丁裔青少年,以及父母受教育程度较低的青少年,获得充足睡眠的可能性越来越低。受影响最大的是年龄较大的青少年。随着年龄的增长,他们的睡眠时间稳步减少,睡眠时长和睡眠充足感从青春期早期到后期显著下降。研究利用了1991-2023 年间逾 40 万名美国八年级、十年级和十二年级学生的调查数据。为了缓解青少年睡眠不足的问题,研究人员提出的一项建议是将高中上课时间推迟到早上 8:30 或更晚。
- 北极野火释放封存的古老碳汇
北极气候严寒,植物生长缓慢,动植物残体长期以泥炭等形式在土壤中不断堆积,历经数百年乃至数千年持续累积。这使得北极及周边北方林区土壤长期扮演碳汇角色,持续吸收大气中的二氧化碳。如今北极地区火灾规模不断扩大、发生频次持续攀升,这一平衡格局正在被打破。为探明实际情况,研究团队在多处近期过火区域采集土壤岩芯样本开展研究。样本分析显示,在多数区域,地表植被快速燃烧后,会引燃土壤深层的老旧有机质并发生阴燃,进而释放大量黑碳与二氧化碳。黑碳能够吸收太阳热能,直接加剧大气升温。此外,在寒冷地区,黑碳沉降至冰雪表面会使其颜色加深、吸热变强,从而加速冰雪非正常消融。越靠近北极腹地,远古碳的外泄风险越高,这是因为北极土壤层更薄,有机质大多富集于浅层地表。
- Linus Torvalds 称 AI 发现的 Bug 报告让安全邮件列表几乎无法管理
Linus Torvalds 在内核邮件列表上宣布释出 Linux 7.1-rc4 时特别强调了大量涌入的 AI Bug 报告问题。Torvalds 称,AI 报告的持续涌入,让安全邮件列表几乎完全无法管理,不同的人使用相同的工具发现了相同的问题,出现了大量的重复报告。这都是毫无意义的空耗,因为 AI 检测到的 bug 几乎不是秘密,而报告者甚至不看彼此的报告。AI 工具固然好,但前提是真的能提供帮助,而不是造成不必要的麻烦和无意义的虚假工作。请随意使用这些工具,但请确保以高效率且能带来更佳体验的方式使用它们。大部分报告的 bug 都是普通 bug 而不是安全漏洞。 Torvalds 强调,“如果你使用 AI 工具发现了一个 bug,那么很可能其他人也发现了它。”如果你想要创造真正的价值那么最好阅读文档开发一个补丁,不要不经大脑思考就递交报告。
- Windows 11 KB5089549 会导致部分 PC 安装失败
微软证实本月释出的例行安全更新 Windows 11 KB5089549 会导致部分 PC 安装失败,原因是对系统启动至关重要的 EFI 系统分区(ESP)空间不足。如果 ESP 可用空间不足 10 MB,KB5089549 安装会失败,返回 0x800f0922 错误,用户会看到安装卡在了 35-36%,然后回滚,提示空间不足。微软提供了一个临时的修复方法:以管理员身份打开命令提示符。运行以下命令:reg add “HKLM\SYSTEM\CurrentControlSet\Control\Bfsvc /v EspPaddingPercent /t REG_DWORD /d 0 /f”。然后重启受影响的设备。
- Fisker Ocean 车主将其变成一个开源汽车项目
2024 年 6 月 Ocean SUV 制造商 Fisker 公司申请破产,它总共交付了 1.1 万辆电动汽车。斥巨资购买汽车的车主面临汽车失去维修的难题,零部件替换、电池、软件、电子钥匙等问题横在他们面前,如果无法解决他们的汽车将会沦为昂贵的垃圾。接下来发生的堪称电动汽车行业历史上最引人入胜的故事。车主们没有认命,他们组织成立了一个非盈利组织 Fisker Owners Association(FOA),对汽车的私有软件进行逆向工程,破解 CAN 总线网络,在 GitHub 上发布开源工具,最终在 Fisker 的废墟上建起一家由志愿者运营的开源汽车公司。Fisker 不是唯一一家破产的美国电动汽车公司,Nikola、Canoo 和 Arrival 的车主都面临类似的困境。
- AMD Mesa 驱动主开发正为 Valve 工作
AMD Mesa 驱动开发者 Marek Olšák 更新了个人资料,显示他正为 Valve 工作。Marek 是资深 Linux GPU 驱动开发者,长期致力于 AMD Mesa 驱动开发,从 R300g 时代开始参与 Mesa 项目。他最初是 Mesa 项目的独立开发者,在大学毕业后加入 AMD。十多年以来他一直是 AMD Linux Mesa 驱动的主要开发者,也是最活跃的开发者之一。现在他将代表 Valve 继续参与 Mesa 项目开发,估计薪水会更高。Valve 的 Steam Deck 和 Steam Machine 采用了定制的 AMD SoC,是 AMD 开源驱动的重要支持者。
- Eric Schmidt 在毕业典礼上谈 AI 收到了学生的嘘声
前 Google CEO Eric Schmidt 在亚利桑那大学的毕业典礼上谈及了 AI,结果现场学生嘘声四起。Schmidt 说:“我们原以为自己是在为人类几个世纪以来一直构建的知识殿堂添砖加瓦,但我们构建的世界最终却比我们预想的复杂得多。那些连接我们的工具,也让我们彼此疏离。那些赋予每个人发言权的平台——就像你们现在正使用的——却也侵蚀了公共领域。”“我毕业后的几年里,没有人会坐下来决心去开发一种使民主制度极化、扰乱一代年轻人生活的技术。这并非我们的初衷,但它却发生了。”他谈到了 AI:“我知道很多人对此的感受。我能听到你们的声音。你们感到恐惧,你们这一代人害怕未来已被预先设定,害怕机器即将到来,害怕工作岗位在消失,害怕气候在恶化,害怕政治四分五裂,害怕你们正继承一个并非由你们造成的烂摊子。”他称这些恐惧是“合理的”,但他鼓励毕业生适应这项技术,并极参与塑造它未来的应用方式。“问题不在于 AI 是否会塑造世界。它肯定会。问题在于你们是否会塑造 AI。”
- 调查显示六成 PC 玩家未来两年没有升级 PC 的计划
对逾 1500 名用户的调查显示,六成 PC 玩家未来两年没有升级 PC 或组装新 PC 的计划。AI 热导致 DRAM 芯片供应严重短缺,进而导致使用到 DRAM 的 PC 组件如内存、SSD 以及显卡等价格上涨,其中内存价格飙升了数倍之多。接受调查的用户中,15% 的人表示会在未来两年内组装 PC,25% 的人计划在未来 12 个月内尝试组装一台新 PC。有很多人都在等待电商平台的促销活动,希望届时价格会略有下降,当然价格回落到一年前是不可能的。
- 刚果再次爆发埃博拉疫情
WHO 周日(5月17日)将刚果民主共和国(DRC)与乌干达暴发的埃博拉疫情列为“国际关注的突发公共卫生事件”。这次疫情尚未达到《国际卫生条例》所定义的大流行紧急事件标准,但与刚果民主共和国接壤的国家面临极高的进一步扩散风险。此次疫情由 Bundibugyo 毒株引发,已在刚果民主共和国造成数十人死亡。截至周六,该国伊图里省已报告 80 例疑似死亡病例、8 例实验室确诊病例,以及 246 例疑似病例。刚果卫生部长表示:“Bundibugyo 毒株目前没有疫苗,也没有针对性地治疗方法。这一毒株的致死率非常高,可高达50%。”乌干达首都坎帕拉在周五和周六也报告了两例实验室确诊病例,其中一人死亡。这两名患者均为从刚果民主共和国入境的人员。这是自 1976 年发现埃博拉病毒以来,刚果民主共和国发生的第 17 次埃博拉疫情。
OrangeBot Weekly
5 Claude Code skills worth using each week — with my verdict on what’s actually good. No hype.