WEEK · 2026-W20

Weekly Digest — 2026-W20

218 unique stories (2026-05-112026-05-17), aggregated across 8 sources.

Hacker News(42)

  1. TanStack NPM Packages Compromised (github.com)
  2. GitLab Announces Workforce Reduction and End of Their CREDIT Values (about.gitlab.com)
  3. Can someone please explain whether Cloudflare blackmailed Canonical? (www.flyingpenguin.com)
  4. Microsoft Israel chief leaves amid ethical controversy (en.globes.co.il)
  5. CUDA-oxide: Nvidia's official Rust to CUDA compiler (nvlabs.github.io)
  6. Nullsoft, 1997-2004 (2004) (slate.com)
  7. Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model (github.com)
  8. CERT is releasing six CVEs for serious security vulnerabilities in dnsmasq (lists.thekelleys.org.uk)
  9. Googlebook (googlebook.google)
  10. Canada’s Bill C-22 Is a Repackaged Version of Last Year’s Surveillance Nightmare (www.eff.org)
  11. Instructure pays ransom to Canvas hackers (www.insidehighered.com)
  12. Why senior developers fail to communicate their expertise (www.nair.sh)

GitHub Trending(23)

  1. bytedance / UI-TARS-desktop
  2. CloakHQ / CloakBrowser
  3. yikart / AiToEarn
  4. playcanvas / supersplat
  5. datawhalechina / easy-vibe
  6. decolua / 9router
  7. tinyhumansai / openhuman
  8. rohitg00 / agentmemory
  9. apernet / hysteria
  10. mattpocock / skills
  11. anonfaded / FadCam
  12. obra / superpowers

Product Hunt(41)

  1. Snapseed 4.0

    Google’s best photo editor just got seriously better

  2. articuler.ai

    Describe your goal. Meet the right professional.

  3. Known Agents

    Track the bots and AI agents crawling your website

  4. Grok Connectors

    Bring your daily apps into Grok

  5. Genpire

    Make Real Products with AI, literally.

  6. Graphbit PRFlow

    AI code reviewer that catches what others miss

  7. Vexilo

    Claude Code planner w/ 31 agents, 92 commands, + 121 skills

  8. Seer Platform

    The fastest way to go from idea to physical product

  9. Pixcode

    A self-hosted control room for AI coding agents

  10. ARKAD Wallet

    The budgeting app you’ll actually use.

  11. display.dev

    Publish agent-generated HTML behind company auth

  12. Hyperswitch Prism

    Library to plug-n-switch payment processors

Hugging Face(31)

  1. Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

    Scaling Diffusion Transformers (DiTs) to hundreds of layers introduces a structural vulnerability: networks can enter a silent, mean-dominated collapse state that homogenizes token representations and suppresses centered variation. Through mechanistic auditing, we isolate the trigger event of this collapse as Mean Mode Screaming (MMS). MMS can occur even when training appears stable, with a mean-coherent backward shock on residual writers that opens deep residual branches and drives the network into a mean-dominated state. We show this behavior is driven by an exact decomposition of these gradients into mean-coherent and centered components, compounded by the structural suppression of attention-logit gradients through the null space of the Softmax Jacobian once values homogenize. To address this, we propose Mean-Variance Split (MV-Split) Residuals, which combine a separately gained centered residual update with a leaky trunk-mean replacement. On a 400-layer single-stream DiT, MV-Split prevents the divergent collapse that crashes the un-stabilized baseline; it tracks close to the baseline's pre-crash trajectory while remaining substantially better than token-isotropic gating methods such as LayerScale across the full schedule. Finally, we present a 1000-layer DiT as a scale-validation run at boundary scales, establishing that the architecture remains stably trainable at extreme depth.

  2. MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

    With the rise of online dance-video platforms and rapid advances in AI-generated content (AIGC), music-driven dance generation has emerged as a compelling research direction. Despite substantial progress in related domains such as music-driven 3D dance generation, pose-driven image animation, and audio-driven talking-head synthesis, existing methods cannot be directly adapted to this task. Moreover, the limited studies in this area still struggle to jointly achieve high-quality visual appearance and realistic human motion. Accordingly, we present MACE-Dance, a music-driven dance video generation framework with cascaded Mixture-of-Experts (MoE). The Motion Expert performs music-to-3D motion generation while enforcing kinematic plausibility and artistic expressiveness, whereas the Appearance Expert carries out motion- and reference-conditioned video synthesis, preserving visual identity with spatiotemporal coherence. Specifically, the Motion Expert adopts a diffusion model with a BiMamba-Transformer hybrid architecture and a Guidance-Free Training (GFT) strategy, achieving state-of-the-art (SOTA) performance in 3D dance generation. The Appearance Expert employs a decoupled kinematic-aesthetic fine-tuning strategy, achieving state-of-the-art (SOTA) performance in pose-driven image animation. To better benchmark this task, we curate a large-scale and diverse dataset and design a motion-appearance evaluation protocol. Based on this protocol, MACE-Dance also achieves state-of-the-art performance. Code is available at https://github.com/AMAP-ML/MACE-Dance.

  3. Flow-OPD: On-Policy Distillation for Flow Matching Models

    Existing Flow Matching (FM) text-to-image models suffer from two critical bottlenecks under multi-task alignment: the reward sparsity induced by scalar-valued rewards, and the gradient interference arising from jointly optimizing heterogeneous objectives, which together give rise to a 'seesaw effect' of competing metrics and pervasive reward hacking. Inspired by the success of On-Policy Distillation (OPD) in the large language model community, we propose Flow-OPD, the first unified post-training framework that integrates on-policy distillation into Flow Matching models. Flow-OPD adopts a two-stage alignment strategy: it first cultivates domain-specialized teacher models via single-reward GRPO fine-tuning, allowing each expert to reach its performance ceiling in isolation; it then establishes a robust initial policy through a Flow-based Cold-Start scheme and seamlessly consolidates heterogeneous expertise into a single student via a three-step orchestration of on-policy sampling, task-routing labeling, and dense trajectory-level supervision. We further introduce Manifold Anchor Regularization (MAR), which leverages a task-agnostic teacher to provide full-data supervision that anchors generation to a high-quality manifold, effectively mitigating the aesthetic degradation commonly observed in purely RL-driven alignment. Built upon Stable Diffusion 3.5 Medium, Flow-OPD raises the GenEval score from 63 to 92 and the OCR accuracy from 59 to 94, yielding an overall improvement of roughly 10 points over vanilla GRPO, while preserving image fidelity and human-preference alignment and exhibiting an emergent 'teacher-surpassing' effect. These results establish Flow-OPD as a scalable alignment paradigm for building generalist text-to-image models.

  4. HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents

    Existing multimodal search agents process target entities sequentially, issuing one tool call per entity and accumulating redundant interaction rounds whenever a query decomposes into independent sub-retrievals. We argue that effective multimodal agents should search wider rather than longer: dispatching multiple grounded queries concurrently within a round. To this end, we present HyperEyes, a parallel multimodal search agent that fuses visual grounding and retrieval into a single atomic action, enabling concurrent search across multiple entities while treating inference efficiency as a first-class training objective. HyperEyes is trained in two stages. For cold-start supervision, we develop a Parallel-Amenable Data Synthesis Pipeline covering visual multi-entity and textual multi-constraint queries, curating efficiency-oriented trajectories via Progressive Rejection Sampling. Building on this, our central contribution, a Dual-Grained Efficiency-Aware Reinforcement Learning framework, operates at two levels. At the macro level, we propose TRACE (Tool-use Reference-Adaptive Cost Efficiency), a trajectory-level reward whose reference is monotonically tightened during training to suppress superfluous tool calls without restricting genuine multi-hop search. At the micro level, we adapt On-Policy Distillation to inject dense token-level corrective signals from an external teacher on failed rollouts, mitigating the credit-assignment deficiency of sparse outcome rewards. Since existing benchmarks evaluate accuracy as the sole metric, omitting inference cost, we introduce IMEB, a human-curated benchmark of 300 instances that jointly evaluates search capability and efficiency. Across six benchmarks, HyperEyes-30B surpasses the strongest comparable open-source agent by 9.9% in accuracy with 5.3x fewer tool-call rounds on average.

  5. Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

    Reinforcement learning with verifiable rewards (RLVR) has become a standard approach for large language models (LLMs) post-training to incentivize reasoning capacity. Among existing recipes, group-based policy gradient is prevalent, which samples a group of responses per prompt and updates the policy via group-relative advantage signals. This work reveals that these optimization strategies share a common geometric structure: each implicitly defines a target distribution on the response simplex and projects toward it via first-order approximation. Building on this insight, we propose Listwise Policy Optimization (LPO) to explicitly conduct the target-projection, which demystifies the implicit target by restricting the proximal RL objective to the response simplex, and then projects the policy via exact divergence minimization. This framework provides (i) monotonic improvement on the listwise objective with bounded, zero-sum, and self-correcting projection gradients, and (ii) flexibility in divergence selection with distinct structural properties through the decoupled projection step. On diverse reasoning tasks and LLM backbones, LPO consistently improves training performance over typical policy gradient baselines under matched targets, while intrinsically preserving optimization stability and response diversity.

  6. LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

    Test-time scaling (TTS) has become an effective approach for improving large language model performance by allocating additional computation during inference. However, existing TTS strategies are largely hand-crafted: researchers manually design reasoning patterns and tune heuristics by intuition, leaving much of the computation-allocation space unexplored. We propose an environment-driven framework, AutoTTS, that changes what researchers design: from individual TTS heuristics to environments where TTS strategies can be discovered automatically. The key to AutoTTS lies in environment construction: the discovery environment must make the control space tractable and provide cheap, frequent feedback for TTS search. As a concrete instantiation, we formulate width--depth TTS as controller synthesis over pre-collected reasoning trajectories and probe signals, where controllers decide when to branch, continue, probe, prune, or stop and can be evaluated cheaply without repeated LLM calls. We further introduce beta parameterization to make the search tractable and fine-grained execution trace feedback to improve discovery efficiency by helping the agent diagnose why a TTS program fails. Experiments on mathematical reasoning benchmarks show that the discovered strategies improve the overall accuracy--cost tradeoff over strong manually designed baselines. The discovered strategies generalize to held-out benchmarks and model scales, while the entire discovery costs only $39.9 and 160 minutes. Our data, and code will be open-source at https://github.com/zhengkid/AutoTTS.

  7. Qwen-Image-2.0 Technical Report

    We present Qwen-Image-2.0, an omni-capable image generation foundation model that unifies high-fidelity generation and precise image editing within a single framework. Despite recent progress, existing models still struggle with ultra-long text rendering, multilingual typography, high-resolution photorealism, robust instruction following, and efficient deployment, especially in text-rich and compositionally complex scenarios. Qwen-Image-2.0 addresses these challenges by coupling Qwen3-VL as the condition encoder with a Multimodal Diffusion Transformer for joint condition-target modeling, supported by large-scale data curation and a customized multi-stage training pipeline. This enables strong multimodal understanding while preserving flexible generation and editing capabilities. The model supports instructions of up to 1K tokens for generating text-rich content such as slides, posters, infographics, and comics, while significantly improving multilingual text fidelity and typography. It also enhances photorealistic generation with richer details, more realistic textures, and coherent lighting, and follows complex prompts more reliably across diverse styles. Extensive human evaluations show that Qwen-Image-2.0 substantially outperforms previous Qwen-Image models in both generation and editing, marking a step toward more general, reliable, and practical image generation foundation models.

  8. CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models

    Recent "Thinking with Video" approaches use Video Generation Models (VGMs) for visual reasoning by producing temporally coherent Chain-of-Frames as reasoning artifacts. Even strong VGMs, however, exhibit two recurring failure modes on goal-directed tasks: long-horizon drift on multi-step tasks and mid-clip simulation errors that compound. Both stem from the absence of explicit reasoning built upon the VGM's short-horizon visual prior, a role naturally filled by Vision-Language Models (VLMs), but where to place the VLM is non-trivial: upfront plans commit before any frame is generated and post-hoc critiques over whole videos intervene too late. We propose VLM-VGM Collaborative Video Reasoning (CollabVR), a closed-loop framework that couples the VLM with the VGM at step-level granularity: the VLM plans the immediate next action, inspects the clip the VGM generates, and folds the verifier's diagnosis directly into the next action prompt to repair detected failures. On Gen-ViRe and VBVR-Bench, CollabVR improves both open-source and closed-source VGMs over single-inference, Pass@k, and prior test-time scaling baselines at matched compute, with the largest gains on the hardest tasks. It also yields further improvements on top of a reasoning-fine-tuned VGM, indicating that step-level VLM supervision is orthogonal to and stackable with reasoning-oriented fine-tuning. We provide video samples and additional qualitative results at our project page: https://joow0n-kim.github.io/collabvr-project-page.

  9. TMAS: Scaling Test-Time Compute via Multi-Agent Synergy

    Test-time scaling has become an effective paradigm for improving the reasoning ability of large language models by allocating additional computation during inference. Recent structured approaches have further advanced this paradigm by organizing inference across multiple trajectories, refinement rounds, and verification-based feedback. However, existing structured test-time scaling methods either weakly coordinate parallel reasoning trajectories or rely on noisy historical information without explicitly deciding what should be retained and reused, limiting their ability to balance exploration and exploitation. In this work, we propose TMAS, a framework for scaling test-time compute via multi-agent synergy. TMAS organizes inference as a collaborative process among specialized agents, enabling structured information flow across agents, trajectories, and refinement iterations. To support effective cross-trajectory collaboration, TMAS introduces hierarchical memories: the experience bank reuses low-level reliable intermediate conclusions and local feedback, while the guideline bank records previously explored high-level strategies to steer subsequent rollouts away from redundant reasoning patterns. Furthermore, we design a hybrid reward reinforcement learning scheme tailored to TMAS, which jointly preserves basic reasoning capability, enhances experience utilization, and encourages exploration beyond previously attempted solution strategies. Extensive experiments on challenging reasoning benchmarks demonstrate that TMAS achieves stronger iterative scaling than existing test-time scaling baselines, while hybrid reward training further improves scaling effectiveness and stability across iterations. Code and data are available at https://github.com/george-QF/TMAS-code.

  10. PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents

    A LaTeX manuscript that compiles without error is not necessarily publication-ready. The resulting PDFs frequently suffer from misplaced floats, overflowing equations, inconsistent table scaling, widow and orphan lines, and poor page balance, forcing authors into repetitive compile-inspect-edit cycles. Rule-based tools are blind to rendered visuals, operating only on source code and log files. Text-only LLMs perform open-loop text editing, unable to predict or verify the two-dimensional layout consequences of their changes. Reliable typesetting optimization therefore requires a visual closed loop with verification after every edit. We formalize this problem as Visual Typesetting Optimization (VTO), the task of transforming a compilable LaTeX paper into a visually polished, page-budget-compliant PDF through iterative visual verification and source-level revision, and introduce a five-category taxonomy of typesetting defects to guide diagnosis. We present PaperFit, a vision-in-the-loop agent that iteratively renders pages, diagnoses defects, and applies constrained repairs. To benchmark VTO, we construct PaperFit-Bench with 200 papers across 10 venue templates and 13 defect types at different difficulty. Extensive experiments show that PaperFit outperforms all baselines by a large margin, establishing that bridging the gap from compilable source to publication-ready PDF requires vision-in-the-loop optimization and that VTO constitutes a critical missing stage in the document automation pipeline.

  11. SEIF: Self-Evolving Reinforcement Learning for Instruction Following

    Instruction following is a fundamental capability of large language models (LLMs), yet continuously improving this capability remains challenging. Existing methods typically rely either on costly external supervision from humans or strong teacher models, or on self-play training with static-difficulty instructions that cannot evolve as the model's capabilities improve. To address these limitations, we propose SEIF (Self-Evolving Reinforcement Learning for Instruction Following), a self-evolving framework for enhancing the instruction-following ability of LLMs. SEIF forms a closed self-evolution loop that improves the model's instruction-following ability, where instruction difficulty evolution and model capability evolution reinforce each other. SEIF consists of four roles: an Instructor that generates increasingly challenging instructions, a Filter that removes conflicting or invalid instructions to ensure data quality, a Follower that learns to follow evolved instructions, and a Judger that provides reward signals for reinforcement learning. The Instructor and Follower are alternately trained and co-evolve throughout the process. Experiments across multiple model scales and architectures show that SEIF consistently improves instruction-following performance, suggesting strong generality. Further analyses reveal the sources of improvement and identify an effective training strategy for self-evolution on open-ended tasks: sufficient early-stage training to build a solid foundation, followed by moderate late-stage training to mitigate overfitting and achieve better final performance. The code and data are publicly available at https://github.com/Rainier-rq1/SEIF.

  12. WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

    Commercial video generation systems such as Seedance2.0 and Veo3.1 have rapidly improved, strengthening the view that video generators may be evolving into "world simulators." Yet the community still lacks a benchmark that directly tests whether a model can reason about how an observed world should evolve over time. We introduce WorldReasonBench, which reframes video generation evaluation as world-state prediction: given an initial state and an action, can a model generate a future video whose state evolution remains physically, socially, logically, and informationally consistent? WorldReasonBench contains 436 curated test cases with structured ground-truth QA annotations spanning four reasoning dimensions and 22 subcategories. We evaluate generated videos with a human-aligned two-part methodology: Process-aware Reasoning Verification uses structured QA and reasoning-phase diagnostics to detect temporal and causal failures, while Multi-dimensional Quality Assessment scores reasoning quality, temporal consistency, and visual aesthetics for ranking and reward modeling. We further introduce WorldRewardBench, a preference benchmark with approximately 6K expert-annotated pairs over 1.4K videos, supporting pair-wise and point-wise reward-model evaluation. Across modern video generators, our results expose a persistent gap between visual plausibility and world reasoning: videos can look convincing while failing dynamics, causality, or information preservation. We will release our benchmarks and evaluation toolkit to support community research on genuinely world-aware video generation at https://github.com/UniX-AI-Lab/WorldReasonBench/.

Techmeme(42)

  1. Musk v. Altman: Satya Nadella says Elon Musk never contacted him with concerns that Microsoft's investments in OpenAI violated any special terms or commitments (CNBC)

    CNBC : Musk v. Altman: Satya Nadella says Elon Musk never contacted him with concerns that Microsoft's investments in OpenAI violated any special terms or commitments —  Microsoft CEO Satya Nadella took the stand in the Musk v. Altman trial on Monday, where he testified that Elon Musk never contacted …

  2. Musk v. Altman: Ilya Sutskever testifies that his OpenAI stake is worth ~$7B and he had concerns about Altman for a year before Altman's brief ouster as CEO (Rachel Metz/Bloomberg)

    Rachel Metz / Bloomberg : Musk v. Altman: Ilya Sutskever testifies that his OpenAI stake is worth ~$7B and he had concerns about Altman for a year before Altman's brief ouster as CEO —  OpenAI co-founder and former chief scientist Ilya Sutskever said his stake in the ChatGPT maker is worth roughly $7 billion …

  3. Google's TIG says it likely thwarted the use of an AI-generated zero-day in a "mass exploitation event" and tools like OpenClaw are being used to find exploits (Samantha Subin/CNBC)

    Samantha Subin / CNBC : Google's TIG says it likely thwarted the use of an AI-generated zero-day in a “mass exploitation event” and tools like OpenClaw are being used to find exploits —  Google's Threat Intelligence Group said in a report on Monday that it thwarted an effort by hackers …

  4. Sources: the White House's Office of the National Cyber Director and Commerce Department's CAISI are fighting over which agency should lead AI model evaluations (Washington Post)

    Washington Post : Sources: the White House's Office of the National Cyber Director and Commerce Department's CAISI are fighting over which agency should lead AI model evaluations —  As the White House grapples with cybersecurity threats from artificial intelligence models, intelligence officials want sway in AI policy overseen by Commerce.

  5. An Anthropic engineer argues HTML is a better output format for AI agents than Markdown, citing information density, ease of sharing, and two-way interaction (@trq212)

    @trq212 : An Anthropic engineer argues HTML is a better output format for AI agents than Markdown, citing information density, ease of sharing, and two-way interaction —  Using Claude Code: The Unreasonable Effectiveness of HTML

  6. Apple releases iOS 26.5, introducing end-to-end encryption for RCS messaging in beta with supported carriers; the setting is enabled by default (Chance Miller/9to5Mac)

    Chance Miller / 9to5Mac : Apple releases iOS 26.5, introducing end-to-end encryption for RCS messaging in beta with supported carriers; the setting is enabled by default —  iOS 26.5 is now available to everyone after six weeks of beta testing.  The update adds fresh wallpapers, new features to Apple Maps, and more.

  7. Sources: Anthropic is in early talks to raise at least $30B at a $900B+ valuation; the round is expected to close as soon as the end of this month (Bloomberg)

    Bloomberg : Sources: Anthropic is in early talks to raise at least $30B at a $900B+ valuation; the round is expected to close as soon as the end of this month —  Anthropic PBC is in early talks with investors to raise at least $30 billion in fresh financing, according to people familiar with the matter …

  8. Qualcomm closed down 11.46% on Tuesday as chip stocks pull back from record AI-driven rally; Intel closed down 6.82%, Sandisk dropped 6%, and Micron 3.61% (Samantha Subin/CNBC)

    Samantha Subin / CNBC : Qualcomm closed down 11.46% on Tuesday as chip stocks pull back from record AI-driven rally; Intel closed down 6.82%, Sandisk dropped 6%, and Micron 3.61% —  Chip stocks dropped on Tuesday, pulling back from a massive rally that broadened the artificial intelligence trade beyond Nvidia and propelled the sector to new highs.

  9. Samsung and its South Korean labor union fail to reach a pay deal; the union has said workers will strike for 18 days from May 21 if its demands are not met (Reuters)

    Reuters : Samsung and its South Korean labor union fail to reach a pay deal; the union has said workers will strike for 18 days from May 21 if its demands are not met —  Samsung Electronics (005930.KS) and its South Korean labor union failed to reach a pay deal on Wednesday, its union leader said …

  10. Meta schedules its annual Connect event for September 23-24 and says the event will focus on "the latest in VR, wearables, metaverse, and AI" (Ben Lang/Road to VR)

    Ben Lang / Road to VR : Meta schedules its annual Connect event for September 23-24 and says the event will focus on “the latest in VR, wearables, metaverse, and AI” —  Meta's annual Connect event is set to return on September 23-24.  The company teased what appears to be a new pair of smart glasses …

  11. Meta offers to give rival AI chatbots free access to WhatsApp for a month while it discusses commitments with EU antitrust regulators to address their concerns (Foo Yun Chee/Reuters)

    Foo Yun Chee / Reuters : Meta offers to give rival AI chatbots free access to WhatsApp for a month while it discusses commitments with EU antitrust regulators to address their concerns —  Meta Platforms (META.O) has offered to give rival AI chatbots free access to its social messaging service WhatsApp for a month …

  12. CME Group and Silicon Data announce a futures market for computing capacity, with contracts based on daily GPU benchmarks for on-demand rental rates (Tobias Burns/CNBC)

    Tobias Burns / CNBC : CME Group and Silicon Data announce a futures market for computing capacity, with contracts based on daily GPU benchmarks for on-demand rental rates —  A new futures market for semiconductors will let traders hedge their artificial intelligence investments with bets on the increasingly expensive price of computing power.

Solidot(39)

  1. 美国分析师称主权云在中美之外很难实现

    Gartner 副总裁 Douglas Toombs 认为完全拥有自主权的主权云在中美之外不太可能实现。他称只有美国和中国拥有主权云所需的所有技术。即使 AWS Outposts、Azure Local 或 Oracle Dedicated Cloud Regions 之类的本地云服务也需要与母公司通信。他认为欧洲的主权云的尝试不会成功,并引用了波士顿咨询集团的“三四法则(The Rule of Three and Four)”:一个稳定的竞争市场中的主要竞争对手的数量永远不会超过三个,其中最大的竞争对手的市场份额不会超过最小竞争对手的四倍。他预测云市场将围绕 AWS、Google 和微软三家公司稳定下来。

  2. 本田新专利是为电动摩托车模拟离合器

    最近披露的一项专利显示本田正在为电动摩托车开发模拟离合器,在电动摩托车上模拟传统燃油摩托车的驾驶体验。模拟离合器系统提供了扭矩增强起步功能,甚至还有触觉反馈。系统利用电子元件根据离合器杆的位置改变电机响应。半拉离合器,系统会按比例降低电机输出;完全拉起离合器,动力会完全切断。根据专利,骑手在离合器拉住的情况下,先扭转电子油门,让电机处于高转速状态,然后快速松开离合器,从而实现类似燃油摩托车的“爆发式起步”效果。这种技巧在竞技场景中可帮助骑手在松软地形或起步时获得更快的加速。专利还描述了安装在车把和离合器杆附近的多个振动电机,用于提供触觉反馈,模拟发动机振动,甚至模拟离合器接合时的“咬合点”感觉。

  3. Mythos 发现了一个 curl 漏洞

    Anthropic 上个月宣布的新 AI 模型 Mythos 引发了媒体的广泛关注,它宣传 Mythos 能极其精确的发现源代码中的安全漏洞。它的识别能力如此强大以至于 Anthropic 暂不向公众发布该模型,而是先提供给少数几家公司,以便于它们能优先解决其发现的安全漏洞。curl 维护者 Daniel Stenberg 认为这是一次极其成功的营销噱头。curl 是广泛使用的开源项目,因此他获得了 Mythos 的访问权限。curl 目前包含了 17.6 万行 C 代码,共 66 万个单词。Mythos 最终返回了一份安全报告,声称确认了五个安全漏洞。但 curl 的安全团队在仔细检查后发现其中 3 个是误报,1 个是 Bug,还有 1 个是低危级别的安全漏洞,将会在下个月释出的版本中修复。安全报告还详细纪录了约 20 个 bug,基本上都是正确的。Stenberg 表示他没有看到任何证据表明 Mythos 在发现安全漏洞上比之前的其它工具更胜一筹,Mythos 可能略好一点,但不足以对代码分析产生显著影响。

  4. 《Forza Horizon 6》游戏文件提前 10 天泄露

    微软旗下工作室 Playground Games 提前 10 天向 Steam 上传了《Forza Horizon 6》的未加密游戏文件。多个盗版网站已经放出了《Forza Horizon 6》的下载。《Forza Horizon 6》是以日本为背景的赛车游戏,预计于 5 月 19 日正式发售,游戏容量约 155 GB。这不是第一次 3A 游戏作品以这种方式泄露,今年 3 月小岛工作室游戏《Death Stranding 2》的 PC 版本也是在发售前几天以未加密的方式将游戏文件上传到 Steam。

  5. 你继承了父亲的 RNA

    南京大学的生化学家 Xin Yin 在一个明媚的下午给小鼠当私人训练员,将小鼠放到小型跑步机上跑步。它们都是运动健将,比对照组跑的更久,乳酸积累也更少。但这些小鼠和对照组在基因上并无差异,它们之所以运动表现更出色可能与它们的父亲的运动习惯相关。这一发现表明,跑步不仅对运动者本人有益,也可能对未出生的孩子有益。Xin Yin 的团队发现,运动小鼠精子中被称为 microRNA 的 RNA 片段浓度比不运动小鼠高。将这些分子注射到不相关的胚胎内,产下的后代与有运动习惯的父亲的后代的运动表现一样出色。过去二十年对小鼠的研究发现,除了 DNA,精子还会将 microRNA 等 RNA 片段遗传给后代。这些 RNA 片段的浓度会随运动或懒惰、高脂肪或高糖饮食、日常压力、童年创伤、酗酒以及接触杀虫剂有害物质等因素发生波动。研究发现,父母超重或承受心理健康压力其后代也更容易出现这些状况。

  6. 近 3000 篇同行评审医学论文包含虚假引用

    哥伦比亚大学护理学院一项使用 AI 展开的评估发现,近 3000 篇同行评审医学论文包含了虚假引用,这些引用在科学数据库里并不存在,是 AI 捏造的。研究团队开发了一个自动化验证系统,使用 AI 扫描了 2023 年 1 月 1 日至 2026 年 2 月 18 日期间发表在 PubMed Central 开放获取数据库中的 250 万篇论文,在 9710 万个已验证的引用文献中,研究人员在 2810 篇论文中发现了 4046 个虚假引用。自 2023 年以来,虚假引用率增长了 12 倍以上,2024 年中期开始出现最为显著的增长,这与 AI 写作工具的流行吻合。研究人员称,他们发现一篇论文的 30 个引用中有 18 个是虚假的。部分虚假引用已被其他论文引用,出现在为临床诊疗提供依据的系统评价中。

  7. 社媒上的毒性

    2025 年 12 月斯坦福大学的研究人员分析了 22 亿条社媒帖子,寻找模式识别发布有毒内容的用户比例。所谓有毒内容指的是充斥着仇恨的极端主义内容。那么发布有毒内容的用户比例多高呢?可能比你想象的低得多,但此类内容被推荐算法放大而让很多人以为它们是主流。在 Twitter/X 上,有毒推文的转发量比非有毒推文高约 86%,曝光度高约 27%;0.3% 的用户分享了 80% 的争议新闻;6% 的用户发布了约 73% 的政治推文。在 TikTok 上,25% 的用户发布了 98% 的公开视频。具体数字有所不同,但本质相同:少数活跃用户压倒了绝大多数用户。研究人员发现的社媒模式是:沉默的大多数,因担心表达异议而社交孤立,大多数用户要么保持沉默要么离开平台,将平台空间让给了表达极端观点的用户;积极发帖的少数人会陷入认知偏差,认为自己属于多数派。

  8. 土星冰环可能源自其卫星

    长期以来,土星环究竟是如何形成的,一直都是争论的焦点。最新的数值模拟指出,壮丽的行星环系统并非与土星同时诞生,而是在约 1 亿年前才形成。这项由美中联合研究团队提出的假说,将环的起源归功于一颗被命名为蝶蛹(Chrysalis)的古老卫星,在强大引力作用下发生的结构性毁灭。该卫星的物理规格与现今的土星第三大卫星土卫八(Iapetus)相仿,直径约 1,469 公里,且具备分层化的内部结构,由岩石核心与外层冰壳组成。研究指出,蝶蛹卫星原本运行于非常狭长的椭圆轨道,最近轨道距离土星半径的1至1.5倍区域,这正是冰质天体的洛希极限(Roche limit)临界范围。在此区域内,土星强大的潮汐力克服了卫星自身的结构强度,迫使其在引力撕扯下发生彻底的崩解。卫星解体后的残骸大部分被土星引力捕获,历经演化后形成了广阔的行星环,其余部分则逃逸至太空。研究显示,初期的土星环规模可能远超现今观测所见,但随后受到土卫六(Titan)等大型卫星的引力影响,大量物质被移除或重新分配。

  9. 欧盟准备对 TikTok 和 Instagram 的成瘾性设计采取行动

    欧盟委员会主席 Ursula von der Leyen 周二表示欧盟将在今年晚些时候对 TikTok 和 Instagram 等平台上的成瘾性设计功能采取行动。此类功能包括了无限滚动、自动播放和推送通知。欧盟委员会最早将在今年夏天公布一项法律提议,目前正在等待 Special Panel of experts on Child Safety Online 的调查报告。

  10. 研究发现工作时间减少与肥胖率下降相关

    欧洲肥胖大会公布的一项研究比较了 1990-2022 年间 33 个经合组织国家的工作模式和肥胖率。结果发现,美国、墨西哥和哥伦比亚等年工作时间较长的国家肥胖率也更高,即使北欧国家的平均能量和脂肪摄入量高于拉美国家。年工作时间减少 1% 与肥胖率下降 0.16% 相关。研究人员认为,工作压力和缺乏锻炼时间可能是工作时长更多的人容易发胖的原因。研究主要作者、澳大利亚昆士兰大学的 Pradeepa Korale-Gedara 博士表示,压力增加会提高皮质醇激素水平,导致人们在无法消耗能量的工作中储存更多脂肪。研究人员强调这一发现是相关性的,并不代表因果关系。但它促使专家再次呼吁推行四天工作制,四天工作制有助于人们在饮食、运动和睡眠方面做出更健康的选择,有助于促进整个社会的健康。

  11. Digg 再次尝试重启,将转向 AI 新闻聚合

    Digg 今年一月初上线了一个 Reddit 克隆版本,提供类似的基于兴趣的社区。但两个月后就宣布关闭,理由是机器人账号泛滥。现在 Digg 准备再次尝试重启,这一次是转向它曾经的模样:新闻聚合。Digg 向 Beta 测试用户展示了新网站的预览,目标是追踪某个领域最具影响力的声音,推送真正值得关注的新闻。AI 是 Digg 目前测试的领域,如果成功将扩展到其他主题。Digg 会实时从 X 抓取内容以判断讨论热点,同时还会进行情感分析、聚类分析和信号检测,判断哪些内容最重要。

  12. Forza Horizon 6 开发商严惩玩泄密版本的玩家

    《Forza Horizon 6》游戏文件在上市 10 天前就提前泄密,盗版网站比正版网站提前放出了可游戏版本。开发商 Playground Games 证实游戏提前泄露,警告玩家不要玩盗版版本,威胁会进行严惩,包括“全系列封禁和硬件封禁”。一名 YouTube 主播使用泄露版本上传了一段 45 分钟的游戏视频,该玩家随后遭到了终身封禁,其封禁期一直持续到 9999 年 12 月 31 日。玩家即使没有上传视频,如果检测到玩泄露版本也会严惩。《Forza Horizon 6》将于 5 月 19 日发售。