WEEK · 2026-W25

Weekly Digest — 2026-W25

69 unique stories (2026-06-152026-06-21), aggregated across 8 sources.

Hacker News(12)

  1. A backdoor in a LinkedIn job offer (roman.pt)
  2. Typst 0.15.0 (typst.app)
  3. Hetzner Price Adjustment (docs.hetzner.com)
  4. TinyWind: A pixel pirate sailing game with real wind physics (380k+ kms sailed) (tinywind.io)
  5. Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?
  6. My Homelab AI Dev Platform (rsgm.dev)
  7. GrapheneOS has been ported to Android 17 and official releases are coming soon (discuss.grapheneos.org)
  8. U.S. pulling ocean sensors a 'shock' for Canadian research as El Niño nears (www.timescolonist.com)
  9. Apple is about to make Hide My Email useless (arseniyshestakov.com)
  10. Calvin and Hobbes and the price of integrity (therepublicofletters.substack.com)
  11. Stop Using JWTs (gist.github.com)
  12. Is Meta destroying its engineering organization? (newsletter.pragmaticengineer.com)

GitHub Trending(9)

  1. iptv-org / iptv
  2. teslamate-org / teslamate
  3. Panniantong / Agent-Reach
  4. meshery / meshery
  5. chatwoot / chatwoot
  6. krahets / hello-algo
  7. freeCodeCamp / freeCodeCamp
  8. swc-project / swc
  9. puppeteer / puppeteer

Product Hunt(12)

  1. ColibotAI

    Translate, summarize & explain any text on-device

  2. IdleDev

    Get paid while your AI agent thinks

  3. AgentBrush

    Your coding agent's missing tool: image generation

  4. MockPilot

    Turn live websites into editable mockups

  5. VEXI

    Open-source AI coding agent for your terminal

  6. Notchcode

    Claude Code + Codex agents in your notch

  7. looquee

    Copilot for college applications

  8. Ledgerly

    Available on the App Store

  9. Fluxmail

    AI email inbox and assistant

  10. Botme

    AI customer support agent, live on your website in 5 minutes

  11. Athena Desktop

    A local command room for AI coding agents.

  12. Obotiq

    Autonomous care robots that give caregivers time back

Hugging Face(12)

  1. OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

    Cloning camera motion from reference videos is an important task in video generation, as videos provide intuitive and precise control. Existing methods either directly use parametric representations that fail to handle multi-shot generation or synthesize cross-paired data, which suffer from data scarcity, resulting in poor performance in complicated camera motion cloning. To address these issues, we introduce a general camera motion representation that encodes cameras as grid motion videos. This camera grid represents the camera parameters visually and supports the integration of diverse trajectories for multi-shot video generation. Building upon this, we propose OmniDirector, a unified framework trained on a million-scale camera grid-video pairs that coordinates characters, actions, and cameras to provide director-level control for multimodal diffusion transformers. Furthermore, we design a novel hierarchical prompt expansion agent that harmoniously integrates different control signals by systematically describing camera motion and visual content through understanding signal relationships. Extensive experiments demonstrate the superior performance and outstanding controllability of our framework. Project page: https://ymlinfeng.github.io/OmniDirector.github.io/

  2. APPO: Agentic Procedural Policy Optimization

    Recent advances in agentic Reinforcement Learning (RL) have substantially improved the multi-turn tool-use capabilities of large language model agents. However, most existing methods assign credit over coarse heuristic units, such as tool-call boundaries or fixed workflows, making it difficult to identify which intermediate decisions influence downstream outcomes. In this work, we study agentic RL from two perspectives: where to branch and how to assign credit after branching. Our pilot analysis shows that influential decision points are broadly distributed throughout the generated sequence rather than concentrated at tool calls, while token entropy alone does not reliably reflect their impact on final outcomes. Motivated by these observations, we propose Agentic Procedural Policy Optimization (APPO), which shifts branching and credit assignment from coarse interaction units to fine-grained decision points in the sequence. APPO selects branching locations using a Branching Score that combines token uncertainty with policy-induced likelihood gains of subsequent continuations, enabling more targeted exploration while filtering out spurious high-entropy positions. It further introduces procedure-level advantage scaling to better distribute credit across branched rollouts. Experiments on 13 benchmarks show that APPO consistently improves strong agentic RL baselines by nearly 4 points, while keeping efficient tool-calls and maintaining behavior interpretability.

  3. Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

    Despite recent progress, LLM agents still struggle with reasoning over long interaction histories. While current memory-augmented agents rely on a static retrieve-then-reason paradigm, this rigid pipeline design prevents them from dynamically adapting memory access to intermediate evidence discovered during inference. To bridge this gap, we propose MRAgent, a framework that combines an associative memory graph with an active reconstruction mechanism. We represent memory as a Cue-Tag-Content graph, where associative tags serve as semantic bridges connecting fine-grained cues to memory contents. Operating on this structure, our active reconstruction mechanism integrates LLM reasoning directly into memory access, allowing the agent to iteratively explore and prune retrieval paths based on accumulated evidence. This ensures that memory retrieval is dynamically adapted to the reasoning context while avoiding combinatorial explosion caused by unconstrained expansion. Experiments on the LoCoMo benchmark and LongMemEval benchmark demonstrate significant improvements over strong baselines (up to 23%), while substantially reducing token and runtime cost, highlighting the effectiveness of active and associative reconstruction for long-horizon memory reasoning.

  4. From Chatbot to Digital Colleague: The Paradigm Shift Toward Persistent Autonomous AI

    Large Language Models (LLMs) are undergoing a fundamental transformation from conversational generators into integrated AI systems capable of reasoning, action, memory, and self-improvement. We conceptualize this transition as a shift from Chatbot to Digital Colleague: from conversational answers to persistent work. We organize this transition along two tightly coupled dimensions. First, at the cognitive core level, LLMs are advancing from Chatbot-era "fast thinking" systems driven by next-token prediction toward Thinking LLMs that leverage inference-time computation, Chain-of-Thought reasoning, reflection, process supervision, and reinforcement learning to support more deliberate and reliable cognition. Second, at the tool-augmented task execution level, LLMs are progressing from tool-calling Agents that invoke external resources in an ad hoc manner toward OpenClaw-style workstation systems (OpenClaw) equipped with persistent Workspaces, skills, verification loops, and governance. The "Workspace + Skill" paradigm makes episodic tool use colleague-like via state persistence, reusable procedures, task closure, and experience reuse. We examine data construction shifts from instruction-response pairs to State-Action-Observation trajectories and evaluation from static benchmarks to sandboxed, auditable, self-evolving AI ecosystems.

  5. Orchestra-o1: Omnimodal Agent Orchestration

    The recent success of agent swarms has shifted the paradigm of large language model (LLM)-based agents from single-agent workflows to multi-agent systems, highlighting the importance of agent orchestration for task decomposition and collaboration. However, existing orchestration frameworks are limited to a narrow set of modalities and struggle to generalize to more complex settings where heterogeneous modalities coexist and interact. This limitation becomes particularly pronounced in omnimodal scenarios, where tasks require the unified understanding and coordination of diverse inputs such as text, image, audio, and video. In this work, we propose Orchestra-o1, an omnimodal agent orchestration framework designed to support efficient agent collaboration across multiple modalities. Orchestra-o1 introduces a unified orchestration mechanism that enables modality-aware task decomposition, online sub-agent specialization, and parallel sub-task execution. This scalable design allows agent systems to effectively tackle complex real-world tasks involving heterogeneous information sources, surpassing the second-best approach by 10.3% accuracy on the OmniGAIA benchmark. Furthermore, we introduce decision-aligned group relative policy optimization (DA-GRPO), an efficient agentic reinforcement learning approach for training Orchestra-o1-8B, which also achieves state-of-the-art performance against all existing open-source omnimodal agents.

  6. HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

    AI agent performance depends critically on the runtime harness, comprising the prompts, tools, memory, and control flow that mediate how a model observes, reasons, and acts. Yet today's harnesses remain largely hand-crafted and static: each new model or task still demands bespoke scaffolding, and the rich traces produced during execution are rarely distilled back into systematic improvement. We introduce HarnessX, a foundry for composable, adaptive, and evolvable agent harnesses. HarnessX assembles typed harness primitives via a substitution algebra, adapts them through AEGIS, a trace-driven multi-agent evolution engine grounded in an operational mirror between symbolic adaptation and reinforcement learning, and closes the harness-model loop by turning trajectories into both harness updates and model training signal. Across five benchmarks (ALFWorld, GAIA, WebShop, tau^3-Bench, and SWE-bench Verified), HarnessX yields an average gain of +14.5% (up to +44.0%), with gains largest where baselines are lowest. These results suggest that agent progress need not come from model scaling alone: composing and evolving runtime interfaces from execution feedback is an actionable and complementary lever. The complete codebase will be open-sourced in a future release.

  7. JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

    Many moments in the real world do not wait for a user to ask. A fire starts on a security monitor, an expression flickers across a video call, or a product a viewer wants flashes by in a livestream. Yet today's large models remain mostly turn-based by design: they answer only when addressed, and even video-call apps that appear interactive still operate as question-answer systems, reacting only when polled or prompted. We argue for a different paradigm: a model that is present in the world like a person. It continuously watches what is happening now, decides on its own whether to speak or stay silent, interacts in real time, and delegates to a background model when the problem is hard. To advance interaction models and their adoption across domains, we make two fully open-sourced contributions. First, we release JoyAI-VL-Interaction, an 8B-scale, vision-first VL-interaction model. The model makes the response decision internally, choosing each second to stay silent, respond, or delegate to a background model, and it excels at vision-triggered responsiveness and time awareness. We pair it with a transferable training recipe, from which capabilities we never trained for emerge, such as guiding a shopper through changing app screens or improvising a lecture from a slide deck. Second, we release a complete, deployable system built around that model. The system streams any ongoing video into the model, making it genuinely present in the world. All other components are pluggable, including ASR/TTS modules, memory, visualization UI, and a background brain that can connect to any API or agent. Across six real-world scenarios, human raters prefer JoyAI-VL-Interaction over the in-app video-call assistants of Doubao and Gemini by a wide margin. To our knowledge, this is the first open, vision-driven interaction model released together with its training recipe, data, and complete deployable system.

  8. Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

    Data tells stories that shape society; the data journalist's job is to turn raw information into stories non-experts can trust. A high-quality news feature takes a newsroom team weeks: hunting for context, running statistics, choosing an angle, and designing visuals. Recent agents handle individual steps well: data-science agents close the analysis loop, while design agents synthesize beautiful websites. But can an agent serve as a data journalist end to end? We introduce Data Journalist Agent (Data2Story), a multi-agent framework that orchestrates specialized roles into a single virtual newsroom. Data2Story contributes two innovations. (i) Claims are evidence-grounded: an Inspector links every number, angle, and asset back to data, code, or an external reference. (ii) Articles are multimodally generative: rather than defaulting to plain text and static charts, Data2Story reasons about what readers will want to see, then deploys multimodal tools, such as interactive maps for geography and audio for music. We evaluate Data2Story on 18 articles, each paired with the originally published expert piece, along four axes: (a) human-agent angle coverage; (b) rubric evaluation with 53 participants across five dimensions; (c) computer-use agents as judges, a cost-saving proxy for how readers navigate interactive articles; and (d) verifiability, where a coding verifier re-executes statements against the data and checks claims against references. Data2Story produces competitive, evidence-traceable multimedia stories, with particular strength in transparency and auditability. Human articles retain an edge in editorial angle, creative design, and presentation. We position Data2Story as a collaborator for journalists, enabling more evidence-based, transparent, and verifiable reporting. Code and demos are available at https://data2story.github.io.

  9. Geometric Action Model for Robot Policy Learning

    Generalist robot policies must follow user instructions while reasoning about how objects, cameras, and robot actions interact in the 3D physical world. Recent vision-language-action models (VLAs) and video world-action models (WAMs) inherit strong semantic or temporal priors from large-scale foundation models, but they still operate primarily on 2D image frames or 2D-derived latent spaces, leaving implicit the 3D geometry required for contact-rich manipulation. We propose the Geometric Action Model (GAM), a language-conditioned manipulation policy that directly repurposes a pretrained geometric foundation model (GFM) as a shared substrate for perception, temporal prediction, and action decoding. GAM splits the GFM at an intermediate layer: the shallow layers serve as an observation encoder, and a causal future predictor inserted at the split layer forecasts future latent tokens conditioned on language, proprioception, and action history. The predicted future tokens are then routed through the remaining GFM blocks for feature propagation and decoding, allowing a single backbone to produce both future geometry and actions. This design equips the GFM with language-conditioned temporal world modeling through minimal architectural modification while preserving its rich geometric priors. Across a broad suite of simulation and real-robot manipulation benchmarks, GAM is more accurate, more robust, faster, and lighter than current foundation-model-scale baselines.

  10. DreamX-World 1.0: A General-Purpose Interactive World Model

    DreamX-World 1.0 is a general-purpose interactive text/image-to-video world model for controllable long-horizon generation. It supports camera navigation, revisits to previously observed regions, and promptable events across photorealistic, game-style, and stylized domains. Our data engine combines camera-accurate Unreal Engine rendering, action-rich gameplay recordings, and real-world videos with recovered camera geometry. For camera control, we introduce E-PRoPE, a lightweight variant of projective positional encoding that retains PRoPE's projective camera geometry while applying camera-aware attention to spatially reduced tokens. We convert a bidirectional video generator into a few-step autoregressive world model using causal forcing, DMD-style distillation, and long-rollout training. Training on self-generated long-horizon contexts exposes the model to its own generated history and reduces the style and color drift that accumulates across autoregressive chunks. Memory-Conditioned Scene Persistence retrieves earlier views through camera-geometry-based retrieval, while residual recycling makes the conditioning path less sensitive to imperfect memory latents. Event Instruction Tuning adds composable event control, and reinforcement learning alignment recovers camera control and visual quality after distillation. With mixed-precision DiT execution, residual reuse, 75\%-pruned VAE decoding, and asynchronous pipeline parallelism, DreamX-World 1.0 reaches up to 16\,FPS on eight RTX\,5090 GPUs. On our 5-second basic evaluation, DreamX-World 1.0 achieves a camera-control score of 73.75 and an overall score of 84.76, outperforming HY-WorldPlay 1.5 and LingBot-World in overall score, which achieve 80.79 and 80.45, respectively.

  11. FastContext: Training Efficient Repository Explorer for Coding Agents

    Large Language Model (LLM) coding agents have achieved strong results on software engineering tasks, yet repository exploration remains a major bottleneck: locating relevant code consumes substantial token budget and pollutes the agent's context with irrelevant snippets. In most agents, the same model explores the repository and solves the task, leaving exploratory reads and searches in the solver's history. We present FastContext, a dedicated exploration subagent that separates repository exploration from solving. Invoked on demand, FastContext issues parallel tool calls and returns concise file paths and line ranges as focused context. FastContext is powered by specialized exploration models spanning 4B--30B parameters. We bootstrap them from strong reference-model trajectories and refine them with task-grounded rewards for broad first-turn search, multi-turn evidence gathering, and precise citation generation. Across SWE-bench Multilingual, SWE-bench Pro, and SWE-QA, integrating FastContext into Mini-SWE-Agent improves end-to-end resolution rates up to 5.5\% while reducing coding-agent token consumption up to 60\%, with marginal overhead. These results show that repository exploration can be separated from solving and handled effectively by specialized models. Code and data: https://github.com/microsoft/fastcontext

  12. VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

    This technical report introduces VibeThinker-3B, a compact dense model with 3B parameters developed to investigate how far verifiable reasoning can be pushed within a strictly small-model regime. Building upon the Spectrum-to-Signal post-training paradigm, we systematically enhance the model through an optimized pipeline that includes curriculum-based supervised fine-tuning, multi-domain reinforcement learning, and offline self-distillation. Experimental evaluations demonstrate that VibeThinker-3B achieves frontier-level performance on highly demanding verifiable tasks. Specifically, it attains a score of 94.3 on AIME26 (improving to 97.1 with claim-level test-time scaling), an 80.2 Pass@1 on LiveCodeBench v6, and exhibits strong out-of-distribution generalization with a 96.1\% acceptance rate on recent unseen LeetCode contests. This effectively places it in the performance band of first-tier reasoning systems, matching or exceeding flagship models that are orders of magnitude larger, such as DeepSeek V3.2, GLM-5, and Gemini 3 Pro. Furthermore, a score of 93.4 on IFEval confirms that this extreme reasoning enhancement does not compromise strict instruction controllability. Extending our previous 1.5B work, these findings motivate the Parametric Compression-Coverage Hypothesis, which views verifiable reasoning as compressible into compact reasoning cores, while open-domain knowledge and general-purpose competence require broad parameter coverage over facts, concepts, and long-tail scenarios. This perspective suggests that compact models are not merely deployment-efficient substitutes, but a complementary path toward frontier-level performance in parameter-dense capability regimes.

Techmeme(12)

  1. Sources: several Xbox studios, including Hellblade maker Ninja Theory, are in talks with Microsoft to buy themselves back and go independent to avoid closure (Jason Schreier/Bloomberg)

    Jason Schreier / Bloomberg : Sources: several Xbox studios, including Hellblade maker Ninja Theory, are in talks with Microsoft to buy themselves back and go independent to avoid closure —  The studios, which include Compulsion Games and Double Fine, are in active negotiations with Xbox and may be given the chance to go independent.

  2. Source: Qualcomm is in talks to buy AI chip designer Tenstorrent for $8B to $10B; Tenstorrent discussed raising $800M at a ~$3.2B valuation last year (The Information)

    The Information : Source: Qualcomm is in talks to buy AI chip designer Tenstorrent for $8B to $10B; Tenstorrent discussed raising $800M at a ~$3.2B valuation last year —  Qualcomm has been in talks to buy Tenstorrent, a startup that designs chips for AI, according to a person with direct knowledge of the deal.

  3. OpenRouter debuts Fusion, a tool for prompting multiple AI models in parallel, claiming it can achieve "Fable-level intelligence at half the price" (Brian Thomas/OpenRouter Blog)

    Brian Thomas / OpenRouter Blog : OpenRouter debuts Fusion, a tool for prompting multiple AI models in parallel, claiming it can achieve “Fable-level intelligence at half the price” —  We've found that synthesizing the results of multiple models can significantly outperform what individual models are capable of.

  4. SpaceX stock closed up 19.6% on Monday, its first full day of trading; Musk said it "might be able to reach" around $1T revenue in 2030, up from $18.7B in 2025 (Arjun Kharpal/CNBC)

    Arjun Kharpal / CNBC : SpaceX stock closed up 19.6% on Monday, its first full day of trading; Musk said it “might be able to reach” around $1T revenue in 2030, up from $18.7B in 2025 —  SpaceX shares climbed 20% on Monday, the first full day of trading following a record-breaking debut last week on the Nasdaq.

  5. Meta launches new AI features, including an "AI Mode" for search that uses Meta AI to surface answers pulled from public posts across Facebook (Lauren Forristal/TechCrunch)

    Lauren Forristal / TechCrunch : Meta launches new AI features, including an “AI Mode” for search that uses Meta AI to surface answers pulled from public posts across Facebook —  As Meta tries to catch up in the AI race and boost engagement with its AI bot, the company announced Monday that it's rolling …

  6. Arcade, which helps companies manage which actions AI agents are authorized to take, raised a $60M Series A led by SYN Ventures, following a $12M seed in 2025 (Steven Rosenbush/Wall Street Journal)

    Steven Rosenbush / Wall Street Journal : Arcade, which helps companies manage which actions AI agents are authorized to take, raised a $60M Series A led by SYN Ventures, following a $12M seed in 2025 —  The startup aims to help companies manage the challenge of determining which actions AI agents are authorized to take

  7. SpaceX closed up 4.8% on Tuesday with a $2.65T market cap, just above Amazon's, after popping ~12% intraday and briefly overtaking Microsoft's $2.93T market cap (CNBC)

    CNBC : SpaceX closed up 4.8% on Tuesday with a $2.65T market cap, just above Amazon's, after popping ~12% intraday and briefly overtaking Microsoft's $2.93T market cap —  SpaceX shares popped about 12% on Tuesday, as Elon Musk's rocket builder continued its meteoric rise following a record-breaking IPO on Friday.

  8. Sources: PayPal to shutter its 10-year-old PayPal Ventures arm amid a broader shakeup under a new CEO and has hired Jefferies to explore selling some positions (Ben Weiss/Fortune)

    Ben Weiss / Fortune : Sources: PayPal to shutter its 10-year-old PayPal Ventures arm amid a broader shakeup under a new CEO and has hired Jefferies to explore selling some positions —  PayPal is shuttering its 10-year-old venture team amid a broader corporate shakeup, according to five sources familiar with the matter.

  9. Melbourne-based Everlab, which is building an AI-powered preventive healthcare platform, raised a AU$65M Series A led by Airtree Ventures (Tegan Jones/SmartCompany)

    Tegan Jones / SmartCompany : Melbourne-based Everlab, which is building an AI-powered preventive healthcare platform, raised a AU$65M Series A led by Airtree Ventures —  Melbourne healthtech startup Everlab has raised $65 million in Series A funding led by Airtree Ventures as it expands its preventative healthcare platform into global markets.

  10. Sensor Tower: ChatGPT's market share fell to 46.4% by the end of May, as Gemini rose to 27.7% and Claude to 10.3%; Grok, Meta AI, and others have less than 5% (Ivan Mehta/TechCrunch)

    Ivan Mehta / TechCrunch : Sensor Tower: ChatGPT's market share fell to 46.4% by the end of May, as Gemini rose to 27.7% and Claude to 10.3%; Grok, Meta AI, and others have less than 5% —  More than three and a half years after ChatGPT's initial release, AI assistants are now used by millions of people worldwide, and the competitive landscape is changing fast.

  11. Z.ai debuts GLM-5.2, saying it has significant improvements for coding, agentic, and long-horizon tasks, with a 1M context window and MIT-licensed open weights (Z.ai)

    Z.ai : Z.ai debuts GLM-5.2, saying it has significant improvements for coding, agentic, and long-horizon tasks, with a 1M context window and MIT-licensed open weights —  We're introducing GLM-5.2, our latest flagship model for long-horizon tasks.  It marks a substantial leap in long-horizon task capability …

  12. Google launches Android 17 and Wear OS 7, first on Pixel devices, with support for the latest AI models, a bubble bar UI, and live updates on Wear OS (Sarah Perez/TechCrunch)

    Sarah Perez / TechCrunch : Google launches Android 17 and Wear OS 7, first on Pixel devices, with support for the latest AI models, a bubble bar UI, and live updates on Wear OS —  Google on Tuesday released the final version of its Android 17 operating system, as well as its counterpart for smartwatches, Wear OS 7.

Solidot(12)

  1. 瑞士选民否决了将人口设限千万的提案

    瑞士于 6 月 14 日举行全民公投,决定是否在 2050 年前将全国常住人口限制在一千万以内。瑞士的人口出生率为每名妇女生育 1.29 个孩子,远低于 2.1 的人口替代率,它的人口增长主要归因于外来移民。目前瑞士人口已超过 900 万,官方数据显示,2024 年外国公民占到了瑞士总人口的 27% 以上。右翼的瑞士人民党(Swiss People's Party)支持的提案要求“2050 年前瑞士常住人口不得超过 1000 万,且瑞士应放弃与欧盟的自由流动协议”。瑞士选民最终否决了这一被称为“瑞士脱欧”的提案,有 54.79% 的选民反对,45.21% 的选民支持,投票率为 58.86%。

  2. 俄罗斯计划退役漏气的国际空间站 PrK 模块

    位于 Progress(进步号)气闸舱和 Zvezda(星辰号)服务舱之间的 PrK 模块因结构裂缝导致的漏气过去几年一直困扰着国际空间站,今年初漏气问题一度被认为已经修复,但本月早些时候报告漏气再次加剧,该模块的裂缝总数达到 16 处。10 天前俄罗斯宇航员试图用锯子拆除该模块的一个承重支架,此举招致了 NASA 的强烈反对,认为可能会产生严重后果,下令宇航员进入与空间站对接的 Crew Dragon 飞船,穿上宇航服,准备必要时紧急撤离。俄罗斯航天局最终放弃了拆支架的计划。双方在幕后反复的拉锯之后,最终俄罗斯通知 NASA 将退役 PrK 模块。这意味着宇航员将不再进入 PrK 模块,或再次尝试对其进行加压。而俄罗斯将需要使用其它端口向空间站转移补给。

  3. Arch Linux 遭遇新一轮 AUR 恶意程序攻击

    Arch Linux 项目的用户软件仓库 Arch User Repository(AUR)上周遭遇了大规模恶意攻击,在处理了逾 1500 个软件包之后开发者认为问题已经得到了控制。然而仅仅过了一天,AUR 遭遇了新一轮的恶意攻击,这一次攻击者使用了代码混淆技术掩盖其意图。AUR 是用户贡献的软件包库,并非官方软件库,Arch Linux 项目可能需要暂时下线 AUR 以免遭遇一轮又一轮的恶意攻击。

  4. 数百万学生就读学校位于有毒污染场地 5 公里内

    根据智库 Centre for Global Development 的地理分析,数百万儿童就读的学校附近存在已知的铅、汞、砷和杀虫剂等有毒污染。研究发现,亚洲、非洲和拉丁美洲 17 个国家的逾 25.2 万所学校位于有毒污染场地 5 公里范围内。这些学校有 4300 多万名儿童,其中 520 万名儿童位于 1 公里范围内。发达国家受污染影响的负担不成比例的落在贫困学生和非白人学生身上,但在发展中国家污染集中在富裕人群居住的城市,城市学校的规模通常更大,因此学生也更多——以菲律宾为例,9% 的学校靠近污染场地,而这些学校的学生总数占到全国学生总数的 27%。分析还显示,私立学校比公立学校更有可能位于污染场地附近。加纳 41% 的私立学校靠近污染场地,而公立学校的这一比例仅为 18%。

  5. 英国将禁止 16 岁以下青少年访问社交媒体

    英国首相 Keir Starmer 宣布,英国将禁止 16 岁以下青少年访问社交媒体。英国的社媒禁令范围以及强度都高于澳大利亚的类似禁令。社媒禁令涵盖所有社交媒体,对包含聊天功能的游戏等网络产品也有单独限制,如禁止青少年与陌生人聊天。Starmer 说政府总要做出选择,他认为全面禁令是正确的选择。

  6. 测试显示 AI 的数学解题能力仍然不如人类专家

    AI 模型的解题水平仍不及顶尖数学家。这项测试隶属 First Proof 项目,旨在评估 AI 解决复杂数学难题的能力。研究人员向 4 款 AI 系统提出 10 道科研级数学难题,再由相关数学领域的匿名专家评审团对作答结果进行打分。这次测试首次同时满足三大核心标准:题目均为前沿科研级数学问题、所有题目从未出现在模型训练数据中、由专业数学家评阅。10 名来自不同数学细分领域的研究人员,各自拿出一道本人研究过程中已解答但尚未公开发表的原创题目。这次测试中,各大推理模型依然频繁出现幻觉问题,这也是大语言模型的通病。而且所有 AI 作答在文献引用方面都“严重缺失”,全程没有标注来源。

  7. 垂直绿化给城市降温

    气候变化和城市化加剧了热岛效应,城市地区的温度显著高于农村地区,而更高的温度又推动了制冷需求和加剧了电网压力,形成某种恶性循环。日本大阪府大学 Jihui Yuan 副教授领导的团队调查了垂直绿化等城市降温策略。他们的研究显示,朝南绿墙可将室内热条件改善最多 1.7°C;低反照率外表面能改善室外热舒适度最多 1.5°C;高反照率外表面则有助于降低室内温度。

  8. GLP-1 减肥药在降低体重的同时也降低了骨折率

    GLP-1 减肥药如 Ozempic、Wegovy、Rybelsus 能快速降低体重,此前有担忧认为快速的体重下降可能导致骨质疏松,增加骨折风险。然而最新研究发现,相比其它起效较慢的减肥药,GLP-1 减肥药能将骨折风险降低 15%。研究人员承认需要更多研究去证实相关性。研究人员分析了逾 59,000 名患者,其中 26,324 名服用了 GLP-1 减肥药,对照组的 33,555 人服用的是非 GLP-1 减肥药。结果显示,实验组发生 794 例骨折,对照组则发生 1045 例。

  9. 亚马逊数据中心 2025 年使用了 25 亿加仑的水

    根据亚马逊公布的数据,它的数据中心在 2025 年使用了 25 亿加仑的水。电商巨人声称它的用水量远低于主要竞争对手。亚马逊称,其数据中心用水量为每千瓦时 0.12 升(L/kWh),称微软在 2025 年的用水量为每千瓦时 0.27 升,Meta 在 2024 年的用水量为每千瓦时 0.19 升,Google 最糟糕达到每千瓦时 1.15 升。亚马逊表示,其设施约 90% 的时间都采用“自然空气冷却”,即引入室外空气使其流经服务器吸收热量,无需用水——但在最炎热的天气里会使用水蒸发降温。

  10. Commodore 宣布反社交网络的翻盖手机

    曾经的家用 PC 巨人 Commodore 又回来了,它宣布了一款翻盖手机 Callback 8020,运行基于 Linux 的 Sailfish OS 操作系统,不支持任何社交媒体、浏览器或工作应用如电子邮件,但支持地图、播客、拼车、以及流行的消息应用如 WhatsApp、Signal、Telegram 和微信(WeChat)——因为对很多人而言没有这些应用手机什么也不是。这款手机是 Commodore 公司推出的,当然也有 Commodore 模拟器。选择翻盖手机是因为它是作为一种多用途工具,你打开翻盖就是为了用它。这款手机不便宜,6 月 30 日开放预购,售价 499 美元,主要配置是 4GB 内存,64 GB SSD,索尼 4800 万像素相机,显示屏分辨率 480 x 640,电池可移除,容量为 1550mAh。

  11. 禁止使用科技产品提升了学生的阅读能力

    在数字化时代,一名教师的低科技实验显示学生的阅读能力有了显著提升。明尼阿波利斯 Washburn 高中的 AP 文学和英语教师 Maureen Mulvaney 在学生抄袭、注意力不集中以及阅读能力下降等问题之后开始了低科技实验,在家长的支持下,她禁止学生使用手机和笔记本电脑,要求所有作业都必须用纸笔完成。尽管学生一开始有抵触,但效果立竿见影:实验前的 2025 年 9 月只有 46% 的学生对阅读能力有信心,到了今年 2 月该比例飙升至 95%。大多数学生能写至少两页部分学生甚至能写五六页英文文章。79% 的学生表示在纸上写作和组织思路比在屏幕上更容易。

  12. 开源模型能否战胜 OpenAI?

    中美两国的 AI 公司采取了不同的发布策略:中国侧重于开源权重模型,美国公司如 OpenAI 和 Anthropic 则采用闭源策略。Hugging Face 前亚太生态系统高管 Tiezhen Wang 表示,OpenAI 和 Anthropic 指责中国 AI 公司蒸馏其模型,他认为蒸馏是中性的,美国 AI 公司是通过抓取互联网上的信息训练模型,它们并非知识的创造者,却试图阻止其他人重复利用知识,有点讽刺。所有 AI 生成的内容都应该没有版权,否则拥有算力的人能滥用权力,生成各种组合内容然后对所有内容都申请版权。他发现中国公司和美国公司在最大化使用 token 上有明显差异,因为中国有很多开源权重模型,其使用成本没有美国大,因此中国互联网公司都鼓励员工最大化使用 token,鼓励员工成为 AI 原生开发者,甚至禁止他们手动完成撰写文档之类的日常工作。