Weekly Digest — 2026-W26
70 unique stories (2026-06-22 → 2026-06-28), aggregated across 8 sources.
Hacker News(12)
- Flock-Powered Police Chiefs Stalking Women Shows Why Warrants Are Needed (ipvm.com)
- Jobs and Software Is Fucked (urflow.bearblog.dev)
- Canada is looking to build up to 10 new nuclear reactors over the next 15 years (www.cbc.ca)
- Steam Machine launches today (store.steampowered.com)
- The text in Claude Code’s “Extended Thinking” output (patrickmccanna.net)
- Moebius: 0.2B image inpainting model with 10B-level performance (hustvl.github.io)
- Jerry's Map (www.jerrysmap.com)
- Claude Tag (www.anthropic.com)
- F3 (github.com)
- AI's Affordability Crisis (blog.dshr.org)
- Show HN: TikZ Editor – WYSIWYG editor for figures in LaTeX (tikz.dev)
- Mistral OCR 4 (mistral.ai)
GitHub Trending(10)
Product Hunt(12)
- OnBrand by SlideSpeak
Design context for AI agents
- AlgoFly AI
The all-in-one place to build and deploy vision AI
- MD+HTML Reader
Review AI-generated Markdown and HTML in a focused workspace
- Cloudflare Temporary Accounts
Let agents deploy before signup
- uwait
Get paid while AI thinks
- Clawd
A context-aware browser mascot with 100% local offline AI
- Latitude
Fix what's breaking in your AI agent
- Cotypist
Local AI Autocomplete in your voice, anywhere on your Mac
- NeuralAgent 3.0
AI that executes UI actions on your computer in ~285ms
- Blazly SEO
Dominate SEO with an AI content operating system
- OpenArt Director
Direct cinematic videos through chat
- Steam Machine
A tiny, powerful PC for big-screen gaming
Hugging Face(12)
- PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models
Multimodal large language models (MLLMs) have achieved remarkable progress in visual understanding tasks. However, most existing MLLMs rely on autoregressive generation, which limits their efficiency for perception tasks that require captioning multiple regions. In this work, we propose PerceptionDLM, a multimodal diffusion language model optimized for efficient parallel region perception. Built upon PerceptionDLM-Base, a strong foundational baseline that achieves state-of-the-art performance among open-source diffusion MLLMs, our architecture fully leverages the parallel decoding nature of DLMs. Specifically, we introduce efficient prompting and structured attention masking to enable simultaneous perception of multiple masked regions, allowing the model to generate region descriptions in parallel at both the sequence and token levels. This design significantly improves inference efficiency compared with existing approaches that process regions sequentially. To systematically evaluate the parallelism property of visual perception capability for DLMs, we construct a new Parallel Detailed Localized Captioning Benchmark (ParaDLC-Bench) by scaling the DLC-Bench to include multiple region masks per image, enabling joint evaluation of both caption quality and inference efficiency. Experiments demonstrate that PerceptionDLM maintains competitive performance in region captioning while achieving substantial speed improvements for multi-region perception tasks. Our results highlight the potential of multimodal diffusion language models for efficient, parallel visual perception. To the best of our knowledge, we are the first to achieve parallel region caption and perception by leveraging the advantages of diffusion language models. Code, models, and datasets are released.
- MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision
Personalized presentation generation requires more than conditioning on a current prompt or template: agents must preserve stable user preferences across tasks, retain newly introduced preferences and constraints during multi-turn revision, and carry out local edits reliably. We propose MemSlides, a hierarchical memory framework for personalized presentation agents that separates long-term memory from working memory and further divides long-term memory into user profile memory and tool memory. User profile memory stores intent-conditioned profiles for round-0 personalization, working memory carries active preferences and session constraints across revision rounds, and tool memory stores reusable execution experience for reliable localized editing. MemSlides pairs this memory design with scoped slide-local revision, so targeted updates act on the smallest affected region instead of repeatedly regenerating the full deck. In controlled experiments, user profile memory improves persona-alignment judgments on a multi-persona, multi-intent profile bank, tool-memory injection improves closed-loop modify behavior in diagnostic matched-pair settings, and qualitative cases illustrate working memory's ability to carryover preferences. Taken together, these results suggest that effective personalization in presentation authoring depends on separating persistent user profiles, session-level working memory, and reusable execution experience across generation and localized revision.
- GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents
Memory benchmarks for LLM agents largely assume single-user settings, leaving shared assistants for hospitals, workplaces, campuses, and households understudied. In these deployments, multiple principals write to a common memory pool and query it under different roles, scopes, and relationships, so memory quality requires governance as well as recall. We introduce GateMem, a benchmark for multi-principal shared-memory agents. GateMem jointly evaluates utility for legitimate long-horizon requests with state updates, access control across contextual authorization boundaries, and agent-facing active forgetting after explicit deletion requests. It spans medical, office, education, and household domains, with long-form multi-party episodes, incremental memory injection, hidden checkpoints, structured judging, and leak-target annotations. Across diverse baselines and backbone models, no method simultaneously achieves strong utility, robust access control, and reliable forgetting. Long-context prompting often yields the best governance score at high token cost, while retrieval-based and external-memory methods reduce cost yet still leak unauthorized or deleted information. These results show current memory agents remain far from reliable shared institutional deployment.
- Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models
While reasoning on autoregressive (AR) models is often performed by chain-of-thought reasoning and reflection, their refinement of previous outputs still relies on fully sequential generation, even when only local edits are needed. In contrast, the masking mechanism in Mask Diffusion Models (MDMs) naturally supports explicit local edits on previous outputs, allowing selective refinement without discarding previous answers and generating another from scratch. While this property more closely aligns with how humans correct mistakes by iterative local refinement, existing MDMs do not support multi-turn masking and denoising. We propose Reflective Masking (RM), which elicits such an intrinsic reasoning capability in MDMs via lightweight post-training. RM provides a native test-time scaling, where an MDM iteratively revisits and revises its prior outputs based on evolving context. To exploit insights from previous turns like AR reasoning, we further introduce History Reference, a parameter-free mechanism that leverages intermediate denoising states during revision. Our approach requires no architectural changes and is easily applicable to existing MDMs. Across diverse tasks and modalities, including text generation, Sudoku, and image editing, Reflective Masking consistently outperforms standard masking-based baselines and demonstrates strong generality, positioning RM as a fundamental primitive for reasoning on MDMs.
- BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation
Three-dimensional (3D) brain MRI is central to clinical neurology and neuro-oncology, where generative models could augment under-represented cohorts, simulate disease trajectories, and support privacy-preserving data sharing. Latent diffusion has been the go-to solution for modeling imaging data, but it places two competing demands on the tokenizer: encoder embeddings must retain the clinical information that downstream tasks act on, and the decoder must reconstruct anatomically faithful volumes. Existing reconstruction-driven tokenizers achieve the second at the expense of the first. To address this, we introduce a fully volumetric masked-autoencoder (MAE) based tokenizer for 3D brain MRI latent diffusion, decoupling encoder and decoder: a frozen 3D MAE encoder produces clinically informative embeddings, while a dedicated CNN decoder reconstructs voxels from a linear projection of those embeddings. We pretrain the encoder on 35,309 volumes from 18 public cohorts spanning four modalities, ten disease categories, and 200+ acquisition sites, and demonstrate its dual utility in two settings. First, on a 23-task linear-probing benchmark, the encoder outperforms or matches SOTA models (i.e., BrainIAC, BrainSegFounder, and MedicalNet) on 21 of 23 tasks. Second, a conditional diffusion transformer (DiT) trained on these clinically informative embeddings supports both conditional generation across six variables and patient-specific longitudinal forecasting. Together these results establish a single 3D brain-MRI embedding space capable of both downstream clinical tasks and controllable generation.
- SproutRAG: Attention-Guided Tree Search with Progressive Embeddings for Long-Document RAG
Retrieval-augmented generation (RAG) systems must balance retrieval granularity with contextual coherence, a challenge that existing methods address through LLM-guided chunking, single-level context expansion, or hierarchical summarization. These approaches variously depend on costly LLM calls during indexing or retrieval, limit context aggregation to a single granularity level, or introduce information loss through summarization. We present SproutRAG, an attention-guided hierarchical RAG framework that addresses this trade-off by organizing sentence-level chunks into progressively larger but semantically coherent units, using learned inter-sentence attention to construct a binary chunking tree. Unlike prior approaches that rely on external LLMs, fixed context expansion, or lossy summarization, SproutRAG learns which attention heads and layers best capture semantic document structure, enabling multi-granularity retrieval without additional LLM calls or compressed summaries. At retrieval time, SproutRAG uses hierarchical beam search to retrieve candidates at multiple granularities, capturing multi-sentence relevance beyond flat retrieval. The framework is trained end-to-end with a joint objective that improves both embeddings and tree structure. Experiments across four benchmarks spanning scientific, legal, and open-domain settings demonstrate that SproutRAG improves information efficiency (IE) by 6.1% on average over the strongest baseline. Code is available on https://github.com/AmirAbaskohi/SproutRAG.
- PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems
LLM agents increasingly operate in large tool ecosystems, where real-world tasks require discovering relevant tools, inferring implicit sub-goals, and adapting to dynamic environments over long horizons. However, existing benchmarks rarely evaluate planning under retrieval-limited tool visibility. To address this gap, we introduce PlanBench-XL, an interactive benchmark of 327 retail tasks over 1,665 tools that tests whether agents can iteratively retrieve usable tools, invoke them to uncover intermediate evidence for subsequent calls toward the final goal. PlanBench-XL further features an optional blocking mechanism that simulates real-world unpredictability through missing, failing, or distracting tool functions, forcing agents to detect disrupted paths and adapt at runtime. Experiments on ten leading LLMs show that massive-tool planning remains challenging: while GPT-5.4 achieves 51.90% accuracy in block-free settings, it collapses to 11.36% under the most severe blocking condition. Further analysis shows that agents are especially vulnerable when failures lack explicit error signals or when recovery requires longer alternative tool-use paths. These results establish PlanBench-XL as a testbed for diagnosing agentic planning failures and highlight the need for robust adaptive planning in long-horizon tasks with large, imperfect tool environments.
- OpenRath: Session-Centered Runtime State for Agent Systems
Modern agent systems often suffer from fragmented runtime state: transcripts, tool effects, memory events, workspace placement, branch provenance, and replay evidence are recorded separately and become difficult to inspect or reproduce. OpenRath addresses this issue with a PyTorch-like programming model for multi-agent, multi-session systems. The analogy concerns the role of a central first-class runtime abstraction, not tensor computation. Its core abstraction is Session, the runtime value passed between agents and workflows. A Session is branchable, inspectable, replayable, backend-aware, and composable. It records conversation chunks, sandbox placement, lineage metadata, token usage, pending work, and tool evidence, while defining where memory interactions enter the runtime record. Since this state is carried by the same value used in program execution, fork, merge, and replay become explicit runtime operations rather than states reconstructed from external traces. OpenRath further defines Sandbox, Tool, Agent, Memory, Workflow, and Selector, with Selector turning control flow into runtime-routed decisions. This report presents the programming model, architecture, audited milestones, and evidence protocol. Its claims are limited to controlled runtime properties, while broad quantitative comparisons, live-provider quality, optional-backend availability, and memory quality are left for follow-on evaluation. The central thesis is that Session provides agent systems with a first-class runtime value for auditable composition.
- DataClaw0: Agentic Tailoring Multimodal Data from Raw Streams
Massive unstructured multimodal streams suffer from high "data entropy," impeding both efficient human knowledge acquisition and high-quality AI post-training. Existing passive annotation paradigms, heavily reliant on heuristic rules or general VLMs, are costly, monotonous, and fail to unlock the deep procedural logic embedded in raw data. We elevate data processing to a learnable capability, proposing a paradigm shift towards Agentic Data Tailoring, which actively refining and structuring data to align with diverse user and downstream intents. To overcome the data scarcity bottleneck in training such high-order capabilities, we design a two-stage pipeline grounding generative semantic synthesis in deterministic Factual Anchors, yielding a large-scale dataset spanning five core physical and digital domains. Building upon this, DataClaw_0-9B model synergizes Supervised Fine-Tuning (SFT) with Group Relative Policy Optimization (GRPO), achieving robust alignment with complex refinement and tailoring intents. To systematically quantify this capability, we construct DataClaw_0-val, the first benchmark dedicated to data refinement. Crucially, we adopt downstream post-training as the ultimate validation touchstone. Evaluations on video generation, real-world VQA, and GUI navigation confirm that DataClaw_0 delivers high-information-density tailored data, facilitating efficient model adaptation to new tasks under limited training data regimes. Project page: https://czjdsg.github.io/MakeAnyData
- EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions
Enterprise agents increasingly operate inside workspaces: they read heterogeneous files, invoke tools, and deliver business artifacts. We introduce EnterpriseClawBench, an enterprise agent benchmark constructed from proprietary, real-world agent sessions. Starting from a large archive of workplace sessions, the EnterpriseClawBench produces 852 reproducible tasks, each paired with recovered fixtures, rewritten prompts, role classes, skill subclasses, hard rules, and semantic rubrics. Because the sessions contain internal enterprise content, we do not release the benchmark data; instead, our reusable contribution is the construction and evaluation protocol. On EnterpriseClawBench, the best configuration reaches only 0.663 (Codex with GPT-5.5). These results show that enterprise agent evaluation must report harness--model combinations, artifact delivery, visual quality, cost, runtime, and skill-transfer behavior, rather than collapsing performance into a single score. Code: https://github.com/FrontisAI/EnterpriseClawBench
- Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention
Self-attention is central to Transformer performance and is often the most expensive part of the Transformer at long context lengths because its pairwise token interactions scale quadratically with sequence length. Standard dense attention also applies the same set of attention heads to every token regardless of token difficulty or information content. This uniform activation can waste compute, especially as sequences grow longer and attention cost increases rapidly. We propose Grouped Query Experts (GQE), a mixture-of-experts layer on top of grouped-query attention (GQA). Within each GQA group, a router selects k query-head experts per token while all key-value (KV) heads remain dense and unchanged. Thus, GQE keeps the KV cache benefits of GQA and reduces only the active query-head computation. On a fixed 30B token budget at the 250M parameter scale, GQE matches the all-active GQA baseline in downstream accuracy while activating half the query heads per token.
- KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking
As retrieval systems scale, high-quality reranking becomes increasingly important. However, most existing rerankers, whether encoder-based or decoder-based, jointly encode the query and passage, tightly coupling their computation and limiting deployment efficiency as well as flexibility. We present KaLM-Reranker-V1, a fast but not late-interaction (FBNL) reranker that decouples query and passage computation while retaining expressive relevance modeling. Built on an encoder-decoder architecture, KaLM-Reranker-V1 uses the encoder to pre-encode passages with Matryoshka embedding pooling, while the decoder models the system instruction, user instruction, and query intent; cross-attention then captures relevance between the query context and passage representations. This design makes KaLM-Reranker-V1 efficient through decoupled passage encoding, yet not late interaction, by preserving rich relevance modeling through cross-attention. We instantiate KaLM-Reranker-V1 in three sizes, Nano, Small, and Large, with 0.27B, 1B, and 4B activated parameters, respectively. Extensive experiments on BEIR, MIRACL, and LMEB demonstrate that KaLM-Reranker-V1 achieves strong reranking performance with superior efficiency. On BEIR, KaLM-Reranker-V1 achieves state-of-the-art performance, on par with strong industrial models such as the Qwen3-Reranker series; on MIRACL, despite not being extensively trained on multilingual data, KaLM-Reranker-V1 still shows excellent reranking performance. Moreover, on LMEB, reranking models demonstrate a clear advantage, with even the 0.27B Nano model remaining competitive with 7-12B embedding models.
Techmeme(12)
- President Trump signs two executive orders aimed at speeding the development of advanced quantum computers and mitigating the security threats they present (Amrith Ramkumar/Wall Street Journal)
Amrith Ramkumar / Wall Street Journal : President Trump signs two executive orders aimed at speeding the development of advanced quantum computers and mitigating the security threats they present — Administration set an ambitious new 2028 target for a system that can conduct scientific research
- Sources: Meta internally exposed data from its employee-tracking program meant to help train its AI models, including full prompts and private conversations (Wired)
Wired : Sources: Meta internally exposed data from its employee-tracking program meant to help train its AI models, including full prompts and private conversations — Employees had previously raised concerns about the initiative, which involves collecting workers' keystroke data to train AI models.
- Air Space Intelligence won an $875M, 12-year FAA contract to develop AI tools that map flight trajectories and identify areas of congestion to reduce delays (Allyson Versprille/Bloomberg)
Allyson Versprille / Bloomberg : Air Space Intelligence won an $875M, 12-year FAA contract to develop AI tools that map flight trajectories and identify areas of congestion to reduce delays — Air Space Intelligence Inc. won a US government contract to develop artificial intelligence technologies for managing flight traffic …
- Sakana AI launches Fugu, a multi-agent orchestration system accessible through a single model API, claiming Fugu Ultra matches Fable and Mythos on benchmarks (Carl Franzen/VentureBeat)
Carl Franzen / VentureBeat : Sakana AI launches Fugu, a multi-agent orchestration system accessible through a single model API, claiming Fugu Ultra matches Fable and Mythos on benchmarks — Last night, the increasingly enterprise-focused AI startup Sakana launched Fugu, a multi-agent orchestration system …
- Sources: Vimeo owner Bending Spoons seeks to raise ~$1.62B in a US IPO, selling 58M shares at $26 to $28 apiece, at a valuation of $19B at the top of the range (Echo Wang/Reuters)
Echo Wang / Reuters : Sources: Vimeo owner Bending Spoons seeks to raise ~$1.62B in a US IPO, selling 58M shares at $26 to $28 apiece, at a valuation of $19B at the top of the range — Bending Spoons, an Italian technology company that acquires and revamps software businesses, is seeking to raise as much as $1.62 billion …
- Valve Steam Machine review: much smaller than PS5, surprisingly smooth, and navigable with any modern gamepad but very expensive and needs manual configuration (Sean Hollister/The Verge)
Sean Hollister / The Verge : Valve Steam Machine review: much smaller than PS5, surprisingly smooth, and navigable with any modern gamepad but very expensive and needs manual configuration — My first day with the Steam Machine was a mess. Instead of enjoying a worry-free game console, I spent hours troubleshooting what felt like a finicky PC.
- Sources: the Trump administration is pressing Meta to submit its AI models for voluntary review; Meta is the only major US AI developer without an agreement (New York Times)
New York Times : Sources: the Trump administration is pressing Meta to submit its AI models for voluntary review; Meta is the only major US AI developer without an agreement — Federal officials are urging the lone major tech company holdout to allow government safety evaluations, weeks after ordering Anthropic to pull its latest model.
- Cerebras reports Q1 revenue up 94% YoY to $193.4M, net loss down 41% to $14M, and forecasts core gross margin to shrink in Q2; CBRS drops 8%+ after hours (Jordan Novet/CNBC)
Jordan Novet / CNBC : Cerebras reports Q1 revenue up 94% YoY to $193.4M, net loss down 41% to $14M, and forecasts core gross margin to shrink in Q2; CBRS drops 8%+ after hours — - Cerebras reported financials for the first time since its IPO in May. … Cerebras said revenue almost doubled in the AI chipmaker's …
- Mistral debuts OCR 4, a model featuring structured document extraction with bounding boxes, block classification, and inline confidence scores, in 170 languages (Mistral AI Blog)
Mistral AI Blog : Mistral debuts OCR 4, a model featuring structured document extraction with bounding boxes, block classification, and inline confidence scores, in 170 languages — Today, we're releasing Mistral OCR 4, featuring bounding boxes, block classification, and inline confidence scores alongside extracted text.
- Sources: Hadrian, which is building AI-powered factories to produce space and defense parts, is in talks to raise ~$1B at a ~$7.5B post-money valuation (Bloomberg)
Bloomberg : Sources: Hadrian, which is building AI-powered factories to produce space and defense parts, is in talks to raise ~$1B at a ~$7.5B post-money valuation — Company runs AI-powered factories that aim to speed up manufacturing — Defense manufacturing startup Hadrian Automation Inc …
- Sources: Miami-based cybersecurity company Varonis is exploring options including a potential sale after receiving takeover interest; VRNS jumps 6%+ (Bloomberg)
Bloomberg : Sources: Miami-based cybersecurity company Varonis is exploring options including a potential sale after receiving takeover interest; VRNS jumps 6%+ — Cybersecurity company Varonis Systems Inc. is exploring options including a potential sale after receiving takeover interest, according to people familiar with the matter.
- The FCC says an auction of wireless mid-band spectrum raised $3.5B+, which will largely be used to fund the replacement of Chinese telecom equipment in the US (David Shepardson/Reuters)
David Shepardson / Reuters : The FCC says an auction of wireless mid-band spectrum raised $3.5B+, which will largely be used to fund the replacement of Chinese telecom equipment in the US — The U.S. Federal Communications Commission said Thursday an auction of wireless mid-band spectrum raised more than $3.5 billion …
Solidot(12)
- 回顾对 AUR 的攻击
由用户递交的软件仓库 Arch User Repository(AUR)最近遭遇了大规模恶意攻击,攻击者创建了一系列新账号,然后通过这些账号接管无人维护的软件包(被称为 orphaned packages),植入恶意代码,推送恶意更新。Arch 项目的维护者现已关闭了新用户注册,正在讨论如何处理这些被恶意滥用的无人维护软件包。AUR 中的软件包由用户递交,其他用户可通过搜索下载 PKGBUILD 文件、解依、编译、安装和更新软件。它不提供软件的二进制版本。目前 AUR 中有逾 107,000 个软件包,其中近 14,000 个无人维护可供认领。任何注册用户都可以认领和修改无人维护的软件包。它提供的软件包未经审核,风险由用户自己承担。其它 Linux 发行版也都有类似的软件仓库,如 Fedora 的 Copr,openSUSE 的 Open Build Service (OBS),Ubuntu 的 Personal Package Archives (PPA)。但这些服务与 AUR 有显著区别:它们提供了类似官方软件包的构建环境,而且不允许预编译二进制文件或私有软件。AUR 的要求过于宽松而在这次攻击中遭到了滥用。
- HPV 疫苗将 30 岁前死于宫颈癌的风险降至几乎为零
根据 WHO 的数据,宫颈癌是女性第四大常见癌症,其 99% 的病例是由高危型人乳头瘤病毒(HPV)引起的。虽然 HPV 疫苗能预防约 90% 的宫颈癌,但疫苗对生存率的影响尚不清楚。根据发表在《柳叶刀》期刊上的新研究,伦敦玛丽皇后学院的研究人员发现,自 2008 年 HPV 疫苗引入以来,疫苗接种者宫颈癌死亡率显著下降。HPV 疫苗对降低死亡率的影响如此之大,以至于研究人员估计,12 或 13 岁接种疫苗的女孩在 30 岁之前死于宫颈癌的可能性几乎为零。对于 30-34 岁的接种过疫苗的女性,死于宫颈癌的相对风险降低了 63%。2020-2024 年间英格兰有记录历史上首次没有 20-24 岁的女性死于宫颈癌。HPV 疫苗除了预防宫颈癌,还能预防肛门癌、阴茎癌、阴道癌、外阴癌、口腔癌和咽喉癌,以及生殖器疣,8 年级的男孩和女孩都会接种该疫苗,部分地区为 9 年级和 10 年级学生提供补种服务。新冠疫情前疫苗接种率接近了 WHO 的目标,但疫情之后接种率大幅下降。
- Anthropic 对特定功能访问要求身份验证
Anthropic 更新了其隐私政策,从 2026 年 7 月 8 日起,部分功能将需要身份验证,该验证将由 Persona 公司负责。Persona 是一家第三方身份验证公司,由 Peter Thiel 投资。此前 Discord 因用户强烈反对以及 2026 年 2 月发生的一起数据泄露事件而终止了在年龄验证上与 Persona 的合作。
- Linux 7.2 内核完全移除 strncpy 函数
在 6 年 362 个补丁之后,Linux 7.2 内核终于完全移除了 strncpy() 函数。strncpy() 是一个 C 语言字符串复制函数,内核文档将其标记为“极度危险(actively dangerous)”。strncpy()是一类内存错误的主要来源:包含敏感数据的内核缓冲区可能会在未终止字符串边界外泄漏字节,导致内存信息泄露。strncpy()被 5 个不同函数取代:strscpy() 用于 NUL 结尾的目的地址,strscpy_pad() 用于 NUL 结尾零填充的目标地址, strtomem_pad() 用于非 NUL 结尾固定宽度字段,memcpy_and_pad() 用于显式填充的有边界复制,memcpy()用于已知长度的内存复制。
- 霸王龙到 40 岁才完全成年
科学家多年来一直认为霸王龙在 25 岁左右达到成年体型,但一项新研究显示,霸王龙要到 40 岁才会完全成年。最新研究是基于对 17 具霸王龙化石的分析,这些霸王龙的年龄从幼年到成年不等。新研究采用了更先进的技术估计恐龙的年龄,并利用复杂统计模型整合多个标本的信息,更完整了解霸王龙整个生命周期的生长情况。结果表明,霸王龙的生长期比之前认为的要长约 15 年。
- 日本宣布新超算理究
日本理化学研究所 19 日宣布,为利用 AI 进行科学研究而建设的新超级计算机命名为“理究”。该名称寓意利用 AI 探“究”自然现象背后的“理”。该超算将设在神户市中央区的理研神户地区,力争 7 月投入使用。理化所还在同一天宣布了另一台量子计算-高性能计算混合平台超算 ROQUO,两台超算都使用了英伟达的 GB200 NVL4 系统。其中 ROQUO 配备了 135 个计算节点,540 (NVIDIA Blackwell) GPU 以及 270 (NVIDIA Grace) CPU,FP64 峰值逾 21 PFLOPS,FP8 峰值 5 EFLOPS 等。
- 高温干旱高 CO2 下大豆蛋白质含量会下降
大豆是重要的蛋白质来源,但气候变化正日益影响其产量和营养品质。根据发表在《Food Research International》上的一项研究,高浓度二氧化碳会使大豆种子产量增加最高 142%,而高温和干旱则分别会使产量降低 91% 和 60%。在高浓度二氧化碳+高温+干旱三重效应下,大豆种子产量可能会增加 50%,可溶性糖含量增加 35%,氨基酸含量增加 175%,同时淀粉含量降低 20%,蛋白质含量降低 6%。
- 中国新超算灵晟登顶 Top500 榜单
Top500 公布了最新的超算榜单,深圳国家超算中心的灵晟首次亮相即登顶榜单。灵晟理论峰值 2.736 Exaflop/s,在 HPL 测试中达到了 2.198 Exaflop/s,是 Top500 榜单中首个仅靠 CPU 实现持续双精度浮点性能逾 2 Exaflops 的超算系统。灵晟使用了 304 个核心的 LX2 CPU,总共 1379 万个核心,运行频率 1.55 GHz,操作系统是麒麟,功耗为 42.2 兆瓦。榜单前五的超算性能都超过了 Exaflops:灵晟;美国劳伦斯利弗莫尔国家实验室的 El Capitan,使用 AMD 第四代 EPYC 处理器,性能 1.809 Exaflop/s;橡树岭国家实验室(ORNL)的 Frontier,使用 AMD 第三代 EPYC,性能 1.353 Exaflop/s;阿贡国家实验室 Aurora 使用英特尔 Xeon CPU,性能 1.012 Exaflop/s,德国 Jülich 超算中心的 JUPITER Booster,使用英伟达 GH Superchip 72C 3GHz,性能 1 Exaflop/s。之后还有意大利 HPC7,微软 Microsoft Azure 超算 Eagle,意大利 HPC6,日本超算富岳(Fugaku),瑞士 Alps。排名前十的超算有四台使用了 AMD EPYC 处理器,两台英伟达处理器,两台英特尔处理器,灵晟的 CPU 架构没有说明。在 Top 500 中,美国有 162 台,日本 44 台,德国 41 台,中国 30 台;联想制造的超算最多有 129 台,其次是 HPE 的 124 台,BULL 的 58 台,戴尔的 49 台,英伟达的 37 台。
- 甲骨文过去一年裁员 2.1 万
根据甲骨文的最新年报,该公司过去一年在全球裁员约 2.1 万人,原因是它正围绕 AI 重塑业务。截至 2026 年 5 月 31 日,甲骨文全职员工总数约 14.1 万人,而去年同期为 16.2 万人。甲骨文在其报告中称,AI 技术在运营中的部署已经导致且可能继续导致员工总数减少。裁员人数约占甲骨文员工总数的 13%。就业追踪公司估计,过去一年中有逾 10 万科技从业者被裁员。甲骨文称,过去一年它支付了 18 亿美元的遣散费和其它重组费用。
- 维基百科联合创始人 Larry Sanger 被封禁
拥抱保守派、支持 MAGA 的维基百科联合创始人 Larry Sanger 再次现身维基百科,理由是帮助维基百科进行改革——aka 将其从自由派手中夺回来。他发起了“WikiProject Intellectual Diversity”提案,想要增加更多保守派的声音。他通过其社交媒体账号宣传该提案,违反了维基百科关于“隐蔽拉票(Stealth canvassing)”的政策,他在维基社区引发了争议,最终被封禁。
- 当代年轻人生物衰老速度更快
华盛顿大学医学院 Yin Cao 博士领导的团队分析了英国生物银行 (UK Biobank) 的超过 15.4 万名参与者的数据,以及美国 NIH All of Us Research Program 项目逾万名参与者的数据,评估了他们的系统性衰老和器官衰老。研究人员发现,1965-1974 年出生的英国人相比 1950-1954 年出生的英国人,在排除实际年龄的影响后,前者的生物衰老速度更快,这一差异具有统计上的显著性,达到了 0.23 个标准差。美国的数据也出现类似的模式:相比 1965-1969 年出生的美国人,1990-1999 年出生人群的生物衰老速度更快,统计显著性达到了 0.92 个标准差。年轻人群的生物衰老速度加速与早发性癌症风险增加相关。
- 野狼重返欧洲
去年夏天,一位女士带着两幼儿在荷兰 Utrecht 附近的天然公园散步,她看到一只体型较大的动物猛冲过来,她起初以为是一只顽皮的狗,但很快听到 6 岁大儿子发出尖叫,这只动物正将他拖进树林。附近两位恰好路过的成年人用棍子赶跑了它。袭击男孩的不是狗,而是一只狼。狼群数量在欧洲多地激增,引发了如何处理野狼的激烈争论。得益于严格的法律保护,灰狼(Canis lupus)的数量自 2000 年以来大幅增长,但袭击牲畜和袭人事件也日益频发。欧盟委员会去年放宽规定允许更多捕杀野狼,科学家对此表示反对,认为基因证据表明狼群数量并不像表面看起来那么庞大,认为用电围栏和护卫犬保护牲畜比捕杀更有效。科学家估计目前欧盟成员国境内共有约 23,000 只狼,相比下 2012 年只有约 12,000 只。