Monthly Digest — 2026-07
67 unique stories across 31 days and 8 sources.
Hacker News(12)
- Fable 5 Is Back (twitter.com)
- ZCode: Claude Code from the Makers of GLM (zcode.z.ai)
- FFmpeg 9.1's new AAC encoder (hydrogenaudio.org)
- Apple 'Hide My Email' vulnerability reveals peoples' real email addresses (easyoptouts.com)
- Virginia bans sale of geolocation data (www.hunton.com)
- Podman v6.0.0 (blog.podman.io)
- Spain Orders Blacklist of Palantir from Public and Private Companies (clashreport.com)
- Since Linux 6.9, LUKS suspend stopped wiping disk-encryption keys from memory (mathstodon.xyz)
- Espionage Against the European Parliament (citizenlab.ca)
- Holes (xkcd.com)
- Costco is the anti-Amazon (phenomenalworld.org)
- 60% Fable cost cut by converting code to images and having the model OCR it (github.com)
GitHub Trending(7)
Product Hunt(12)
- Stigg 2.0
The usage runtime for AI products
- Gemini Omni Flash
High-quality video generation and conversational editing
- Adam CAD Copilot
AI CAD inside Onshape and Fusion
- RunInfra
Describe the AI model you need and get an optimized AI
- Macro
Unifies your work into one app with shared memory
- PieterPost MCP
Connect your AI agent to postal mail
- Flowly
A personal AI agent that runs on your desktop and iPhone
- Basedash Actions
A BI tool that can take action for you
- nxt
Talk to your to do list and get what's next
- Osloq
An AI agent that reproduces GitHub issues for you
- Tamamon
A desktop pet that grows as you code with Claude Code
- Glaze by Raycast
Create your own Mac apps by chatting with AI
Hugging Face(12)
- Orca: The World is in Your Mind
We introduce Orca, an initial instantiation of a general world foundation model. Orca learns a unified world latent space from multimodal world signals and exposes it through multimodal readout interfaces. Rather than optimizing isolated next-token, next-frame, or next-action prediction, we are centered on Next-State-Prediction modeling, offering a unified state-transition modeling route toward understanding, predicting, and acting upon the world. Orca learns through two complementary paradigms: unconscious learning captures dense natural state transitions from continuous videos, and conscious learning models sparse meaningful state transitions by language-described events and VQA supervision. For pre-training, we construct a large-scale world-learning inventory data, including 125K hours of video data and 160M event annotations. After pre-training, Orca learns a unified world latent space. To examine whether the learned latent supports downstream, we evaluate it by three representative downstream readouts: text generation, image prediction, and embodied action generation. Orca's backbone is frozen, and only the lightweight modality-specific decoders are trainable. Experiments show the scalability of the proposed paradigm and verify that stronger world latent enables stronger downstream readouts. Orca outperforms similar-sized specialized baselines. These results show that Orca, as a general world foundation model, presents a promising approach to understanding, predicting, and acting upon the world. Finally, we discuss the current limitations, aiming to provide useful insights and inspiration for the community.
- Dockerless: Environment-Free Program Verifier for Coding Agents
Program verifiers play a central role in training coding agents, including selecting trajectories for supervised fine-tuning (SFT) and providing rewards for reinforcement learning (RL). Standard execution-based verification requires running unit tests inside per-repository environments such as Docker images, incurring substantial environment setup costs. We propose Dockerless, an environment-free agentic patch verifier that evaluates generated code patches without executing them. Rather than simply matching candidate patches to references, Dockerless judges patch correctness using evidence gathered through agentic repository exploration. On a verifier evaluation benchmark, Dockerless outperforms the strongest open-source verifier by 14.3 AUC points. Using Dockerless as both the SFT trajectory filter and the RL reward enables a fully environment-free post-training pipeline. The resulting model reaches 62.0%, 50.0%, and 35.2% resolve rate on SWE-bench Verified, Multilingual, and Pro, respectively. It surpasses the Qwen3.5-9B baseline by 2.4, 8.7, and 2.9 points, matching environment-based post-training.
- DOPD: Dual On-policy Distillation
On-policy distillation (OPD) offers superior capacity transfer by supervising student-sampled trajectories with dense token-level signals. To furnish high-quality supervision sources and thereby elevate the performance frontier of distillation, an intuitive direction is to infuse privileged information to either teacher or student itself. However, this additional input induces a potential failure mode we dub privilege illusion: a pattern that conflates the transferable capability gap that students are meant to close, and the information asymmetry gap that can only be mimicked but never replicated. This issue is further amplified by the inherent non-uniformity of token-level supervision, where only a small subset of tokens carries pivotal capability-bearing signals. To this end, we propose DOPD, an advantage-aware dual distillation paradigm that dynamically routes token-level supervision between privileged teacher and privileged student policies based on their advantage gap and relative probabilities. Each token receives supervision of different strength, objective, and strategy from either teacher or student itself, which transfers credible capability while simultaneously receiving auxiliary signals, to alleviate privilege illusion. Extensive experiments on both large language model (LLM) and vision-language model (VLM) settings demonstrate that DOPD consistently outperforms Vanilla OPD and other counterparts. Further results on stability, robustness, continual learning, and out-of-distribution tasks validate its superiority.
- BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding
Speculative decoding accelerates inference by using a lightweight draft model to generate candidate tokens in parallel, and are then verified by the target model, enabling lossless acceleration. Recently, diffusion-based speculative decoding further improves parallelism by generating multiple tokens per forward pass via block-level diffusion, achieving state-of-the-art (SOTA) performance. However, existing methods adopt a fixed inference block size and assume a uniform optimal decoding strategy across all inputs. In this paper, we show that this assumption is suboptimal, as the optimal block size varies across samples and plays a critical role in speculative decoding performance. Moreover, these values exhibit a clear local structure, concentrating around the training block size, which reduces the problem to a low-dimensional and structured decision space. Based on these insights, we propose BlockPilot, a sample-adaptive policy that predicts the optimal block size from the prefilling representation. Specifically, we formulate block size selection as a lightweight policy learning problem and propose an instance-adaptive decision mechanism that predicts the optimal block size based on the representation of the prefilling stage. The prediction is performed only once after prefilling, allowing for seamless integration. Extensive experiments demonstrate that our method is plug-and-play, introduces minimal overhead, and consistently improves efficiency, achieving an acceptance length of 5.92 and a 4.20times speedup on Qwen3-4B under temperature T=1.
- PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception
We introduce PerceptionRubrics, a rubric-based evaluation framework that addresses the gap between saturated benchmark scores and real-world brittleness. Shifting evaluation from holistic semantic matching to rigorous atomic auditing, PerceptionRubrics pairs 1,038 information-dense images with over 12,000 instance-specific rubrics. These criteria are derived from golden captions constructed via a novel Circular Peer-Review consensus pipeline and then distilled into a dual-stream system of Must-Right (essential facts) and Easy-Wrong (fine-grained details) rubrics. Crucially, PerceptionRubrics implements a Gated Scoring mechanism: unlike linear averages, failure on mandatory visual facts triggers sharp binary penalties. Extensive evaluation yields critical insights: (1) The Reliability Gap: models often verify fragmented elements correctly yet fail strict conjunctive constraints, exposing brittleness in dense domains; (2) Open-Closed Stratification: contrary to reasoning trends, we reveal a persistent 8% perception deficit between open-source and proprietary frontiers; and (3) Human-Aligned Rigor: our gated metrics substantially out-align conventional benchmarks, validating that strict perceptual fidelity is the prerequisite for reliable generation.
- ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving
In prefill-decode (PD) disaggregated LLM serving, each request is assigned to a decode worker after prefill. Existing decode routers balance only load; for mixture-of-experts (MoE) models this is incomplete: equally loaded workers can differ in latency, since each decode step loads the weights of every distinct expert its batch activates. We present ELDR, an expert-locality-aware decode router for PD-disaggregated MoE serving. From a request's prefill expert activations, ELDR builds an expert signature predicting the experts it will activate during generation. Offline, balanced K-means partitions signature space across decode workers; online, locality-band routing sends each request to the least-loaded worker among those best matching its signature. A signature cache, co-indexed with the KV cache at KV-block granularity, keeps signatures exact under prefix caching. Implemented in vLLM and evaluated on deployments of up to 40 GPUs, ELDR reduces median TPOT by 5.9-13.9% over the strongest of four load-balancing baselines across three MoE models and two workloads, with model outputs unchanged.
- TurboServe: Serving Streaming Video Generation Efficiently and Economically
Streaming video generation is emerging as a new serving workload in which users interact with long-lived sessions that generate video progressively, chunk by chunk. Unlike offline video generation or typical LLM serving, streaming video generation must preserve session state across active and idle periods, repeatedly schedule ongoing sessions, and deliver each chunk under a tight latency target. This creates two key serving challenges in multi-user, multi-GPU environments: session duration heterogeneity, where long-running sessions make placement decisions suboptimal over time, and temporal user-demand heterogeneity, where the number of active sessions fluctuates sharply across bursts and idle periods. We present TurboServe, the first serving system designed specifically for streaming video generation workloads. TurboServe formulates serving as an online scheduling problem that jointly coordinates session placement and GPU provisioning. Its closed-loop scheduling algorithm combines a migration-aware placement controller, which rebalances sessions across GPUs to reduce the maximum per-chunk latency, with a load-driven autoscaling controller, which adapts the GPU budget to workload variation for improved cost efficiency. To support these decisions at runtime, TurboServe implements coalesced chunk processing for batching concurrent active sessions on the same GPU, GPU-CPU offloading for session suspension and resumption, and NCCL-based GPU-GPU migration for online rebalancing. We evaluate TurboServe on real-world production traces from Shengshu Technology across multiple model sizes and GPU clusters with up to 64 NVIDIA B300 GPUs. Compared with baseline serving configurations, TurboServe reduces worst-case per-chunk latency by 37.5% and total GPU operating cost by 37.2% on average. Our code is publicly available at https://github.com/shengshu-ai/TurboServe.
- MemSyco-Bench: Benchmarking Sycophancy in Agent Memory
Memory has emerged as a cornerstone of modern LLM-based agents, supporting their evolution from single-turn assistants to long-term collaborators. However, memory is not always beneficial: retrieved memories often induce a critical issue of sycophancy, causing agents to over-align with the user at the cost of factual accuracy or objective reasoning. Despite this emerging risk, existing memory benchmarks primarily evaluate whether memories are correctly stored, retrieved, or updated, while overlooking how retrieved memories influence downstream reasoning and decision-making. To bridge this gap, we propose MemSyco-Bench, a comprehensive benchmark for evaluating memory-induced sycophancy in agent systems. MemSyco-Bench measures when memory should influence a decision and how valid memory should be used. Specifically, it covers five tasks that assess whether agents can reject memory as factual evidence, respect its applicable scope, resolve conflicts between memory and objective evidence, track memory updates, and use valid memory for personalization. All related resources are collected for the community at https://github.com/XMUDeepLIT/MemSyco-Bench.
- Program-as-Weights: A Programming Paradigm for Fuzzy Functions
Many everyday programming tasks resist clean rule-based implementation, such as alerting on important log lines, repairing malformed JSON, or ranking search results by intent, and are increasingly outsourced to large language model APIs at the cost of locality, reproducibility, and price. We propose fuzzy-function programming: compiling such a function from a natural-language specification into a compact, locally-executable neural artifact. We instantiate this paradigm with Program-as-Weights (PAW), in which a 4B compiler trained on FuzzyBench, a 10M-example dataset we release, emits parameter-efficient adapters for a frozen, lightweight interpreter. A 0.6B Qwen3 interpreter executing PAW programs matches the performance of direct prompting of Qwen3-32B, while using roughly one fiftieth of the inference memory and running at 30 tokens/s on a MacBook M3. PAW reframes the foundation model from a per-input problem solver into a tool builder: invoked once per function definition, it produces a small reusable artifact whose subsequent calls per function application are cheap and offline.
- EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments
Autonomous agents are increasingly expected to improve executable policies through feedback, yet existing evaluations often collapse this process into a final score or confound it with open-ended software-engineering progress. We introduce Autonomous Policy Evolution, a controlled evaluation setting in which a harness-model agent repeatedly edits an executable policy system under a fixed interaction budget. We instantiate this setting in EvoPolicyGym, a benchmark built from compact interactive RL environments that evaluates how agents iteratively improve explored policies. On the EvoPolicyGym suite, GPT-5.5 achieves the strongest aggregate rank score and top-two performance on all 16 environments. Beyond leaderboard results, EvoPolicyGym also provides trajectory-level diagnostics that distinguish how agents allocate budget, convert feedback into parametric tuning. These analyses show that strong autonomous policy evolution depends not only on isolated task wins, but on discovering task-appropriate mechanisms and refining policies under bounded feedback.
- AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM Agents
Memory for a long-horizon LLM agent is a contract about what each future decision is allowed to see. The simplest contract appends past observations, tool calls, and reflections to every prompt, which makes prior context easy to access but also turns it into a jumbled mixture in which the effect of any single memory component is hard to isolate. We introduce and instrument an alternative bounded contract: every decision is made from a fresh user message assembled by typed retrieval, with no raw cross-decision transcript appended. The prompt thus stays bounded across runs of any length, and any single layer can be ablated in isolation. We instantiate the contract in Slay the Spire 2, a closed-rule stochastic deck-building game whose runs require hundreds of tactical and strategic decisions. A public online benchmark of frontier LLMs on the same game reports zero wins at the lowest difficulty across five configurations, and the developer-reported human win rate at the same difficulty is 16%; the task is hard but not saturated. Within our harness, a fixed-A0 ablation shows the largest observed difference when triggered strategic skills are enabled: the no-store baseline wins 3/10 games and adding the skill layer 6/10. At this sample size the comparison is directional rather than statistically decisive (Fisher exact p\approx0.37); a cross-backbone probe and public accumulating-context baselines are reported as operational comparisons rather than controlled tests of the contract variable itself. We release a reproducible testbed: 298 completed trajectories with condition tags, frozen memory/skill snapshots, prompt records, and analysis scripts -- an agent design and a validated, reusable methodology for studying how explicit memory layers shape long-horizon LLM-agent decisions.
- Morphing into Hybrid Attention Models
Hybrid attention models improve long-context efficiency by retaining only a subset of full-attention layers and replacing the remaining layers with linear attention. However, the effectiveness of Transformer-to-hybrid conversion critically depends on which layers preserve full attention. Existing hybrid layer selection methods typically rely on heuristic strategies such as fixed placement patterns or layerwise scoring, implicitly treating layer importance as isolated and overlooking the interdependent layer effect under a global hybrid configuration. In this work, we formulate hybrid layer selection as a budget-constrained subset optimization problem. We further propose FlashMorph (Fast LAyer Selection for Hybrid MORPHing), an effective, efficient and scalable layer selection method for Transformer-to-hybrid conversion. FlashMorph first constructs a morphable model by equipping each full-attention layer with a converted linear-attention branch. It then freezes all model weights and jointly optimizes layerwise gates on synthetic long-context retrieval data, with a linearization regularization that encourages the model to rely on linear attention for efficiency. The learned gates are discretized under a preset full-attention budget to instantiate the hybrid architecture, followed by standard logits distillation and long-context finetuning. Extensive experiments show that FlashMorph discovers more effective hybrid configurations, preserves strong long-context recall and general benchmark performance while substantially reducing layer selection cost compared with existing layer selection methods, demonstrating its effectiveness, efficiency, and scalability.
Techmeme(12)
- Lime's shares closed up 4% in its Nasdaq debut on Wednesday, valuing the company at ~$1.7B, after Lime and existing stockholders raised ~$174M in its US IPO (Reuters)
Reuters : Lime's shares closed up 4% in its Nasdaq debut on Wednesday, valuing the company at ~$1.7B, after Lime and existing stockholders raised ~$174M in its US IPO — Uber-backed (UBER.N) Lime's (LIME.O) shares rose 4% in its Nasdaq debut on Wednesday, signaling robust investor confidence …
- Sources: Apple is working on new iPad Pros and an entry-level 14" MacBook Pro with a new design in line with the upcoming touch-screen laptops for H1 2027 (Mark Gurman/Bloomberg)
Mark Gurman / Bloomberg : Sources: Apple is working on new iPad Pros and an entry-level 14" MacBook Pro with a new design in line with the upcoming touch-screen laptops for H1 2027 — Apple Inc. is preparing an upgraded iPad Pro line and a redesigned entry-level MacBook Pro for next year, adding to a slate of several …
- Alibaba and its US payment processor AUS agree to pay $600M to resolve DOJ allegations that they failed to prevent illegal sales of drugs and other products (Reuters)
Reuters : Alibaba and its US payment processor AUS agree to pay $600M to resolve DOJ allegations that they failed to prevent illegal sales of drugs and other products — Chinese technology and e-commerce giant Alibaba (9988.HK) and its U.S.-based payment processor have agreed to pay $600 million …
- Sources: SoftBank has reopened talks for a $10B loan backed by its OpenAI stake and is offering to guarantee repayment if OpenAI collateral proves insufficient (Echo Wang/Reuters)
Echo Wang / Reuters : Sources: SoftBank has reopened talks for a $10B loan backed by its OpenAI stake and is offering to guarantee repayment if OpenAI collateral proves insufficient — SoftBank Group has reopened talks with a consortium of lenders for a $10 billion loan backed by its stake in OpenAI …
- Sources: Crusoe is in active talks to raise ~$3B in a funding round expected to value the company in the ~$30B range, up from a ~$10B valuation in October (Bloomberg)
Bloomberg : Sources: Crusoe is in active talks to raise ~$3B in a funding round expected to value the company in the ~$30B range, up from a ~$10B valuation in October — Crusoe, the data center upstart with contracts to supply AI computing power for the likes of Meta Platforms Inc. and Oracle Corp. …
- At a town hall, Mark Zuckerberg said Meta's AI agent development has not accelerated as expected and its reorganization was not as "clean" as it could have been (Katie Paul/Reuters)
Katie Paul / Reuters : At a town hall, Mark Zuckerberg said Meta's AI agent development has not accelerated as expected and its reorganization was not as “clean” as it could have been — Meta (META.O) Chief Executive Mark Zuckerberg told an internal town hall on Thursday that AI agent development …
- Source: crypto payments and settlement startup Mesh is raising funding led by Binance at a ~$2B valuation, six months after raising $75M at a $1B valuation (Lucinda Shen/Axios)
Lucinda Shen / Axios : Source: crypto payments and settlement startup Mesh is raising funding led by Binance at a ~$2B valuation, six months after raising $75M at a $1B valuation — Binance is set to lead Mesh's funding round that will value the crypto payments and settlement company at up to $2 billion, Axios Pro has learned.
- Travel app Hopper agrees to a $35M FTC settlement over allegations the company misled users by imposing hidden fees and misrepresenting the total costs (Lauren Forristal/TechCrunch)
Lauren Forristal / TechCrunch : Travel app Hopper agrees to a $35M FTC settlement over allegations the company misled users by imposing hidden fees and misrepresenting the total costs — Hopper, the travel app known for its AI-driven flight and hotel price predictions, has agreed to a $35 million settlement following …
- Meta could use its compute for its own models, ad scaling, SpaceX-like neocloud deals, and hosting 3rd-party models; it may be close to an Anthropic deal (Jeremie Eliahou Ontiveros/SemiAnalysis)
Jeremie Eliahou Ontiveros / SemiAnalysis : Meta could use its compute for its own models, ad scaling, SpaceX-like neocloud deals, and hosting 3rd-party models; it may be close to an Anthropic deal — Zuck Takes Plan B? SpaceX 2.0, Bedrock 2.0, MSL Isn't Giving Up, Scaling RecSys by 10x... ClusterMAX ranking coming soon?
- Meta getting into the cloud business has been inevitable for a long time, as it seeks to diversify beyond ad revenue and monetize its AI buildout (M.G. Siegler/Spyglass)
M.G. Siegler / Spyglass : Meta getting into the cloud business has been inevitable for a long time, as it seeks to diversify beyond ad revenue and monetize its AI buildout — Their need to diversify the business meets the AI build out concerns... Meta has a problem. Well, two of them, actually.
- Instagram has been running ads promoting child sexual abuse material in India, with terms like "rape video" and "child video" and linking to Telegram channels (Divya Arya/BBC)
Divya Arya / BBC : Instagram has been running ads promoting child sexual abuse material in India, with terms like “rape video” and “child video” and linking to Telegram channels — Warning: This story contains descriptions of abuse — Instagram has been running paid adverts promoting …
- An interview with Sriram Krishnan, who says "there will not be an FDA for AI" under Trump, blames the AI backlash on the industry's "doomer" messaging, and more (Financial Times)
Financial Times : An interview with Sriram Krishnan, who says “there will not be an FDA for AI” under Trump, blames the AI backlash on the industry's “doomer” messaging, and more — Sriram Krishnan tells the FT the president is against a centralised regulator as AI backlash grows
Solidot(12)
- 瑞典法院判决 Google 向比价网站赔偿 15 亿美元
瑞典法院以 Google 在搜索结果中偏袒自家购物服务为由判决它向比价网站 PriceRunner 赔偿约 15 亿美元(143 亿瑞典克朗)。这是瑞典法院在反垄断诉讼中判处的最高金额罚款,但远低于 PriceRunner 寻求的 780 亿瑞典克朗赔偿。PriceRunner 于 2022 年起诉 Google,指控 Google 操纵搜索结果。2008 年 Google 开始在搜索结果中突出展示其比价购物服务,导致竞争对手的比价网站流量急剧下降。2017 年时任欧盟竞争事务专员 Margrethe Vestager 以 Google 利用其比价购物服务获取不公平优势对其处以罚款。Google 于 2021 年对该裁决提出上诉但被驳回。之后欧洲的多家比价网站提起了赔偿诉讼。
- 索尼 PS 从 2028 年 1 月起不再发售新游戏的光盘版
数字游戏是未来,索尼正式宣布其 PS 游戏机从 2028 年 1 月起不再发售新游戏的实体光盘版本。这也意味着未来的的 PS 游戏机不会再发售包含蓝光光驱的型号。索尼称 2028 年 1 月之前已发售或即将发售的游戏实体光盘版不受影响。消费者普遍偏爱数字媒介而不是实体光盘,索尼表示它只是顺应这一趋势罢了。
- Godot 拒绝接受 AI 生成的代码
开源项目都面临 AI 代码越来越多的问题,现在负责开发开源游戏引擎 Godot 的基金会宣布修订贡献者政策,禁止递交 AI 署名的代码和 AI 智能体提交的 pull request,以及在人与人之间的沟通中禁止 AI 生成文本——机器翻译除外。新政策旨在限制 AI Slop,鼓励维护者审查代码,将新贡献者培养成未来的维护者,最重要的是要求所有贡献都必须来自对代码负责的人类,修复出现问题的代码。基金会称,“AI 不能承担责任,我们也不能指望 AI 的重度用户能充分理解他们的代码并能进行修正。”
- LHC 第三次停机维护
CERN 宣布了 LHC 的第三次长时间停机维护(Long Shutdown 3)。这次维护和升级将为下一阶段的 High-Luminosity LHC(HiLumi LHC)的运行做准备。LHC 于 1998-2008 年建造,2009 年投入运行,2010 年首次实现 3.5TeV 粒子对撞,2012 年宣布发现了希格斯玻色子。2013-2015 年 LHC 进行了第一次维护升级,使得粒子对撞的总能量提高到了 13.0TeV;2018 年底到 2022 年 4 月 LHC 进行第二次维护升级。第三次停机维护将是至今最大规模的升级改造,HiLumi LHC 计划于 2030 年投入运行,其亮度提高最多十倍,将使研究人员能收集规模更大的数据集,对希格斯玻色子进行更精确的研究,增强发现标准模型之外现象的潜力。
- Google 的 2025 年用电量增长了 37%
Google 通过最新的可持续发展报告承认,该公司自 2019 年以来用电量增长了逾 250%,在 2024 年增长 27% 基础上 2025 年又增长了 37%。Google 将这一切归于 Google Cloud、YouTube 视频串流以及 AI 基础设施的建造和运营的持续增长。Google 数据中心在 2025 年消耗了逾 4200 万 MWh 电力,2024 年则是 3060 万 MWh。这意味着 Google 数据中心的能源消耗量相当于新西兰、丹麦和尼日利亚等国全国的电力消耗量。
- OpenAI 磋商将 5% 股份送给美国政府
随着 AI 公司试图缓和与特朗普政府的关系,OpenAI 正磋商向美国政府捐赠其 5% 的股份。OpenAI CEO Sam Altman 认为,向美国公众提供该公司的股份是分享 AI 好处的最佳方式。它的提议还建议还其它美国 AI 公司向政府捐出类似的股份,目前尚不清楚 Anthropic、Google 和 Meta 等公司是否会同意该计划。OpenAI 高管建议,美国 AI 公司应将 5% 股份捐给主权基金 Alaska Permanent Fund。这一谈判是“概念性的”,还处于早期阶段,任何协议可能需要国会通过法案才能实施。
- 轨道数据中心的炒作和现实
SpaceX 创始人 Elon Musk 今年一月在达沃斯世界经济论坛上宣称,最迟三年轨道数据中心就能实现。随后 SpaceX 向 FCC 递交申请发射 100 万颗卫星建立轨道数据中心星座。Musk 总是喜欢夸大其词,他说完全自动驾驶汽车将在 2017 年实现,载人火星任务将在 2024 年实现,到 2025 年底将会制造出 1 万台 Optimus 人形机器人。目前地球轨道上约有 14,500 颗卫星,Starlink 星座占了三分之二,要部署 100 万颗卫星,SpaceX 的火箭发射频率和卫星制造能力都需要大幅提升。SpaceX 下一代火箭 Starship 能将 60 颗卫星发射到轨道上,100 万颗卫星至少需要执行 16,666 次发射。SpaceX 在 2025 年创下了 165 次轨道发射纪录,如果将发射频率提高到 10 倍,也需要十年才能发射完毕。Starlink 卫星的建造速度为每年 4000 颗,除非卫星制造发生革命性变革,制造 100 万颗卫星也需要约 25 年。轨道数据中心星座距离现实还遥遥无期。这还没有考虑轨道数据中心所需要的庞大散热器、以及辐射、维护、轨道碎片等问题。那么为什么 SpaceX 要大力宣传轨道数据中心?为了钱。IEEE Spectrum 的 Dina Genkina 称,Musk 在自己给自己发钱上几乎是天才,他让 xAI 负责建造数据中心,SpaceX 负责将它们发射到太空,特斯拉负责制造太阳能电池板,他就像是自己给自己发工资。
- DGX Spark 黑客松线上训练营:4 小时干货,从环境配置到具身智能,手把手教你搭出能跑的 Agent
NVIDIA DGX Spark 黑客松开赛即报满,但赛事之外还有一场更适合"先蹭一波再决定要不要打"的硬核直播 直播时间:7 月 12 日 10:00 - 12:00 训练营内容: 1· 黑客松赛事规则说明:解读赛事机制、评审标准与提交流程,帮助团队明确方向、高效备赛。 2 · 基于 DGX Spark 和 Step 3.7 搭建本地 Agent Team 的最佳实践:从环境配置到模型推理,讲解如何在 DGX Spark 上高效落地Stepfun3.7模型能力。 3 · Agent 一键出片:基于 DGX Spark 搭建本地视觉生成智能体 演示如何构建具备视觉理解与内容生成能力的本地 Agent,打通从提示词到成片的完整链路。 4 · 从本地 AI 到具身智能:基于 DGX Spark 构建桌面机器人 Agent 开发平台 探索 Agent 从软件走向物理世界的实现路径,展示 DGX Spark 在具身智能场景下的开发实践。
- 现代生活可能与人类大脑不匹配
人类大脑演化适应了一个充满熟人、直接威胁以及小型社交群体的世界。但今天世界变化速度远超人类演化适应的速度。这种不匹配或能解释解释人类所经历的压力、孤独和持续攀比心理。新加坡研究人员在《Behavioral Sciences》期刊上发表论文,探讨了如何从演化角度理解压力、竞争和孤独。在熟人小群体中形成的行之有效的反应,在现代生活中显得格格不入,甚至令人难以承受。这种演化上的不匹配在社交媒体时代尤其显著。论文合作者 Jose Yong 博士称,竞争不是新鲜事,但现代生活让它无处不在,从演化论的角度看,或许可以解释为什么人们对对比和害怕落后的反应如此强烈,即便这些信号并非来自小群体,而是来自陌生人或屏幕。
- NASA 发射探测器拯救坠落中的 Swift 天文台
2004 年发射的 Neil Gehrels Swift 天文台设计监测宇宙中的伽马射线暴,它最初的运行轨道高度约 600 公里,但由于大气阻力其轨道高度已衰减至约 400 公里。2024 年太阳极大期间的太阳活动导致了地球大气膨胀,加速了轨道衰减。如果不进行拯救,Swift 天文台将于 2026 年底不受控重返大气层。Swift 天文台本身没有推进系统。去年 9 月 NASA 授予 Katalyst Space Technologies 一份 3000 万美元的合同,用于开发和发射一艘与 Swift 对接并提升其轨道的探测器 LINK。LINK 于 7 月 3 日成功发射,未来几个月它将尝试用三个机械臂抓住 Swift 天文台,启动推进器,将其送回 600 公里高度的安全轨道。这是一项雄心勃勃的任务,如果成功,那么下一个拯救任务有可能是哈勃太空望远镜。
- 蜂后会将农药负担转移给蜂卵
蜜蜂是重要的授粉昆虫,但其数量在世界各地都出现了下降,原因可能与农药有关。发表在《Current Biology》期刊上的一项研究在实验室条件下跟踪了农药在一个小型蜂群中的流动。研究发现,工蜂最初通过食物过滤和在蜂巢中沉积将食物中的农药水平降低 95%,但到第 10 天过滤效率下降至 86%;蜂后体内的农药含量显著低于工蜂,但随着时间的推移,蜂后会在卵巢中积累农药并将其转移到正在发育的蜂卵;蜂后的存在会改变整个蜂群的化学物质分布,工蜂会集中接触农药,增加蜂蜡中农药的沉积。
- 阿里巴巴禁止员工使用 Claude Code
阿里巴巴因担心后门禁止员工在工作中使用 Anthropic 的 Claude Code,它要求员工使用自家的编程平台 Qoder。尽管 Anthropic 限制了中国用户和实体的访问,但 Claude Code 在中国程序员中间仍然非常受欢迎。Anthropic 上个月指控阿里巴巴蒸馏了其模型,几天前它的 Claude Code 被发现包含了检测用户是否来自中国的代码。