OrangeBot.AI Digest — 2026-06-15
90 headlines across 8 sources, aggregated for this day.
Hacker News(15)
- A backdoor in a LinkedIn job offer (roman.pt)
- Typst 0.15.0 (typst.app)
- Hetzner Price Adjustment (docs.hetzner.com)
- TinyWind: A pixel pirate sailing game with real wind physics (380k+ kms sailed) (tinywind.io)
- Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?
- My Homelab AI Dev Platform (rsgm.dev)
- Copper transport drug restores memory and clears toxic Alzheimer's proteins (www.monash.edu)
- Iroh 1.0 (www.iroh.computer)
- Hetzner increased dedicated server prices 3-4x
- CrankGPT (crankgpt.com)
- Fox to buy Roku (www.wsj.com)
- Salesforce to Acquire Fin (formerly Intercom) for $3.6B (www.salesforce.com)
- Anthropic's Safety Superpower (stratechery.com)
- Openrouter Fusion API (openrouter.ai)
- What happened to nerds? (mrmarket.lol)
GitHub Trending(15)
- iptv-org / iptv
- teslamate-org / teslamate
- Panniantong / Agent-Reach
- meshery / meshery
- chatwoot / chatwoot
- krahets / hello-algo
- freeCodeCamp / freeCodeCamp
- trycua / cua
- jwasham / coding-interview-university
- rohitg00 / ai-engineering-from-scratch
- music-assistant / server
- Free-TV / IPTV
- Introduction-to-Autonomous-Robots / Introduction-to-Autonomous-Robots
- Raphire / Win11Debloat
- mikeroyal / Self-Hosting-Guide
Product Hunt(15)
- ColibotAI
Translate, summarize & explain any text on-device
- IdleDev
Get paid while your AI agent thinks
- AgentBrush
Your coding agent's missing tool: image generation
- MockPilot
Turn live websites into editable mockups
- VEXI
Open-source AI coding agent for your terminal
- Notchcode
Claude Code + Codex agents in your notch
- Verol
Stop AI hallucinations
- EmailFlow.AI
Like Claude Design for Email Newsletters
- MiMo Code
A coding agent with explicit long-term memory architecture
- Wobo 2.0
Tinder for jobs: swipe right and AI applies for you
- Dropmatico
Drop. Pick. Done.
- Tinfoil Pigeons
See the aircraft flying over you on a retro radar scope
- Synopsule
On device private AI meeting transcripts
- Momentra
A cozy camera app for beautifully framed memories
- AEVS
proof-of-execution for AI agents
Hugging Face(15)
- OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data
Cloning camera motion from reference videos is an important task in video generation, as videos provide intuitive and precise control. Existing methods either directly use parametric representations that fail to handle multi-shot generation or synthesize cross-paired data, which suffer from data scarcity, resulting in poor performance in complicated camera motion cloning. To address these issues, we introduce a general camera motion representation that encodes cameras as grid motion videos. This camera grid represents the camera parameters visually and supports the integration of diverse trajectories for multi-shot video generation. Building upon this, we propose OmniDirector, a unified framework trained on a million-scale camera grid-video pairs that coordinates characters, actions, and cameras to provide director-level control for multimodal diffusion transformers. Furthermore, we design a novel hierarchical prompt expansion agent that harmoniously integrates different control signals by systematically describing camera motion and visual content through understanding signal relationships. Extensive experiments demonstrate the superior performance and outstanding controllability of our framework. Project page: https://ymlinfeng.github.io/OmniDirector.github.io/
- APPO: Agentic Procedural Policy Optimization
Recent advances in agentic Reinforcement Learning (RL) have substantially improved the multi-turn tool-use capabilities of large language model agents. However, most existing methods assign credit over coarse heuristic units, such as tool-call boundaries or fixed workflows, making it difficult to identify which intermediate decisions influence downstream outcomes. In this work, we study agentic RL from two perspectives: where to branch and how to assign credit after branching. Our pilot analysis shows that influential decision points are broadly distributed throughout the generated sequence rather than concentrated at tool calls, while token entropy alone does not reliably reflect their impact on final outcomes. Motivated by these observations, we propose Agentic Procedural Policy Optimization (APPO), which shifts branching and credit assignment from coarse interaction units to fine-grained decision points in the sequence. APPO selects branching locations using a Branching Score that combines token uncertainty with policy-induced likelihood gains of subsequent continuations, enabling more targeted exploration while filtering out spurious high-entropy positions. It further introduces procedure-level advantage scaling to better distribute credit across branched rollouts. Experiments on 13 benchmarks show that APPO consistently improves strong agentic RL baselines by nearly 4 points, while keeping efficient tool-calls and maintaining behavior interpretability.
- Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents
Despite recent progress, LLM agents still struggle with reasoning over long interaction histories. While current memory-augmented agents rely on a static retrieve-then-reason paradigm, this rigid pipeline design prevents them from dynamically adapting memory access to intermediate evidence discovered during inference. To bridge this gap, we propose MRAgent, a framework that combines an associative memory graph with an active reconstruction mechanism. We represent memory as a Cue-Tag-Content graph, where associative tags serve as semantic bridges connecting fine-grained cues to memory contents. Operating on this structure, our active reconstruction mechanism integrates LLM reasoning directly into memory access, allowing the agent to iteratively explore and prune retrieval paths based on accumulated evidence. This ensures that memory retrieval is dynamically adapted to the reasoning context while avoiding combinatorial explosion caused by unconstrained expansion. Experiments on the LoCoMo benchmark and LongMemEval benchmark demonstrate significant improvements over strong baselines (up to 23%), while substantially reducing token and runtime cost, highlighting the effectiveness of active and associative reconstruction for long-horizon memory reasoning.
- From Chatbot to Digital Colleague: The Paradigm Shift Toward Persistent Autonomous AI
Large Language Models (LLMs) are undergoing a fundamental transformation from conversational generators into integrated AI systems capable of reasoning, action, memory, and self-improvement. We conceptualize this transition as a shift from Chatbot to Digital Colleague: from conversational answers to persistent work. We organize this transition along two tightly coupled dimensions. First, at the cognitive core level, LLMs are advancing from Chatbot-era "fast thinking" systems driven by next-token prediction toward Thinking LLMs that leverage inference-time computation, Chain-of-Thought reasoning, reflection, process supervision, and reinforcement learning to support more deliberate and reliable cognition. Second, at the tool-augmented task execution level, LLMs are progressing from tool-calling Agents that invoke external resources in an ad hoc manner toward OpenClaw-style workstation systems (OpenClaw) equipped with persistent Workspaces, skills, verification loops, and governance. The "Workspace + Skill" paradigm makes episodic tool use colleague-like via state persistence, reusable procedures, task closure, and experience reuse. We examine data construction shifts from instruction-response pairs to State-Action-Observation trajectories and evaluation from static benchmarks to sandboxed, auditable, self-evolving AI ecosystems.
- Orchestra-o1: Omnimodal Agent Orchestration
The recent success of agent swarms has shifted the paradigm of large language model (LLM)-based agents from single-agent workflows to multi-agent systems, highlighting the importance of agent orchestration for task decomposition and collaboration. However, existing orchestration frameworks are limited to a narrow set of modalities and struggle to generalize to more complex settings where heterogeneous modalities coexist and interact. This limitation becomes particularly pronounced in omnimodal scenarios, where tasks require the unified understanding and coordination of diverse inputs such as text, image, audio, and video. In this work, we propose Orchestra-o1, an omnimodal agent orchestration framework designed to support efficient agent collaboration across multiple modalities. Orchestra-o1 introduces a unified orchestration mechanism that enables modality-aware task decomposition, online sub-agent specialization, and parallel sub-task execution. This scalable design allows agent systems to effectively tackle complex real-world tasks involving heterogeneous information sources, surpassing the second-best approach by 10.3% accuracy on the OmniGAIA benchmark. Furthermore, we introduce decision-aligned group relative policy optimization (DA-GRPO), an efficient agentic reinforcement learning approach for training Orchestra-o1-8B, which also achieves state-of-the-art performance against all existing open-source omnimodal agents.
- HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry
AI agent performance depends critically on the runtime harness, comprising the prompts, tools, memory, and control flow that mediate how a model observes, reasons, and acts. Yet today's harnesses remain largely hand-crafted and static: each new model or task still demands bespoke scaffolding, and the rich traces produced during execution are rarely distilled back into systematic improvement. We introduce HarnessX, a foundry for composable, adaptive, and evolvable agent harnesses. HarnessX assembles typed harness primitives via a substitution algebra, adapts them through AEGIS, a trace-driven multi-agent evolution engine grounded in an operational mirror between symbolic adaptation and reinforcement learning, and closes the harness-model loop by turning trajectories into both harness updates and model training signal. Across five benchmarks (ALFWorld, GAIA, WebShop, tau^3-Bench, and SWE-bench Verified), HarnessX yields an average gain of +14.5% (up to +44.0%), with gains largest where baselines are lowest. These results suggest that agent progress need not come from model scaling alone: composing and evolving runtime interfaces from execution feedback is an actionable and complementary lever. The complete codebase will be open-sourced in a future release.
- Rethinking RAG in Long Videos: What to Retrieve and How to Use It?
Retrieval-augmented generation is moving beyond text into long, egocentric video, where systems must select query-relevant chunks across multiple modalities and temporal granularities. Yet progress in VideoRAG is limited by two gaps: existing benchmarks allow queries to be answered without the video, obscuring retrieval errors, and prior methods apply a single modality-granularity configuration per query, ignoring chunk-level variability. We address both by introducing V-RAGBench, a benchmark of langlequery, evidence chunk, answerrangle triplets that enables faithful, decoupled evaluation of retrieval and generation, and CARVE, a simple method that runs parallel retrievers across configurations and employs chunk-adaptive reranking to identify the winning configuration for each chunk. Each chunk then enters the generator under its winning configuration selected during retrieval, yielding an interleaved evidence form where the chunk-level decision propagates across both stages. CARVE outperforms eight recent VideoRAG baselines, with the chunks supplied to the generator interleaving multiple configurations rather than sharing a single one, a behavior unattainable by query-level methods.
- From AGI to ASI
Over the last decade, building human-level artificial general intelligence has moved from far-fetched speculation to being a concrete next-decade target for many of the largest AI organisations. Achieving this goal would have profound and far-reaching impacts on human society, which raises many complex questions for the decade ahead. This report investigates how AI itself might continue to develop in a post-AGI world along the continuum of machine intelligence. The endpoint of this continuum, Universal AI, is theoretically well understood, which provides some formal grounding for the main focus of this report: the transition from human-level AGI to artificial general superintelligence, which, intuitively, can be understood as a system that is more intelligent and cognitively capable than large organisations of humans. After characterizing ASI, the report discusses four potential pathways from AGI to ASI: scaling AGI, AI paradigm shifts, recursive improvement, and ASI emerging from large-scale multi-agent collectives. The report then discusses possible frictions and bottlenecks along these pathways. Determining whether the impact of these frictions will be negligible or substantial raises a number of concrete open research questions. Due to large uncertainties for predicting ASI progress, it cannot be ruled out that AI progress might continue to accelerate over the next years. This could imply that the image of a single transformative step change, caused by the introduction of human-level AGI into our society, could be inaccurate. More apt might be the prospect of a series of transformative societal changes caused by AI-enabled progress and breakthroughs across many areas of science and technology. Preparing for this prospect requires a massively interdisciplinary endeavour of global scope and interest.
- OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains
Current automated pipelines for audio-visual Question Answering (QA) generally adopt a ``video-caption-QA'' paradigm. However, these methods typically segment videos into short clips and generate separate descriptions for audio and visual modalities. This decoupled processing severs inherent associations between sounds and their visual sources, while independent clip processing often causes inconsistent descriptions of the same entity across segments. Furthermore, coupling long-text comprehension and QA synthesis into a single step often restricts models to localized events, yielding questions lacking long-term temporal connections and deep cross-modal reasoning. To address these issues, we propose an automated data engine featuring two mechanisms: (1) Entity-Anchored Video Scripting transforms videos into structured scripts, comprising summaries, main entity lists, and segment-wise audio-visual descriptions. The entity list serves as a global prior to ensure cross-segment referential consistency and reconstruct audio-visual associations. (2) Clue-Guided QA Generation prompts models to first mine cross-segment, multimodal clues from the script, and subsequently generate QA pairs based on these high-value clues. Leveraging this pipeline, we construct the instruction-tuning dataset OmniVideo-100K and a human-verified test set, OmniVideo-Test. Fine-tuning VITA-1.5, Qwen2.5-Omni-7B and Qwen3-Omni-30B on OmniVideo-100K yields performance gains of up to 20.59% on OmniVideo-Test, demonstrating strong generalization (up to 12.64% improvements) across established benchmarks like Daily-Omni and JointAVBench.
- Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO
We identify a new dimension for enhancing rollout diversity in Group Relative Policy Optimization (GRPO) for LLMs. While GRPO relies on diverse rollouts, prevailing strategies primarily increase diversity by injecting more token-level randomness, which may introduce step-wise noise and lead to incoherent trajectories. We uncover that smaller models within the same model family inherently exhibit higher policy-level diversity, indicated by their superior pass@k relative to larger counterparts as sample counts increase. Unlike token-level noise, this diversity is temporally correlated, preserves logical consistency, and provides structured exploration signals for gradient estimation. We thus propose S2L-PO (Small-to-Large Policy Optimization), a framework that leverages fixed small models as natural explorers to train larger models. To balance exploration and exploitation, we design a progressive annealing strategy that transitions from offline small-model rollouts to the large learner's own sampling. This shift elegantly avoids mid-training performance drops caused by the small model's capacity limits, achieving faster convergence and unlocking a higher performance ceiling. S2L-PO improves accuracy on diverse mathematical reasoning benchmarks (e.g., +8.8% on AIME 24 using a 1.7B explorer to guide the 8B model) while reducing rollout compute.
- Measuring Epistemic Resilience of LLMs Under Misleading Medical Context
Large language models (LLMs) now reach expert-level scores on medical licensing exams, encouraging the assumption that high scores imply safe medical judgment while patients increasingly use them for health advice. We show this assumption is fragile: when misleading context is injected into questions that LLMs originally answer correctly, they abandon the correct answer. We call the ability to maintain correct judgment under adversarial context epistemic resilience, and introduce MedMisBench to measure it. MedMisBench contains 10,932 medical question items and 48,889 misleading context-option pairs spanning medical reasoning, agentic capability, and patient-journey evaluation. Across 11 model configurations, mean accuracy falls from 71.1% on original questions to 38.0% under focused misleading context, with 51.5% attack success. The most damaging injections are formal, rule-like fabrications: authority-framed falsehoods reach 69.5% attack success and exception-poisoning claims reach 64.1%. A 14-member clinical panel from 7 countries identified serious potential harm in 38.2% of reviewed cases. MedMisBench exposes a structural blind spot in LLM evaluation in medical settings: existing benchmarks measure what models know, but not whether they preserve correct medical judgment under misleading context.
- RedAct: Redacting Agent Capability Traces for Procedural Skill Protection
Users rely on execution traces to observe agent behavior, diagnose failures, and ensure accountability. These traces contain rich procedural detail, including tool invocations, intermediate decisions, and error-recovery logic. Yet this detail can expose private procedural skills, allowing downstream methods to recover key formulas, thresholds, and strategies without access to model weights or skill files. To quantify this risk and evaluate protection, we construct CapTraceBench, a benchmark of 75 specialized long-horizon tasks and 154 curated skills across seven domains. We also introduce RedAct https://github.com/XuShuwenn/RedAct, a protected trace release framework that localizes protected key information, rewrites traces while preserving verifier-critical evidence, and embeds behavioral watermarks for downstream provenance analysis. Across representative trace reuse methods, RedAct reduces normalized skill transfer (NST) from 44.7--67.1\% on raw traces to below the no-skill baseline, while preserving audit evidence. Its standalone behavioral watermarks reach 93.6--100.0\% true detection with a false alarm rate of at most 1.9\%. These results frame public agent traces as security interfaces and show that selective redaction can reduce procedural capability leakage without removing audit evidence.
- Skip a Layer or Loop It? Learning Program-of-Layers in LLMs
Large language models (LLMs) perform inference by following a fixed depth and order, non-recurrent execution of all layers. We reveal the wide existence of training-free, flexible, dynamic program-of-layers (PoLar), where pretrained layers can be packed as modules and then skipped or looped to form a customized program for each input. For most inputs, substantially shorter program executions can achieve the same or better accuracy, while incorrect predictions of the original LLM can be corrected by alternative programs with fewer layers. These observations indicate that inference admits multiple valid latent computations beyond the standard forward pass. To efficiently achieve PoLar in practice, we propose a lightweight PoLar prediction network, which learns to generate execution programs that dynamically skip or repeat pretrained layers for each input. Experiments on mathematical reasoning benchmarks demonstrate that PoLar consistently improves accuracy over standard inference and prior dynamic-depth methods, often while executing fewer layers, and that these gains persist under out-of-distribution evaluation. Our results suggest that fixed-depth execution captures only a narrow subset of an LLM's latent reasoning capacity.
- LLM Agents Can See Code Repositories
Coding agents powered by large language models have demonstrated strong performance on software engineering tasks. Yet most agents consume repositories almost entirely as text, which differs from how human developers use visual structure such as folder hierarchies and dependency relationships to orient themselves in large codebases. With multimodal large language models (MLLMs), it is an open question whether agents can effectively benefit from visual representations of repositories. This paper presents the first systematic empirical study of visual repository representations for LLM-based agents on repository-level issue resolution. We evaluate four recent multimodal models. Our results show that a strictly vision-only setup degrades accuracy and increases token cost, because agents lack sufficient symbolic detail and compensate with repeated visual queries. In contrast, integrating visual graphs of repository structure as a supplementary modality alongside standard text interfaces helps agents understand structure more efficiently: input token consumption decreases by up to 26% while issue-resolution accuracy is maintained or improved. Visualization is most useful during fault localization and when the agent autonomously controls exploration depth. These findings point to a practical hybrid text-and-vision design for next-generation coding agents.
- iMaC: Translating Actions into Motion and Contact Images for Embodied World Models
Embodied world models have emerged as a pivotal paradigm for visual robotic decision-making and interactive environment simulation. However, conventional embodied frameworks rely on low-dimensional structured action vectors (e.g., joint angles and end-effector poses), which suffer from limited expressive capacity, poor generalization across diverse embodiments, and unnatural dynamic modeling for complex physical interactions. To address these limitations, this paper proposesiMac (Image as Action Control), a novel unified control paradigm that treats raw visual images as native action representations for embodied world models. Departing from traditional explicit kinematic action encoding, iMac formulates continuous visual manipulation as image-based action tokens, which inherently encapsulate spatial motion intentions, interactive geometric constraints and subtle physical dynamics. We construct a dual-branch embodied architecture consisting of an image-action encoder and a dynamic world predictor: the encoder compresses target-driven visual images into compact action embeddings, while the predictor learns environment transition rules conditioned on image actions to achieve high-fidelity future state prediction and closed-loop embodied control. Extensive experiments are conducted on public embodied manipulation benchmarks and real-world robotic scenarios. The results demonstrate that iMac outperforms vector-based action control baselines in prediction accuracy, task success rate and cross-scene generalization ability. Moreover, our image-action design eliminates the reliance on manually defined action spaces, realizing flexible and universal control for heterogeneous embodied agents. This work provides an innovative visual-action perspective for embodied world models, offering a simple yet effective paradigm for scalable robotic perception and manipulation.
Techmeme(15)
- Sources: several Xbox studios, including Hellblade maker Ninja Theory, are in talks with Microsoft to buy themselves back and go independent to avoid closure (Jason Schreier/Bloomberg)
Jason Schreier / Bloomberg : Sources: several Xbox studios, including Hellblade maker Ninja Theory, are in talks with Microsoft to buy themselves back and go independent to avoid closure — The studios, which include Compulsion Games and Double Fine, are in active negotiations with Xbox and may be given the chance to go independent.
- Source: Qualcomm is in talks to buy AI chip designer Tenstorrent for $8B to $10B; Tenstorrent discussed raising $800M at a ~$3.2B valuation last year (The Information)
The Information : Source: Qualcomm is in talks to buy AI chip designer Tenstorrent for $8B to $10B; Tenstorrent discussed raising $800M at a ~$3.2B valuation last year — Qualcomm has been in talks to buy Tenstorrent, a startup that designs chips for AI, according to a person with direct knowledge of the deal.
- OpenRouter debuts Fusion, a tool for prompting multiple AI models in parallel, claiming it can achieve "Fable-level intelligence at half the price" (Brian Thomas/OpenRouter Blog)
Brian Thomas / OpenRouter Blog : OpenRouter debuts Fusion, a tool for prompting multiple AI models in parallel, claiming it can achieve “Fable-level intelligence at half the price” — We've found that synthesizing the results of multiple models can significantly outperform what individual models are capable of.
- SpaceX stock closed up 19.6% on Monday, its first full day of trading; Musk said it "might be able to reach" around $1T revenue in 2030, up from $18.7B in 2025 (Arjun Kharpal/CNBC)
Arjun Kharpal / CNBC : SpaceX stock closed up 19.6% on Monday, its first full day of trading; Musk said it “might be able to reach” around $1T revenue in 2030, up from $18.7B in 2025 — SpaceX shares climbed 20% on Monday, the first full day of trading following a record-breaking debut last week on the Nasdaq.
- Meta launches new AI features, including an "AI Mode" for search that uses Meta AI to surface answers pulled from public posts across Facebook (Lauren Forristal/TechCrunch)
Lauren Forristal / TechCrunch : Meta launches new AI features, including an “AI Mode” for search that uses Meta AI to surface answers pulled from public posts across Facebook — As Meta tries to catch up in the AI race and boost engagement with its AI bot, the company announced Monday that it's rolling …
- Arcade, which helps companies manage which actions AI agents are authorized to take, raised a $60M Series A led by SYN Ventures, following a $12M seed in 2025 (Steven Rosenbush/Wall Street Journal)
Steven Rosenbush / Wall Street Journal : Arcade, which helps companies manage which actions AI agents are authorized to take, raised a $60M Series A led by SYN Ventures, following a $12M seed in 2025 — The startup aims to help companies manage the challenge of determining which actions AI agents are authorized to take
- A US judge dismisses xAI's lawsuit alleging OpenAI stole trade secrets, saying xAI failed to show OpenAI induced a former xAI engineer to divulge trade secrets (Jonathan Stempel/Reuters)
Jonathan Stempel / Reuters : A US judge dismisses xAI's lawsuit alleging OpenAI stole trade secrets, saying xAI failed to show OpenAI induced a former xAI engineer to divulge trade secrets — A federal judge on Monday dismissed a lawsuit by Elon Musk's artificial intelligence company xAI that accused rival Sam Altman's OpenAI …
- Anthropic's belief in its own commitment to safety gives Anthropic the license to aggressively favor its business and even challenge the US government (Ben Thompson/Stratechery)
Ben Thompson / Stratechery : Anthropic's belief in its own commitment to safety gives Anthropic the license to aggressively favor its business and even challenge the US government — I'm sympathetic to the cynics who consistently characterize Anthropic's public statements, particularly those surrounding their model releases …
- Radical Numerics, which is developing AI models that learn directly from biological data, raised a $50M seed led by Emergence Capital (Natalie Breymeyer/Axios)
Natalie Breymeyer / Axios : Radical Numerics, which is developing AI models that learn directly from biological data, raised a $50M seed led by Emergence Capital — Radical Numerics, an AI research lab for biological data, raised a $50 million seed round, CEO Eric Nguyen tells Axios.
- Source: Anthropic was given 90 minutes to comply and was not provided with detailed concerns before the export control order was issued (Financial Times)
Financial Times : Source: Anthropic was given 90 minutes to comply and was not provided with detailed concerns before the export control order was issued — Export controls on Fable and Mythos raise doubts over how US will police the most powerful AI systems — The Trump administration's decision …
- Google says a Chinese-linked hacking group targeted US and Canadian academic, medical, and military research institutions from September 2023 to November 2025 (A.J. Vicens/Reuters)
A.J. Vicens / Reuters : Google says a Chinese-linked hacking group targeted US and Canadian academic, medical, and military research institutions from September 2023 to November 2025 — A Chinese-linked hacking group spent more than a year secretly stealing data from U.S. and Canadian academic …
- NewCore, which helps companies manage both human and AI agent identities in a single system, emerges from stealth with a $66M seed led by Cyberstarts (Jagmeet Singh/TechCrunch)
Jagmeet Singh / TechCrunch : NewCore, which helps companies manage both human and AI agent identities in a single system, emerges from stealth with a $66M seed led by Cyberstarts — Cybersecurity startup NewCore emerged from stealth with $66 million in funding on Monday, aiming to solve a challenge it believes …
- Cybersecurity startup Chainguard, Cisco, Cloudflare, JPMorgan Chase, and others launch Athena, a coalition to secure open-source software using AI (Rachel Metz/Bloomberg)
Rachel Metz / Bloomberg : Cybersecurity startup Chainguard, Cisco, Cloudflare, JPMorgan Chase, and others launch Athena, a coalition to secure open-source software using AI — More than two dozen companies including JPMorgan Chase & Co. and an array of cybersecurity firms are collaborating to remedy software flaws spotted …
- American Express agrees to acquire restaurant booking platform TheFork from Tripadvisor in an all-cash deal worth $700M; TRIP jumps 6%+ (Reuters)
Reuters : American Express agrees to acquire restaurant booking platform TheFork from Tripadvisor in an all-cash deal worth $700M; TRIP jumps 6%+ — American Express (AXP.N) will buy restaurant booking platform TheFork from Tripadvisor (TRIP.O) in an all-cash deal worth $700 million …
- A US judge rejects Meta's bid to dismiss a lawsuit from Strike 3, which owns porn production companies, that alleges Meta torrented its videos for AI training (Ernesto Van der Sar/TorrentFreak)
Ernesto Van der Sar / TorrentFreak : A US judge rejects Meta's bid to dismiss a lawsuit from Strike 3, which owns porn production companies, that alleges Meta torrented its videos for AI training — To keep their piracy lawsuit alive, adult film producers Strike 3 Holdings and Counterlife Media don't have to prove Meta used their films for AI training.
Solidot(15)
- 瑞士选民否决了将人口设限千万的提案
瑞士于 6 月 14 日举行全民公投,决定是否在 2050 年前将全国常住人口限制在一千万以内。瑞士的人口出生率为每名妇女生育 1.29 个孩子,远低于 2.1 的人口替代率,它的人口增长主要归因于外来移民。目前瑞士人口已超过 900 万,官方数据显示,2024 年外国公民占到了瑞士总人口的 27% 以上。右翼的瑞士人民党(Swiss People's Party)支持的提案要求“2050 年前瑞士常住人口不得超过 1000 万,且瑞士应放弃与欧盟的自由流动协议”。瑞士选民最终否决了这一被称为“瑞士脱欧”的提案,有 54.79% 的选民反对,45.21% 的选民支持,投票率为 58.86%。
- 俄罗斯计划退役漏气的国际空间站 PrK 模块
位于 Progress(进步号)气闸舱和 Zvezda(星辰号)服务舱之间的 PrK 模块因结构裂缝导致的漏气过去几年一直困扰着国际空间站,今年初漏气问题一度被认为已经修复,但本月早些时候报告漏气再次加剧,该模块的裂缝总数达到 16 处。10 天前俄罗斯宇航员试图用锯子拆除该模块的一个承重支架,此举招致了 NASA 的强烈反对,认为可能会产生严重后果,下令宇航员进入与空间站对接的 Crew Dragon 飞船,穿上宇航服,准备必要时紧急撤离。俄罗斯航天局最终放弃了拆支架的计划。双方在幕后反复的拉锯之后,最终俄罗斯通知 NASA 将退役 PrK 模块。这意味着宇航员将不再进入 PrK 模块,或再次尝试对其进行加压。而俄罗斯将需要使用其它端口向空间站转移补给。
- Arch Linux 遭遇新一轮 AUR 恶意程序攻击
Arch Linux 项目的用户软件仓库 Arch User Repository(AUR)上周遭遇了大规模恶意攻击,在处理了逾 1500 个软件包之后开发者认为问题已经得到了控制。然而仅仅过了一天,AUR 遭遇了新一轮的恶意攻击,这一次攻击者使用了代码混淆技术掩盖其意图。AUR 是用户贡献的软件包库,并非官方软件库,Arch Linux 项目可能需要暂时下线 AUR 以免遭遇一轮又一轮的恶意攻击。
- 数百万学生就读学校位于有毒污染场地 5 公里内
根据智库 Centre for Global Development 的地理分析,数百万儿童就读的学校附近存在已知的铅、汞、砷和杀虫剂等有毒污染。研究发现,亚洲、非洲和拉丁美洲 17 个国家的逾 25.2 万所学校位于有毒污染场地 5 公里范围内。这些学校有 4300 多万名儿童,其中 520 万名儿童位于 1 公里范围内。发达国家受污染影响的负担不成比例的落在贫困学生和非白人学生身上,但在发展中国家污染集中在富裕人群居住的城市,城市学校的规模通常更大,因此学生也更多——以菲律宾为例,9% 的学校靠近污染场地,而这些学校的学生总数占到全国学生总数的 27%。分析还显示,私立学校比公立学校更有可能位于污染场地附近。加纳 41% 的私立学校靠近污染场地,而公立学校的这一比例仅为 18%。
- 英国将禁止 16 岁以下青少年访问社交媒体
英国首相 Keir Starmer 宣布,英国将禁止 16 岁以下青少年访问社交媒体。英国的社媒禁令范围以及强度都高于澳大利亚的类似禁令。社媒禁令涵盖所有社交媒体,对包含聊天功能的游戏等网络产品也有单独限制,如禁止青少年与陌生人聊天。Starmer 说政府总要做出选择,他认为全面禁令是正确的选择。
- 测试显示 AI 的数学解题能力仍然不如人类专家
AI 模型的解题水平仍不及顶尖数学家。这项测试隶属 First Proof 项目,旨在评估 AI 解决复杂数学难题的能力。研究人员向 4 款 AI 系统提出 10 道科研级数学难题,再由相关数学领域的匿名专家评审团对作答结果进行打分。这次测试首次同时满足三大核心标准:题目均为前沿科研级数学问题、所有题目从未出现在模型训练数据中、由专业数学家评阅。10 名来自不同数学细分领域的研究人员,各自拿出一道本人研究过程中已解答但尚未公开发表的原创题目。这次测试中,各大推理模型依然频繁出现幻觉问题,这也是大语言模型的通病。而且所有 AI 作答在文献引用方面都“严重缺失”,全程没有标注来源。
- 中国就食品安全问题约谈山姆
中国市场监管部门因食品安全问题约谈沃尔玛旗下的山姆会员店(Sam's Club),对这家在全球第二大经济体快速扩张的仓储式连锁业务带来挑战。国家市场监督管理总局周一表示,“针对一段时期以来监管发现和媒体曝光的山姆线下门店及线上网店多发的食品安全问题”,已对该公司进行约谈。通报补充说,监管机构要求沃尔玛严格遵守中国食品安全法律,但并未说明会面具体时间,也未披露涉嫌违规的具体情况。
- 中国高校撤掉了 1.2 万个过时专业
中国高校正在大规模课程重组,撤掉了数千个“过时”专业,转而开设以科技为导向的新兴专业。这场教育改革正值中国力争成为众多高科技未来产业的全球领导者,解决严重的毕业生就业危机之际。这场危机已导致数百万年轻人难以找到工作。根据新华社援引教育部的数据,2021-2025 年间,中国高校撤销或暂停了 12200 个本科专业,同时新增了 10200 个专业,意味着逾三成的高校课程进行了调整。
- 黄金并不是惰性的
黄金是少数几乎不会被氧化的金属之一,但黄金的纳米粒子却能充当催化剂。根据发表在《Physical Review Letters》期刊上的一项研究,科学家指出黄金的惰性并非源于原子本身,而是源于黄金晶体形成的表面。黄金会形成晶体。如果沿着不同的原子平面切割晶体,会得到不同的表面排列。在黄金中,部分平面呈正方晶格,部分平面呈六方晶格。研究人员测试了不同黄金晶体表面对氧分子的吸附能力。结果显示,最常见的六方晶格黄金晶体对氧的吸附力较弱,而正方晶格则很容易吸附氧分子,能促使其发生形变至分裂,在这种情况下黄金被氧化了。
- 人类管理推动水稻产量过去五十年翻倍
伊利诺伊大学厄巴纳-香槟分校科学家的一项最新研究表明,尽管气候变化带来重重挑战,但全球水稻产量在过去半个世纪依然几乎翻了一番。研究揭示,水稻增产的秘诀并非天公作美,而是人类的管理决策,比如扩大灌溉、增施养料,以及推行能有效提升单产的耕作方式,共同维持了水稻产量,并抵消了气候相关因素带来的损失。这表明,未来的粮食安全不仅取决于环境条件,更取决于人们如何管理和调整水稻生产系统,以适应不断变化的世界。研究揭示,气候变化是导致水稻减产的首要因素。在 2006-2015 年间,因气温升高、热害频发和水资源短缺,全球水稻产量估计减少了 7%。然而大气中二氧化碳浓度的升高也成了主要的增产推手,因为它能增强光合作用、提升水分利用效率。这些发现共同描绘了一幅复杂的图景:环境变化对农业生产的影响是多面的,甚至彼此对立。
- 内存成本占到了手机成本的五成以上
Nothing CEO 兼联合创始人 Carl Pei 说,如果你考虑升级手机,最佳时机是昨天。Carl Pei 称,内存短缺影响到了 Nothing 的中端手机。内存已成为智能手机最昂贵的组件,比处理器更贵,比显示屏更贵,可能占到硬件总成本的五成以上。以 Phone (4a)为例,自决定生产这款设备到它上市,内存成本翻了一番。此后又翻了一番。手机价格在上涨,明年还会继续涨。自 2 月以来,新上市手机比上一代产品贵了 100 美元。印度售价 3 万卢比以上的手机价格涨了 7000 卢比或更多。
- Linux 7.1 释出
因所在时区差异 Linus Torvalds 在美国时间周日早晨释出了 Linux 7.1。主要新特性包括:移除了部分基于 486 的旧架构;龙芯加入高内存支持;因缺乏维护移除 RISC-V 立即执行支持;新 clone()flags 简化进程管理;io_uring 子系统加入 BPF 支持;ublk 用户空间块驱动支持零拷贝 I/O;sched_ext 初步支持子调度器(sub-scheduler);改进交换机制;完全重写 NTFS 实现,等等。
- 本田思域容易遭到“邪恶女佣攻击”
邪恶女佣攻击(Evil maid attack)是对无人值守设备的一种攻击方式,具有物理访问权限的攻击者,用某种无法检测的手段对设备进行更改,以便后续访问该设备或设备中的数据。本田思域也很容易面临类似的攻击,比如邪恶的酒店代客泊车员。研究人员发现,本田汽车使用的 Android 软件包使用了公开的 AOSP 测试密钥进行签名,只要能物理访问汽车的 USB 接口,就可以刷入任意软件包,执行任意代码。
- 科学家再生受损膝关节软骨逆转关节炎
骨关节炎是最常见的关节炎类型,美国有五分之一成年人患有骨关节炎。它会逐渐破坏关节软骨,导致疼痛、僵硬和肿胀。现有疗法主要是缓解疼痛,病情严重则需进行关节置换手术。尚无药物能减缓、阻止或逆转关节炎。名为 15-PGDH 的蛋白质与软骨的衰老相关,研究人员对比了年轻和年长小鼠的软骨,发现 15-PGDH 的水平随年龄增长翻了一番。研究人员测试了一种能阻断 15-PGDH 活性的小分子药物,发现它能修复年长小鼠受损的膝关节软骨,预防严重关节损伤后关节炎的发生。人体组织测试也表现出了类似的效果。
- 印度工人训练将会替代他们的 AI 机器人
家庭主妇 Nagireddy Sriramyachandra 头上绑着智能手机,拍摄自己切芒果的视频,以训练 AI 机器人在未来能做家务。她每录制一小时视频能赚到 250 卢比。看似普普通通的视频对科技巨头而言却弥足珍贵,能帮助机器学习如何在现实世界里像人一样行动。这位 25 岁的年轻女性是印度越来越多的 AI 训练大军成员之一。她说只是做家务谁会每小时给你 250 卢比?她表示自己未来也许会拥有一台机器人。她通过专门的应用将拍摄的视频发送给一家 AI 数据公司,该公司在印度和美国设有办事处,其客户包括多家财富 500 强跨国公司。据估计到 2050 年全球将有逾 10 亿台人形机器人投入使用,主要用于工业和商业用途。印度将自身定位为全球 AI 数据创建、处理和标注的中间商。
OrangeBot Weekly
5 Claude Code skills worth using each week — with my verdict on what’s actually good. No hype.