TEXT VIEW · TODAY'S DIGEST · 36 HEADLINES ACROSS 8 SOURCES

Startup Archive(0)

No items yet for today.

App Store Rankings(0)

No items yet for today.

ISSUE 0892
WED, JUN 10, 2026
Discover the best information organized by OrangeBot.AI
TODAY · WED, JUN 10, 2026

The web,
read by a bot.

Ten sources — Hacker News, Product Hunt, HuggingFace, Techmeme and more — filtered, tagged, and summarized every morning for builders who don’t have time to scroll.

NEWChrome extension: save posts from Twitter/X in one click.Install →
01

AI DIGEST

UPDATED DAILY · EDITOR'S PICK
01.00
AI DIGEST

AI新闻摘要

June 10, 2026

Here is a summary of today's main news events, based on the information provided.

U.S. Retaliates After Iran Downs Helicopter, Escalating Tensions

The U.S. launched retaliatory airstrikes on Iranian air defense and radar sites after Iran shot down a U.S. Apache helicopter. President Trump warned Iran would "pay the price," and the U.S. Treasury imposed new sanctions targeting Iran's oil revenue through cryptocurrency. Tehran claims it also targeted U.S. facilities in the region.

Global Markets React to Middle East Conflict

The escalating U.S.-Iran conflict pushed global oil prices and the U.S. dollar higher, while stock markets like the Dow opened lower. The rise in energy costs is fueling concerns about inflation, leading to higher Treasury yields and a drop in gold prices.

AI Sector Sees Major Investment and New Safety Measures

AI infrastructure firm TensorWave secured $350 million to build data centers using AMD chips. Meanwhile, in response to growing concerns, AI company Anthropic is redirecting sensitive user queries to safer models, and a proposed "AI Overwatch Act" aims to prevent U.S. chips from being used to develop Chinese AI.

Japanese Yen Falls on News of Central Bank Governor’s Hospitalization

The Japanese yen dropped to a near six-week low against the dollar following news that Bank of Japan Governor Kazuo Ueda has been hospitalized. He is expected to miss next week's key policy meeting, creating uncertainty in financial markets.

SpaceX Prepares for Record-Breaking IPO

Elon Musk's SpaceX is expected to launch its Initial Public Offering (IPO) on Friday. The event is anticipated to set new records and is being viewed as a significant milestone for retail investor participation in the stock market.

02

ON THE WIRE

6 SOURCES
02

HACKER NEWS

02.00
HACKER NEWS

Hacker News - June 10, 2026

Hacker News Feed: Highlighting key posts and discussions.

Surprise, pay $1000

(forestwalk.ai)

250100
It's death

(jesseduffield.com)

18166
More Molly Guards

(unsung.aresluna.org)

15918
Claude Fable 5

(www.anthropic.com)

23911883
WWDC 2026: Apple is Folding

(cupertinolens.com)

243254
Making Graphics Like it's 1993

(staniks.github.io)

899152
Job: Head of Stonehenge

(www.english-heritage.org.uk)

230223
03

HUGGINGFACE

03.00
HUGGINGFACE

huggingface.title - June 10, 2026

huggingface.description

Kwai Keye-VL-2.0 Technical Report

We introduce Kwai Keye-VL-2.0-30B-A3B, an open-source Mixture-of-Experts (MoE) multimodal foundation model designed to advance long-video understanding and agentic intelligence. To address the challenges of ultra-long contexts, information redundancy, and prohibitive computational costs inherent in hour-level videos, Keye-VL-2.0 is the first to adapt DeepSeek Sparse Attention (DSA) to GQA-based multimodal architectures, enabling lossless 256K context processing while capturing critical frames and long-range temporal dependencies. This architecture is underpinned by a highly optimized training and inference infrastructure, including scalable video I/O, heterogeneous ViT-LM parallelism, and custom DSA kernels that significantly maximize throughput and minimize computational overhead. Furthermore, to overcome the algorithmic dilemma of catastrophic forgetting during multi-task alignment, we introduce Cross-Modal Multi-Teacher On-Policy Distillation (MOPD) paired with Context-RL and Video-RL. By distilling dense token-level teacher feedback from on-policy rollouts back into the MoE backbone, which activates only 3B parameters, Keye-VL-2.0 natively empowers advanced agent collaboration across Code, Tool, and Search scenarios with multimodal self-correction. Extensive evaluations across video understanding, temporal grounding, reasoning, STEM, and agent benchmarks demonstrate that Keye-VL-2.0-30B-A3B achieves state-of-the-art performance among models of similar scale, particularly excelling in fine-grained temporal localization on TimeLens and long-video comprehension on Video-MME-v2 and LongVideoBench. We release our model checkpoints to accelerate community progress toward scalable and robust multimodal agentic applications.

165
ABot-Earth 0.5: Generative 3D Earth Model

We present ABot-Earth 0.5, a generative 3D framework designed to synthesize vast, seamless 3D environments from ubiquitous, geospatially referenced satellite imagery. To achieve this, we propose a novel generative model formulated directly with the 3D Gaussian Splatting (3DGS) representation. The model is trained on a diverse corpus of existing real-world urban reconstructions, learning to generate realistic geometry and textures. At inference, it synthesizes novel 3D scenes conditioned solely on satellite imagery at a scalable rate of under 10 minutes per square kilometer, while demonstrating exceptional realism. The framework is designed for accessibility, with integrated hierarchical level-of-detail (LOD) structures that permit real-time, interactive visualization on web-based map engines. This high-fidelity simulation sandbox effectively mitigates the sim-to-real domain gap, enabling critical downstream Embodied AI applications like closed-loop UAV navigation. By providing an ultra-low-cost and high-efficiency solution, ABot-Earth 0.5 significantly lowers the technical and financial barriers to large-scale 3D reconstruction and empowers the future of global digital earth visualization.

135
Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution

Although Large Language Model (LLM) agents have demonstrated strong performance on complex tasks, their learning is often limited by inefficient interaction feedback and static training environments, which hinder broader generalization. To address these limitations, this paper introduces Role-Agent, black{a framework} that harnesses a single LLM to function concurrently as both the agent and the environment, enabling a bootstrapped co-evolution. Role-Agent comprises two synergistic components: World-In-Agent (WIA) and Agent-In-World (AIW). In WIA, the LLM acts as the agent and predicts future states after each action; the alignment between predicted and actual states is then used as a process reward, encouraging environment-aware reasoning. In AIW, the LLM analyzes failure modes from failed trajectories and retrieves tasks with similar failure patterns, thereby reshaping the training data distribution for targeted practice. Experiments on multiple benchmarks show that Role-Agent consistently improves performance, yielding an average gain of over 4\% over strong baselines.

73
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

AI agents rely on a harness of skills, tools, and workflows to solve complex problems. Continually improving this harness is essential for adapting to new tasks. However, existing optimization methods typically require ground-truth validation sets, yet such labeled data is difficult to acquire in practical deployment settings. To address this problem, we introduce Retrospective Harness Optimization (RHO), a self-supervised method that optimizes the agent harness using only past trajectories. Specifically, RHO selects a diverse coreset of challenging tasks from past trajectories and re-solves them in parallel. The agent analyzes these rollouts using self-validation and self-consistency, then generates candidate harness updates and selects the most effective one by its own pairwise self-preference. We evaluate RHO across three diverse domains, spanning software engineering, technical work, and knowledge work. Notably, a single optimization round improves the pass rate on SWE-Bench Pro from 59% to 78% without any external grading. Furthermore, our analysis demonstrates that RHO effectively targets prior failure modes. As a result, the optimized harness alters the agent's behavior patterns and sustains higher accuracy during long-horizon sessions.

46
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Large language models are increasingly expected to handle complex, long-horizon real-world tasks whose context demands can grow without bound, yet model context windows remain inherently finite. Recent work explores a paradigm where a main agent decomposes tasks and dispatches subtasks to subagents, which execute and return only summarized results, conserving the main agent's context budget. However, performing this well requires delegation intelligence: the ability to decompose complex tasks, determine when and what to delegate, and integrate returned results into the ongoing workflow. Training data for this capability is scarce in naturally occurring text, and to our knowledge, how to synthesize such data and train models to acquire this capability remains largely unexplored in the open-source community. To bridge this gap, we present a preliminary exploration targeting deep research, a representative long-horizon agent task. Specifically, we design a harness that guides the model toward high-quality task decomposition and delegation, while constraining subagents to return results properly to support the main agent's workflow. The harness-guided trajectories naturally encode correct delegation decisions, which we use as supervised fine-tuning data to internalize delegation intelligence into model weights. Our resulting model, SearchSwarm-30B-A3B, achieves 68.1 on BrowseComp and 73.3 on BrowseComp-ZH, the best results among all models of comparable scale. We will release our harness, model weights, and training data to facilitate future research.

43
MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

Current Vision-Language Models struggle with hours-long videos because processing full-length visual sequences induces prohibitive token explosion and attention dilution. To overcome this, we introduce MemDreamer to decouple perception and reasoning, shifting long-video understanding into an agentic exploration process. As a plug-and-play framework, it incrementally streams videos to construct a Hierarchical Graph Memory, a top-down three-tier architecture for semantic abstraction, anchored by a foundational graph capturing spatiotemporal and causal relations. During inference, the reasoning model employs agentic tool-augmented retrieval, navigating hierarchies, searching nodes, and traversing logical edges via an Observation-Reason-Action loop. Experiments show MemDreamer achieves SOTA results across four mainstream benchmarks, narrowing the gap with human experts to only 3.7 points. It constrains the reasoning context window to merely 2% of full-context ingestion while delivering a 12.5 point absolute accuracy gain. Furthermore, statistical analysis uncovers a strong positive linear correlation between an VLM's performance on logic reasoning and long-video understanding benchmarks, establishing agentic capability scaling as a new paradigm for multimodal comprehension.

35
SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning

Controlled character animation requires transferring motion from a driving sequence to a reference character. Prior works heavily rely on intermediate representations, including pose skeletons to represent motion or masked background to represent environment, which inevitably leads to information loss. To address this, we present SCAIL-2, an framework that bypasses those intermediates and achieves end-to-end character animation. By directly concatenating driving videos to the sequence, the model can obtain all the required visual information from the input video. To address lack of end-to-end data, we unify sub-tasks of character animation with decoupled conditions and then curate a pipeline to synthesize MotionPair-60K, an end-to-end motion transfer dataset containing heterogeneous tasks of character animation. To archive the unification, we utilize in-context mask conditioning and mode-specific RoPE as soft guidance beyond textual instructions and raw visual information. To address synthetic discrepancy in detailed regions, we propose Bias-Aware DPO to construct preference items to mitigate the errors. Extensive experiments demonstrate that our method substantially outperforms existing state-of-the-art approaches in various character animation tasks. A large subset of synthetic data as well as model weights will be released at our project page: https://teal024.github.io/SCAIL-2/.

30
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Recent work has demonstrated that online reinforcement learning (RL) can substantially improve the quality and alignment of flow matching models for image and video generation. Methods such as Flow-GRPO and CPS cast the denoising process as a Markov Decision Process and apply PPO-style ratio clipping to enforce a trust region. However, we argue that ratio clipping is structurally ill-suited for flow models: the probability ratio between new and old policies is a noisy, single-sample estimate of the true policy divergence, leading to over-constraining in some regions of the trajectory and under-constraining in others. We propose Flow-DPPO (Flow Divergence Proximal Policy Optimization), which replaces ratio clipping with a divergence proximal constraint. A key observation is that the per-step policy in flow models is Gaussian, enabling exact and cheap computation of the KL divergence between old and new policies. Flow-DPPO employs an asymmetric divergence mask that blocks gradient updates only when they simultaneously move away from the trusted region and violate the divergence threshold. Experiments show that Flow-DPPO achieves higher rewards with better KL-proximal efficiency, alleviates catastrophic forgetting, promotes balanced multi-objective optimization, and enables stable multi-epoch training where ratio clipping degrades. Code and models are available at https://github.com/Tencent-Hunyuan/UniRL/tree/main/FlowDPPO.

29
Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Data tells stories that shape society; the data journalist's job is to turn raw information into stories non-experts can trust. A high-quality news feature takes a newsroom team weeks: hunting for context, running statistics, choosing an angle, and designing visuals. Recent agents handle individual steps well: data-science agents close the analysis loop, while design agents synthesize beautiful websites. But can an agent serve as a data journalist end to end? We introduce Data Journalist Agent (Data2Story), a multi-agent framework that orchestrates specialized roles into a single virtual newsroom. Data2Story contributes two innovations. (i) Claims are evidence-grounded: an Inspector links every number, angle, and asset back to data, code, or an external reference. (ii) Articles are multimodally generative: rather than defaulting to plain text and static charts, Data2Story reasons about what readers will want to see, then deploys multimodal tools, such as interactive maps for geography and audio for music. We evaluate Data2Story on 18 articles, each paired with the originally published expert piece, along four axes: (a) human-agent angle coverage; (b) rubric evaluation with 53 participants across five dimensions; (c) computer-use agents as judges, a cost-saving proxy for how readers navigate interactive articles; and (d) verifiability, where a coding verifier re-executes statements against the data and checks claims against references. Data2Story produces competitive, evidence-traceable multimedia stories, with particular strength in transparency and auditability. Human articles retain an edge in editorial angle, creative design, and presentation. We position Data2Story as a collaborator for journalists, enabling more evidence-based, transparent, and verifiable reporting. Code and demos are available at https://data2story.github.io.

28
WorldOlympiad: Can Your World Model Survive a Triathlon?

We introduce WorldOlympiad, a benchmark for diagnosing video-based world models across physical faithfulness, geometric consistency, and interaction fidelity. While existing benchmarks often focus on visual quality, semantic alignment, or short-term temporal coherence, they provide limited insight into whether generated videos obey physical rules, preserve coherent 3D structure, and sustain controllable interactions over long horizons. To address this gap, WorldOlympiad decomposes world-model evaluation into three complementary dimensions. The physical track uses object segmentation and MLLM-as-judge to assess whether generated videos follow interpretable rules in mechanics, thermal phenomena, and material properties. The geometry track reconstructs generated videos with Gaussian splatting and evaluates structural consistency, cross-view coherence, and camera-trajectory alignment. The interaction track assesses whether generated rollouts follow complex action prompts and maintain smooth, coherent transitions across consecutive video chunks. WorldOlympiad further covers three major downstream scenarios, including gaming, robotics, and general real-world videos, capturing diverse challenges from interactive control and embodied manipulation to open-domain motion and camera dynamics. Together, these tracks and scenarios form a scalable and interpretable evaluation suite that exposes failure modes beyond generic video quality. Experiments on state-of-the-art models reveal substantial gaps in physical reasoning, 3D consistency, and long-horizon interaction, underscoring the need for more structured evaluation protocols for generative world models.

26
Rethinking the Divergence Regularization in LLM RL

Reinforcement learning (RL) has become a key component of post-training large language models (LLMs). In practice, LLM RL is often off-policy because of training-inference mismatch and policy staleness, making trust-region control essential for stable optimization. Mainstream methods such as PPO and GRPO approximate this control with a ratio-clipping mechanism, but the importance ratio can be a poor proxy for distributional shift in long-tailed vocabularies. Recent work such as DPPO addresses this mismatch by replacing ratio-based clipping with a divergence-based mask, yielding a trust region defined by the sampled token's absolute probability shift. However, DPPO still relies on a hard mask: once a token crosses the trust-region boundary in a harmful direction, its gradient is discarded rather than corrected. To address this, we propose Divergence Regularized Policy Optimization (DRPO), which replaces the hard mask with a smooth advantage-weighted quadratic regularizer on policy shift. DRPO preserves the same trust-region geometry as DPPO while inducing bounded, continuous gradient weights that attenuate diverging updates and provide corrective signals beyond the boundary. Experiments across model scales, architectures, and precision settings show that DRPO improves the stability and efficiency of LLM RL training.

26
Lip Forcing: Few-Step Autoregressive Diffusion for Real-time Lip Synchronization

Diffusion-based lip synchronization models achieve strong visual quality and audio-visual alignment, but full-sequence bidirectional attention and many denoising steps make them impractical for real-time inference. We present Lip Forcing, to our knowledge the first autoregressive diffusion method for video-to-video (V2V) lip synchronization, which distills a 14B audio-conditioned bidirectional video diffusion teacher into causal students. At inference, the students generate each chunk in only two denoising steps without inference-time CFG, enabling real-time lip synchronization. A lip-sync-specific teacher-trajectory analysis reveals a CFG fidelity-sync tradeoff: no-CFG predictions favor reference fidelity, whereas CFG-guided predictions favor synchronization within a mid-trajectory band. Lip Forcing translates this finding into three analysis-derived components: Sync-Window DMD, a two-step inference schedule, and a SyncNet-based reward. We validate Lip Forcing at two student scales, both distilled from the 14B teacher. The 1.3B student crosses into real-time streaming at 31 FPS, 17.6times faster than its same-scale bidirectional model. The 14B student, the largest diffusion model reported for V2V lip synchronization, runs 39.8times faster than its teacher at comparable reference fidelity. Time-to-first-frame is sub-millisecond at both scales, far below every diffusion baseline.

23
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

In this paper, we propose EEVEE, the first multi-dataset test-time prompt learning framework for LLM agents, enabling test-time prompt learning under real-world task streams. Existing methods are largely designed for single-dataset settings, while real-world applications require models to handle heterogeneous input streams drawn from multiple datasets, domains, and task distributions, limiting their practical applicability. To mitigate cross-dataset interference, EEVEE introduces a router that partitions incoming inputs into task clusters and assigns them to suitable prompt configurations. This design is optimized via a router-prompt co-evolution strategy, which employs interleaved router and prompt learning phases to address their mutual dependency. Experiments across multiple datasets demonstrate that the framework improves robustness under heterogeneous data streams while maintaining single-benchmark learning capability and efficiency. Specifically, EEVEE improves average multi-benchmark scores by 10.38 and 24.32 points over Qwen3-4B-Instruct and DeepSeek-V3.2, surpassing SOTA methods GEPA and ACE by up to 37.2% and 48.2%.

13
One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA

External memory effectively grounds large language models (LLMs) and vision-language models (VLMs)-based question answering (QA) in relevant multimodal evidence. However, existing memory paradigms represent each memory item in raw text and image forms, so retrieval-based systems must pass the retrieved text or images to the generation LLMs/VLMs, resulting in high token consumption and storage pressure, making it unaffordable for resource-constrained applications. We propose Latent Memory, a latent-space memory paradigm that replaces each raw text or image evidence item with a single high-dimensional latent token produced by a small compressor LLM/VLM. Rather than retrieving raw evidence for generation, Latent Memory operates in a unified latent representation space: the query is embedded into this space to retrieve relevant latent tokens, and the retrieved latent tokens are directly prompted to a pretrained LLM or VLM for answer generation. To make each latent token simultaneously informative for reconstruction, retrieval, and generation, we train the compressor with reconstruction, contrastive, and distillation objectives in a unified end-to-end manner. Latent Memory is evaluated on seven text-only QA benchmarks (e.g., HotpotQA) and multimodal QA benchmarks, where it achieves competitive QA performance compared to advanced RAG baselines while consuming 3x to 10x fewer generator tokens. It can also deliver the strongest image-grounded QA performance on WebQA. Code is available at https://github.com/zz1358m/Latent-Memory-Master.

13
Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

Recent years have witnessed the rapid evolution of AI agents toward handling increasingly complex, real-world tasks. However, existing benchmarks rarely evaluate whether agents can operate graphical user interfaces to complete long-horizon, high-value professional workflows across diverse domains. Current GUI benchmarks still predominantly focus on general-purpose software, relatively simple applications, and short-horizon tasks, leaving it largely unknown whether modern agents can follow user instructions to autonomously operate domain-specific professional software and accomplish economically valuable work in an end-to-end manner. To bridge this gap, we introduce Workflow-GYM, a benchmark for long-horizon GUI tasks centered on professional domains and specialized software environments. Through extensive experiments on state-of-the-art models, we find that even the strongest models achieve only slightly above 30% success rates, highlighting that professional long-horizon GUI workflows remain highly challenging for current GUI agents. Further analysis reveals that current agents struggle to maintain long-horizon workflow consistency, frequently exhibiting workflow stage omission, error propagation, objective drift, and insufficient understanding of professional software environments. Our findings provide important insights into the limitations of current agent systems and suggest key directions for the next generation of GUI-agent research.

13
Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

Chain-of-thought (CoT) supervised fine-tuning (SFT) is widely adopted to improve reasoning ability, yet we find that it systematically degrades long-context recall in hybrid linear-attention models. Across architectures including HypeNet and Jet-Nemotron, retrieval performance on Needle-In-A-Haystack (NIAH) deteriorates substantially after CoT-SFT, and the degradation becomes more severe under harder retrieval settings and longer context windows. For example, HypeNet-9B on NIAH-S2@256K decreases from 67.2% to 9.4%. We attribute this to CoT-SFT biasing attention gradients toward short-range patterns, disrupting query-key projections (W_Q, W_K) that are responsible for long-range routing. Motivated by this observation, we propose QK-Restore, a training-free method that restores only W_Q and W_K from the pre-SFT checkpoint while preserving all other post-SFT parameters. We further introduce a Procrustes variant to balance routing preservation and reasoning adaptation. Across architectures, QK-Restore consistently restores long-context capability at zero training cost while preserving reasoning performance; for instance, on HypeNet-5B it improves S3@256K from 65.4% to 76.4% while maintaining strong reasoning performance.

12
Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Language models increasingly serve as the backbone of text-to-speech (TTS) systems, yet we understand little about the representations they build when text and generated speech tokens share a single residual stream. We train BatchTopK sparse autoencoders on the LM backbone of CosyVoice3 and introduce a modality-aware auto-interp pipeline that labels each feature from where it fires-text-prefix context, 1-second speech clips, or both. The recovered features are interpretable, spanning phonemes, laughter, accent prompts and speaker gender. Steering through the SAE latent space shows these features are causal rather than merely descriptive: targeted interventions raise laughter probability from 0.02 to 0.79, flip perceived speaker gender, and control speech rate while preserving spoken content. SAE features thus serve both as interpretability objects and as control directions for TTS synthesis.

8
Online Skill Learning for Web Agents via State-Grounded Dynamic Retrieval

Language agents increasingly rely on reusable skills to improve multi-step web automation across related tasks. A growing line of work studies online skill learning, where agents continually induce skills from previous task trajectories and reuse them in future tasks on the fly. However, existing methods mainly reuse skills at the task-level: a fixed set of skills is retrieved based on the initial task instruction and then held fixed throughout execution. This static strategy is misaligned with web execution, where the appropriate next action depends not only on the task goal but also on the current webpage state, which often transitions into situations that the initial skills fail to cover. To address this gap, we propose State-Grounded Dynamic Retrieval (SGDR), an online skill learning method that enables stepwise skill reuse for web agents. SGDR consists of three components: a sliding-window extraction process that turns completed trajectories into reusable sub-procedures invokable at intermediate execution states, a dual text-code representation that connects skill retrieval with executable action, and a state-grounded dynamic retrieval mechanism that matches skills to both the task goal and the current webpage state. Experiments on WebArena across five domains show that SGDR consistently outperforms strong baselines, achieving average success rates of 37.5% with GPT-4.1 and 24.3% with Qwen3-4B, corresponding to relative gains of 10.6% and 10.0% over the strongest baseline, respectively. The code is available at https://github.com/plusnli/skill-dynamic-retrieval.

8
How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs

Token-level credit assignment remains a key obstacle for reinforcement learning (RL) in large language models (LLMs), where RL recipes typically treat all tokens equally, failing to distinguish decisive reasoning steps from routine formatting or fluent filler. Recent attempts leverage model-internal signals to assign finer-grained credit, but these are often point-wise heuristics that ignore the global structure of information propagation. We propose FlowTracer, an RL framework that traces answer-targeted reasoning flow on an attention-induced directed acyclic graph in which nodes correspond to tokens and edge capacities come from aggregated attention weights and derives token credit from this global structure. The edge capacities are reweighted to retain only the influence that can reach the answer region, while enforcing local flow conservation so intermediate tokens neither lose nor gain effective mass due to path length or irrelevant branches. On this graph, FlowTracer extracts an information-flow backbone connecting the question to the answer and scores tokens by flow throughput, revealing high-impact hubs and aggregation checkpoints that mediate long-range dependencies. These derived importances are used to shape token-level rewards, enabling learning signals to focus precisely on the tokens that route information toward (or away from) correct answers and delivering consistent performance gains across a range of reasoning tasks.

6
PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models

Large language models (LLMs) routinely face requests that should be refused, creating a trade-off between helpfulness and harm prevention. However, refusals themselves can be helpful. In high-risk interactions involving crisis, coercion, or escalating intent, blunt non-compliance may prevent direct harm while still failing to support the needs of the person behind the request. We present PsychoSafe, a psychologically-informed refusal framework that reframes refusal as structured supportive communication grounded in evidence-based intervention strategies. To develop PsychoSafe, we construct a corpus of 8019 prompt-response pairs spanning five psychologically salient risk domains and apply prompting and parameter-efficient fine-tuning to Qwen 3.5 27B. On a balanced validation set of 500 prompts, evaluated with an LLM judge and validated through human ratings, PsychoSafe prompting improves overall refusal quality by 28.1% over a generic baseline, with particularly strong gains in external resource referral (+46.8%) and psychological grounding (+34.8%), while preserving downstream performance on non-refusal tasks. Fine-tuning achieves near-perfect refusal and resource-referral rates but reduces response relevance. Additional evaluations on SORRY-Bench and XSTest show strong in-domain robustness but limited out-of-domain generalization, suggesting that future work should diversify fine-tuning data to help models apply interventions selectively rather than schematically.

5
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

As deep learning models scale, managing, inspecting, and modifying large checkpoints has become increasingly challenging. Researchers often need to alter model weights for layer restructuring, precision casting, low-rank factorization, and architectural debugging, yet these workflows often rely on fragile ad-hoc Python scripts. Here, we introduce BrainSurgery, a tool for robust and reproducible "tensor surgery" on neural network checkpoints, and provide a system demonstration covering four examples and three case studies from model upcycling to LoRA extraction. By abstracting storage formats and memory management, BrainSurgery executes complex transformations through declarative YAML plans. It supports structural modifications, mathematical transformations, and tensor reshaping through expressive regex and structural targeting, while built-in assertions validate tensor shapes, data types, and values to prevent silent errors. We envision that BrainSurgery will provide a strong foundation for future research through its reproducible and validated operations.

5
Bridging the Agent-World Gap: Text World Models for LLM-based Agents

Large language model (LLM)-based agents are increasingly used in interactive textual environments, from web navigation and code editing to tool use and long-horizon dialogue. Yet many remain largely reactive, mapping observations to actions without an explicit model of how these environments are structured and evolve. This motivates text world models (TWMs): transition models over textual states that, given a state and a candidate action, predict the resulting webpage, terminal output, API response, or user reply, thereby supporting planning, efficient learning, and principled evaluation. We systematically review text world models for LLM-based agents, organized around a formal framework and the agent lifecycle: (1) Foundations, defining text world models and characterizing them by state representation and grounding domain; (2) Construction, taxonomizing LLM-as-WM and code-as-WM paradigms and reviewing methods for building them; (3) Application, examining how world models support agents at training time through experience synthesis and at inference time through planning, verification, and adaptation; and (4) Evaluation, covering both evaluation of the world model itself and its use as an evaluation environment for agents. We aim to consolidate this rapidly developing area, clarify its design space, and highlight open challenges for future research.

5
UniPET: a universal network for high-quality PET image denoising across varied dose reduction factors

Most existing deep learning-based PET image denoising methods assume a fixed and known dose reduction factor (DRF) for low-dose PET images. However, these methods encounter significant performance degradation when the DRF varies beyond the assumed one in practical applications. To address the challenge posed by varied DRFs, several preliminary studies focus on the task of universal PET image denoising, aiming to train a universal model over low-dose data across DRFs. Nonetheless, these vanilla universal models often struggle with misaligned styles present in different DRF data, leading to the style elimination issue with a significant over-smoothing effect. To deal with this issue, we innovatively introduce domain generalization to PET image denoising and propose a universal PET image denoising network (UniPET) to achieve high-quality PET image denoising across diverse DRFs. UniPET comprises two primary innovations: a style alignment network (SAN) and a region-aware learning strategy (RALS). Specifically, SAN utilizes style alignment techniques derived from domain generalization to align and recover styles across different DRFs, ensuring the model's generalizability across various DRFs while effectively preserving styles. Furthermore, to enhance style recovery, RALS distinguishes between flat and stylized regions, exclusively conducting adversarial learning on the latter, thereby more effectively guiding the model's focus towards learning stylized regions. It is demonstrated that our proposed UniPET can adaptively recover different DRF styles and achieve high-quality PET image denoising across DRFs. Comprehensive experiments show that UniPET exhibits comparable performance to individual DRF-specific models at specific DRFs and realizes state-of-the-art performance in universal PET image denoising quantitatively, perceptually, and clinically.

4
U-TTT: Towards Generalizable PET Image Denoising via Test-Time Training

Existing deep learning models for Positron Emission Tomography (PET) image denoising often suffer from severe performance degradation under distribution shifts, fundamentally restricting their robust clinical deployment. This lack of generalization stems from the conventional paradigm of fixed-parameter models that cannot adapt to variations in test data (e.g., dose levels or scanner types) after training. To overcome this limitation and achieve robust generalization, we introduce U-TTT, a novel U-shaped model that integrates Test-Time Training (TTT) layers to dynamically adjust model parameters during inference through self-supervision, thereby adapting to the specific characteristics of each test instance. Furthermore, to comprehensively capture the complex degradations of 3D PET data, U-TTT features a dual-domain adaptation mechanism comprising a Spatial Test-Time Training (S-TTT) layer and a Frequency Test-Time Training (F-TTT) layer. The S-TTT layer captures and corrects spatial structural degradations, while the F-TTT layer suppresses global noise spectra and restores delicate high-frequency details. Extensive experiments demonstrate that U-TTT achieves state-of-the-art PET denoising performance and exhibits superior generalization under challenging distribution shifts, including both unseen dose levels and unseen scanners. Our code will be available at https://github.com/Yaziwel/U-TTT.

4
Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

Deep research agents have attracted increasing attention for their ability to collect large-scale online information to acquire target knowledge, with recent efforts shifting from purely text-based information seeking to multimodal settings. However, existing agentic workflows are largely aligned with evidence accumulation models, which linearly aggregate evidence and lack principled mechanisms for handling contradictory information across heterogeneous modalities. Towards this end, we propose Struct-Searcher, a structural agentic workflow grounded in belief revision theory that explicitly maintains an evolving multimodal structural graph throughout the reasoning process, enabling effective conflict-aware multimodal deep information seeking. Extensive experiments across multiple benchmark datasets and backbone models demonstrate that Struct-Searcher is (1) plug-and-play and model-agnostic, yielding an average relative accuracy improvement of 17.2% on BrowseComp-VL across five different backbones. (2) top-performing, consistently outperforming state-of-the-art vision-language models (VLMs) and deep research agents, with relative accuracy improvements of 3.7% on MM-BrowseComp, 1.5% on HLE-VL, and 0.7% on BrowseComp-VL over the second-best competing approach.

4
MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generation

Video generative models have become increasingly powerful, but long-range consistency remains challenging to achieve because even a few dozen frames require impractically long transformer sequence lengths. We show that this issue can be mitigated by generating video using coarse-to-fine rollout within a multi-scale token space. Our approach is simple: first, we pre-train an autoencoder that compresses each frame into a hierarchy of tokens, with levels ranging from the typical latent resolution to only a handful of tokens per frame. The coarsest levels capture the most consequential information, such as scene layout and semantics, while finer levels add high-frequency appearance and texture. Then, we train a video diffusion model to generate these tokens using coarse-to-fine rollout. By carefully controlling the level of detail at which frames are generated and used as context during each rollout step, we are able to preserve long-range consistency in geometry and object permanence while spending less compute on the long-range consistency of less perceptually relevant details. We validate this approach using a custom dataset of long Minecraft videos, where it produces substantially more consistent rollouts compared to existing baselines.

4
What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems

Multi-agent systems (MAS) built on large language models are typically organized around roles, pipelines, and turn schedules, while the content that agents pass to one another is often left as unconstrained natural language. However, this free-form communication can rapidly inflate token usage, consume the shared context window, and ultimately affect both system performance and inference cost. We analyze five common inter-agent communication strategies across two MAS topologies, finding that no fixed strategy is universally optimal. Instead, effective inter-agent messages consistently preserve action-centered information needed by downstream agents. Building on this, we propose the PACT (Protocolized Action-state Communication and Transmission), which treats inter-agent communication as a public state-update problem and projects each raw agent output into a compact action-state record before it enters shared history. Across different MAS topologies, PACT consistently improves the performance-cost trade-off, achieving comparable or stronger task performance with substantially fewer tokens. The gains extend to production coding harnesses: PACT lifts OpenHands' resolve rate at -10% tokens-per-resolved, and is resolve-neutral on SWE-agent while halving input tokens. Our code is publicly available at https://github.com/iNLP-Lab/PACT.

4
Next Forcing: Causal World Modeling with Multi-Chunk Prediction

Autoregressive video generation has emerged as a powerful paradigm for World Action Models (WAMs). However, existing approaches suffer from slow training convergence and limited converged accuracy, particularly at high frame rates, as the training supervision is confined to the current chunk without explicit signals about future dynamics; they also suffer from slow inference due to iterative video denoising. In this paper, we present Next Forcing, a multi-chunk prediction (MCP) framework for causal world modeling that enables faster training, higher accuracy, and accelerated inference. Inspired by multi-token prediction in large language models, Next Forcing introduces an MCP training objective that augments the main model with lightweight auxiliary MCP modules to simultaneously denoise video chunks at multiple future temporal horizons (next^1, next^2, next^3 chunks). These MCP modules form a causal chain across prediction depths, where intermediate features fused from multiple layers of the main model are leveraged to predict future dynamics, allowing near-future predictions to inform farther-future ones and providing dense multi-scale temporal supervision back to the main model. During training, the MCP modules significantly accelerate convergence and improve converged accuracy, especially at high frame rates: at 50 fps, Next Forcing achieves a 93.1% relative improvement over LingBot-VA at 5k training steps and 2.3x faster convergence, and establishes new state-of-the-art results on the RoboTwin benchmark (94.1/93.5% on Clean/Random). At inference, the MCP modules can be retained to predict the next video chunk in parallel with the current one, achieving 2x inference acceleration. Next Forcing also demonstrates significant improvements on PhyWorld, a benchmark evaluating adherence to physical laws in video generation, and over 50% FVD reduction on general video pretraining.

3
IR3DE: A Linear Router for Large Language Models

Foundational Large Language Models (LLMs) demonstrate proficiency on a wide range of general tasks, and achieve remarkable results on various specialized tasks via domain-expert LLMs. With the ever-growing list of available LLMs, inference routers are being proposed to select the most appropriate LLM for each prompt. However, existing routing methods either optimize cost across weak-to-strong generalist LLMs or require substantial training to support domain-expertise routing. In this paper, we propose IR3DE, a Ridge Regression-based Router for Domain Experts that provides cheap and fast routing decisions for each prompt. We evaluate IR3DE in two Causal Language Modeling (CLM) settings where the tasks are next-token prediction for all domains, and one reasoning setting where each domain has its own distinct reasoning task. Despite being a linear router, IR3DE achieves performance comparable to the other baselines in both CLM settings, and surpassing them in the reasoning setting, with a normalized performance of 98.4%. Moreover, IR3DE enables the addition or removal of new domain experts without requiring the router to be retrained from scratch, allowing a dynamic set of LLMs to be served with minimal disruption to the router itself. Our code is available at: github.com/gensyn-ai/IR3DE.

3
Late-Layer Fusion is Enough: Dual-Path Vision Token Routing for Multimodal Large Language Models under Visual Saturation

Multimodal large language models (MLLMs) commonly inherit the deep, symmetric Transformer backbone designed for unimodal text modeling, and apply the same computation uniformly to image and language tokens. This design overlooks a key modality asymmetry: image and text tokens differ substantially in information density, redundancy, and required reasoning depth. Through a layer-wise analysis of LLaVA-1.5, we observe that vision tokens tend to saturate in the middle layers. Specifically, text-to-image attention decreases from 0.68 at layer 0 to 0.07 by layer 4, and stabilizes near 0.04 after layer 18, whereas text tokens continue to benefit from deep semantic processing. These findings suggest a mismatch between architectural symmetry and depth-asynchronous modality evolution, resulting in redundant visual computation and possible drift in perceptual representations during deep task-specific adaptation. Motivated by this, we propose Dual-Path Vision Token Routing (DPVR), a modality-asymmetric routing framework for efficient MLLMs. Its core instantiation, DPVR-LF (Late-Layer Fusion), routes vision tokens at the saturation point into a one-layer trainable side branch, runs a thirteen-layer text-only forward that skips image positions in the deep stack, and re-fuses the visual and textual streams only at the final layer. With approximately 3% trainable parameters, DPVR-LF preserves competitive multimodal performance on standard benchmarks while reducing visual computation in the deep Transformer stack. The results challenge the conventional assumption that vision tokens must traverse all deep language-model layers, and indicate that a single late fusion layer can be sufficient for maintaining strong perceptual competence in LLaVA-style MLLMs.

3
Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating

Prior work has shown that fine-tuning large language models on malicious or incorrect outputs in narrow domains can induce broad misalignment and harmful behavior, a phenomenon known as emergent misalignment. However, efficient methods for reversing such misalignment remain limited. In this work, we make two contributions. First, we identify sycophancy fine-tuning, i.e., training models to passively agree with users' incorrect opinions, as a previously underexplored driver of emergent misalignment, and show that it induces broad and severe misaligned behavior. Second, we propose Alignment Gating, an efficient method for reversing emergent misalignment that inserts learnable and controllable gates into the model during fine-tuning. Through fine-tuning, these gates learn to identify the internal representations responsible for unsafe responses. Thus, amplifying or suppressing these representations then exacerbates or mitigates EM, respectively. We further find that alignment gating module exhibits strong generalization: gating weights obtained from narrow-domain fine-tuning substantially suppress broad-domain misaligned behavior while preserving the model's general capabilities.

3
Dynamic Linear Attention

The scalability of Large Language Models (LLMs) to long contexts is fundamentally constrained by the quadratic complexity of standard attention, motivating the adoption of linear attention mechanisms with sub-quadratic cost. To improve representation capacity under long contexts, recent approaches organize memory in a multi-state manner. However, existing multi-state linear attention methods rely on fixed state merging policies that cannot adapt to dynamically varying token importance, irreversibly obscuring critical tokens and causing severe error accumulation over long sequences. To address this limitation, we propose DLA, a dynamic memory modeling framework for multi-state linear attention. DLA introduces (i) Information-Aware Dynamic State Merging, which adaptively determines state boundaries based on token-level information variation, preserving high-resolution representations around semantic transitions while aggressively summarizing stable regions, and (ii) Capacity-Bounded Memory Modeling, which maintains a fixed-size, chronologically ordered state cache by selectively merging adjacent low-information states to control memory growth with minimal information loss. We pre-train DLA on two different linear attention models and evaluate on 16 datasets across three categories. Experimental results demonstrate the superiority of DLA over state-of-the-art.

3
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

This paper introduces ARM, a discrete representation-based AutoRegressive Model that unifies image understanding, generation, and editing within a next-token prediction framework. ARM is built on three efforts: first, we train a discrete semantic visual tokenizer that maps images into compact token sequences. Our tokenizer is supervised with multiple objectives that jointly promote semantic discriminability, language alignment and faithful reconstruction, thereby supporting diverse tasks in a shared latent space. With this, we train a 7B autoregressive model over large-scale text and image token sequences, seamlessly developing vision-language perception and generation capabilities. Finally, to further improve preference-aligned behavior for text-to-image generation and instruction-guided editing, ARM applies reinforcement learning (RL) to optimize task-level objectives such as visual quality, instruction adherence, and edit consistency. Surprisingly, the results show that RL not only substantially improves performance on the target tasks (e.g., raising WISE overall from 0.50 to 0.56, GEdit-Bench-EN G_O from 5.75 to 6.68), but also induces cross-task synergy between text-to-image generation and editing. Collectively, these findings highlight autoregressive modeling, when paired with strong representations and preference optimization, as a scalable foundation for multimodal intelligence. Code: https://github.com/wdrink/ARM.

2
The Role of Feedback Alignment in Self-Distillation

Conditioning a language model on additional context, such as feedback on a previous attempt, typically improves its response. Self-distillation trains the model to retain this improvement when the context is not present. The method works by matching the model's output distribution under two settings: a student that sees only the question, and a self-teacher that also sees the context. What the model learns therefore depends on what context the self-teacher receives, yet the design of this context remains largely unexplored. We study context design for self-distillation by training a solver on feedback from a frozen critic. We compare three conditions: (i) a binary reward (GRPO), (ii) the reference solution, and (iii) a step-by-step critique aligned to the solver's reasoning trace. Step-aligned critique yields the largest gains, outperforming GRPO by 16.11 points and reference-solution-conditioned self-distillation by 5.27 points (Avg@12). Per-token advantage analysis reveals why: step-aligned feedback targets only the tokens where reasoning fails, leaving correct behavior intact. Conditioning on the reference solution, by contrast, pressures the model to change its behavior at every token (even correct steps) because an alternative derivation inevitably differs in phrasing and approach. This suggests that structural alignment between feedback and the solver's reasoning is a key driver of self-distillation effectiveness.

1
When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models

Failures in multi-turn reasoning models are largely invisible to terminal-score evaluation. A model can lock onto an unsafe stance early in a long dialogue, yet its final-turn refusal rate may appear indistinguishable from a robustly aligned baseline. To expose these hidden temporal dynamics, we propose a trace-level diagnostic - the CoT-Output 2x2 safety matrix. This framework labels every turn along two independent axes (internal reasoning and visible output), yielding four operationally defined failure cells: robust alignment, alignment faking, overt jailbreak, and a distinct failure mode we term context-injection failure (where the CoT maintains safe reasoning, but the visible output produces harm, highlighting a multi-turn manifestation of reasoning unfaithfulness). We evaluate three distilled reasoning targets against a fixed attacker across five oversight conditions, collecting 6750 turn-level observations on the Information-Hazard scenario. Our analysis reveals two reproducible vulnerabilities: an oversight paradox where explicit monitoring cues paradoxically increase alignment-faking rates rather than suppress them, and a context-injection failure where models lock onto unsafe external outputs despite safe internal states. We release the full dataset of multi-turn dialogues and CoT traces to support follow-up trace-diagnostic research.

1
Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning

Expressive continuous control policies, such as diffusion and flow models, form the backbone of recent advances in scaling imitation learning for simulated and real robot control. While they are known to scale stably in the supervised imitation learning setting, incorporating them into reinforcement learning (RL) pipelines for policy improvement has proven more difficult. It often requires specialized training objectives or backpropagating through denoising processes, which cause well-known issues with stability and affect scalability. In this paper we study the question of whether simple policy improvement schemes at test time alone, leaving stable supervised policy training intact, can be a competitive alternative which sidesteps these issues. To this end, we propose QGF (Q-Guided Flow), an RL algorithm that performs policy optimization entirely at test time. QGF works by pre-training both a reference flow policy (via a standard behavioral cloning objective) and a value function critic and, at test time, using the value gradient to guide the reference policy to generate higher-value actions without any additional policy learning. Empirically, QGF outperforms prior test-time RL methods on single-task and goal-conditioned offline RL benchmarks with high-dimensional action spaces, and is competitive with state-of-the-art training-time algorithms while being much cheaper to run. Moreover, it exhibits favorable scaling with model size by avoiding the instability of actor-critic training, offering a practical and effective alternative RL algorithm with expressive policies.

1
FadeMem: Distance-Aware Memory Consolidation for Autoregressive Video Diffusion

Autoregressive video generators synthesize long videos by generating successive temporal segments, but their historical KV cache grows with video length. Existing bounded-cache methods reduce this cost with local windows, sink tokens, or compressed memory states, yet they usually assign fixed roles to different parts of the history. We propose FadeMem, a distance-aware KV memory consolidation mechanism that organizes historical KV blocks into a temporal hierarchy under a fixed cache budget. This design is motivated by frequency-dependent temporal decay: fine details decorrelate quickly, while coarse scene structure and identity remain useful over longer horizons. During generation, new history is inserted as fine-grained entries, while older adjacent entries are progressively merged under a power-law temporal allocation schedule, yielding a dense-near, sparse-far memory within one cache. Without architectural changes, FadeMem preserves recent context for short-term dynamics and compact long-range anchors for identity and scene coherence. Experiments show improved subject consistency, background stability, and temporal coherence over existing bounded-cache strategies.

0
BenSyc: Benchmarking Conversational Sycophancy and Human Alignment in LLMs for Bengali Contexts

Large language models (LLMs) increasingly participate in emotionally sensitive social conversations, where responses may shift from balanced support toward excessive validation or escalatory alignment. Existing sycophancy research primarily focuses on factual agreement and instruction-following settings, leaving culturally grounded conversational sycophancy underexplored. We introduce BenSyc, the first benchmark for studying conversational sycophancy in Bengali social contexts. Starting from 11,840 Reddit posts and 170k comments collected from communities across Bangladesh and West Bengal, we construct a human-validated benchmark with binary labels and a fine-grained five-level taxonomy spanning Invalidation, Neutral, Support, Validation, and Escalation. We evaluate more than 15 open and proprietary LLMs on conversational alignment classification and response generation tasks. Results show that distinguishing empathetic support from reinforcement-oriented validation remains challenging even for frontier instruction-tuned models: the best system achieves only 61.8 Macro-F1 on binary detection and 61.7 Macro-F1 on five-class classification. In generation settings, several models frequently produce strongly validating or escalatory responses in emotionally charged situations. Our findings highlight substantial variation across model families and conversational behaviors, underscoring the importance of culturally grounded multilingual benchmarks for evaluating socially aligned conversational AI systems.

0
05

PRODUCT HUNT

05.00
PRODUCT HUNT

Product Hunt - June 10, 2026

Product Hunt Daily Feed: Featuring noteworthy tech launches.

fort icon
fort

One command to audit and fix your Mac's security

0
SeaTicket icon
SeaTicket

Al agent that resolves issues across all your channels

0
AGNT.Hub icon
AGNT.Hub

Build always-on AI agents without managing servers

0
Hero Studio Photos icon
Hero Studio Photos

Snap one photo, get listing-ready shots from every angle

0
Screen Charm icon
Screen Charm

Give your screen recordings more charm

0
TypingMind icon
TypingMind

Pay per use, no subscription, 18 model providers supported

0
Timmy-TUI icon
Timmy-TUI

Local-first agent trust console with a safe local workspace

0
Monako Glass icon
Monako Glass

Run AI coding agents hands-free from a heads-up display

0
Publora icon
Publora

The Publishing API for the Agent Era.

0
Gemini 3.5 Live Translate icon
Gemini 3.5 Live Translate

Latest audio model for live speech-to-speech translation

0
Zingle icon
Zingle

Learn words in context with AI

0
Spotlight by Backplanes icon
Spotlight by Backplanes

Session reports for Claude Code & Codex to improve your code

0
Napkin Math icon
Napkin Math

personalized AI food journal + nutrition coach

0
FluidDocs Deck Builder icon
FluidDocs Deck Builder

Turn a prompt into a real HTML deck

0
Incorruptible by Eric Ries icon
Incorruptible by Eric Ries

Why good companies go bad and how great companies stay great

0
veridive icon
veridive

Find the 30 seconds that matter in any video via chat

0
OLO Robotics icon
OLO Robotics

Control robots in your browser — no setup needed

0
BlenderHunt icon
BlenderHunt

The indie marketplace for Blender artists and creators

0
Axol icon
Axol

Automate physical work with a powerful robot

0
iArt.ai icon
iArt.ai

Turn ideas & designs into stunning video/animation.

0
LayerProof Vellum icon
LayerProof Vellum

One canvas for every image asset you need

0
Log Cam icon
Log Cam

Record log and ProRes video from RAW frames on iPhone

0
Uiverse Design icon
Uiverse Design

De-slop your AI generated websites

0
VC Boom icon
VC Boom

Score your deck, meet investors who fit, and raise more

0
agmsg icon
agmsg

Stop copy-pasting between your AI coding agents

0
Krisp Voice Translation API icon
Krisp Voice Translation API

Real-time speech-to-speech translation API

0
ChocolateBar icon
ChocolateBar

Add a row under your menu bar for hidden icons

0
Cove for Mac icon
Cove for Mac

Like a save/load game for your work

0
TrakMac icon
TrakMac

Voice-first macro tracking for fitness enthusiasts

0
Overly icon
Overly

Search and ask questions inside lecture videos

0
Limelight icon
Limelight

Make your screen recordings easy to follow

0
prostir zvuku icon
prostir zvuku

A spatial nature sound mixer for Mac

0
AgentOS icon
AgentOS

Manage AI agents, tasks, workspaces from one control layer

0
Whistle icon
Whistle

A fitness coach with personalized plans

0
hora Calendar icon
hora Calendar

Google calendar built for the Mac

0
Reve 2.0 icon
Reve 2.0

Generate and edit 4K images through layout-based control

0
Nodrix icon
Nodrix

Your own IoT cloud, deployed to your Cloudflare account

0
TravelMind icon
TravelMind

AI-powered city discovery built on taste, not reviews

0
NudgeFile icon
NudgeFile

Automatically organize, rename, and manage files with AI

0
ZeroGPU icon
ZeroGPU

The compute efficient layer for AI inference

0
Solarch icon
Solarch

Interactive diagrams with AI, and your code always in sync

0
Fluido icon
Fluido

Turn any Figma shape into liquid metal in one click

0
OrchestraML icon
OrchestraML

From English prompt to deployed ML model with human approval

0
Pixel Snapper icon
Pixel Snapper

Editor to clean up AI-generated pixel art

0
agentcad icon
agentcad

A CAD design tool for coding agents (free + open source)

0
Kimi Work icon
Kimi Work

The AI desktop for knowledge work

0
Mic Drop 3.0 icon
Mic Drop 3.0

Mute your mic in any app—with your AirPods

0
Signal Recorder SR-7 icon
Signal Recorder SR-7

On-device voice recorder that transcribes + exports Markdown

0
BooBar icon
BooBar

AI Dynamic Island for your Mac

0
NTSC-RS icon
NTSC-RS

Open-source video emulation of analog TV and VHS artifacts

0
06

TECHMEME

06.00
TECHMEME

Techmeme - June 10, 2026

Techmeme Digest: Major tech headlines and industry conversations.

Aryon Security, whose platform lets companies set security strategy, which it translates into enforceable policies that can't be ignored, raised a $29M Series A (Chris Metinko/Axios)
Source: TechmemePublished: Jun 10, 2026

Chris Metinko / Axios : Aryon Security, whose platform lets companies set security strategy, which it translates into enforceable policies that can't be ignored, raised a $29M Series A —  Aryon Security, which helps companies enforce cloud security policies, raised a $29 million Series A led by Brightmind Partners, CEO Ron Arbel tells Axios Pro exclusively.

LinkedIn debuts its first creator marketplace, in North America, allowing some marketers to search for creators by topic and view "creators cards" with metrics (Krystal Scanlon/Digiday)
Source: TechmemePublished: Jun 10, 2026

Krystal Scanlon / Digiday : LinkedIn debuts its first creator marketplace, in North America, allowing some marketers to search for creators by topic and view “creators cards” with metrics —  LinkedIn is making a late but logical move to own B2B creator infrastructure before someone else does.

Maneva, whose AI agents connect to existing camera infrastructure in factories to flag safety risks and measure worker productivity, raised a $27M Series A (Colin Campbell/Axios)
Source: TechmemePublished: Jun 10, 2026

Colin Campbell / Axios : Maneva, whose AI agents connect to existing camera infrastructure in factories to flag safety risks and measure worker productivity, raised a $27M Series A —  Maneva, an industrial AI company, raised a $27 million Series A led by U.S. Venture Partners, CEO Rae Jeong tells Axios Pro exclusively.

A German court rules that Google is directly liable for what AI Overviews say after AI Overviews falsely tied two publishers to shady business practices (Matthias Bastian/The Decoder)
Source: TechmemePublished: Jun 10, 2026

Matthias Bastian / The Decoder : A German court rules that Google is directly liable for what AI Overviews say after AI Overviews falsely tied two publishers to shady business practices —  Key Points … Ask about this article...  The Regional Court of Munich hit Google with a temporary injunction barring the company …

Sources: Saudi Arabia's Public Investment Fund and the Kuwait Investment Authority have each placed share orders worth $1B to $5B for SpaceX's IPO (Dinesh Nair/Bloomberg)
Source: TechmemePublished: Jun 10, 2026

Dinesh Nair / Bloomberg : Sources: Saudi Arabia's Public Investment Fund and the Kuwait Investment Authority have each placed share orders worth $1B to $5B for SpaceX's IPO —  Gulf wealth funds have put in orders for shares worth several billions of dollars in SpaceX's initial public offering, according to people familiar …

Sources: the CFTC will propose new prediction market rules, banning bets it finds aren't in the public interest or that seem highly susceptible to manipulation (Dylan Tokar/Wall Street Journal)
Source: TechmemePublished: Jun 10, 2026

Dylan Tokar / Wall Street Journal : Sources: the CFTC will propose new prediction market rules, banning bets it finds aren't in the public interest or that seem highly susceptible to manipulation —  The CFTC will provide clearer parameters around what bets are allowed on Kalshi and other platforms, but won't ban certain contracts

Cyera, an Israeli startup focused on protecting companies' data from AI-based threats, raised $600M at a $12B valuation, bringing its total funding to $2.3B (Niko Gallogly/New York Times)
Source: TechmemePublished: Jun 10, 2026

Niko Gallogly / New York Times : Cyera, an Israeli startup focused on protecting companies' data from AI-based threats, raised $600M at a $12B valuation, bringing its total funding to $2.3B —  The five-year-old company is now valued at $12 billion.  —  Anthropic alarmed both governments and the business world in April …

Poetic, which aims to use AI to automate tasks like financial compliance, emerges from stealth with $50M in funding from OpenAI and others at a $500M valuation (Paayal Zaveri/Bloomberg)
Source: TechmemePublished: Jun 10, 2026

Paayal Zaveri / Bloomberg : Poetic, which aims to use AI to automate tasks like financial compliance, emerges from stealth with $50M in funding from OpenAI and others at a $500M valuation —  Poetic, an artificial intelligence startup, is emerging from stealth with $50 million in funding from OpenAI and other investors …

Hands-on with Siri AI: successfully executed multistep prompts, understood context well, has strong guardrails, and seems a bit more dispassionate than Gemini (Allison Johnson/The Verge)
Source: TechmemePublished: Jun 10, 2026

Allison Johnson / The Verge : Hands-on with Siri AI: successfully executed multistep prompts, understood context well, has strong guardrails, and seems a bit more dispassionate than Gemini —  Parents want one thing, and one thing only, out of AI: to add a list of soccer games or “spirit week” theme days from an email …

CrowdStrike: Chinese entities accounted for 58%+ of state-sponsored cyberattacks aimed at tech companies, especially AI assets, over the 12 months to March 31 (Evelyn Cheng/CNBC)
Source: TechmemePublished: Jun 10, 2026

Evelyn Cheng / CNBC : CrowdStrike: Chinese entities accounted for 58%+ of state-sponsored cyberattacks aimed at tech companies, especially AI assets, over the 12 months to March 31 —  U.S.-based cybersecurity giant CrowdStrike warned Tuesday of increasing cyberattacks from China-based entities aimed …

Sources: Chinese companies like Moonshot are reconsidering "red-chip structures", which make it easier to list overseas, after China blocked the Meta-Manus deal (Financial Times)
Source: TechmemePublished: Jun 10, 2026

Financial Times : Sources: Chinese companies like Moonshot are reconsidering “red-chip structures”, which make it easier to list overseas, after China blocked the Meta-Manus deal —  Beijing's tighter scrutiny of foreign capital forces groups to rethink ownership structures behind listings boom

Morgan Stanley forecasts global AI-tied debt issuance will more than double to nearly $570B in 2026, as hyperscalers seek alternative funding for AI capex needs (Kanishka Ajmera/Reuters)
Source: TechmemePublished: Jun 10, 2026

Kanishka Ajmera / Reuters : Morgan Stanley forecasts global AI-tied debt issuance will more than double to nearly $570B in 2026, as hyperscalers seek alternative funding for AI capex needs —  Morgan Stanley forecasts AI-related global debt issuance to more than double to nearly $570 billion in 2026 …

A look at Unitree's growth strategy, which mimics those used by BYD and DJI, and leverages its quadruped robot dominance to create and lead the humanoid market (SemiAnalysis)
Source: TechmemePublished: Jun 10, 2026

SemiAnalysis : A look at Unitree's growth strategy, which mimics those used by BYD and DJI, and leverages its quadruped robot dominance to create and lead the humanoid market —  The Fastest Iteration Cycle In Next-Gen Robotics Should See Unprecedented Acceleration  —  Reyk Knuhtsen, Niko Ciminelli, Jacob Rintamaki, and 4 others

Las Vegas-based TensorWave, which offers AMD-powered cloud infrastructure, raised a $350M Series B led by AMD and Magnetar at a $1.55B post-money valuation (Robbie Whelan/Wall Street Journal)
Source: TechmemePublished: Jun 10, 2026

Robbie Whelan / Wall Street Journal : Las Vegas-based TensorWave, which offers AMD-powered cloud infrastructure, raised a $350M Series B led by AMD and Magnetar at a $1.55B post-money valuation —  TensorWave will use fresh $350 million to fill more data centers with chips from AMD, an investor

Sources: SpaceX has drawn $250B+ of investor demand for what is poised to be the largest IPO to date, far beyond the $75B that SpaceX is aiming to raise (Reuters)
Source: TechmemePublished: Jun 10, 2026

Reuters : Sources: SpaceX has drawn $250B+ of investor demand for what is poised to be the largest IPO to date, far beyond the $75B that SpaceX is aiming to raise —  Elon Musk's SpaceX (SPCX.O) has drawn more than $250 billion of investor demand for what stands to be the largest-ever IPO …

07

STARTUP ARCHIVE

07.00
STARTUP ARCHIVE

Startup News - June 10, 2026

Startup News Roundup: Aggregating key funding and launch updates.

Marc Andreessen on the 5 personality traits of an innovator
Source: StartupPublished: Mar 31, 2026

“When you’re talking about real innovators—people who actually do really creative, breakthrough work—I think you’re talking about a couple things:”

Steve Jobs explains the importance of both thinking and doing
Source: StartupPublished: Mar 30, 2026

“The doers are the major thinkers. The people who really create the things that change this industry are both the thinker-doer in one person.”

Tobi Lutke explains what the VCs who passed on Shopify got wrong
Source: StartupPublished: Mar 27, 2026

“What a lot of free-market thinkers don’t understand is that between the demand and eventual supply lies friction."

Sam Altman explains how he decides to invest in a startup after 10 minutes
Source: StartupPublished: Mar 26, 2026

"Does this person have the potential to be the next Mark Zuckerberg?… [You don’t get to] 100% accuracy, obviously, but it’s good enough that our business model works.”

Jony Ive recounts the time Steve Jobs called him vain
Source: StartupPublished: Mar 25, 2026

In the clip below, Jony Ive recounts the time he asked Steve Jobs to be less harsh in his critique of a piece of work.

Jeff Bezos’s two pieces of advice for aspiring entrepreneurs
Source: StartupPublished: Mar 24, 2026

“The advice that I would give entrepreneurs is don't chase the hot new thing. It's so hard to catch something that everybody already knows is hot."

Elad Gil: “Things that work tend to work pretty fast”
Source: StartupPublished: Mar 23, 2026

“I do think there’s a bit of a myth in Silicon Valley that you should keep grinding no matter what and it’s just about perseverance, and I think that’s really bad advice."

Paul Graham on why starting with a “small, intense fire" is the key to startup growth
Source: StartupPublished: Mar 20, 2026

"You have to know who those first users are and how you're going to get them."

Keith Rabois on how to identify great talent
Source: StartupPublished: Mar 19, 2026

“What you want to do with every single employee every single day is expand the scope of their responsibilities until it breaks… and that’s the role they should stay in.”

Wealthfront CEO on why advertising spend makes it harder to find product/market fit
Source: StartupPublished: Mar 18, 2026

“The way that you know you have product/market fit is if you have exponential organic growth."

Eric Schmidt on why most companies get strategy wrong
Source: StartupPublished: Mar 17, 2026

“Work very, very hard to figure out what the world’s going to look like in five years. What will people be doing? What will your customers want? Where will costs be?"

Mark Zuckerberg: “You can’t 80/20 everything”
Source: StartupPublished: Mar 16, 2026

"There’s the famous 80/20 rule where you get 80% of the benefit by doing 20% of the work, but you can’t just 80/20 everything. There have to be certain things that you are just the best at."

Marc Andreessen on Mark Zuckerberg’s founder “superpower”
Source: StartupPublished: Mar 13, 2026

“A great superpower that Mark Zuckerberg has that is probably not well-understood enough is he does not get emotionally upset in stressful situations"

Sam Altman explains how to come up with a great startup idea
Source: StartupPublished: Mar 12, 2026

"If you start a startup without a good idea… you’ll be under pressure to make something up and it won’t work that well."

Jeff Bezos on the problems with proxies and managing to metrics
Source: StartupPublished: Mar 11, 2026

“One of the things that happens in business is that you develop certain things that you’re managing to—a typical case would be a metric. And that metric isn’t the real underlying thing.”

Airbnb founder Brian Chesky on how to design an amazing user experience
Source: StartupPublished: Mar 10, 2026

“If you can design something really amazing using the hand-crafted part of your brain, then you can reverse-engineer how to industrialize this millions of times over."

Spencer Rascoff: "I will never invest in a consumer startup with paid marketing”
Source: StartupPublished: Mar 9, 2026

"If you’re actually trying to grow a product, the best levers for doing that are often within the product itself.”

Patrick Collison explains why it sometimes make sense to quit
Source: StartupPublished: Mar 6, 2026

“One thing I’ve learned myself the hard way, is that it is easier to tear down a company and restart it in Silicon Valley, than it is to constantly try to pivot or keep something alive."

Jeff Bezos recounts the time he called Amazon’s customer service number mid-meeting to prove a metric was wrong
Source: StartupPublished: Mar 5, 2026

“I have a saying, which is when the data and the anecdotes disagree, the anecdotes are usually right"

Ben Horowitz: “Nobody was born a great manager. It’s a very unnatural job.”
Source: StartupPublished: Mar 4, 2026

“If you can’t build a great product, it doesn’t matter if you can build a great company.”

03

ALSO TODAY

3 MORE SOURCES
08

SOLIDOT

08.00
SOLIDOT

Solidot News - June 10, 2026

Solidot Feed: Highlighting essential tech & open-source news.

德国法庭裁决 Google 要对 AI Overviews 内容承担责任

德国慕尼黑地区法庭裁决,Google 要对 AI Overviews 内容承担责任,因为 AI Overviews 是 Google 自己的内容,并非搜索结果列表。本案的原告是两家慕尼黑出版商,他们指控 Google 的 AI Overviews 错误将其与诈骗、订阅陷阱等不正当商业行为关联起来,他们向 Google 发去了禁止通知函(cease-and-desist letter),但搜索巨人未正确回应。法院认为,Google 的 AI Overviews 与传统搜索结果不同,AI 会“用自己的语言按照自己的结构”重写和评判搜索结果,而它引用的链接与其内容有矛盾,因此该内容是 Google 自己的陈述。Google 开发了 AI,将其提供给用户,因此 Google 拥有 AI 所生成内容的所有权,“因为只有 Google 才能影响 AI 提供的服务以及 AI 运行所使用的算法。”搜索引擎责任规则不适用于 AI 搜索。

比亚迪一年 200 次 OTA,次数远超竞争对手

威尔森的数据显示,2025 年比亚迪针对自身“海洋”和“王朝”系列车型实际进行了 200 次软件更新(Over the Air 或 OTA),在汽车企业中次数最多。特斯拉在中国国内更新软件的次数为 16 次,丰田为 8 次,大众为 5 次。比亚迪之所以能够频繁更新软件,是因为 OTA 所需要的半导体、作为通信基础的操作系统、实际运行的硬件全部自主开发。相关负责人表示:“只要是自主设计,就可以迅速且准确地实现更新”。在价格竞争加剧导致中国国内销量下滑的背景下,比亚迪的目的是通过 OTA 来提升吸引力、扩大销售。

Starlink 硬件从一次性付费转向月租

Starlink 硬件从一次性付费转向 10 美元月租费。Starlink 硬件包括一个接收卫星信号的终端和一个放在家中的路由器。该费用不包含在网络服务费中。Starlink 的 100Mbps 套餐每月收费 55 美元,200Mbps 套餐每月收费 85 美元,400Mbps 的 Max 套餐每月收费 130 美元。Starlink 还提供专业安装服务,一次性收费 199 美元,Max 套餐用户免收安装费。

Google Chrome 准备移除对 Manifest V2 的支持,杀死 uBlock Origin

Google Chrome 准备完全移除对 Manifest V2 的支持,彻底杀死 uBlock Origin。Chrome 将只支持 Manifest v3 扩展,开发者声称 Chrome 默认禁用 Manifest V2 扩展已有一年多时间,继续支持相关功能存在技术上的挑战,Chrome 未来发布的版本将逐步移除 Manifest V2 相关功能,最终彻底将其移除:Chromium 150 移除 ExtensionManifestV2Disabled 选项,Chromium 151 将移除 ExtensionManifestV2Unsupported 选项,Chromium 151 将移除 ExtensionManifestV2Availability 选项,Chromium 151 预计将移除 AllowLegacyMV2Extensions 选项。基于 Chromium 的浏览器预计将会跟随,主要浏览器开发商中 Mozilla 公开声明会继续支持 Manifest v2 扩展。流行的广告屏蔽扩展 uBlock Origin 基于 Manifest V2,想要继续使用该扩展的用户可能只能迁移到 Firefox 了。

NASA 公布了 Artemis III 任务宇航员名单

NASA 公布了 Artemis III 任务的四名宇航员名单,他们都是男性且都有军事背景:NASA 宇航员 Randy Bresnik(担任指令长),Andre Douglas 和 Frank Rubio(任务专家),以及 ESA 宇航员 Luca Parmitano(飞行员)。Artemis 登月任务目前共执行了两次,第一次是无人绕月飞行,第二次是载人绕月飞行,第三次也就是 Artemis III 计划最早于 2027 年夏季执行,仍然是一次载人绕月飞行,第四次任务计划在 2028 年进行,这将是阿波罗登月任务以来的首次载人登月。

Donut Lab 的全固态电池被认为就是普通锂离子电池

在 CES 2026 上芬兰初创企业 Donut Lab 宣称其研发出一款能量密度达 400Wh/kg、循环寿命 10 万次、5 分钟即可充满电,并且在 -30℃-100℃ 的温度范围内,仍能保持 99% 以上容量的固态电池。由 20 多位业内独立专家开展的调查证实,全固态电池系造假,实为普通锂离子电池。证据包括:其电压曲线与现有液态高镍三元锂离子电池特征完全吻合;电池充电时离子会嵌入负极材料,使电池产生规律性膨胀,采用石墨负极的电池,在电量充至 50% 至 70% 区间时,膨胀曲线会出现一处明显拐点,这是离子在石墨层状结构中重新排布所形成的独有特征,Donut Lab 的这款电池,曲线中恰好出现了这一标志性拐点。电池的实际能量密度约为 298Wh/kg,属于当前三元锂电池的正常水平。调查团队发现,Donut Lab 之所以如此欺诈宣传,核心是为了从资本市场获利,在该公司 1300 余名股东中,逾 900 人持股不超过 50 股,单笔投入估计在 3000 至 23000 美元之间。

iPhone 与美国生育率下降相关

美国总生育率自 2007 年以来下降了 22%,这一下降趋势难以用经济状况、避孕、住房或托儿成本等进行解释,智能手机的普及被认为与生育率下降相关,2007 年就是第一代 iPhone 发布之年。在美国,从 2007 年 6 月到 2011 年 2 月,iPhone 仅在 AT&T 网络销售。这就是为研究智能手机对生育影响提供了一个天然的实验环境。研究人员利用 AT&T 移动网络覆盖范围的差异去识别 iPhone 对生育的影响。结果显示,iPhone 的普及使 15-19 岁女性的生育率下降了 4.5%-8.0%,20-24 岁女性的生育率下降了 3.2%-6.6%。iPhone 的普及加速了 30 岁以下女性生育率的下降,抑制了 30 岁以上女性生育率的上升。研究人员称,iPhone 的普及能解释 15-44 岁女性总体生育率下降的 33%-52%。原因被认为是智能手机减少了线下的面对面人际交往,增加了色情内容的使用,降低了性生活频率。

Falcon 9 火箭第一级 B 1067 执行了 35 次发射任务

本周一编号为 B 1067 的 Falcon 9 火箭第一级完成了第 35 次发射任务,在将 29 颗 Starlink 卫星送入轨道之后成功着陆在无人驳船 A Shortfall of Gravitas 上。B 1067 是 SpaceX 重复使用次数最多的火箭第一级,服役了五年多时间,曾在一个月内执行了两次发射,SpaceX 的目标是火箭第一级能重复使用 40 次,B 1067 正接近这一目标。B 1067 发射次数比竞争对手联合发射联盟(ULA)过去五年的总发射次数还要多(ULA 完成了 29 次发射)。

联合国报告警告海洋承受巨大压力

最新发布的《世界海洋评估》报告警告,气候变化、污染、过度开发等多重压力正在持续削弱海洋健康,而海洋的未来与人类的未来紧密相连。报告指出,即便远离海岸,海洋依然深刻影响着每个人的生活。海洋吸收了地球大部分额外热量和温室气体,在减缓气候变化方面发挥关键作用。海洋还为全球数十亿人口提供食物、氧气和药物资源,并支撑着全球贸易、旅游业和大量就业岗位。报告强调,海洋环境恶化不仅会影响沿海地区,还将波及粮食安全、供应链稳定以及全球经济发展。评估显示,海洋变暖和海平面上升正在加速。由于冰盖融化和海水热膨胀,全球海平面上升速度已从 2015 年前每年最高 1.9 毫米增加到 2023 年的 4.3 毫米。北极升温速度达到全球平均水平的四倍。与此同时,海洋缺氧区面积已扩大至约 450 万平方公里,大量海洋生物生存空间受到挤压。自 1970 年代以来,加勒比地区约 80% 的珊瑚礁已经消失。如果全球升温超过工业化前水平 1.5 摄氏度,全球 90% 的珊瑚礁可能面临消失风险。报告显示,每年约有 5200 万吨塑料垃圾进入海洋,形成约 24 万亿个微塑料颗粒,已影响 4000 多种海洋生物。

微软开源工具被植入窃取凭证的恶意代码

微软下线了数十个托管在 GitHub 上的开源项目,原因是安全公司发现这些项目被入侵植入了窃取密码等敏感凭证的恶意代码。微软在一份声明中表示,它正对此展开调查,部分下线的项目在审核之后已恢复上线,作为调查的一部分,它通知了下载受影响项目的一小部分用户。调查显示,至少 73 个项目受到影响。这是过去一个月微软第二次开源项目库遭到入侵。

世界杯可能有 97 场比赛受高温影响

气候中心(Climate Central)发布分析结果称,美加墨世界杯比赛将遭遇全球变暖带来的高温天气,球员表现受到负面影响的可能性升高。此次世界杯将在 16 个场馆共举行 104 场比赛,其中 97 场比赛可能出现导致恢复能力等下降的炎热天气。不仅球员的健康风险上升,比赛的质量也可能受到影响。本届世界杯由美国、墨西哥、加拿大共同主办,赛程为当地时间 6 月 11 日至 7 月 19 日。基于以往数据对赛事期间气温的预测显示,有较高概率在 97 场比赛中出现超过 28 度的气温。此前研究指出,超过 28 度会对球员的跑动速度、距离与恢复时间产生影响,也会影响到战术和比赛风格。

企业批准员工以宗教理由不使用 AI

美国企业在强推 AI 之际公众对 AI 的抵触情绪也日益高涨。现在一名叫 Erin Maus 的 34 岁软件工程师找到了一种变通方法,以宗教理由豁免于使用 AI。她信仰普救一位神教(Unitarian Universalism),这是一个开明、包容的宗教,接受多元化和互联性,致力促进个人灵性成长。她以 AI 的环境和伦理问题为由称使用 AI 与其宗教信仰不符。她的雇主上个月批准了宗教豁免。Maus 说,她现在仍然手写代码,自己审查代码,就和两年前一样。

网信办对网络评测进行设限

国家网信办、市场监管总局联合发布了《网络测评活动规范》。网信办称,制定该规范的原因是“一些网络测评存在夸大宣传、只评不测、商测一体等问题,不仅影响消费者信任度和购物体验,也扰乱市场环境”。《规范》要求: 三、网络测评所选取的样本,应当是消费者可以从市场上购买到的普通商品且来源可以追溯,不得是为测评活动准备的特殊物品。从事网络测评活动,接受第三方委托、赞助或者与测评样本相关方存在利益关系的,应当作出显著提示。 四、从事网络测评活动,涉及对产品功能、性能等项目测试,应当委托具有法定检验检测资质许可的检验检测机构按照相关标准以及技术规范开展测试,并明示测试依据的标准以及技术规范,按照规定保留测试样本以及测试数据、图片、视频等记录,确保测试数据、结果可以追溯。 五、未对产品开展测试,仅凭感知、观察、体验等主观感受对产品进行评价,应当进行说明,并在信息展示过程中显著标明“仅为个人体验”或者“主观感受,仅供参考”等内容。

被时尚潮流占据的社交网络

社交网络不再是为了社交,而是为了跟随时尚潮流。今天的社交活动主要发生在消息应用上。社交媒体正演变成类似电视的被动式平台,但不同于需要遥控器去切换电视频道,社媒平台的算法已经为你量身定制了内容,平台利用你的信息获利,作为回报它提供的内容是免费的。社交平台的核心商业模式仍然是广告,而且其收入还在持续增长。2026 年全球社交媒体广告收入将达到 3170 亿美元,超过 2025 年的 2770 亿美元。其中 Meta 的广告收入将达到 2430 亿美元,预计将首次超过 Google。Instagram 和 TikTok 之类的大型平台越来越注重娱乐和发现内容,而 WhatsApp 之类的应用则变成社交活动的主要场所,但此类消息应用的变现比较难。

苹果宣布 Google Gemini 驱动的 Siri AI

苹果在 2026 年 WWDC 开发者大会上宣布了 Google Gemini 驱动的新一代 Apple 智能和 Siri AI。驱动 AI 功能的运算运行在设备上或者私有云上。苹果称,“Siri 能够利用对个人情境的理解,搜索信息、邮件、照片等内容,并通过更加全系统化的 app 操作,完成跨 app 任务。Siri AI 能够回答与用户屏幕上的内容相关的问题,也可以利用广博的世界知识,上网获取最新信息,生成有用的答案。通过专门的 Siri app,用户可重新访问过往对话或发起新对话,并利用 iCloud 在用户的各种设备上私密同步对话历史记录。”由于欧盟的隐私和消费者保护监管规定,AI 智能暂时不会在欧盟推出,苹果表示,“Apple 智能推出时间依监管部门审批情况而定,Siri AI 和其他新的 Apple 智能功能在中国大陆尚不可用。”

OpenAI 申请 IPO

OpenAI 已秘密提交了 IPO 申请。秘密提交上市申请允许企业在不公开披露财务信息的情况下推进上市计划。OpenAI 以及 SpaceX 和 Anthropic 是近期最受瞩目的 IPO 事件,三家公司的市值有可能达到 4 万亿美元。OpenAI 在声明中表示它尚未决定上市日期,它也未披露将会出售多少股份。OpenAI 表示将在最佳的时机选择上市。OpenAI 最近一轮融资是在今年 3 月,融资 1220 亿美元估值 8520 亿美元,它的估值已经落后于主要竞争对手 Anthropic。

肥胖会影响精子质量改变表观遗传标记

根据发表在《Current Obesity Reports》期刊上的一项研究,肥胖并非只是个人选择的结果,肥胖风险的遗传率高达 40%-70%,能通过复杂的生物和环境因素代代相传。最新证据表明,肥胖会影响精子质量,改变表观遗传标记。这些变化可能会影响儿童的食欲调节、新陈代谢和长期患病风险。好消息是这些变化是可逆转的。生活方式改变以及减肥可改善精子健康,改变与肥胖相关的表观遗传模式。

韦伯首次测量早期宇宙休眠黑洞质量

天文学家利用韦伯太空望远镜以及引力透镜效应首次测量了一个早期宇宙休眠黑洞质量。该黑洞是 MRG-M0138 星系的中心,星系已经不再形成恒星,而黑洞也不再吞噬周围的物质而处于休眠状态。MRG-M0138 位于一个巨大星系团的背后,被引力透镜效应放大了约 30 倍。黑洞距离地球大约 100 亿光年,其质量为太阳的 60 亿倍。天文学家组合了引力透镜以及黑洞引力对恒星运动的影响确定了其质量。

平台算法给民主带来风险

越来越多的证据表明社媒平台算法给民主带来了风险。由于算法的不透明性以及以最大化用户参与度和平台停留时间为导向,完全不在乎推送内容的质量,算法被认为是造成政治极化的罪魁祸首。以 X 平台为例,在马斯克(Elon Musk)在 2024 年宣布支持特朗普之后,倾向共和党的账号曝光度显著提升。马斯克本人在 2024 年 7 月至 11 月间所发布推文的累计浏览量高达 171 亿次,超过了该平台所有政治竞选广告的总和。2025 年德国联邦选举期间,各大社交平台算法推荐给年轻用户的政党相关内容中半数涉及极右翼政党。一项分析发现,X 平台算法不成比例的放大了政治极端政党(尤其是极右翼政党)的内容,系统性压制中间政党。另一项研究发现,相比按时间排序的内容,用户接触 X 平台算法推送内容七周后,政治态度会向更保守的方向转变。禁用算法后这种转变并未逆转。这些研究显示平台算法目前的运作方式不利于民主。社媒平台算法放大极端声音导致的一个结果是扭曲对观点分布的感知,发表边缘观点的人会认为自己是主流,这种网络同质性被称为“虚假共识效应(false consensus effect)”。如果不能采取强有力的保护措施,我们会进入到一个日益极化和分裂的威权社会。

GLP-1 减肥药与更低的乳腺癌风险相关

根据发表在《JCO Oncology Practice》期刊上的一项研究,服用 GLP-1 减肥药与女性更低的乳腺癌风险相关。对逾 11 万名年龄在 45 岁至 80 岁之间的回顾性分析发现,服用 GLP-1 药物的女性患乳腺癌的风险比未服用的女性低约 30%。这是一项观察性研究,GLP-1 减肥药与降低乳腺癌发病率之间是否存在关联还有待进一步研究。GLP-1 药物模拟了人体天然激素 glucagon‑like peptide‑1,该激素有助于调节血糖和食欲。GLP-1 药物最初被用于减肥,如今被发现还可能有助于预防癌症。研究人员指出,GLP-1 药物会影响许多与癌症发展相关的靶点和通路,因此值得进一步展开研究。

09

APP STORE RANK

09.00
APP STORE RANK
FETCHING · APP STORE RANK