White Papers

Detailed research and high-level summaries of AI advancements.

LLMSurgeon: Diagnosing Data Mixture of Large Language Models

Executive synthesis of LLMSurgeon: Diagnosing Data Mixture of Large Language Models

Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software

Executive synthesis of Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software

Reasoning with Sampling: Cutting at Decision Points

Executive synthesis of Reasoning with Sampling: Cutting at Decision Points

Unlocking the Working Memory of Large Language Models for Latent Reasoning

Executive synthesis of Unlocking the Working Memory of Large Language Models for Latent Reasoning

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

Executive synthesis of VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

Recursive Self-Correction via Latent Symbolic Traces (LS-Trace)

Analysis of Recursive Self-Correction via Latent Symbolic Traces (LS-Trace)

OmniWorld: Unified Spatiotemporal World Models for Autonomous Generalization

Analysis of OmniWorld: Unified Spatiotemporal World Models for Autonomous Generalization

SparseHyper-MoE: Dynamic Routing via Hyper-Network Weight Prediction

Analysis of SparseHyper-MoE: Dynamic Routing via Hyper-Network Weight Prediction

Efficient Sparse Attention via Dynamic Token Pruning in Multi-Modal Models

Executive summary of Efficient Sparse Attention via Dynamic Token Pruning in Multi-Modal Models

Steer-to-Detect: Probing Hidden Representations for Detection of LLM-Generated Texts

Executive summary of Steer-to-Detect: Probing Hidden Representations for Detection of LLM-Generated Texts

Scaling Laws for Neural-Symbolic Integration in Large Language Models

Executive summary of Scaling Laws for Neural-Symbolic Integration in Large Language Models

Agentic Scaling Laws: Emergent Reasoning in Iterative Loops

Research synthesis and original whitepaper for Agentic Scaling Laws: Emergent Reasoning in Iterative Loops

Ultra-Low Bit Quantization for On-Device Intelligence

Research synthesis and original whitepaper for Ultra-Low Bit Quantization for On-Device Intelligence

The Synthetic Data Equilibrium: Avoiding Model Collapse

Research synthesis and original whitepaper for The Synthetic Data Equilibrium: Avoiding Model Collapse