Research Index

Research notes about agi

A focused archive of papers and working notes across the core areas I am studying and writing about.

Mar 17, 2026 Paper

Kimi K2.5 Deep Outline

A detailed reading note on Kimi K2.5, native multimodal training, and parallel agent orchestration.

Multi-Modal UnderstandingTrainingModel Architecture

Mar 14, 2026 Paper

A short framework for reading cross-attention design choices in modern vision-language model papers.

Model Architecture

Mar 10, 2026 Paper

Notes on reducing visual token load while keeping cross-modal reasoning stable in large VLMs.

Multi-Modal Understanding

Mar 6, 2026 Paper

A working summary of the training choices that most affect stability, convergence, and downstream transfer.

Training

Feb 28, 2026 Paper

A concise look at why unified latent representations keep appearing in modern image, video, and audio generation systems.

Multi-Modal Generation