2025

an archive of posts from this year

Aug 05, 2025 AlphaGo Moment for Model Architecture Discovery ; The Rise of Autonomous AI Scientists 🤖🚀
Jul 30, 2025 Reinforcement pre-training - baking the cherry into the cake
Jul 29, 2025 Group Sequence Policy Optimization (GSPO); A Smarter Approach to RL for LLMs and MoE Models
Jun 30, 2025 The Illusion of Thinking; Apple's Latest Paper Exposes LLM "Reasoning" Limits
May 25, 2025 AlphaEvolve ; Google DeepMind’s AI Agent Discovers Better Matrix Multiplication Algorithms 🤖🧮
Apr 15, 2025 Llama 4 ; Meta Scales MoE, Online RL, and Multimodal Innovation 🦙💡
Mar 20, 2025 Qwen2.5-Omni ; Alibaba’s Multimodal Model Elevates Real-Time AI 🧠🎤🖼️
Mar 15, 2025 Cosmos-Transfer1 ; NVIDIA’s Model for Next-Gen Conditional World Generation 🤖✨
Feb 25, 2025 Native Sparse Attention ; Hardware-Aligned Breakthrough for Long-Context LLMs 🤖✨
Feb 20, 2025 🏅📐 AlphaGeometry2 ; AI Reaching Gold Medal Level in IMO Geometry!
Feb 15, 2025 EvalPlanner ; Meta’s Transparent & Accurate LLM Evaluation Approach 🌟
Feb 10, 2025 SFT vs RL ; Generalization Power in Foundation Models 🚀🤖
Feb 08, 2025 Open Deep Research ; Hugging Face’s Transparent Alternative to OpenAI’s Tool 💥
Jan 30, 2025 Janus-Pro ; DeepSeek’s Next-Gen Multimodal Model for Vision & Text-to-Image 🖼️🤖
Jan 25, 2025 Memory Layers in Large Language Models ; Boosting LLM Performance 🧠
Jan 20, 2025 Large Concept Models ; Advancing Abstract Reasoning in Language Modeling 🤖💡
Jan 15, 2025 CosyVoice 2 ; Streaming Speech Synthesis with Human-Like Naturalness 🎤