| Oct 09, 2025 | Tiny Recursive Model ; Small, Simple… and Surprisingly Strong 🤖🧩 |
| Aug 19, 2025 | GLM-4.5 ; A Unified Open-Source Powerhouse for Agents, Reasoning & Coding 🤖✨ |
| Aug 05, 2025 | AlphaGo Moment for Model Architecture Discovery ; The Rise of Autonomous AI Scientists 🤖🚀 |
| Jul 30, 2025 | Reinforcement pre-training - baking the cherry into the cake |
| Jul 29, 2025 | Group Sequence Policy Optimization (GSPO); A Smarter Approach to RL for LLMs and MoE Models |
| Jun 30, 2025 | The Illusion of Thinking; Apple's Latest Paper Exposes LLM "Reasoning" Limits |
| May 25, 2025 | AlphaEvolve ; Google DeepMind’s AI Agent Discovers Better Matrix Multiplication Algorithms 🤖🧮 |
| Apr 15, 2025 | Llama 4 ; Meta Scales MoE, Online RL, and Multimodal Innovation 🦙💡 |
| Mar 20, 2025 | Qwen2.5-Omni ; Alibaba’s Multimodal Model Elevates Real-Time AI 🧠🎤🖼️ |
| Mar 15, 2025 | Cosmos-Transfer1 ; NVIDIA’s Model for Next-Gen Conditional World Generation 🤖✨ |
| Feb 25, 2025 | Native Sparse Attention ; Hardware-Aligned Breakthrough for Long-Context LLMs 🤖✨ |
| Feb 20, 2025 | 🏅📐 AlphaGeometry2 ; AI Reaching Gold Medal Level in IMO Geometry! |
| Feb 15, 2025 | EvalPlanner ; Meta’s Transparent & Accurate LLM Evaluation Approach 🌟 |
| Feb 10, 2025 | SFT vs RL ; Generalization Power in Foundation Models 🚀🤖 |
| Feb 08, 2025 | Open Deep Research ; Hugging Face’s Transparent Alternative to OpenAI’s Tool 💥 |
| Jan 30, 2025 | Janus-Pro ; DeepSeek’s Next-Gen Multimodal Model for Vision & Text-to-Image 🖼️🤖 |
| Jan 25, 2025 | Memory Layers in Large Language Models ; Boosting LLM Performance 🧠 |
| Jan 20, 2025 | Large Concept Models ; Advancing Abstract Reasoning in Language Modeling 🤖💡 |
| Jan 15, 2025 | CosyVoice 2 ; Streaming Speech Synthesis with Human-Like Naturalness 🎤 |