- AI
- LLM
- MultimodalAI
- Innovation
- OpenSource
- MachineLearning
- Research
•
•
•
•
•
•
-
AlphaGo Moment for Model Architecture Discovery ; The Rise of Autonomous AI Scientists 🤖🚀
An autonomous, agentic AI system discovers novel architectures and accelerates scientific progress—breaking the human creativity bottleneck.
-
Reinforcement pre-training - baking the cherry into the cake
exploring microsoft research's revolutionary approach to training language models with reinforcement learning from the ground up
-
Group Sequence Policy Optimization (GSPO); A Smarter Approach to RL for LLMs and MoE Models
Review and notes on the GSPO paper from the Qwen team at Alibaba — improving reinforcement learning stability in LLMs, especially for Mixture-of-Experts architectures.
-
The Illusion of Thinking; Apple's Latest Paper Exposes LLM "Reasoning" Limits
Takeaways from Apple's "The Illusion of Thinking" paper, which challenges what we call "AI reasoning" and highlights critical limitations in current large language and reasoning models.
-
AlphaEvolve ; Google DeepMind’s AI Agent Discovers Better Matrix Multiplication Algorithms 🤖🧮
Google DeepMind introduces AlphaEvolve, a Gemini-powered autonomous coding agent that improves on Strassen's algorithm for matrix multiplication—raising new questions about practical AI-driven breakthroughs in foundational algorithms.