State of AI- Models, Research & Innovation

Latest advances in AI models, research, and industry innovation.

Tiny Recursive Model ; Small, Simple… and Surprisingly Strong 🤖🧩

The 7M-parameter Tiny Recursive Model (TRM) outperforms larger networks on reasoning puzzles like Sudoku, Maze, and ARC-AGI through clever recursion.

2 min read · October 09, 2025

2025 · AI MachineLearning DeepLearning Reasoning Recursion LLM AgenticAI GenAI Walmart WalmartGlobalTech · research-notes
GLM-4.5 ; A Unified Open-Source Powerhouse for Agents, Reasoning & Coding 🤖✨

GLM-4.5 from Zhipu AI & Tsinghua unifies agents, reasoning, and coding in a massively efficient Mixture-of-Experts model—advancing open-source AI.

2 min read · August 19, 2025

2025 · AI LLM OpenSource MixtureOfExperts MoE AgenticAI Reasoning Coding ZhipuAI GLM4 Walmart FoundationModel · research-notes
AlphaGo Moment for Model Architecture Discovery ; The Rise of Autonomous AI Scientists 🤖🚀

An autonomous, agentic AI system discovers novel architectures and accelerates scientific progress—breaking the human creativity bottleneck.

2 min read · August 05, 2025

2025 · AI ASI DeepLearning LLM Research Innovation Automation ScalingLaws StateOfTheArt Walmart WalmartGlobalTech SOTA Agents AgenticAI GenAI · research-notes
Reinforcement pre-training - baking the cherry into the cake

exploring microsoft research's revolutionary approach to training language models with reinforcement learning from the ground up

2 min read · July 30, 2025

2025 · AI reinforcement-learning LLM pretraining research · machine-learning
Group Sequence Policy Optimization (GSPO); A Smarter Approach to RL for LLMs and MoE Models

Review and notes on the GSPO paper from the Qwen team at Alibaba — improving reinforcement learning stability in LLMs, especially for Mixture-of-Experts architectures.

2 min read · July 29, 2025

2025 · AI ReinforcementLearning GSPO LLM Qwen3 PolicyOptimization MachineLearning WalmartGlobalTech Walmart DeepLearning MoE AIResearch · research-notes

State of AI- Models, Research & Innovation

Latest advances in AI models, research, and industry innovation.

Tiny Recursive Model ; Small, Simple… and Surprisingly Strong 🤖🧩

GLM-4.5 ; A Unified Open-Source Powerhouse for Agents, Reasoning & Coding 🤖✨

AlphaGo Moment for Model Architecture Discovery ; The Rise of Autonomous AI Scientists 🤖🚀

Reinforcement pre-training - baking the cherry into the cake

Group Sequence Policy Optimization (GSPO); A Smarter Approach to RL for LLMs and MoE Models