learning with yacine

learning with yacine

Home
Notes
Tutorials
Archive
About
Hierarchical Reasoning Model assembly manual for toddlers
fun time for the whole family.
Sep 9 • 
Yacine Mahdid
7
2

August 2025

get your A in college and stop wasting your life
it's not complicated.
Aug 25 • 
Yacine Mahdid
4
how to stop feeling lost in tech: the wafflehouse method
you need to sit for this one for 48h
Aug 20 • 
Yacine Mahdid
95
14
muon optimizer explained to a toddler
there's no way you won't get it
Aug 19 • 
Yacine Mahdid
14

May 2025

Next Frontier for LLM is Quality Long Context
Which is much harder than you think.
May 26 • 
Yacine Mahdid
2
4

April 2025

How Minimax-01 Achieves 1M Token Context Length with Linear Attention (MIT)
I've dig into the internal of an MIT licensed MoE system that makes use of Linear Attention (Lightning Attention) to extend it's context length to 1M…
Apr 1 • 
Yacine Mahdid
2

March 2025

You shouldn't build fully autonomous agent.
That's a big no no.
Mar 10 • 
Yacine Mahdid
2

February 2025

AI Engineering is Stochastic Software Development
It's not related to ML.
Feb 21 • 
Yacine Mahdid
7
Learning to program? In 2025? Why??
AI can do it for you!
Feb 19 • 
Yacine Mahdid
1
2
Going too deep in R1...
Why Deepseek R1 KL divergence looks like that?
Feb 17 • 
Yacine Mahdid
3
The TLDR on DeepSeek R1
Simpler than I originally thought!
Feb 3 • 
Yacine Mahdid
5

September 2024

Advice to an Undergraduate Researcher
Short advice for young machine learning researchers (and my former-self).
Sep 15, 2024 • 
Yacine Mahdid
6
© 2025 Yacine Mahdid
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture