The math is not designed to intimidate but rather approach the "how to build sequence model" in a principled way from state space models, which draws from an arguably longer literature than neural networks.
Some of concepts are better explained here than anywhere else, and make it straightforward to make sense of Mamba, which is increasingly popular.
I did not mean it in a negative way, this is a great resource. But the math will be intimidating regardless for most devs who don't have a solid math/signal processing background. It's way beyond the simple linear algebra plus chain rule from calculus that are required to understand basic neural networks training.
Some of concepts are better explained here than anywhere else, and make it straightforward to make sense of Mamba, which is increasingly popular.