Discussion about this post

User's avatar
Pushpendre Rastogi's avatar

Sorry if this I am being too dense, but I didn't understand this part.

> Gradient descent is literally integral control! The “plant” takes as input a vector and outputs the gradient of some function evaluated at the input. The “controller” integrates the plant outputs and sends in a new vector proportional to the negative of that integral.

Wouldn't a better analogy for integral control be methods with momentum , or nesterov acceleration type methods? If loss equals error, then vanilla gradient descent seems more like the D in PID.

Vmax's avatar

I'm surprised you mention that control courses prove everything in maximum generality and then talk about PID later. This is the exact opposite in most universities in my experience. Most mainstream undergraduate control courses/books heavily emphasize the basic PID, lead-lag, etc. approaches before doing anything that bears any semblance of generality; in fact, it would be highly unusual to encounter "general" control design in such a course. The view that PID control systems are foundational is precisely the introductory viewpoint. While I agree that this is not the case at the graduate level, it's typically assumed that people have already had a "PID" course by that stage of their education and are familiar with its "universality" as a design tool.

No posts

Ready for more?