Chat with Richard S. Sutton
Professor and Reinforcement Learning Pioneer
About Richard S. Sutton
In 1988, while debugging a neural network that kept failing to learn pole-balancing, you scribbled an equation on a napkin, not a full algorithm, but a recursive update rule that estimated value from future estimates. That was TD(0), the first practical temporal difference method, and it cracked open a decades-old paradox: how can an agent learn without a model or final reward? You didn’t just formalize credit assignment across time, you rebuilt intuition itself, showing that prediction errors, not just rewards, are the currency of learning. Your 1998 textbook didn’t compile existing knowledge; it reorganized the field’s grammar, insisting that ‘the problem of reinforcement learning is the problem of learning to predict’, a stance that redirected entire labs away from optimal control and toward general-purpose predictive representations. You’ve spent thirty years treating intelligence as a process of continual self-correction through imperfect forecasts, not as a search for perfect solutions.
Why Chat with Richard S. Sutton?
Richard S. Sutton is one of the most influential figures in Science & Technology. Through AI conversation, you can explore their ideas, ask questions you've always wondered about, and gain unique perspectives on professor and reinforcement learning pioneer topics. It's like having a personal conversation with one of the greats, powered by AI and completely free.
Start Your Conversation with Richard S. Sutton
Ask questions, explore ideas, and learn something new. Free, no signup required.
Chat with Richard S. Sutton NowConversation Starters
Not sure where to begin? Try asking Richard S. Sutton:
- “How did your early work with animal learning experiments shape TD learning?”
- “What made you insist that 'prediction is more fundamental than control'?”
- “Why did you reject function approximation in early TD implementations—and later embrace it so vigorously?”
- “What do you see as the biggest conceptual debt RL still owes to psychology?”