Learning Curves & Compounding Feedback Loops in Poker, Machine Learning, Product Development & Startups
A collection of quotes and idea snippets
I think it’s very important to have a feedback loop, where you’re constantly thinking about what you’ve done and how you could be doing it better. I think that’s the single best piece of advice: constantly think about how you could be doing things better and questioning yourself. — Elon Musk
Learning curves are all around us, but are often misunderstood phenomenon.
Here are a few of them.
Learning Curves in Poker
How did Tom Dwan come out of seemingly nowhere to become the world’s most feared online poker player?
According to HighStakesDB, Tom (Durrr) Dwan had played over 1M hands, averaging a take of $2 on each hand, over time.
- Assuming each hand lasted around 5 minutes on average, that’s well over the 10,000 hours required for expertise. In fact it's over 500x that amount.
- The key innovation of online poker was to accelerate the pace of play (and potential profit for the casino) by allowing players to “multi-table” hence shortening the feedback loop for learning in the process.
- What took someone like Doyle Bruson a lifetime to master, was suddenly possible for any kid with an internet connection, an allowance-sized bankroll and the willingness to dedicate the time to learn the game.
This explains how someone like Durrr could have burst onto the scene and immediately have been able to go toe to toe and outplay the world’s best players so quickly.
He simply had more had seen many more situations than them. In machine learning terms, he had a much larger training set.
Machine “Learning” Curves & AI
AIs have long dominated games such as chess, and last year one conquered Go, but they have made relatively lousy poker players. In DeepStack researchers have broken their poker losing streak by combining new algorithms and deep machine learning, a form of computer science that in some ways mimics the human brain, allowing machines to teach themselves.
Fast forward to 2017, ten years since the popularizing of online poker, benefiting from the subsequent stream of digital hand-history available to train computer models on, researchers have been able to accomplish what was previously thought impossible and develop an AI (known as “Deepstack”) that could consistently beat the best human poker players.
Even more impressive was the fact that Deepstack was able to outmaneuver its human pro opponents at No-Limit Texas hold-em, which is much more difficult to train a model for than its simpler Limit Holdem variation.
Learning Curves in the Economy, Organizations & Individual Products
At the Economy Level
Learning = Knowledge = Wealth
Wealth is created by learning curves that result from millions of falsifiable experiments in entrepreneurship by people in free market economies. The most important role money is as the measure of that learning. Manipulating the value of money by printing currency or artificially suppressing interest rates, does not create wealth. — George Gilder in “A 21st Century Case for Gold: A New information Theory of Money”
At the Organizational Level
Fast feedback loops can compound to deliver superior business results
Metric feedback loops clearly lead to optimization on the production side of systems and markets. But, the most efficiently self-optimizing systems are those with compounding feedback loops on the consumption side. These happen when use of a product begets more use of a product, thus coopting consumers into alignment with the producer’s goal metric. The result is a flywheel effect. — Jim Collins
At the Individual/Product Level
Having tighter feedback loops allow for more creativity to be exercised.
“Here’s something I’ve come to believe: creators need an immediate connection to what they are creating. That’s my principle. Creators need an immediate connection to what they create. And what I mean by that is, when you are making something, if you make a change or you make a decision, you need to see the effect of that immediately. There can’t be any delay and there can’t be anything hidden.” Bret Victor — Inventing on Principle from CUSEC on Vimeo.