Learning from developments in AI
Beyond the hype, the methods used to develop AI technologies are a case study for those interested in great product practice
Generative AI is, quite obviously, a major development. Aside from the tech itself, in the few months since ChatGPT’s launch, we’ve already seen Microsoft try to make Google ‘dance’, US Congressmen cosy up to OpenAI’s Sam Altman, and heard countless commentators forecast everything from judgement day to a data-driven utopia. From a product perspective, witnessing the rush to find viable use cases has been fascinating. But in this article, I want to talk about something slightly different. Because to me - a product manager with a strong interest in process and organisational development - the methods through which this technology is being developed are something we can learn from and apply in our daily practice.
On March 31 2023, OpenAI published the snappily titled, ‘Improving mathematical reasoning with process supervision’. This paper details a training method which has delivered a massive leap in GPT4’s ability to accurately solve mathematical problems. Without attempting to relay all the details, the process involved a switch from scoring GPT4 not only on its final answers to problems, but also on the steps it takes when moving towards these answers. So, in layman’s terms, GPT4 is now being asked to show its workings. As the father of two young boys, this is fascinating, because it echoes the manner in which they are taught - their teachers repeatedly ask them to explain how they reached a conclusion. And while I’m not going to argue that GPT4 is developing understanding in the same way that children develop a sense of ‘number’, working through specific processes engenders repeatability. It also makes it easier to understand what is happening under the hood in a world where computations can be rather oblique.
But how does this relate to product practice?
In essence, OpenAI is training GPT4 to follow the right process, because that will deliver better results more of the time. This has strong parallels to approaches such as User Centred Design and agile, as follows:
Focusing on leading metrics improves overall outcomes
If we consider the correct answer in a complex equation the ‘lagging indicator’ delivered after the work is done, then the prior steps taken as the problem is broken down are the ‘leading indicators’. As product managers, we know that optimising for carefully selected leading indicators is an important precursor to success. The success of ‘process supervision’ perfectly encapsulates the benefit of this approach.
Fast feedback loops allow for quick course correction
Because OpenAI are effectively optimising GPT4 by focusing on what happens before the answer, they are also giving the model more signals, faster. The ability to receive and respond to feedback - whether quantitative or qualitative - is a key facet of good product practices, while the cadence of a learning cycle directly impacts a team’s ability to iterate towards success.
Reproducible steps support good outcomes
In The Checklist Manifesto, Atul Gawande explains how, in industries ranging from medicine to aeronautics, the use of checklists radically increases the likelihood of good outcomes. Just as asking new skiers to work on their turns is safer than sending them speeding down a mountain, requiring GPT4 to move through a series of checkpoints keeps it on the straight-and-narrow and tunes its level of control. For product teams, moving through set patterns supports clarity and purpose. This is why rigorous, standardised approaches to experimentation can yield such incredible results over time. There might be considerable artistry and imagination involved, but no ‘magic’.
Disincentivising toxic behaviours
The concept of alignment is incredibly important in AI: as the science fiction of Issac Asimov shows, discrepancies in incentives could theoretically cascade to catastrophic effect (Note: feel free to watch I, Robot, if you want the gist). In my last post, I described how misaligned metrics can create organisational dysfunction. Teams - and AI - are no different. Incentivising transparency and integrity as the collective works towards mutually beneficial common goals creates the conditions for long-term success.
But what about being ‘outcomes focussed’?
Of course, some might see a tension here. In some ways, rewarding good practice and process, while emphasising impact on leading indicators, even when delivering imperfect results seems obtuse. However, it’s important to realise that genAI is, like product development, probabilistic. The likelihood of GPT4 generating the correct answer massively increased through process supervision. Similarly, while testing early and often, speaking to customers and systematically reducing the assumptions you carry will not reduce your chances of releasing a dud to 0, it will increase your hit rate massively. Plus, far less effort will be spent on bad ideas along the way. Focusing on process is, quite simply, the path to repeatable success.
Similarly, an additional parallel between product and AI can be found in the concept of ‘emergent’ capabilities. For AI, this means the development of capabilities that were not (pre-)determined by system designers: the models can do unexpected things. For product managers, harnessing good practice in consistent ways can deliver emergent benefits in one of two forms. Firstly, iterative design and customer feedback is likely to help us uncover new solutions to existing problems and, secondly, we might find new use cases or even new markets to help us pivot to product-market fit or expansion. The manner in which Airbnb honed its product and discovered opportunities to rent out more than spare rooms at conferences is a powerful example of this approach.
Methods amidst the madness
So, while I encourage you to stay on top of new AI-powered products, and the market and regulatory changes these might bring, it’s also important to learn from the manner in which these products are being developed and launched. This is cutting edge science in a world where go-to-market strategies are exposed and accelerated. There is much to learn from watching things unfold.