Innovation, regional economic growth, and the UK’s productivity problem

A week ago I gave a talk with this title at a conference organised by the Smart Specialisation Hub. This organisation was set up to help regional authorities in developing their economic plans; given the importance of local industrial strategies in the government’s overall industrial strategy its role becomes all the more important.

Other speakers at the conference represented central government, the UK’s innovation agency InnovateUK, and the Smart Specialisation Hub itself. Representing no-one but myself, I was able to be more provocative in my own talk, which you can download here (PDF, 4.7 MB).

My talk had four sections. Opening with the economic background, I argued that the UK’s stagnation in productivity growth and regional economic inequality has broken our political settlement. Looking at what’s going on in Westminster at the moment, I don’t think this is an exaggeration.

I went on to discuss the implications of the 2.4% R&D target – it’s not ambitious by developed world standards, but will be a stretch from our current position, as I discussed in an earlier blogpost: Reaching the 2.4% R&D intensity target.

Moving on to the regional aspects of research and innovation policy, I argued (as I did in this blog post: Making UK Research and Innovation work for the whole UK) that the UK’s regional concentration of R&D (especially public sector) is extreme and must be corrected. To illustrate this point, I used this version of Tom Forth’s plot splitting out the relative contributions of public and private sector to R&D regionally.

I argued that this plot gives a helpful framework for thinking about the different policy interventions needed in different parts of the country. I summarised this in this quadrant diagram [1].

Finally, I discussed the University of Sheffield’s Advanced Manufacturing Research Centre as an example of the kind of initiative that can help regenerate the economy of a de-industrialised area. Here a focus on translational research & skills at all levels both drives inward investment by international firms at the technology frontier & helps the existing business base upgrade.

I set this story in the context of Shih and Pisano’s notion of the “industrial commons” [2] – a set of resources that supports the collective knowledge, much of it tacit, that drives innovations in products and processes in a successful cluster. A successful industrial commons is rooted in a combination of large anchor companies & institutions, networks of supplying companies, R&D facilities, informal knowledge networks and formal institutions for training and skills. I argue that a focus of regional economic policy should be a conscious attempt to rebuild the “industrial commons” in an industrial sector which allows the opportunities of new technology to be embraced, yet which works with grain of the existing industry and institutional base. The “smart specialisation” framework is a good framework for identifying the right places to look.

1. As a participant later remarked, I’ve omitted the South East from this diagram – it should be in the bottom right quadrant, albeit with less business R&D than East Anglia, though with the benefits more widely spread.

2. See Pisano, G. P., & Shih, W. C. (2009). Restoring American Competitiveness. Harvard Business Review, 87(7-8), 114–125.

The semiconductor industry and economic growth theory

In my last post, I discussed how “econophysics” has been criticised for focusing on exchange, not production – in effect, for not concerning itself with the roots of economic growth in technological innovation. Of course, some of that technological innovation has arisen from physics itself – so here I talk about what economic growth theory might learn from an important episode of technological innovation with its origins in physics – the development of the semiconductor industry.

Economic growth and technological innovation

In my last post, I criticised econophysics for not talking enough about economic growth – but to be fair, it’s not just econophysics that suffers from this problem – mainstream economics doesn’t have a satisfactory theory of economic growth either. And yet economic growth and technological innovation provides an all-pervasive background to our personal economic experience. We expect to be better off than our parents, who were themselves better off than our grandparents. Economics without a theory of growth and innovation is like physics without an arrow of time – a marvellous intellectual construction that misses the most fundamental observation of our lived experience.

Defenders of economics at this point will object that it does have theories of growth, and there are even some excellent textbooks on the subject [1]. Moreover, they might remind us, wasn’t the Nobel Prize for economics awarded this year to Paul Romer, precisely for his contribution to theories of economic growth? This is indeed so. The mainstream approach to economic growth pioneered by Robert Solow regarded technological innovation as something externally imposed, and Romer’s contribution has been to devise a picture of growth in which technological innovation arises naturally from the economic models – the “post-neoclassical endogenous growth theory” that ex-Prime Minister Gordon Brown was so (unfairly) lampooned for invoking.

This body of work has undoubtedly highlighted some very useful concepts, stressing the non-rivalrous nature of ideas and the economic basis for investments in R&D, especially for the day-to-day business of incremental innovation. But it is not a theory in the sense a physicist might understand that – it doesn’t explain past economic growth, so it can’t make predictions about the future.

How the information technology revolution really happened

Perhaps to understand economic growth we need to turn to physics again – this time, to the economic consequences of the innovations that physics provides. Few would disagree that a – perhaps the – major driver of technological innovation, and thus economic growth, over the last fifty years has been the huge progress in information technology, with the exponential growth in the availability of computing power that is summed up by Moore’s law.

The modern era of information technology rests on the solid-state transistor, which was invented by William Shockley at Bell Labs in the late 1940’s (with Brattain and Bardeen – the three received the 1956 Nobel Prize for Physics). In 1956 Shockley left Bell Labs and went to Palo Alto (in what would later be called Silicon Valley) to found a company to commercialise solid-state electronics. However, his key employees in this venture soon left – essentially because he was, by all accounts, a horrible human being – and founded Fairchild Semiconductors in 1957. Key figures amongst those refugees were Gordon Moore – of eponymous law fame – and Robert Noyce. It was Noyce who, in 1960, made the next breakthrough, inventing the silicon integrated circuit, in which a number of transistors and other circuit elements were combined on a single slab of silicon to make a integrated functional device. Jack Kilby, at Texas Instruments, had, more or less at the same time, independently developed an integrated circuit on germanium, for which he was awarded the 2000 Physics Nobel prize (Noyce, having died in 1990, was unable to share this). Integrated circuits didn’t take off immediately, but according to Kilby it was their use in the Apollo mission and the Minuteman ICBM programme that provided a turning point in their acceptance and widespread use[2] – the Minuteman II guidance and control system was the first mass produced computer to rely on integrated circuits.

Moore and Noyce founded the electronics company Intel in 1968, to focus on developing integrated circuits. Moore had already, in 1965, formulated his famous law about the exponential growth with time of the number of transistors per integrated circuit. The next step was to incorporate all the elements of a computer on a single integrated circuit – a single piece of silicon. Intel duly produced the first commercially available microprocessor – the 4004 – in 1971, though this had been (possibly) anticipated by the earlier microprocessor that formed the flight control computer for the F14 Tomcat fighter aircraft. From these origins emerged the microprocessor revolution and personal computers, with its giant wave of derivative innovations, leading up to the current focus on machine learning and AI.

Lessons from Moore’s law for growth economics

What should clear from this very brief account is that classical theories of economic growth cannot account for this wave of innovation. The motivations that drove it were not economic – they arose from a powerful state with enormous resources at its disposal pursuing complex, but entirely non-economic projects – such as the goal of being able to land a nuclear weapon on any point of the earth’s surface with an accuracy of a few hundred meters.

Endogenous growth theories perhaps can give us some insight into the decisions companies made about R&D investment and the wider spillovers that such spending led to. They would need to take account of the complex institutional landscape that gave rise to this innovation. This isn’t simply a distinction between public and private sectors – the original discovery of the transistor was made at Bell Labs – nominally in the private sector, but sustained by monopoly rents arising from government action.

The landscape in which this innovation took place seems much more complex than growth economics, with its array of firms employing undifferentiated labour, capital, all benefiting from some kind of soup of spillovers seems able to handle. Semiconductor fabs are perhaps the most capital intensive plants in the world, with just a handful of bunny-suited individuals tending a clean-room full of machines that individually might be worth tens or even hundreds of millions of dollars. Yet the value of those machines represents, as much as anything physical, the embodied value of the intangible investments in R&D and process know-how.

How are the complex networks of equipment and materials manufacturers coordinated to make sure technological advances in different parts of this system happen at the right time and in the right sequence? These are independent companies operating in a market – but the market alone has not been sufficient to transmit the information needed to keep it coordinated. An enormously important mechanism for this coordination has been the National Technology Roadmap for Semiconductors (later the International Technology Roadmap for Semiconductors), initiated by a US trade body, the Semiconductor Industry Association. This was an important social innovation which allowed companies to compete in meeting collaborative goals; it was supported by the US government by the relaxation of anti-trust law and the foundation of a federally funded organisation to support “pre-competitive” research – SEMATECH.

The involvement of the US government reflected the importance of the idea of competition between nation states in driving technological innovation. Because of the cold war origins of the integrated circuits, the original competition was with the Soviet Union, which created an industry to produce ICs for military use, based around Zelenograd. The degree to which this industry was driven by indigenous innovation as against the acquisition of equipment and know-how from the west isn’t clear to me, but it seems that by the early 1980’s the gap between Soviet and US achievements was widening, contributing to the sense of stagnation of the later Brezhnev years and the drive for economic reform under Gorbachev.

From the 1980’s, the key competitor was Japan, whose electronics industry had been built up in the 1960’s and 70’s driven not by defense, but by consumer products such as transistor radios, calculators and video recorders. In the mid-1970’s the Japanese government’s MITI provided substantial R&D subsidies to support the development of integrated circuits, and by the late 1980’s Japan appeared within sight of achieving dominance, to the dismay of many commentators in the USA.

That didn’t happen, and Intel still remains at the technological frontier. Its main rivals now are Korea’s Samsung and Taiwan’s TSMC. Their success reflects different versions of the East Asian developmental state model; Samsung is Korea’s biggest industrial conglomerate (or chaebol), whose involvement in electronics was heavily sponsored by its government. TSMC was a spin-out from a state-run research institute in Taiwan, ITRI, which grew by licensing US technology and then very effectively driving process improvements.

Could one build an economic theory that encompasses all this complexity? For me, the most coherent account has been Bill Janeway’s description of the way government investment combines with the bubble dynamics that drives venture capitalism, in his book “Doing Capitalism in the Innovation Economy”. Of course, the idea that financial bubbles are important for driving innovation is not new – that’s how the UK got a railway network, after all – but the econophysicist Didier Sornette has extended this to introduce the idea of a “social bubble” driving innovation[3].

This long story suggests that the ambition of economics to “endogenise” innovation is a bad idea, because history tells us that the motivations for some of the most significant innovations weren’t economic. To understand innovation in the past, we don’t just need economics, we need to understand politics, history, sociology … and perhaps even natural science and engineering. The corollary of this is that devising policy solely on the basis of our current theories of economic growth is likely to lead to disappointing outcomes. At a time when the remarkable half-century of exponential growth in computing power seems to be coming to an end, it’s more important than ever to learn the right lessons from history.

[1] I’ve found “Introduction to Modern Economic Growth”, by Daron Acemoglu, particularly useful

[2] Jack Kilby: Nobel Prize lecture, https://www.nobelprize.org/uploads/2018/06/kilby-lecture.pdf

[3] See also that great authority, The Onion “Recession-Plagued Nation Demands New Bubble to Invest In

The Physics of Economics

This is the first of two posts which began life as a single piece with the title “The Physics of Economics (and the Economics of Physics)”. In the first section, here, I discuss some ways physicists have attempted to contribute to economics. In the second half, I turn to the lessons that economics should learn from the history of a technological innovation with its origin in physics – the semiconductor industry.

Physics and economics are two disciplines which have quite a lot in common – they’re both mathematical in character, many of their practitioners are not short of intellectual self-confidence – and they both have imperialist tendencies towards their neighbouring disciplines. So the interaction between the two fields should be, if nothing else, interesting.

The origins of econophysics

The most concerted attempt by physicists to colonise an area of economics is in the area of the behaviour of financial markets – in the field which calls itself “econophysics”. Actually at its origins, the traffic went both ways – the mathematical theory of random walks that Einstein developed to explain the phenomenon of Brownian motion had been anticipated by the French mathematician Bachelier, who derived the theory to explain the movements of stock markets. Much later, the economic theory that markets are efficient brought this line of thinking back into vogue – it turns out that financial markets can be quite often modelled as simple random walks – but not quite always. The random steps that markets take aren’t drawn from a Gaussian distribution – the distribution has “fat tails”, so rare events – like big market crashes – aren’t anywhere like as rare as simple theories assume.

Empirically, it turns out that the distributions of these rare events can sometimes be described by power laws. In physics power laws are associated with what are known as critical phenomena – behaviours such as the transition from a liquid to a gas or from a magnet to a non-magnet. These phenomena are characterised by a certain universality, in the sense that the quantitative laws – typically power laws – that describe the large scale behaviour of these systems doesn’t strongly depend on the details of the individual interactions between the elementary objects (the atoms and molecules, in the case of magnetism and liquids) whose interaction leads collectively to the larger scale phenomenon we’re interested in.

For “econophysicists” – whose background often has been in the study of critical phenomenon – it is natural to try and situate theories of the movements of financial markets in this tradition, finding analogies with other places where power laws can be found, such as the distribution of earthquake sizes and the behaviour of sand-piles. In terms of physicists’ actual impact on participants in financial markets, though, there’s a paradox. Many physicists have found (often very lucrative) employment as quantitative traders, but the theories that academic physicists have developed to describe these markets haven’t made much impact on the practitioners of financial economics, who have their own models to describe market movements.

Other ideas from physics have made their way into discussions about economics. Much of classical economics depends on ideas like the “representative household” or the “representative firm”. Physicists with a background in statistical mechanics recognise this sort of approach as akin to a “mean field theory”. The idea that a complex system is well represented by its average member is one that can be quite fruitful, but in some important circumstances fails – and fails badly – because the fluctuations around the average become as important as the average itself. This motivates the idea of agent based models, to which physicists bring the hope that even simple “toy” models can bring insight. The Schelling model is one such very simple model that came from economics, but which has a formal similarity with some important models in physics. The study of networks is another place where one learns that the atypical can be disproportionately important.

If markets are about information, then physics should be able to help…

One very attractive emerging application of ideas from physics to economics concerns the place of information. Friedrich Hayek stressed the compelling insight that one can think of a market as a mechanism for aggregating information – but a physicist should understand that information is something that can be quantified, and (via Shannon’s theory) that there are hard limits on how much information can transmitted in a physical system . Jason Smith’s research programme builds on this insight to analyse markets in terms of an information equilibrium[1].

Some criticisms of econophysics

How significant is econophysics? A critique from some (rather heterodox) economists – Worrying trends in econophysics – is now more than a decade old, but still stings (see also this commentary from the time from Cosma Shalizi – Why Oh Why Can’t We Have Better Econophysics? ). Some of the criticism is methodological – and could be mostly summed up by saying, just because you’ve got a straight bit on a log-log plot doesn’t mean you’ve got a power law. Some criticism is about the norms of scholarship – in brief: read the literature and stop congratulating yourselves for reinventing the wheel.

But the most compelling criticism of all is about the choice of problem that econophysics typically takes. Most attention has been focused on the behaviour of financial markets, not least because these provide a wealth of detailed data to analyse. But there’s more to the economy – much, much more – than the financial markets. More generally, the areas of economics that physicists have tended to apply themselves to have been about exchange, not production – studying how a fixed pool of resources can be allocated, not how the size of the pool can be increased.

[1] For a more detailed motivation of this line of reasoning, see this commentary, also from Cosma Shalizi on Francis Spufford’s great book “Red Plenty” – “In Soviet Union, Optimization Problem Solves You”.

The UK’s top six productivity underperformers

The FT has been running a series of articles about the UK’s dreadful recent productivity performance, kicked off with this very helpful summary – Britain’s productivity crisis in eight charts. One important aspect of this was to focus on the (negative) contribution of formerly leading sectors of the economy which have, since the financial crisis, underperformed:

“Computer programming, energy, finance, mining, pharmaceuticals and telecoms — which together account for only one-fifth of the economy — generated three-fifths of the decline in productivity growth.”

The original source of this striking statistic is a paper by Rebecca Riley, Ana Rincon-Aznar and Lea Samek – Below the Aggregate: A Sectoral Account of the UK Productivity Puzzle.

What this should stress is that there’s no single answer to the productivity crisis. We need to look in detail at different industrial sectors, different regions of the UK, and identify the different problems they face before we can work out the appropriate policy responses.

So what can we say about what’s behind the underperformance of each of these six sectors, and what lessons should policy-makers learn in each case? Here are a few preliminary thoughts.

Mining. This is dominated by North Sea Oil. The oil is running out, and won’t be coming back – production peaked in 2000; what oil is left is more expensive and difficult to get out.
Lessons for policy makers: more recognition is needed that the UK’s prosperity in the 90’s and early 2000’s depended as much on the accident of North Sea oil as any particular strength of the policy framework.

Finance. It’s not clear to me how much of the apparent pre-crisis productivity boom was real, but post-crisis increased regulation and greater capital requirements have reduced apparent rates of return in financial services. This is as it should be.
Lessons for policy makers: this sector is the problem, not the solution, so calls to relax regulation should be resisted, and so-called “innovation” that in practise amounts to regulatory arbitrage discouraged.

The end of North Sea oil and the finance bubble cannot be reversed – these are headwinds that the economy has to overcome. We have to find new sources of productivity growth rather than looking back nostalgically at these former glories (for example, there’s a risk that the enthusiasm for fracking and fintech represent just such nostalgia).

Energy. Here, a post-privatisation dysfunctional pseudo-market has prioritised sweating existing assets rather than investing. Meanwhile there’s been an unclear and inconsistent government policy environment; sometimes the government has willed the ends without providing the means (e.g. nuclear new build), elsewhere it has introduced perverse and abrupt changes of tack (e.g. in its support for onshore wind and solar).
Lessons for policy makers: develop a rational, long-term energy strategy that will deliver the necessary decarbonisation of the energy economy. Then stick to it, driving innovation to support the strategy. For more details, read chapter 4 – Decarbonisation of the energy economy – of the Industrial Strategy Commission’s final report.

Computer programming. Here I find myself on less sure ground. Are we seeing the effects of increasing overseas outsourcing and competition, for example to India’s growing IT industry? Are we seeing the effect of more commoditisation of computer programming, with new business models such as “software as a service”?

Telecoms. Again, here I’m less certain of what’s been going on. Are we seeing the effect of lengthening product cycles as the growth in processor power slows? Is this the effect of overseas competition – for example, rapidly growing Chinese firms like Huawei – moving up the value chain? Here it’s also likely that measurement problems – in correctly accounting for improvements in quality – will be most acute.

Pharmaceuticals. As my last blogpost outlined, productivity growth in pharmaceuticals depends on new products being developed through formal R&D, their value being protected by patents. There has been a dramatic, long-term fall in the productivity of pharma R&D, so it is unsurprising that this is now feeding through into reduced labour productivity.
Lessons for policy makers: see the recent NESTA report “The Biomedical Bubble”.

Many of these issues were already discussed in my 2016 SPERI paper Innovation, research and the UK’s productivity crisis. Two years on, the productivity crisis seems even more pressing, and as the FT series illustrates, is receiving more attention from policy makers and economists (though still not enough, in view of its fundamental importance for living standards and fiscal stability). The lesson I would want to stress is that, to make progress, policy makers and economists need to go beyond generalities, and pay more attention to the detailed particulars of individual industries, sectors and regions, and the different way innovation takes place – or hasn’t being taking place – within them.

The biomedical bubble

I have a new report out, written with science policy expert James Wilsdon for the innovation foundation NESTA, entitled The Biomedical Bubble: Why UK research and innovation needs a greater diversity of priorities, politics, places and people. Here’s a summary of the report:

Biomedical science and innovation has benefited from significant increases in public investment over the past 15 years. This builds on the remarkable strengths of the UK’s academic life sciences base and pharmaceutical industry. But continuing to prioritise the biomedical, in a period when government aims to boost research and development (R&D) spending to 2.4 per cent of GDP, risks unbalancing our innovation system, and is unlikely to deliver the economic benefits or improvements to health outcomes that society expects.

For too long, the pharmaceutical and biotechnology sectors have dominated policy thinking about translating research, but these sectors are in deep trouble, with R&D productivity plummeting and R&D investment falling. Meanwhile, much of the wider innovation needed for the NHS, public health and social care has been under-resourced. Greater emphasis needs to be given to the social, environmental, digital and behavioural determinants of health, and decisions about research priorities need to involve a greater diversity of perspectives, drawn from across the country. The creation of UK Research and Innovation (UKRI), which aims to bring a more strategic approach to funding and prioritisation, is the right moment to rethink this balance. This paper sets out why and how the UK needs to escape the biomedical bubble if it is to realise the economic, social and health potential of extra investment in R&D.

There are some shorter pieces discussing different aspects of our arguments:

In the Guardian Political Science blog: It’s time to burst the biomedical bubble in UK research

On the WonkHE website: Building an industrial strategy around the pharma/biotech industry is a bet on the US healthcare system remaining unreformed.
Rethinking the life sciences strategy

On the ResearchProfessional website (subscription required), focussing on the task UKRI has balancing its portfolio:
Examine funding balance to pop ‘biomedical bubble’, UKRI told

A news piece about the report in the Times Higher:
UK’s biomedical research funding ‘bubble’ is ‘about to burst’

Bad Innovation: learning from the Theranos debacle

Earlier this month, Elizabeth Holmes, founder of the medical diagnostics company Theranos, was indicted on fraud and conspiracy charges. Just 4 years ago, Theranos was valued at $9 billion, and Holmes was being celebrated as one of Silicon Valley’s most significant innovators, not only the founder of one of the mythical Unicorns, but through the public value of her technology, a benefactor of humanity. How this astonishing story unfolded is the subject of a tremendous book by the journalist who first exposed the scandal, John Carreyrou. “Bad Blood” is a compelling read – but it’s also a cautionary tale, with some broader lessons about the shortcomings of Silicon Valley’s approach to innovation.

The story of Theranos

The story begins in 2003. Holmes had finished her first year as a chemical engineering student at Stanford. She was particularly influenced by one of her professors, Channing Robertson; she took his seminar on drug delivery devices, and worked in his lab in the summer. Inspired by this, she was determined to apply the principles of micro- and nano- technology to medical diagnostics, and wrote a patent application for a patch which would sample a patient’s blood, analyse it, use the information to determine the appropriate response, and release a controlled amount of the right drug. This closed loop system would combine diagnostics with therapy – hence Theranos, (from “theranostic”).

Holmes dropped out from Stanford in her second year to pursue her idea, encouraged by her professor, Channing Robertson. By the end of 2004, the company she had incorporated, with one of Robertson’s PhD students, Shaunak Roy, had raised $6 million from angels and venture capitalists.

The nascent company soon decided that the original theranostic patch idea was too ambitious, and focused on diagnostics. Holmes focused on the idea of doing blood tests on very small volumes – the droplets of blood you get from a finger prick, rather than the larger volumes you get by drawing blood with a needle and syringe. It’s a great pitch for those scared of needles – but the true promise of the technology was much wider than this. Automatic units could be placed in patients’ homes, cutting out all the delay and inconvenience of having to go to the clinic for the blood draw, and then waiting for the results to come back. The units could be deployed in field situations – with the US Army in Iraq and Afghanistan – or in places suffering from epidemics, like ebola or zika. They could be used in drug trials to continuously monitor patient reactions and pick up side-effects quickly.

The potential seemed huge, and so were the revenue projections. By 2010, Holmes was ready to start rolling out the technology. She negotiated a major partnership with the pharmacy chain Walgreens, and the supermarket Safeway had loaned the company $30 million with a view to opening a chain of “wellness centres”, built around the Theranos technology, in its stores. The US Army – in the powerful figure of General James Mattis – was seriously interested.

In 2013, the Walgreen collaboration was ready to go live; the company had paid Theranos a $100 million “innovation fee” and a $40 million loan on the basis of a 2013 launch. The elite advertising agency Chiat\Day, famous for their work with Apple, were engaged to polish the image of the company – and of Elizabeth Holmes. Investors piled in to a new funding round, at the end of which Theranos was valued at $9 billion – and Holmes was a paper billionaire.

What could go wrong? There turned out to be two flies in the ointment. Firstly, Theranos’s technology couldn’t do even half of what Holmes had been promising, and even on the tests it could do, it was unacceptably inaccurate. Carreyrou’s book is at its most compelling as he gives his own account of how he broke the story, in the face of deception, threats, and some very expensive lawyers. None of this would have come out without some very brave whistleblowers.

At what point did the necessary optimism about a yet-to-be developed technology turn first into self-delusion, and then into fraud? To answer this, we need to look at the technological side of the story.

The technology

As is clear from Carreyrou’s account, Theranos had always taken secrecy about its technology to the point of paranoia – and it was this secrecy that enabled the deception to continue for so long. There was certainly no question that they would be publishing anything about their methods and results in the open literature. But, from the insiders’ accounts in the book, we can trace the evolution of Theranos’s technical approach.

To go back to the beginning, we can get a sense of what was in Holmes’s mind at the outset from her first patent, originally filed in 2003. This patent – “Medical device for analyte monitoring and drug delivery” is hugely broad, at times reading like a digest of everything that anybody at the time was thinking about when it comes to nanotechnology and diagnostics. But one can see the central claim – an array of silicon microneedles would penetrate the skin to extract the blood painlessly, this would be pumped through 100 µm wide microfluidic channels, combined with reagent solutions, and then tested for a variety of analytes through detecting their binding to molecules attached to surfaces. In Holmes’s original patent, the idea was that this information would be processed, and then used to initiate the injection of a drug back into the body. One example quoted was the antibiotic vancomycin, which has rather a narrow window of effectiveness before side effects become severe – the idea would be that the blood was continuously monitored for vancomycin levels, which would then be automatically topped up when necessary.

Holmes and Roy, having decided that the complete closed loop theranostic device was too ambitious, began work to develop a microfluidic device to take a very small sample of blood from a finger prick, route it through a network of tiny pipes, and subject it to a battery of scaled-down biochemical tests. This all seems doable in principle, but fraught with practical difficulties. After three years making some progress, Holmes seems to have decided that this approach wasn’t going to work in time, so in 2007 the company switched direction away from microfluidics, and Shaunak Roy parted from it amicably.

The new approach was based around a commercial robot they’d acquired, designed for the automatic dispensing of adhesives. The idea of basing their diagnostic technology on this “gluebot” is less odd than it might seem. There’s nothing wrong with borrowing bits of technology from other areas, and reliably glueing things together depends on precise, automated fluid handling, just as diagnostic analysis does. But what this did mean was that Theranos no longer aspired to be a microfluidics/nanotech firm, but instead was in the business of automating conventional laboratory testing. This is a fine thing to do, of course, but it’s an area with much more competition from existing firms, like Siemens. No longer could Theranos honestly claim to be developing a wholly new, disruptive technology. What’s not clear is whether its financial backers, or its board, were told enough or had enough technical background to understand this.

The resulting prototype was called Edison 1.0 – and it sort-of worked. It could only do one class of tests – immunoassays, it couldn’t do many of these tests at the same time, and its results were not reproducible or accurate enough for clinical use. To fill in the gaps between what they promised their proprietary technology could do and its actual capabilities, Theranos resorted to modifying a commercial analysis machine – the Siemens Advia 1800 – to be able to analyse smaller samples. This was essential, to fulfil Theranos’s claimed USP, of being able to analyse the drops of blood from pin-pricks rather than the larger volumes taken for standard blood tests from a syringe and needle into a vein.

But these modifications presented their own difficulties. What they amounted to was simply diluting the small blood sample to make it go further – but of course this reduces the concentration of the molecules the analyses are looking for – often below the range of sensitivity of the commercial instruments. And there remained a bigger question, that actually hangs over the viability of the whole enterprise – can one take blood from a pin-prick that isn’t contaminated to an unknown degree by tissue fluid, cell debris and the like? Whatever the cause, it became clear that the test results Theranos were providing – to real patients, by this stage – were erratic and unreliable.

Theranos was working on a next generation analyser – the so-called miniLab – with the goal of miniaturising the existing lab testing methods to make a very versatile analyser. This project never came to fruition. Again, it was unquestionably an avenue worth pursuing. But Theranos wasn’t alone in this venture, and it’s difficult to see what special capabilities they brought that rivals with more experience and a longer track record in this area didn’t have already. Other portable analysers exist already (for example, the Piccolo Xpress), and the miniaturised technologies they would use were already in the market-place (for example, Theranos were studying the excellent miniaturised IR and UV spectrophotometers made by Ocean Optics – used in my own research group). In any case, events had overtaken Theranos before they could make progress with this new device.

Counting the cost and learning the lessons

What was the cost of this debacle? There was an human cost, not fully quantified, in terms of patients being given unreliable test results, which surely led to wrong diagnoses, missed or inappropriate treatments. And there is the opportunity cost – Theranos spent around $900 million, some of this on technology development, but rather too much on fees for lawyers and advertising agencies. But I suspect the biggest cost was the effect Theranos had slowing down and squeezing out innovation in an area that genuinely did have the potential to make a big difference to healthcare.

It’s difficult to read this story without starting to think that something is very wrong with intellectual property law in the United States. The original Theranos patent was astonishingly broad, and given the amount of money they spent on lawyers, there can be no doubt that other potential innovators were dissuaded from entering this field. IP law distinguishes between the conception of a new invention and its necessary “reduction to practise”. Reduction to practise can be by the testing of a prototype, but it can also be by the description of the invention in enough detail that it can be reproduced by another worker “skilled in the art”. Interpretation of “reduction to practise” seems to have become far too loose. Rather than giving the right to an inventor to benefit from a time-limited monopoly on an invention they’ve already got to work, patent law currently seems to allow the well-lawyered to carve out entire areas of potential innovation for their exclusive investigation.

I’m also struck from Carreyrou’s account by the importance of personal contacts in the establishment of Theranos. We might think that Silicon Valley is the epitome of American meritocracy, but key steps in funding were enabled by who was friends with who and by family relationships. It’s obvious that far too much was taken on trust, and far to little actual technical due diligence was carried out.

Carreyrou rightly stresses just how wrong it was to apply the Silicon Valley “fake it till you make it” philosophy to a medical technology company, where what follows from the fakery isn’t just irritation at buggy software, but life-and-death decisions about people’s health. I’d add to this a lesson I’ve written about before – doing innovation in the physical and biological realms is fundamentally more difficult, expensive and time-consuming than innovating in the digital world of pure information, and if you rely on experience in the digital world to form your expectations about innovation in the physical world, you’re likely to come unstuck.

Above all, Theranos was built on gullibility and credulousness – optimism about the inevitability of technological progress, faith in the eminence of the famous former statesmen who formed the Theranos board, and a cult of personality around Elizabeth Holmes – a cult that was carefully, deliberately and expensively fostered by Holmes herself. Magazine covers and TED talks don’t by themselves make a great innovator.

But in one important sense, Holmes was convincing. The availability of cheap, accessible, and reliable diagnostic tests would make a big difference to health outcomes across the world. The biggest tragedy is that her actions have set back that cause by many years.

Theresa May on Science and Industrial Strategy

It’s not often that a UK Prime Minister devotes a whole speech to science, so Theresa May’s speech on Monday – PM speech on science and modern Industrial Strategy – was a significant signal by itself. It’s obvious that Brexit consumes a huge amount of political and government bandwidth at the moment, so its interesting that the Prime Minister wants to associate herself with the science and industrial strategy agenda, perhaps to emphasise that Brexit is not completely all-consuming, and there is some space left for domestic policy initiatives.

Beyond the signal, though, I thought there was quite a lot of substance as well. The speech comes at an important moment – the UK government’s science and innovation funding agencies have just been through a major reorganisation, with seven discipline-based research councils, the innovation agency Innovate UK, and a body responsible for the research environment in English universities, coming together in a single organisation, UK Research and Innovation. UKRI began life on 1 April, and it launched its first major strategy document on 14th May. This document isn’t a strategy, though – it’s a “Strategic Prospectus” – a statement of some fundamental principles, together with a commitment to develop a strategy over the months to come. The PM’s speech didn’t mention UKRI at all – somewhat curiously, I thought. Nonetheless, the speech is an important statement of the direction UKRI will be expected to take.

Perhaps the most important commitment was the PM’s stress on the 2.4% R&D intensity target, together with a recognition that this needs a substantial increase in private sector funding, catalysed by state investment. I’ve already written about how stretching this target will be. My estimate is that it will need a £7 billion increase in government spending by 2027 – going well beyond the £2.2 billion increase to 2021 already announced – and a £14 billion increase in private sector spending. These are big numbers; to achieve them will require a significant shifting of the UK’s economic landscape (for the better, I believe). My suspicion is that the attempt to achieve them will have a very big influence on the way UKRI operates, perhaps a bigger influence than people yet realise.

Another important departure from science and innovation policy up to now was the insistence that everywhere in the UK should benefit from it – “backing businesses and building infrastructure not just in London and the South East but across every part of our country”. This needs to include those cities that were pioneers of innovation in the 19th century, but which since have suffered the effects of deindustrialisation. This of course speaks to the profound regional imbalances in R&D expenditure that I highlighted in my recent blogpost Making UKRI work for the whole UK.

Can old cities find new economic tricks? The PM’s speech pointed to some examples – from jute to video games in Dundee, fish to offshore wind in Hull, coal to compound semiconductors in Cardiff. Cities and regions do need to specialise. The PM’s speech didn’t mention any specific mechanisms for encouraging and supporting this, but on the same day UKRI announced a new “place-based” funding competition – the “Strength in Places Fund”, which represents a valuable first step. The aim is to develop interventions on a scale to make a material difference to local and regional economies, and it’s great news that the very distinguished economist Dame Kate Barker – who chaired the Industrial Strategy Commission – has been persuaded to chair the assessment panel.

One interesting section of the speech went some way to sharpening up thinking about “Grand Challenges” and “Missions” as organising principles for research. Last November’s Industrial Strategy White Paper introduced four “Grand Challenges” for the UK – on AI and data, the future of mobility, clean growth, and the ageing society. Within these “Grand Challenges”, the intention is to define more specific “missions”. These are more concrete than the Grand Challenges, with some hard targets. Politicians like to announce targets, because they make good headlines – artificial intelligence is “to help prevent 22,000 cancer deaths a year by 2033”, according to the BBC’s (somewhat inaccurate) trail of the speech . I think targets are helpful because they focus policy-makers’ minds on scale. It’s been a besetting sin of UK innovation policy to identify the right things to do, but then to execute them on a scale wholly inadequate to the task.

The PM’s speech announced four such “missions”, and I thought these examples were quite good ones. Two of these missions – around better diagnostics, and support for independent living for older people – are good examples of putting innovation for health and social care at the centre of industrial strategy, as the Industrial Strategy Commission recommended in its final report. Some questions remain.

One simple question that there needs to be an answer for when we talk about “mission-led innovation” is this – who will be the customer for the products that the innovation produces? The government – or the NHS – is the obvious answer for this question in the context of health and social care – but obvious doesn’t mean straightforward. We’ve seen many years of people (including me) saying we should use government procurement much more to drive innovation, to depressingly little effect. What this highlights is that the problems and barriers are as much organisational and cultural as technological. I’m entirely prepared to believe that machine learning techniques could potentially be very helpful in speeding up cancer diagnoses, but realising their benefits will take changes in organisation and working practises.

The other outstanding issue is how these “Grand Challenges” and “missions” are chosen. Is it going to be on the basis of which businessman last caught the ear of a minister, or what a SPAD read in the back of the Economist this week? On the contrary, one would hope – these decisions would be made by aggregating the collective intelligence of a very diverse range of people. This should include scientists, technologists and people from industry, but also the people who see the problems that need to be overcome in their everyday working lives. The UKRI Strategic Prospectus talks warmly about the need for public engagement, including the need to “listen and respond to a diverse range of views and aspirations about what people want research and innovation to do for them”. This needs to be converted into real mechanisms for including these voices in the formulation of these grand challenges and missions.

The Prime Minister may have wanted to give a speech about science to get away from Brexit for a few minutes, but of course there’s no escape, and the consequences of Brexit on our science and innovation system are uncertain and likely to be serious. So it was important and welcome that the PM – particularly this Prime Minister, and former Home Secretary – spelt out the importance of international collaboration, the huge contribution of the many overseas scientists who have chosen to base their careers and lives in the UK, and the value that overseas students bring to our universities. But warm words are not enough, and there needs to be a change in climate and culture in the Home Office on this issue.

One place where we have heard many warm words, but as yet little concrete action, has been in the question of the future relationship between the UK and the EU’s research and innovation programmes. Here it was enormously welcome to hear the Prime Minister state unequivocally that the UK wishes to be fully associated with the successor to Horizon 2020, and is prepared to pay for that. Stating a wish doesn’t make it happen, of course, but it is huge progress to hear that this is now a negotiating goal for the UK government. This is one goal that should be politically unproblematic, and should benefit both sides, so we must hope it can be realised.

The Second Coming of UK Industrial Strategy

I have a longish piece in the Winter issue of the US science policy journal “Issues in Science and Technology”, which aims to place our current debates about industrial strategy in the UK in a longer historical context. This is now online here: The Second Coming of UK Industrial Strategy. Here is the introduction to the article:

The United Kingdom dismantled industrial policies in the 1980s; today it must rebuild them to create a social-industrial complex.

Industrial strategy, as a strand of economic management, was killed forever by the turn to market liberalism in the 1980s. At least, that’s how it seemed in the United Kingdom, where the government of Margaret Thatcher regarded industrial strategy as a central part of the failed post-war consensus that its mission was to overturn. The rhetoric was about uncompetitive industries producing poor-quality products, kept afloat by oceans of taxpayers’ cash. The British automobile industry was the leading exhibit, not at all implausibly, for those of us who remember those dreadful vehicles, perhaps most notoriously exemplified by the Austin Allegro.

Meanwhile, such things as the Anglo-French supersonic passenger aircraft Concorde and the Advanced Gas-cooled Reactor program (the flagship of the state-controlled and -owned civil nuclear industry) were subjected to serious academic critique and deemed technical successes but economic disasters. They exemplified, it was argued, the outcomes of technical overreach in the absence of market discipline. With these grim examples in mind, over the next three decades the British state consciously withdrew from direct sponsorship of technological innovation.

In this new consensus, which coincided with a rapid shift in the shape of the British economy away from manufacturing and toward services, technological innovation was to be left to the market. The role of the state was to support “basic science,” carried out largely in academic contexts. Rather than an industrial strategy, there was a science policy. This focused on the supply side—given a strong academic research base, a supply of trained people, and some support for technology transfer, good science, it was thought, would translate automatically into economic growth and prosperity.

And yet today, the term industrial strategy has once again become speakable. The current Conservative government has published a white paper—a major policy statement—on industrial strategy, and the opposition Labour Party presses it to go further and faster.

This new mood has been a while developing. It began with the 2007-8 financial crisis. The economic recovery following that crisis has been the slowest in a century; a decade on, with historically low productivity growth, stagnant wage growth, and no change to profound regional economic inequalities, coupled with souring politics and the dislocation of the United Kingdom’s withdrawal from the European Union, many people now sense that the UK economic model is broken.

Given this picture, several questions are worth asking. How did we get here? How have views about industrial strategy and science and innovation policy changed, and to what effect? Going forward, what might a modern UK industrial strategy look like? And what might other industrialized nations experiencing similar political and economic challenges learn from these experiences?

Read the rest of the article here.

Making UK Research and Innovation work for the whole UK

The first of April saw the formal launch of UK Research and Innovation, the new body which will be responsible for the bulk of public science and innovation funding in the UK. All seven research councils, the innovation funding agency InnovateUK, and the research arm of the body formerly responsible for university funding in England, to be renamed Research England, have been folded into this single body, with a budget of more than £6 billion a year.

Expectations for this new body are very high. Its formation has been linked to the government’s decision to increase research funding substantially, with extra funding rising to £2.3 billion by 2021/2. This new money is explicitly linked to the need to increase the productivity of the UK economy. The government has also committed to a 10-year target of raising the overall R&D intensity of the economy from its current 1.7% to 2.4% of GDP. As I’ve discussed earlier, this is a very challenging target that will require a major change in behaviour from the UK”s private sector, as well as substantial increases in public sector R&D. UKRI has the task of ensuring that the extra public sector investment is made in ways that maximise increases in private sector R&D. All of this, of course, takes place with the background of Brexit, and the need for the UK to rebuild its business model.

There’s one factor that, unless urgently addressed, will hold UKRI back from its mission of making a significant difference to the UK’s overall productivity problems and raising the economy’s R&D intensity. That is the extraordinary and unhealthy concentration of publicly funded R&D in a relatively small part of the country – London, the Southeast, and East Anglia. Entirely uncoincidentally, these are the parts of the country with the most productive economies already. As the Industrial Strategy Commission (of which I was a member) stressed in its final report last year, unless the UK fixes its gross regional economic disparities it will never be able to prosper.

No-one has done more to bring attention to the UK’s unbalanced R&D geography than Tom Forth; everyone should read his recent article on how we should use the increase in funding to redress the balance. One key point that Tom has stressed is the geographical relationship – or lack of it – between public and private sector research. Industry spends roughly twice as much as the government on research, so reaching the 2.4% R&D intensity target will not be possible without major increases in private sector spending – roughly £14 billion a year, by my estimate. Yet classical economics tells us that firms will always underinvest in R&D, because they are unable to capture the full economic benefit of their spending, much of which “spills over” to benefit the rest of the economy. That’s the logic which convinces even HM Treasury that the state ought to support R&D.

Yet Tom Forth’s work – especially his plots of of public against private R&D spending – shows how badly government spending on R&D is matched to the demand of industry, as measured by where industry actually invests its own money.

R&D funding in the business and non-business sectors (government, higher education and charity), by NUTS2 regions. 2014 figures, by sector of performance, from Eurostat.

My first graph shows how unbalanced the UK’s research landscape is. This shows R&D spending – both private and public – broken down sub-regionally. The first thing that’s obvious is the dominance of three sub-regions – Oxford and its environs, Cambridge and its sub-region, and (part of) London – inner West London. These three subregions – out of a total of 40 in the UK – account for 31% of total UK R&D spending, and an even higher fraction of spending in the government, HE and charity sectors – 41%.

There is a striking difference between these three sub-regions, though, and that is their split between public and private R&D (taking public R&D here to mean R&D carried out in government-owned laboratories, universities and the non-profit sector). Overall in the UK, the value of business R&D stands at 1.89 times the value of public R&D. East Anglia – dominated by Cambridge – does even better than this, with private sector R&D coming in at more than twice public sector R&D. This is a science-based cluster that works, with high levels of public R&D being rewarded at above average rates by private sector R&D.

The Oxford, Berks and Bucks sub-region does slightly less well in converting its very large public investment into private R&D, with a multiplier a little below the national average at 1.72.

The real anomaly, though, is Inner London (West). This single region receives by far the largest amount of public R&D spending of any single region – nearly 20% of the entire public funding for R&D. Yet the rate of return on this, in terms of private sector R&D, is only 0.46, far below the national average.

Working down the list, we find 9 sub-regions with respectable levels of total R&D. These include Bristol, Hampshire, Derby, Bedford, Surrey and the West Midlands, Worcestershire and Cheshire. With the exception of East Scotland, all these sub-regions are characterised by above average ratios of private to public sector R&D. Two sub-regions stand out for significant private sector R&D and almost no public sector activity – Cheshire, with its historic concentration of chemical and pharmaceutical industries, and Warwickshire, Herefordshire and Worcestershire.

Then we come to the long tail, with much lower investment in R&D, either public or private. All of Wales, Northern Ireland, the North of Engand, Southwest England beyond Bristol, outer east and southeast London and Kent, Lincolnshire – this is pretty much a map of left-behind Britain.

This brings us to the question: why should we worry about regional disparities in R&D? Let’s put aside for the moment any question of fairness – what it comes back to is productivity. If we plot R&D intensity against regional GDP, we find quite a respectable correlation. (Slightly to my surprise, the correlation is strongest if you plot total R&D – both public and private, rather than just business R&D. I’ve ommitted London entirely as it is such an outlier on both measures).

Sub-regional GDP per capita against total R&D (public and private) per capita. 2014 data, from Eurostat.

Of course, the relationship is not a straightforward one: higher R&D intensities are associated with the presence of high-productivity firms at the technology frontier, there may be a more general effect of higher skill levels associated with regions with higher R&D, and there may well be other factors, direct and indirect, at play too. We can also see outliers. NE Scotland has very high productivity but mid-level R&D investment, not doubt because of the importance of the oil industry, while East Anglia has relatively low productivity given the very high levels of R&D. I suspect the latter is associated with the relatively concentrated nature of the Cambridge cluster, whose effect doesn’t really penetrate far into a large and relatively poor rural and coastal hinterland.

As the Industrial Strategy Commission argued, just as a matter of arithmetic averages, we cannot expect the productivity of the country as a whole to grow if such a large fraction is structurally lagging. So what should we do? Different places demand different approaches, and Tom Forth has some suggestions.

On London, I agree with Tom – the over-investment of public R&D in London is a grotesque misallocation of public resources, but the realities of political economy mean it’s likely to stay that way. At the least, we certainly need to stop building new institutions there. One thing the data does highlight, though, is how concentrated even within London the investment is, so initiatives like UCL East, aiming to spread the benefits of the investment to less favoured parts of the capital, are to be welcomed.

The eight sub-regions with high private sector R&D and public underinvestment offer a clear rationale for further public investment. One needs to be aware of the arbitrary nature of these sub-regional boundaries – the strength of the Cheshire cluster is a very good reason to have a strong Chemistry department in the University of Liverpool, for example – but these regions are likely to provide some very strong investment cases for following the private sector money to support existing clusters.

I somewhat disagree with Tom on the case of Oxford and Cambridge (and here of course my biases may be showing). Tom believes that funding should be frozen in these places until they agree to allow more growth. As far as Cambridge is concerned, I’m not sure this is quite right – there seems to be a huge amount of growth happening in that city at the moment, with significant numbers of new-build apartments going up in the city, and a string of new suburbs like Eddington being built around its fringes, complete with new schools and supermarkets. The issue here is the completely inadequate transport infrastructure to get into and around the city; it’s these infrastructure problems that are stopping the further growth of what is the UK’s most successful science-based cluster, and are stopping the spread of its benefits to its less prosperous hinterland.

But this leaves the bigger problem – how can public R&D investments be used to raise productivity and economic growth in what is the majority of the country, where levels of R&D investment, both public and private, are far too small? It’s easy to imagine arbitrary and ill-thought through investments imposed on regions with no understanding of the potential their economic history and current industrial base will support. But for a counter-example, in my own city, Sheffield, I think there has been a well thought through policy based on promoting advanced manufacturing through investments in translational research and skills, which is bearing fruit through the twin routes of the attraction of inward investment from firms at the technological frontier and improving the performance of the existing business base.

Where does this leave the new organisation, UK Research and Innovation, whose job should be to rectify these issues? It’s not going to be easy, given that the cultures of the organisations UKRI are being built from have been positively opposed to place-based policy. The research councils have focused on “research excellence” as the sole criterion for funding, while the policy of InnovateUK has been to be led by industry. But “place-blind” policies inevitably lead to research concentration, though the well-known Matthew effect. I know that EPSRC at least has been seriously grappling with these issues in the last couple of years, while there is real expertise in Research England on the economic potential of universities in their cities and regions, so there is something to build on.

The early actions of the new organisation are not encouraging. The move back from Swindon to London is a retrograde step, suggesting that UKRI’s first priority is keeping ministers happy and keeping ahead of Whitehall office politics. When the Technology Strategy Board (the predecessor organisation to InnovateUK) was first set up as a free-standing funding agency, it was moved out of London to Swindon to emphasise that it served business, not Whitehall, and it was the better for it.

Moreover, the main board of UKRI is conspicuous for its lack of geographical diversity – out of 16 members, just one – Aberdeen’s Ian Diamond – is from outside London and the southeast. Allowing this situation to arise was a telling and worrying oversight.

One urgent concrete step UKRI should take is to create a high-level advisory board to hold its feet to the fire on a plan to rebalance R&D expenditure across the country. UKRI is to produce a strategy within the next month or two, and it is to be hoped that addressing the regional balance issue forms a central part of this strategy. This board should include senior representatives from the Devolved Administrations, and economic development leads from the metro mayors’ offices and combined authorities in the regions of England. At an operational level, UKRI needs to get its staff out of their London home, perhaps with regional specialists seconded to economic development units in the regions and nations.

The stakes here are high. UKRI has been set up with great expectations – the substantial injection of extra research spending, the 2.4% R&D target are signals that the government expects UKRI to deliver. If it does not produce tangible, positive effects on the wider economy – across the whole of the UK – UKRI will rightfully be judged to have failed – it will have failed the country, and it will have failed UK science. Let’s hope it can rise to the challenge.

Technological innovation in the linear age

We’re living in an age where technology is accelerating exponentially, but people’s habits of thought are stuck in an age where progress was only linear. This is the conventional wisdom of the futurists and the Davos-going classes – but it’s wrong. It may have been useful to say this 30 years ago: then we were just starting on an astonishing quarter century of year-on-year, exponential increases in computing power. In fact, the conventional wisdom is doubly wrong – now that that exponential growth in computing power has come to an end, those people who lived through that atypical period are perhaps the least well equipped to deal with what comes next. The exponential age of computing power that the combination of Moore’s law and Dennard scaling gave us, came to an end in the mid-2000’s, but technological progress will continue. But the character of that progress is different – dare I say it, it’s going to be less exponential, more linear. Now, if you need more computer power, you aren’t going to be able to wait for a year or two for Moore’s law to do its work; you’re much more likely to add another core to your CPU, another server to your datacenter. This transition is going to have big implications for business and our economy, which I don’t see being taken very seriously yet.

Just how much faster have computers got? According to the standard textbook on computer architecture, a high-end microprocessor today has nearly 50,000 times the performance of a 1978 mini-computer, at perhaps 0.25% of the cost. But the rate of increase in computing power hasn’t been uniform. A remarkable plot in this book – Computer Architecture: A Quantitative Approach (6th edn) by John Hennessy & David Patterson – makes this clear.

In the early stages of the microprocessor revolution, between 1978 and 1986, computing power was increasing at a very healthy 25% a year – a doubling time of 3 years. It was around 1986 that the rate of change really took off – between 1986 and 2003 computer power increased at an astonishing 52% a year, a doubling time of just a year and a half.

This pace of advance was checked in 2004. The rapid advance had come about from the combination of two mutually reinforcing factors. The well-known Moore’s law dictated the pace at which the transistors in microprocessors were miniaturised. More transistors per chip gives you more computing power. But there was a less well-known factor reinforcing this – Dennard scaling – which says that smaller transistors allow your computer to run faster. It was this second factor, Dennard scaling, which broke down around 2004, as I discussed in a post last year.

With Moore’s law in operation, but Dennard scaling at an end, between 2003 and 2011, computer power counted to grow, but at the slower rate of 23% – back to a 3 year doubling time. But after 2011, according to Hennessy and Patterson, the growth rate slowed further – down to 3.5% a year since 2015. In principle, this corresponds to a doubling time of 20 years – but, as we’ll see, we’re unlikely to see this happen.

This is a generational change in the environment for technological innovation, and as I discussed in my previous post, I’m surprised that it’s economic implications aren’t being discussed more. There have been signs of this stagnation in everyday life – I think people are much more likely to think twice about replacing their four year old lap-top, say, than they were a decade ago, as the benefits of these upgrades get less obvious. But the stagnation has also been disguised by the growth of cloud computing.

The impressive feats of pattern recognition that allow applications like Alexa and Siri to recognise and respond to voice commands provide a good example of the way personal computing devices give the user the impression of great computer power, when in fact the intensive computation that these applications rely on take place, not in the user’s device, but “in the cloud”. What “in the cloud’ means, of course, is that the computation is carried out by the warehouse scale computers that make up the cloud providers’ server farms.

The end of the era of exponential growth in computing power does not, of course, mean the end of innovation in computing. Rather than relying on single, general purpose CPUs to carry out many different tasks, we’ll see many more integrated circuits built with bespoke architectures optimised for specific purposes. The very powerful graphics processing units that were driven by the need to drive higher quality video displays, but which have proved well-suited to the highly parallel computing needs of machine learning are one example. And without automatic speed gains from progress in hardware, there’ll need to be much more attention given to software optimisation.

What will the economic implications be of moving into this new era? The economics of producing microprocessors will change. The cost of CPUs at the moment is dominated by the amortisation of the huge capital cost of the plant needed to make them. Older plants, whose capital costs are already written off, will find their lives being prolonged, so the cost of CPUs a generation or two behind the leading edge will plummet. This collapse in price of CPUs will be a big driver for the “internet of things”. And it will lead to the final end of Moore’s law, as the cost of new generations becomes prohibitive, squeezed between the collapse in price of less advanced processors and the diminishing returns in performance for new generations.

In considering the applications of computers, habits learnt in earlier times will need to be rethought. In the golden age of technological acceleration, between 1986 and 2003, if one had a business plan that looked plausible in principle but that relied on more computer speed than was currently available, one could argue that another few cycles of Moore’s law would soon sort out that difficulty. At the rates of technological progress in computing prevailing then, you’d only need to wait five years or so for the available computing power to increase by a factor of ten.

That’s not going to be the case now. A technology that is limited by the availability of local computing power – as opposed to computer power in the cloud – will only be able to surmount that hurdle by adding more processors, or by waiting for essentially linear growth in computer power. One example of an emerging technology that might fall into this category would be truly autonomous self-driving vehicles, though I don’t know myself whether this is the case.

The more general macro-economic implications are even less certain. One might be tempted to associate the marked slowing in productivity growth that the developed world saw in the mid-2000’s with the breakdown in Dennard scaling and the end of the fastest period of growth in computer power, but I’m not confident that this stacks up, given the widespread rollout of existing technology, coupled with much greater connectivity through broadband and mobile that was happening at that time. That roll-out, of course, has still got further to go.

This paper – by Neil Thompson – does attempt to quantify the productivity hit to ICT using firms caused by the end of Dennard scaling in 2004, finding a permanent hit to total factor productivity of between 0.5 and 0.7 percentage points for those firms that were unable to adapt their software to the new multicore architectures introduced at the time.

What of the future? It seems inconceivable that the end of the biggest driving force in technological progress over the last forty years would not have some significant macroeconomic impact, but I have seen little or no discussion of this from economists (if any readers know different, I would be very interested to hear about it). This seems to be a significant oversight.

Of course, it is the nature of all periods of exponential growth in particular technologies to come to an end, when they run up against physical or economic limits. What guarantees continued economic growth is the appearance of entirely new technologies. Steam power grew in efficiency exponentially through much of the 19th century, and when that growth levelled out (due to the physical limits of Carnot’s law) new technologies – the internal combustion engine and electric motors – came into play to drive growth further. So what new technologies might take over from silicon CMOS based integrated circuits to drive growth from here?

To restrict the discussion to computing, there are at least two ways of trying to look to the future. We can look at those areas where the laws of physics permit further progress, and the economic demand to drive that progress is present. One obvious deficiency of our current computing technology is its energy efficiency – or lack of it. There is a fundamental physical limit on the energy consumption of computing – the Landauer limit – and we’re currently orders of magnitude away from that. So there’s plenty of room at the bottom here, as it were – and as I discussed in my earlier post, if we are to increase the available computing power of the world simply by building more data centres using today’s technology before long this will be using a significant fraction of the world’s energy needs. So much lower power computing is both physically possible and economically (and environmentally) needed.

We can also look at those technologies that currently exist only in the laboratory, but which look like they have a fighting chance of moving into commercial scales sometime soon. Here the obvious candidate is quantum computing; there really does seem to be a groundswell of informed opinion that quantum computing’s time has come. In physics labs around the world there’s a real wave of excitement at that point where condensed matter physics met nanotechnology, in the superconducting properties of nanowires, for example. Experimentalists are chasing the predicted existence of a whole zoo of quasi-particles (that is quantised collective excitations) with interesting properties, with topics such as topological insulators and Majorana fermion states now enormously fashionable. The fact that companies such as Google and Microsoft have been hoovering up the world’s leading research groups in this area give further cause to suspect that something might be going on.

The consensus about quantum computing among experts that I’ve spoken to is that this isn’t going to lead soon to new platforms for general purpose computing (not least because the leading candidate technologies still need liquid helium temperatures), but that it may give users a competitive edge in specialised uses such as large database searches and cryptography. We shall see (though one might want to hesitate before making big long-term bets which rely on current methods of cryptography remaining unbreakable – some cryptocurrencies, for example).

Finally, one should not forget that information and computing isn’t the only place where innovation takes place – a huge amount of economic growth was driven be technological change before computers were invented, and perhaps new non-information based innovation might drive another future wave of economic growth.

For now, what we can say is that the age of exponential growth of computer power is over. It gave us an extraordinary 40 years, but in our world all exponentials come to an end, and we’re now firmly in the final stage of the s-curve. So, until the next thing comes along, welcome to the linear age of innovation.