The Irony Of AWS Being Intel’s Latest Savior

Intel needs a whole lot of big wins for both its chip design and selling business and its increasingly arm’s length foundry business if it is going to turn itself around and be competitive with AMD and Nvidia on the left hand and with Taiwan Semiconductor Manufacturing Co on its right.

So far, what Intel is able to demonstrate is a bunch of small wins, which are steps in the right direction but about all both the company’s top brass and Wall Street can expect based on how far behind Intel fell in chip manufacturing techniques and therefore with its server and client CPUs, GPUs, and other kinds of compute engines.

We think that Pat Gelsinger, Intel’s prodigal son from EMC and VMware who has come home to save the company, has been doing a some pretty effective fast talking to the hyperscalers and cloud builders as well as to interested parties in the political sphere. And Gelsinger can be thankful that he didn’t get the chief executive officer job back in 2009 because the mess he has been cleaning up since he returned to the company in February 2021 is largely not his own, but rather those of former CEOs Paul Otellini, who could have been replaced by Renee James (now CEO of archrival Arm CPU provider Ampere Computing) but was not. Intel instead opted for Brian Krzanich in 2013, and the company was making money hand over until 2019 and it let its whole chip making business go to hell because of it, getting stuck in the La Brea 14 nanometer process pits in the process.

Roadmap after roadmap had to be torn up and redrawn, leaving the market wide open for AMD to take the pole position in CPUs and the Arm collective to coalesce – thanks in large measure to the desires of the hyperscalers and cloud builders to be rid of the X86 pricing tyranny Intel exhibited from 2010 through 2019 and to be in control of their own CPU architectures. A distracted but rich Intel fell far short with its many-core “Knights” family of accelerators and its “Ponte Vecchio” and “Rialto Bridge” GPUs, the former not really seeing the AI wave and the latter using a chiplet design and associated packaging that proved difficult to yield.

And to top it all off, Intel bought Nervana Systems, and then bought Habana Labs to replace them with the Gaudi line, and now Intel can’t make enough of the Gaudi matrix engines to create enough revenue to matter – $500 million in Gaudi 3 accelerator sales are expected this year, which is 32,000 devices, against more than $100 billion in datacenter sales for Nvidia in calendar 2024 and $5 billion in GPU sales and maybe $7 billion in CPU sales for AMD this year.

Don’t get us started on the past decade and a half of Intel’s attempts to build a datacenter networking business. . . . This obviously has not worked out, much to the detriment to the company. Networking represented 16.8 percent of Nvidia’s datacenter revenues in the trailing ten quarters the represent the GenAI boom. The share is coming down as Ethernet is gaining steam for AI clusters.

We believe in redemption, as we said as much in early August when going over Intel’s shocking financial results in The Resurrection Of Intel Will Take More Than Three Days. And the partnership just announced with Amazon Web Services is a step in the right direction. But we do not think it will lead to very much in the way of profits, even if it does result in billions of dollars in revenues. We are not sure how much of that purported and unspecified amount of revenue talked about in the deal this week is actually incremental.

Much is being made of the deal with AWS, so let’s break this down a bit.

Intel said in its statement that it is producing a custom Xeon 6 processor based on the Intel 3 process for the world’s largest cloud builder. Intel 3 is a refined version of the Intel 4 process, which is based on 7 nanometer geometries with some tweaks that make it more like a 5 nanometer process. The “Sierra Forest” Xeon 6700 and 6900 series announced back in June, the first of the Xeon 6 devices to come to market, use the Intel 3 process. And so will the impending “Granite Rapids” Xeon 6 chips, which have a smaller number of beefier X86 cores than the Sierra Rapids. We do not know if AWS is getting custom Sierra Forest or Granite Rapids chips – or both. We think the latter, and very likely with some modest tweaks to boost AI serving and database serving performance.

The nature of the customization is not clear, but we strongly suspect that it has more to do with the price of those X86 server CPUs than it does for any specific technical features. Yes, Intel has made custom processors for the big clouds for more than a decade, we know. Sometimes unneeded features are deactivated, sometimes the clock speeds are capped or boosted – the clouds were getting all-core boosted CPUs for years, and before everyone else could get their hands on these and also get more deterministic performance for their workloads. And, because the hyperscalers and cloud builders represented 30 percent of X86 CPU shipments worldwide and now represent in excess of 50 percent, they not only got better processors, but they got much better prices.

We do not think this is a change in the behavior of AWS at all, which has hewed closer to the Intel line than either Microsoft or Google ever did with their clouds. Google is well known for its enthusiasm for AMD server CPUs, and Microsoft has learned the same. All three are much more enamored of their own Arm server CPUs these days – which is without a doubt due in part to Intel not keeping its Xeon chips on a proper process roadmap, thus making them less competitive by necessity.

It is interesting that the AWS announcement did not say that Intel was being chosen as the foundry for future Trainium and Inferentia AI compute engines, but rather that it was “co-developing next-generation AI fabric chips on Intel 18A.”

Presumably, what Intel and AWS are referring to is breaking the Inferentia and Trainium processors into chiplets and having Intel etch the NeuronLink interconnect chips that were formerly on the Inferentia and Trainium monolithic dies free so they can be etched separately and, usually, on less expensive processes. A lot of the general business press and some of the IT trade press read what Intel and AWS said and mistakenly, we think, reported that Intel got the gig to co-design and etch AWS Trainium or Inferentia chips. If Intel is etching the whole Trainium and/or Inferentia chips, it did not describe this properly – and it did not give interviews on these announcements for further clarification.

Anyway, this AI fabric chip will be implemented in Intel’s 18A process, the first process to go into wide production using its RibbonFET transistor design and more or less at parity to TSMC’s 2N processes. (The 20A process that was dropped for server chips was more like TSMC’s 3N process.)

Intel also said the following in its announcement: “Intel and AWS intend to explore the potential for further designs to be produced by Intel based on Intel 18A and future process nodes including Intel 18AP and Intel 14A, which is expected to be produced in Intel’s Ohio facilities, as well as the migration of existing Intel designs to these platforms.”

So as we read this, there is a chance for Intel to get more chip fabbing deals with AWS at the 14A (or 1.4 nanometer) node, which is being brought up in the Intel Foundry operations in New Albany, Ohio.

As part of the announcements, Gelsinger also announced that Intel Foundry was not just having its financials broken out separately now, as happened a few quarters ago, but would be broken out as a independent subsidiary of Intel, much as the Altera FPGA business has already been. By being a subsidiary, Intel Foundry will have its own profit and loss statements and its own balance sheet, and it will be able to seek more independent investment from outside parties. (Hey, private equity firms can invest in NFL football teams now, so why not chip factories?)

Nothing was stopping Intel from seeking more equity investment to build out its foundry business, and it already has before making it a subsidiary. The US government gave Intel $8.5 billion in cash and incentives as part of the CHIPS Act. Apollo Global Management invested $11 billion in Intel’s Ireland fabs and Brookfield Asset Management invested $15 billion in Intel’s Arizona fabs, to get 49 percent stakes in new facilities being built there.

Making Intel Foundry a separate subsidiary might place stronger firewalls between the foundry and the rest of Intel, which very likely means seeding it with cash as well as keeping stuff confidential that foundry customers are doing. We think potential foundry customers want both, particularly with Intel Foundry hemorrhaging $8 billion in operating losses in the trailing twelve months and no reason to believe it will not continue.

Intel Foundry will be helped by the $3 billion in direct funding to make chips for the military that was announced as part of the Secure Enclave program within the CHIPS Act. This funding includes dough for the Rapid Assured Microelectronics Prototypes – Commercial (RAMP-C) and State-of-the-Art Heterogeneous Integration Prototype (SHIP) programs, and it is meant to ensure that chips of military significance are made indigenously. This $3 billion was not a surprise, given that Intel is the only indigenous chip maker. (IBM Microelectronics used to build a lot of chips for the US military, but it sold out to GlobalFoundries a decade ago; it still operates foundries in New York and Vermont, and they may also still make such chips for the Department of Defense.)

Intel is looking to shed 15 percent of its 116,500-strong work force, to cut its costs by $10 billion, and cutting its dividend payments to shareholders rocked its stock in early August, but the news of some of its actions happening helped the stock a little this week. Intel is about halfway towards it goal of shedding 15,000 workers by the end of the year. Intel is looking to consolidate its office space and will exit two-third of its office space by the end of 2025. (This sounds like a difficult thing to do.)

Gelsinger said in a memo to Intel employees that the Ireland fab will be its main hub in Europe and that it was pausing its manufacturing projects in Poland and Germany for two years, and that chip design and packaging operations in Malaysia would continue, but other manufacturing operations would halt.

“There are no changes to our other manufacturing locations,” Gelsinger said. “We remain committed to our U.S. manufacturing investments and are moving forward with our projects in Arizona, Oregon, New Mexico, and Ohio. We remain well-positioned to scale up production around the world based on market demand as we grow our foundry business.”

Ah, but will that market demand materialize?

There was some other re-org housekeeping news items. The edge and automotive business will be moved into the Client Computing Group, which makes chips for PCs and, in theory, other clients. The Network and Edge business will be focusing on “networking and telco” and presumably should just be called the Networking Group. Integrated photonics, which had presumably been part of NEX, is being moved into the Data Center & AI group and Intel is working on a roadmap to better serve the datacenter. Whatever that means. (Paring it down, more like.)

In the end, the hyperscalers and cloud builders were Intel’s biggest customers, but the malleability of the Arm architecture and the volume economics of the situation are going to make it very hard to Intel to win revenues at a reasonable profitability for its Xeons, and its process roadmaps are going to have to be successful for many years before they will trust Intel Foundry to be their etcher and packager. Intel’s foundry costs are so much higher than TSMC’s for any given process that it is hard to believe anyone would do it except for patriotic reasons – even Intel has to go to TSMC right now to get the best etching.

Sign up to our Newsletter

Featuring highlights, analysis, and stories from the week directly from us to your inbox with nothing in between.
Subscribe now

6 Comments

  1. I wonder if by their visceral (or not) reaction upon flashing “++++” onto a screen, can one tell if an employee has been with Intel for a significant amount of time?

  2. It sure takes time, investment, and trust, to get a leading edge fab up and running successfully. This expanding strategic collaboration between AWS and Intel should help ensure that IFS gets there profitably, and sooner rather than later, here for AI fabric chips and custom Xeons. I can only hope that others join the fray as well, especially for cloud-oriented ARM chips, TPUs, GPUs, and the likes. Those are supply-constrained and face packaging bottlenecks in the global supply chain (eg. CoWoS), that should give ample room and opportunities for IFS to shine IMHO.

  3. The AWS nod was important for Pat’s relationship with FTC chair Gina Raimando. Federal funds are at stake here, so the worst reflection on her would be the IFS investment turns into another Solyndra. Pat had some good news to digest before the re-org stuff.

    Beyond that its bold for AWS to commit at this point to an build a product in an area Intel has been struggling with for years, so good for them. I’m sure that brought some relief to Gina. But in order to become a supplier in good standing with other key fabless guys (AMD, Nvidia for example) Intel is going to need to sever the IFS business completely, which isn’t in the cards for now.

  4. Why would it be costlier to build chips in the US if the factory is FULLY automated. The advantage that Asia has is the lower labour cost right?

    • It is a perplexing question, isn’t it? Same reason servers are cheaper to build in Shenzen than in Guadalaraja, which is cheaper than Fremont. It might have something to do with the ecosystem of engineering and parts expertise that builds up in places mentioned above.

      Perhaps there are more experts in foundries in Taiwan than there are experts in the United States? Whatever expertise IBM and AMD had and sold to GlobalFoundries is now way outdated. When all of the engineers and parts suppliers are local, the feedback loops are shorter and the expertise is leveraged quicker to solve problems. I think it has to be more than just cheaper labor. It has to also be better labor. Getting a foundry up to yield on a process quicker means losing less money on it. The cost is idleness, and as Intel’s results show, idleness can cost billions of dollars a quarter.

  5. “if it is going to turn itself around and be competitive with AMD ”
    Last time I looked, Intel still has 75% of x86 market.
    They are a node ahead with Lunar Lake using TSM N3.
    They are a node ahead with Granite Rapids and Sierra Forest using Intel-3.
    They continue to lead with introduction of Wifi7, and TB5 coming on Arrow Lake.
    They continue to lead with AMX tiled matrix acceleration on Granite Rapids.
    The integrated GPU and NPU on Lunar Lake push its performance to 120 TOPS for AI processing.

Leave a Reply

Your email address will not be published.


*


This site uses Akismet to reduce spam. Learn how your comment data is processed.