Tag: inference

NEXTPLATFORM AD

Taalas Etches AI Models Onto Transistors To Rocket Boost Inference

Microsoft Takes On Other Clouds With “Braga” Maia 200 AI Compute Engines

NEXTPLATFORM AD

Is Nvidia Assembling The Parts For Its Next Inference Platform?

Rebellions AI Puts Together An HBM And Arm Alliance To Take On Nvidia

What Is Cerebras Going To Do With That $1.1 Billion In New Funding?

Google Shows Off Its Inference Scale And Prowess

NEXTPLATFORM AD

Skimpy HBM Memory Opens Up The Way For AI Inference Memory Godbox

With “Ironwood” TPU, Google Pushes The AI Accelerator To The Floor

Future Proofing Inference Servers With PCI-Express Switches

Cerebras Trains Llama Models To Leap Over GPUs

Cerebras Needs Wall Street Money To Expand Beyond One Core Customer

The Battle Begins For AI Inference Compute In The Datacenter

NEXTPLATFORM AD

The First AI Benchmarks Pitting AMD Against Nvidia

Stacking Up Intel Gaudi Against Nvidia GPUs For AI

Talking AI Costs And Addressable Markets With SambaNova

How AWS Can Undercut Nvidia With Homegrown AI Compute Engines

Groq Says It Can Deploy 1 Million AI Inference Chips In Two Years

Big Blue Can Still Catch The AI Wave If It Hurries

Optimizing AI Inference Is As Vital As Building AI Training Beasts

Chiplet Cloud Can Bring The Cost Of LLMs Way Down

Meta Platforms Crafts Homegrown AI Inference Chip, AI Training Next

A Peek Into The Future Of AI Inference At Nvidia

Chip Roadmaps Unfold, Crisscrossing And Interconnecting, At AMD

Intel Pits New Gaudi2 AI Training Engine Against Nvidia GPUs

NEXTPLATFORM AD

The Performance Of MLPerf As A Ubiquitous Benchmark Is Lacking

Doing The Math On CPU-Native AI Inference

Ampere Computing Buys An AI Inference Performance Leap

Further Funding Flows to Canadian AI Inference Hardware

SoC-Driven Inference Datacenters Becoming New Reality

Python Could Reset the AI Inference Playing Field

NEXTPLATFORM AD

Feeding The Datacenter Inference Beast A Heavy Diet Of FPGAs

Inside Facebook’s Future Rack And Microserver Iron

MLPerf Inference Results Offer Glimpse into AI Chip Performance

Nvidia Shows Off Tech Chops With RC18 Inference Chip

Habana Takes Training And Inference Down Different Paths

Will Analog AI Make Mythic a Unicorn?

AI Chip Startup Puts Inference Cards on the Table

Where The FPGA Hits The Server Road For Inference Acceleration

Teasing Out The Bang For The Buck Of Inference Engines

Google Rounds Out Insight into TPU Architecture and Inference

Facebook Sounds Opening Bell for AI Inference Hardware Makers

First Wave of Spiking Neural Network Hardware Hits

Xilinx Unveils xDNN FPGA Architecture for AI Inference

A Mythic Approach to Deep Learning Inference

Even at the Edge, Scale is the Real Challenge

Google Research Pushing Neural Networks Out of the Datacenter

Does Google’s TPU Investment Make Sense Going Forward?

Nvidia Pushes Deep Learning Inference With New Pascal GPUs