NeMO Archives - The Next Platform

Code

Nvidia NeMo Microservices For AI Agents Hits The Market

April 23, 2025 Jeffrey Burt Comments Off

Last year, amid all the talk of the “Blackwell” datacenter GPUs that were launched at last year’s GPU Technicval Conference, Nvidia also introduced the idea of Nvidia Inference Microservices, or NIMs, which are prepackaged enterprise-grade generative AI software stacks that companies can use as virtual copilots to add custom AI software to their own applications. …

Using NIM Guardrails To Keep Agentic AI From Jumping To Wrong Conclusions

January 16, 2025 Jeffrey Burt 1

AI agents are the latest evolution in the relatively short life span of generative AI, and while some organizations are still trying to figure out how the emerging technology fits in their operations, others are making strides into agentic AI. …

Nvidia’s “Grace” Arm CPU Holds Its Own Against X86 For HPC

February 6, 2024 Timothy Prickett Morgan 14

In many ways, the “Grace” CG100 server processor created by Nvidia – its first true server CPU and a very useful adjunct for extending the memory space of its “Hopper” GH100 GPU accelerators – was designed perfectly for HPC simulation and modeling workloads. …

Finding NeMo Features for Fresh LLM Building Boost

December 5, 2023 Nicole Hemsoth Prickett 2

This week Nvidia shared details about upcoming updates to its platform for building, tuning, and deploying generative AI models. …

Keeping Large Language Models From Running Off The Rails

April 26, 2023 Jeffrey Burt Comments Off

The heady, exciting days of ChatGPT and other generative AI and large-language models (LLMs) is beginning to give way to the understanding that enterprises will need to get a tight grasp on how these models are being used in their operations or they will risk privacy, security, legal, and other problems down the road. …