Category: Top story

NVIDIA NeMo Agent Toolkit (NAT): set up the agent orchestrator locally

by Maker | Jun 16, 2026 | Large Language Models, News, Top story | 0 |

The two halves of a voice agent are in place: with Parakeet (Part 2) and Canary (Part 3) the agent...

Read More

NVIDIA Magpie TTS locally: German speech output as a microservice

by Maker | Jun 14, 2026 | Large Language Models, News, Top story | 0 |

The speech recognition is in place: with Parakeet (Part 2) and Canary (Part 3) I covered the input...

Read More

NVIDIA Canary locally: multilingual speech recognition and translation as a NIM

by Maker | Jun 14, 2026 | Large Language Models, News, Top story | 0 |

In Part 2 I set up Parakeet as a streaming-capable ASR NIM for German and ran it as a live service...

Read More

NVIDIA NIM locally: running German speech recognition as a microservice

by Maker | Jun 14, 2026 | Large Language Models, News, Top story | 0 |

In my last post I ran NVIDIA Nemotron ASR Streaming directly with NeMo locally. That was the...

Read More

Install NVIDIA Nemotron ASR Streaming Locally – Step-by-Step Guide

by Maker | Jun 13, 2026 | Large Language Models, News, Top story | 0 |

Real-time speech recognition is one of the building blocks I absolutely want to self-host for...

Read More

Hermes Agent as a Native Desktop App on Windows

by Maker | Jun 9, 2026 | AI Pipeline, Large Language Models, Top story | 0 |

Nous Research has given its open-source Hermes Agent something with the “Surface...

Read More

llama-benchy: llama-bench-style LLM benchmarks for any OpenAI-compatible endpoint

by Maker | Jun 8, 2026 | Large Language Models, News, Top story | 0 |

If, like me, you run your models locally and sovereignly, you know the problem: I want to know how...

Read More

Fully Local Web Search – How I Wean My Hermes Agent off the Cloud Drip

by Maker | May 30, 2026 | AI Pipeline, Large Language Models, Top story | 0 |

My Hermes Agent has been able to search the web and extract page content for a while now. Until...

Read More

Animated Architecture Diagrams with D2 and ffmpeg – the Foundation for an AI-powered Diagram Generator

by Maker | May 29, 2026 | AI Pipeline, Software, Top story | 0 |

Anyone documenting complex infrastructures quickly hits the limits of static diagrams and MS...

Read More

Firecrawl in Your Own LAN: Sovereign Web Scraping with a Local Ollama Server

by Maker | May 26, 2026 | Large Language Models, News, Top story | 0 |

It has been itching me for quite a while: Firecrawl self-hosted, in my own LAN, with no cloud...

Read More

Sovereign AI on a 10-year-old office PC: NemoClaw on a Dell OptiPlex 5040 with my own Ollama server

by Maker | May 25, 2026 | GenAI Agents, Large Language Models, News, Top story | 0 |

Why NemoClaw is moving into my AI workshop Over the past few weeks I’ve shown you in several...

Read More

Sovereign AI in the Home Network: Running the Hermes Agent Dashboard Securely as a systemd Service

by Maker | May 24, 2026 | Large Language Models, News, Top story | 0 |

My Hermes Agent has been running productively on my application server for several weeks: as a...

Read More

1
...
2
3
4
5
...
6

Latest Posts

Local Voice Agent: Wiring ASR, LLM and TTS into a Loop with NVIDIA Pipecat Locally
Every building block is now available locally: ASR (Parakeet, Canary), TTS (Magpie), an LLM via my Ollama server and the orchestrator (NAT, Part 5). Now I’m connecting them into a continuous, interruptible speech loop. This will be my first small local voice agent. What I’m aiming for is a kind of general-purpose agent: I speak,… Read more: Local Voice Agent: Wiring ASR, LLM and TTS into a Loop with NVIDIA Pipecat Locally
NVIDIA NeMo Agent Toolkit (NAT): set up the agent orchestrator locally
The two halves of a voice agent are in place: with Parakeet (Part 2) and Canary (Part 3) the agent listens, with Magpie (Part 4) it answers. What’s still missing is the brain: the layer that turns recognized text into a decision and triggers the matching answer or action. That’s exactly what I take on… Read more: NVIDIA NeMo Agent Toolkit (NAT): set up the agent orchestrator locally
NVIDIA Magpie TTS locally: German speech output as a microservice
The speech recognition is in place: with Parakeet (Part 2) and Canary (Part 3) I covered the input direction, speech to text, natively on NVIDIA. Now comes the opposite direction, the voice output. In this post I run NVIDIA Magpie TTS as a local NIM and have German text read aloud naturally. This is the… Read more: NVIDIA Magpie TTS locally: German speech output as a microservice
NVIDIA Canary locally: multilingual speech recognition and translation as a NIM
In Part 2 I set up Parakeet as a streaming-capable ASR NIM for German and ran it as a live service with low latency. In this post I take on the sister model: NVIDIA Canary as a NIM. Canary does not shine at latency but at accuracy, and it can do something Parakeet cannot: translate.… Read more: NVIDIA Canary locally: multilingual speech recognition and translation as a NIM
NVIDIA NIM locally: running German speech recognition as a microservice
In my last post I ran NVIDIA Nemotron ASR Streaming directly with NeMo locally. That was the “bare” route via the framework. In this post I go one step further and dive into NVIDIA NIM. NIM stands for NVIDIA Inference Microservices, the microservice variant NVIDIA uses to ship its models as ready-made, optimized containers. The… Read more: NVIDIA NIM locally: running German speech recognition as a microservice

Ferry on AI Pipeline – Tensorflow Object Detection Training GUI RunMay 20, 2025
The tutorial offers a clear and practical guide for setting up and running the Tensorflow Object Detection Training Suite. Could…
Georg Mill on How to Install and Use OpenAI’s Whisper Locally for Automatic Transcription and TranslationDecember 31, 2024
This works using an very old laptop with old GPU >>> print(torch.cuda.is_available()) True >>> print(torch.version.cuda) 12.6 >>> print(torch.cuda.device_count()) 1 >>>…
Maker on YOLOv5 – Training a neural network for PFM-1 antipersonnel mine detectionDecember 30, 2024
Hello Valentin, I will not share anything related to my work on detecting mines or UXO's. Best regards, Maker
Valentin GRATEAU on YOLOv5 – Training a neural network for PFM-1 antipersonnel mine detectionJanuary 11, 2024
Hello, We are a group of students at ESILV working on a project that aim to prove the availability of…