Fully Local Web Search – How I Wean My Hermes Agent off the Cloud Drip
My Hermes Agent has been able to search the web and extract page content for a while now. Until...
Read Moreby Maker | May 30, 2026 | AI Pipeline, Large Language Models, Top story | 0 |
My Hermes Agent has been able to search the web and extract page content for a while now. Until...
Read Moreby Maker | May 29, 2026 | AI Pipeline, Software, Top story | 0 |
Anyone documenting complex infrastructures quickly hits the limits of static diagrams and MS...
Read Moreby Maker | May 26, 2026 | Large Language Models, News, Top story | 0 |
It has been itching me for quite a while: Firecrawl self-hosted, in my own LAN, with no cloud...
Read Moreby Maker | May 25, 2026 | GenAI Agents, Large Language Models, News, Top story | 0 |
Why NemoClaw is moving into my AI workshop Over the past few weeks I’ve shown you in several...
Read Moreby Maker | May 24, 2026 | Large Language Models, News, Top story | 0 |
My Hermes Agent has been running productively on my application server for several weeks: as a...
Read Moreby Maker | May 23, 2026 | Large Language Models, News, Top story | 0 |
In my last posts I built the inference layer (Ollama, TensorRT-LLM) and the orchestrator layer...
Read Moreby Maker | May 22, 2026 | Hardware, Large Language Models, Top story | 0 |
After I showed you in Part 4 of my ESP-Claw series how I got my ESP-Claw agent talking to my local...
Read Moreby Maker | May 17, 2026 | Large Language Models, News, Top story | 0 |
After my last post, where I dissected the ReAct loop in detail and built the first custom GPU...
Read Moreby Maker | May 17, 2026 | Large Language Models, Top story | 0 |
Agent orchestration is conceptually exactly the leap that turns “LLM inference” into...
Read Moreby Maker | May 16, 2026 | GenAI Agents, Large Language Models, Top story | 0 |
In my four-part TensorRT-LLM series I showed how I optimize inference performance on the RTX A6000...
Read Moreby Maker | May 16, 2026 | Hardware, Large Language Models, Top story | 0 |
Whether I later want to run TensorRT-LLM, Ollama, vLLM, or any other container-based inference...
Read Moreby Maker | May 16, 2026 | Large Language Models, Top story | 0 |
In the first three posts of this little series I explained why I’m tackling TensorRT-LLM on...
Read More





The tutorial offers a clear and practical guide for setting up and running the Tensorflow Object Detection Training Suite. Could…
This works using an very old laptop with old GPU >>> print(torch.cuda.is_available()) True >>> print(torch.version.cuda) 12.6 >>> print(torch.cuda.device_count()) 1 >>>…
Hello Valentin, I will not share anything related to my work on detecting mines or UXO's. Best regards, Maker
Hello, We are a group of students at ESILV working on a project that aim to prove the availability of…