An MCP Server for Multi-GPU Monitoring – Step by Step with Python, pynvml and EMA Smoothing
In my last posts I built the inference layer (Ollama, TensorRT-LLM) and the orchestrator layer...
Read MorePosted by Maker | May 23, 2026 | Large Language Models, News, Top story |
In my last posts I built the inference layer (Ollama, TensorRT-LLM) and the orchestrator layer...
Read MorePosted by Maker | May 22, 2026 | Hardware, Large Language Models, Top story |
After I showed you in Part 4 of my ESP-Claw series how I got my ESP-Claw agent talking to my local...
Read MorePosted by Maker | May 17, 2026 | Large Language Models, News, Top story |
After my last post, where I dissected the ReAct loop in detail and built the first custom GPU...
Read MorePosted by Maker | May 17, 2026 | Large Language Models, Top story |
Agent orchestration is conceptually exactly the leap that turns “LLM inference” into...
Read MorePosted by Maker | May 16, 2026 | GenAI Agents, Large Language Models, Top story |
In my four-part TensorRT-LLM series I showed how I optimize inference performance on the RTX A6000...
Read More




The tutorial offers a clear and practical guide for setting up and running the Tensorflow Object Detection Training Suite. Could…
This works using an very old laptop with old GPU >>> print(torch.cuda.is_available()) True >>> print(torch.version.cuda) 12.6 >>> print(torch.cuda.device_count()) 1 >>>…
Hello Valentin, I will not share anything related to my work on detecting mines or UXO's. Best regards, Maker
Hello, We are a group of students at ESILV working on a project that aim to prove the availability of…