Sashiko — The AI That Catches Kernel Bugs Humans Already Missed

Posted on Mon 08 June 2026 in GenAI Engineering • Tagged with agentic-ai, linux-kernel, code-review, rust, llm, sashiko

Sashiko (刺し子, "little stabs") borrows its name from a Japanese reinforcement-stitching technique — fabric repaired and strengthened at its points of wear. The metaphor is the whole pitch: an agentic system that stitches over the weak spots in proposed Linux kernel patches before they land. It's written in Rust …


Continue reading

100 Real GenAI Engineer Interview Questions

Posted on Wed 03 June 2026 in GenAI • Tagged with genai, interview, llm, rag, agents, mlops, compliance

Training & Adaptation Strategy

  1. What approaches exist for training or adapting an LLM? — Pretraining, fine-tuning, instruction tuning, prompt engineering, RAG.

  2. Base model vs instruction-tuned model? — Pure next-token predictor vs one aligned to follow instructions.

  3. When would you choose fine-tuning over RAG? — Stable domain knowledge, style/format control, latency sensitivity.

  4. When would …


Continue reading

50 Basic GenAI Engineer Interview Questions

Posted on Tue 02 June 2026 in GenAI • Tagged with genai, interview, llm, rag, fine-tuning, mlops

A starter question bank for screening entry-level GenAI engineers. Grouped by theme, covering fundamentals through production concerns.

Fundamentals

  1. What is generative AI vs discriminative AI? — Generative models learn to produce new data; discriminative models learn decision boundaries to classify or predict.

  2. What is a large language model (LLM)? — A neural …


Continue reading

Exploring NemoClaw — NVIDIA's Local AI Agent Sandbox

Posted on Fri 01 May 2026 in GenAI • Tagged with GenAI, LLM, NVIDIA, NemoClaw, Ollama, Docker

NemoClaw is NVIDIA's agent sandbox that lets you run AI assistants locally using your own inference backend — Ollama, llama.cpp, or cloud providers. It bundles OpenShell as a gateway and OpenClaw as the agent runtime, all orchestrated through Docker containers.

Here's a walkthrough of setting it up from scratch.

Setup …


Continue reading

Using Free Cloud-Based LLMs via Ollama on Ubuntu

Posted on Sun 19 April 2026 in GenAI Engineering • Tagged with ollama, llm, ubuntu, cloud-llm, local-ai, kactii, linux


Ollama is a lightweight, open-source LLM runner. Its :cloud model suffix lets you route prompts to free-tier hosted models — no GPU, no paid API key required. Useful for learning, prototyping, and small projects on modest hardware.

This post covers the full Ubuntu setup: manual install, service startup, chatting with Kimi …


Continue reading

LLM Inference Optimization: What Actually Makes Your Model Fast

Posted on Sat 18 April 2026 in GenAI • Tagged with LLM, Inference, Optimization, Quantization, KV Cache, Speculative Decoding, Flash Attention

When you send a prompt to an LLM, three layers shape how fast you get a response: the hardware (GPUs, TPUs, LPUs), the model size and architecture, and the inference engine strategies sitting on top. Most of the latency battle is fought at that third layer — and the core problem …


Continue reading

AgentLegatus — Terraform for AI Agents

Posted on Wed 15 April 2026 in Agents • Tagged with agentlegatus, multi-agent, llm, open-source, orchestration, python, langgraph, vendor-agnostic

The multi-agent space is fracturing fast. Teams pick LangGraph one quarter, switch to CrewAI the next, then discover Google ADK or AWS Strands and wonder if they should migrate again. Every switch costs weeks — ripping out abstractions, rewriting orchestration logic, re-testing state management. This is the exact problem Terraform solved …


Continue reading

GitHub Spec Kit: A Practical Introduction to Spec-Driven Development

Posted on Tue 14 April 2026 in GenAI • Tagged with GenAI, LLM, SpecKit, GitHub, SDD, AIcoding, Copilot, AgentDev

"The issue isn't the coding agent's coding ability, but our approach.
We treat coding agents like search engines when we should be treating them
more like literal-minded pair programmers."

— Den Delimarsky, GitHub Principal Product Manager


The Problem With Vibe Coding

If you've used an AI coding assistant, you've experienced vibe …


Continue reading

Agentic System Design Concepts - Patterns Every AI Engineer Should Know

Posted on Sat 11 April 2026 in GenAI • Tagged with GenAI, AI-agents, LLM, agentic-systems, design-patterns, reliability

Building reliable AI agents isn't just about picking the right model — it's about the patterns you wire around it. Here's a concise reference of 15 agentic system design concepts worth knowing. Two lines each — just enough to understand what they do and why they matter.

Resilience & Failure Isolation

Agent Circuit …


Continue reading

Every Claude Code Concept You Need to Know

Posted on Sat 11 April 2026 in GenAI • Tagged with GenAI, Claude-Code, LLM, agents, developer-tools, local-AI

Claude Code is not a chatbot. It lives in your terminal, reads your actual files, writes code, runs commands, and executes multi-step workflows — all with your permission. Here are 30 concepts you need to understand it properly. No fluff, no hand-holding.

The 30 Concepts

1. The Terminal — Claude Code doesn't …


Continue reading