RajaCSP

GraphRAG for Mainframe Abend Troubleshooting with AgentScope

Posted on Fri 12 June 2026 in GenAI • Tagged with GenAI, RAG, KnowledgeGraph, AgentScope, Mainframe, GraphRAG, Python, COBOL

Most mainframe troubleshooting RAGs fail at the same place: retrieval. An abend code like S0C7 is a near-exact lookup, not a fuzzy semantic match — but vector search happily returns the S0C4 chunk because the embeddings sit close together. And job dependencies are graph-shaped: an abend in step 3 cascades to …

100 Real GenAI Engineer Interview Questions

Posted on Wed 03 June 2026 in GenAI • Tagged with genai, interview, llm, rag, agents, mlops, compliance

Training & Adaptation Strategy

What approaches exist for training or adapting an LLM? — Pretraining, fine-tuning, instruction tuning, prompt engineering, RAG.
Base model vs instruction-tuned model? — Pure next-token predictor vs one aligned to follow instructions.
When would you choose fine-tuning over RAG? — Stable domain knowledge, style/format control, latency sensitivity.
When would …

50 Basic GenAI Engineer Interview Questions

Posted on Tue 02 June 2026 in GenAI • Tagged with genai, interview, llm, rag, fine-tuning, mlops

A starter question bank for screening entry-level GenAI engineers. Grouped by theme, covering fundamentals through production concerns.

Fundamentals

What is generative AI vs discriminative AI? — Generative models learn to produce new data; discriminative models learn decision boundaries to classify or predict.
What is a large language model (LLM)? — A neural …

Exploring NemoClaw — NVIDIA's Local AI Agent Sandbox

Posted on Fri 01 May 2026 in GenAI • Tagged with GenAI, LLM, NVIDIA, NemoClaw, Ollama, Docker

NemoClaw is NVIDIA's agent sandbox that lets you run AI assistants locally using your own inference backend — Ollama, llama.cpp, or cloud providers. It bundles OpenShell as a gateway and OpenClaw as the agent runtime, all orchestrated through Docker containers.

Here's a walkthrough of setting it up from scratch.

Setup …

LLM Inference Optimization: What Actually Makes Your Model Fast

Posted on Sat 18 April 2026 in GenAI • Tagged with LLM, Inference, Optimization, Quantization, KV Cache, Speculative Decoding, Flash Attention

When you send a prompt to an LLM, three layers shape how fast you get a response: the hardware (GPUs, TPUs, LPUs), the model size and architecture, and the inference engine strategies sitting on top. Most of the latency battle is fought at that third layer — and the core problem …

Six Terms You Must Know Before Building Agentic AI Systems

Posted on Fri 17 April 2026 in GenAI • Tagged with Agentic AI, MCP, RAG, Multi-Agent, Memory, Agent Architecture

If you are shipping agentic systems in 2026 and these six terms are fuzzy, you are building on sand. This is the foundational vocabulary — not hype, not vendor marketing, just the concepts that actually show up when you are designing, debugging, or scaling an agent pipeline.

The Six Terms

Model …

GitHub Spec Kit: A Practical Introduction to Spec-Driven Development

Posted on Tue 14 April 2026 in GenAI • Tagged with GenAI, LLM, SpecKit, GitHub, SDD, AIcoding, Copilot, AgentDev

"The issue isn't the coding agent's coding ability, but our approach.
We treat coding agents like search engines when we should be treating them
more like literal-minded pair programmers."
— Den Delimarsky, GitHub Principal Product Manager

The Problem With Vibe Coding

If you've used an AI coding assistant, you've experienced vibe …

Agentic System Design Concepts - Patterns Every AI Engineer Should Know

Posted on Sat 11 April 2026 in GenAI • Tagged with GenAI, AI-agents, LLM, agentic-systems, design-patterns, reliability

Building reliable AI agents isn't just about picking the right model — it's about the patterns you wire around it. Here's a concise reference of 15 agentic system design concepts worth knowing. Two lines each — just enough to understand what they do and why they matter.

Resilience & Failure Isolation

Agent Circuit …

Every Claude Code Concept You Need to Know

Posted on Sat 11 April 2026 in GenAI • Tagged with GenAI, Claude-Code, LLM, agents, developer-tools, local-AI

Claude Code is not a chatbot. It lives in your terminal, reads your actual files, writes code, runs commands, and executes multi-step workflows — all with your permission. Here are 30 concepts you need to understand it properly. No fluff, no hand-holding.

The 30 Concepts

1. The Terminal — Claude Code doesn't …

AI Agent Directory - Few Shots LLM Models

Posted on Fri 10 April 2026 in GenAI • Tagged with GenAI, AI-agents, LLM, startups, directory

The AI agent ecosystem is growing fast. Here's a quick directory of notable AI startups and a couple of few-shot LLM models worth knowing about. Two lines each — just enough to know what they do and why they matter.

AI Agent Directory

Can of Soup — An AI-powered app that lets …