> For the complete documentation index, see [llms.txt](https://blog.htunnthuthu.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://blog.htunnthuthu.com/readme.md).

# Blog Content

**Principal Architect · GenAI & AIOps · Platform Engineering** — *Veni, Vidi, Vici* 🥷💻

After 10+ years of leadership at the intersection of high-velocity cloud platforms, automated security, data ecosystems, and agentic AI, I built this repository to share what actually works in production. I translate complex technical innovation — GenAI, AIOps, agentic systems, and LLMOps — into resilient enterprise infrastructure that drives business growth while maintaining strict governance. This is not documentation; it's an honest record of problems I had to solve.

## What You'll Find Here

This repository contains over **430+ articles** organized into comprehensive learning paths, covering everything from programming fundamentals to advanced cloud architecture and AI implementation.

### 🚀 Programming & Development

**Foundation & Core Skills:**

* Programming fundamentals with Python and TypeScript
* **Database 101** - Complete PostgreSQL guide
* **ORM 101** - Prisma with TypeScript and PostgreSQL
* **OOP 101** - Object-Oriented Programming mastery
* **Functional Programming 101** - Functional paradigms with TypeScript

**API Development:**

* **REST API 101** - Building production-ready APIs (7-part series)
* **gRPC 101** - High-performance microservices (7-part series)

**Testing & Quality:**

* **Software Testing 101** - From unit to E2E testing (7-part series)
* **Code Debugging 101** - VS Code debugging techniques
* **Code Review and Refactoring 101** - Maintaining code quality
* **Pair Programming 101** - Collaborative development

**Infrastructure as Code:**

* **Version Control 101** - Git and GitHub workflows
* **GitHub Actions 101** - CI/CD pipeline automation

**Specialized Topics:**

* **Vector Database 101** - pgvector with PostgreSQL (7-part series)
* **Rust 101** - Systems programming with Rust
* **Mathematics for Programming 101** - Mathematical foundations (7-part series)
* **Software Engineering 101** - Best practices and production readiness
* **Programming Issues 101** - Common errors and solutions

### 🤖 Artificial Intelligence & Machine Learning

**Core AI Development:**

* Building intelligent agents with LLMs and Azure OpenAI
* Production-ready RAG systems
* LLM-powered observability and automation
* AI-powered chatbots for microservices
* MCP vs ACP: Understanding the difference between the two protocols
* Building MCP servers from scratch using Rust
* Building ACP agents from scratch using Rust
* What is LangChain and LangGraph? — Composing LLM pipelines and stateful agent workflows
* **Self-Hosting OpenClaw on AWS Lightsail** — Running a personal AI assistant (OpenAI/Claude) across 20+ messaging channels on a $7/month VPS, with full security hardening and systemd daemon setup

**ML Frameworks & Tools:**

* **Hugging Face Transformers 101** - NLP model deployment
* **PyTorch 101** - Deep learning fundamentals
* **LLM API Development 101** - FastAPI with Claude
* **MLOps 101** - Kubeflow and production ML systems

### ☁️ Cloud Engineering & Infrastructure

**Infrastructure Automation:**

* **Terraform 101** - Infrastructure as Code (11-part series)
* **Kubernetes 101** - Container orchestration (12-part series)
* **Ansible 101** - Configuration management (22-part series)
* **Ansible Automation Platform 101** - Enterprise automation
* **Chef 101** - Configuration management
* **Helm 101** - Kubernetes package management
* **GitOps 101** - ArgoCD and GitOps workflows
* **Cloud Landing Zone 101** - Enterprise cloud foundations

### 🔄 DevOps & Site Reliability Engineering

**Observability & Monitoring:**

* **OpenTelemetry 101** - Distributed tracing and metrics (15-part series)
* **ELK Stack 101** - Logging and analytics
* **Prometheus 101** - Monitoring TypeScript applications (9-part series)

**SRE Practices:**

* **SRE 101** - Complete reliability guide (7-part series)
* **Release and Reliability Engineering 101** - Deployment strategies
* SLIs, SLOs, SLAs, and error budgets
* Incident management and toil reduction

**Microsoft Administration:**

* **Microsoft Administration 101** - Active Directory, ADFS, PKI, IIS

### 🔒 DevSecOps & Security

**Security Integration:**

* **DevSecOps 101** - Security-first development (14-part series)
* SAST, DAST, IAST, and SCA
* Container security and SBOM
* Secrets management and policy as code

**Identity & Access Management:**

* **MS Entra 101** - Microsoft identity platform (8-part series)
* OAuth 2.0, OIDC, SAML, and JWT
* SSO implementations and tenant restrictions

### 🏗️ Architecture & System Design

**Architecture Patterns:**

* **Software Architecture 101** - Core architectural concepts
* **Microservice Architecture 101** - Service decomposition (12-part series)
* **System Design 101** - Scalability and distributed systems (11-part series)
* Domain-Driven Design and event-driven architecture

**Design Patterns:**

* **Software Design 101** - SOLID principles and design patterns
* Creational, structural, and behavioral patterns
* Integration and communication patterns

### 📊 Data Engineering & Analytics

* **Data Engineering 101** - ETL/ELT pipelines (13-part series)
* Medallion architecture and data modeling
* **Python for Data Engineering 101** - Building real ETL pipelines with AWS Glue, S3, Athena, Redshift, Step Functions, and EventBridge — from local scripts to production-grade distributed jobs
* **DSA 101** - Data structures and algorithms (15-part series)

### 🎯 Fundamentals

* **Agile Development 101** - Scrum, Kanban, and workflows
* **Platform Engineering 101** - Internal developer platforms
* **Serverless 101** - AWS Lambda and event-driven architecture
* **Coding Interview 101** - Interview preparation

### 💻 Language-Specific Series

* **TypeScript 101** - Type-safe development (18-part series)
* **Golang 101** - Go programming (16-part series)
* **PowerShell 101** - Automation scripting (14-part series)

## Why I Created This Resource

This knowledge base represents what I wish I had when navigating these complex domains. My goal is to provide practical, experience-based guidance beyond theoretical concepts—real approaches that have succeeded (and sometimes failed) in production environments.

Each series is structured as a progressive learning path, taking you from fundamentals to production-ready implementations. Whether you're starting your tech journey or deepening your expertise, you'll find hands-on examples, real-world scenarios, and lessons learned from building production systems.

I regularly update this content with new series and emerging practices. The articles reflect my actual experience with these technologies, including the mistakes I made and how I overcame them.

*—Sharing practical knowledge from years of building production systems*

## 👨‍💻 About the Author

I'm **Htunn Thu Thu** — a **Principal Architect & Advisor** with 10+ years of leadership at the intersection of high-velocity cloud platforms, automated security, data ecosystems, and applied AI.

I spearhead organisation-wide transformations, establish global engineering standards, and act as a technical force multiplier — bridging executive leadership and engineering teams to balance innovation with regulatory compliance. As an active open-source author, I build tools that automate compliance, streamline platform delivery, and securely integrate AI with enterprise infrastructure.

### What I Focus On

| Domain                                | Scope                                                                                                                                                                                                                         |
| ------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| **GenAI · Agentic Systems · AIOps**   | LLMOps, MLOps, governance · Agentic systems with tool use, planning & memory · MCP servers · Multi-provider LLM strategy (OpenAI, Anthropic, Azure OpenAI, GitHub Models, Ollama) · RAG, fine-tuning, evaluation & guardrails |
| **Platform Engineering · IDPs · SRE** | Self-service Internal Developer Platforms · Vendor-agnostic multi-cloud (AWS & Azure, Terraform, Ansible) · SRE practices — SLOs, error budgets, golden paths                                                                 |
| **Enterprise DevSecOps**              | Supply-chain security (SBOM, SAST/DAST, SCA) · CSPM, CWPP, CNAPP, ZTNA · OWASP LLM Top 10 · Zero-trust IAM                                                                                                                    |
| **Engineering Leadership & OSS**      | Technical authority across global business units · Mentoring senior engineers · OSS tooling that removes dev↔ops friction                                                                                                     |

### Selected Impact

* **DevSecOps at scale** — Automated supply-chain security (SBOM, SAST/DAST) across enterprise SDLCs, delivering **60%+ reduction in MTTR**
* **Platform engineering** — Built and scaled Internal Developer Platforms enabling self-service infrastructure, **reducing deployment time \~70%**
* **Multi-cloud strategy** — Defined vendor-agnostic AWS + Azure architectures preventing lock-in while optimising large-scale cloud spend
* **GenAI productionisation** — Translated AI research into governed enterprise systems with LLMOps, observability, and model risk management
* **OSS authorship** — Published production-grade tooling in Python, Go, and Rust spanning AI agents, compliance, and security testing

### Selected Open Source

| Project                                                                   | Description                                                                                                              | Stack                  |
| ------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------ | ---------------------- |
| [aiops-orchestrator](https://github.com/Htunn/aiops-orchestrator)         | Production-ready multi-channel AI agent — AIOps, Kubernetes management, security scanning, human-in-the-loop remediation | Python · FastAPI · MCP |
| [hybrid-inference-app](https://github.com/Htunn/hybrid-inference-app)     | Mobile-first PWA for hybrid LLM inference — local Ollama + Google Gemini 2.5 Flash via secure FastAPI proxy              | Python · FastAPI       |
| [simple-cicd-translator](https://github.com/Htunn/simple-cicd-translator) | Agentic service translating CI/CD pipelines between Jenkins, GitLab CI, and GitHub Actions via pluggable LLMs            | Python · Multi-LLM     |
| [simple-exploit](https://github.com/Htunn/simple-exploit)                 | AI-powered pentest framework with dual AI engines — GitHub Models (GPT-4o/5) and Ollama (Gemma/Llama 3)                  | Go · Ollama            |
| [simple-waf-scanner](https://github.com/Htunn/simple-waf-scanner)         | High-performance WAF detection and authorised security testing toolkit                                                   | Rust                   |
| [simple-service-bus](https://github.com/Htunn/simple-service-bus)         | Decentralised ESB with multi-protocol support (HTTP, gRPC, AMQP, MQTT, Kafka, WebSocket, GraphQL)                        | Go · TypeScript        |
| [ansible-inspec](https://github.com/Htunn/ansible-inspec)                 | Combines Ansible automation with Chef InSpec for continuous compliance and security validation                           | Python                 |

This repository is my working knowledge base — not polished documentation, but honest write-ups from real projects: what worked, what failed, and the reasoning behind the decisions. Every article reflects a problem I actually had to solve.

### Let's Connect

* 🔗 [LinkedIn](https://www.linkedin.com/in/htunnthuthu/)
* 🐙 [GitHub](https://github.com/Htunn)
* ✍️ [Blog — Tech With Htunn](https://blog.htunnthuthu.com/)
* 🐳 [Docker Hub](https://hub.docker.com/u/htunnthuthu)
* 🏅 [Credly Certifications](https://www.credly.com/users/htunn)

> *Translating frontier AI into governed, enterprise-grade platforms — secure, observable, and built to scale.*


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://blog.htunnthuthu.com/readme.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
