> For the complete documentation index, see [llms.txt](https://blog.htunnthuthu.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://blog.htunnthuthu.com/readme.md).

# Blog Content

**Principal Architect · GenAI & AIOps · Platform Engineering** — *Veni, Vidi, Vici* 🥷💻

After 10+ years of leadership at the intersection of high-velocity cloud platforms, automated security, data ecosystems, and agentic AI, I built this repository to share what actually works in production. I translate complex technical innovation — GenAI, AIOps, agentic systems, and LLMOps — into resilient enterprise infrastructure that drives business growth while maintaining strict governance. This is not documentation; it's an honest record of problems I had to solve.

## What You'll Find Here

This repository contains over **430+ articles** organized into comprehensive learning paths, covering everything from programming fundamentals to advanced cloud architecture and AI implementation.

### 🚀 Programming & Development

**Foundation & Core Skills:**

* Programming fundamentals with Python and TypeScript
* **Database 101** - Complete PostgreSQL guide
* **ORM 101** - Prisma with TypeScript and PostgreSQL
* **OOP 101** - Object-Oriented Programming mastery
* **Functional Programming 101** - Functional paradigms with TypeScript

**API Development:**

* **REST API 101** - Building production-ready APIs (7-part series)
* **gRPC 101** - High-performance microservices (7-part series)

**Testing & Quality:**

* **Software Testing 101** - From unit to E2E testing (7-part series)
* **Code Debugging 101** - VS Code debugging techniques
* **Code Review and Refactoring 101** - Maintaining code quality
* **Pair Programming 101** - Collaborative development

**Infrastructure as Code:**

* **Version Control 101** - Git and GitHub workflows
* **GitHub Actions 101** - CI/CD pipeline automation

**Specialized Topics:**

* **Vector Database 101** - pgvector with PostgreSQL (7-part series)
* **Rust 101** - Systems programming with Rust
* **Mathematics for Programming 101** - Mathematical foundations (7-part series)
* **Software Engineering 101** - Best practices and production readiness
* **Programming Issues 101** - Common errors and solutions

### 🤖 Artificial Intelligence & Machine Learning

**Core AI Development:**

* Building intelligent agents with LLMs and Azure OpenAI
* Production-ready RAG systems
* LLM-powered observability and automation
* AI-powered chatbots for microservices
* MCP vs ACP: Understanding the difference between the two protocols
* Building MCP servers from scratch using Rust
* Building ACP agents from scratch using Rust
* What is LangChain and LangGraph? — Composing LLM pipelines and stateful agent workflows
* **Self-Hosting OpenClaw on AWS Lightsail** — Running a personal AI assistant (OpenAI/Claude) across 20+ messaging channels on a $7/month VPS, with full security hardening and systemd daemon setup

**ML Frameworks & Tools:**

* **Hugging Face Transformers 101** - NLP model deployment
* **PyTorch 101** - Deep learning fundamentals
* **LLM API Development 101** - FastAPI with Claude
* **MLOps 101** - Kubeflow and production ML systems

### ☁️ Cloud Engineering & Infrastructure

**Infrastructure Automation:**

* **Terraform 101** - Infrastructure as Code (11-part series)
* **Kubernetes 101** - Container orchestration (12-part series)
* **Ansible 101** - Configuration management (22-part series)
* **Ansible Automation Platform 101** - Enterprise automation
* **Chef 101** - Configuration management
* **Helm 101** - Kubernetes package management
* **GitOps 101** - ArgoCD and GitOps workflows
* **Cloud Landing Zone 101** - Enterprise cloud foundations

### 🔄 DevOps & Site Reliability Engineering

**Observability & Monitoring:**

* **OpenTelemetry 101** - Distributed tracing and metrics (15-part series)
* **ELK Stack 101** - Logging and analytics
* **Prometheus 101** - Monitoring TypeScript applications (9-part series)

**SRE Practices:**

* **SRE 101** - Complete reliability guide (7-part series)
* **Release and Reliability Engineering 101** - Deployment strategies
* SLIs, SLOs, SLAs, and error budgets
* Incident management and toil reduction

**Microsoft Administration:**

* **Microsoft Administration 101** - Active Directory, ADFS, PKI, IIS

### 🔒 DevSecOps & Security

**Security Integration:**

* **DevSecOps 101** - Security-first development (14-part series)
* SAST, DAST, IAST, and SCA
* Container security and SBOM
* Secrets management and policy as code

**Identity & Access Management:**

* **MS Entra 101** - Microsoft identity platform (8-part series)
* OAuth 2.0, OIDC, SAML, and JWT
* SSO implementations and tenant restrictions

### 🏗️ Architecture & System Design

**Architecture Patterns:**

* **Software Architecture 101** - Core architectural concepts
* **Microservice Architecture 101** - Service decomposition (12-part series)
* **System Design 101** - Scalability and distributed systems (11-part series)
* Domain-Driven Design and event-driven architecture

**Design Patterns:**

* **Software Design 101** - SOLID principles and design patterns
* Creational, structural, and behavioral patterns
* Integration and communication patterns

### 📊 Data Engineering & Analytics

* **Data Engineering 101** - ETL/ELT pipelines (13-part series)
* Medallion architecture and data modeling
* **Python for Data Engineering 101** - Building real ETL pipelines with AWS Glue, S3, Athena, Redshift, Step Functions, and EventBridge — from local scripts to production-grade distributed jobs
* **DSA 101** - Data structures and algorithms (15-part series)

### 🎯 Fundamentals

* **Agile Development 101** - Scrum, Kanban, and workflows
* **Platform Engineering 101** - Internal developer platforms
* **Serverless 101** - AWS Lambda and event-driven architecture
* **Coding Interview 101** - Interview preparation

### 💻 Language-Specific Series

* **TypeScript 101** - Type-safe development (18-part series)
* **Golang 101** - Go programming (16-part series)
* **PowerShell 101** - Automation scripting (14-part series)

## Why I Created This Resource

This knowledge base represents what I wish I had when navigating these complex domains. My goal is to provide practical, experience-based guidance beyond theoretical concepts—real approaches that have succeeded (and sometimes failed) in production environments.

Each series is structured as a progressive learning path, taking you from fundamentals to production-ready implementations. Whether you're starting your tech journey or deepening your expertise, you'll find hands-on examples, real-world scenarios, and lessons learned from building production systems.

I regularly update this content with new series and emerging practices. The articles reflect my actual experience with these technologies, including the mistakes I made and how I overcame them.

*—Sharing practical knowledge from years of building production systems*

## 👨‍💻 About the Author

I'm **Htunn Thu Thu** — a **Principal Architect & Advisor** with 10+ years of leadership at the intersection of high-velocity cloud platforms, automated security, data ecosystems, and applied AI.

I spearhead organisation-wide transformations, establish global engineering standards, and act as a technical force multiplier — bridging executive leadership and engineering teams to balance innovation with regulatory compliance. As an active open-source author, I build tools that automate compliance, streamline platform delivery, and securely integrate AI with enterprise infrastructure.

### What I Focus On

| Domain                                | Scope                                                                                                                                                                                                                         |
| ------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| **GenAI · Agentic Systems · AIOps**   | LLMOps, MLOps, governance · Agentic systems with tool use, planning & memory · MCP servers · Multi-provider LLM strategy (OpenAI, Anthropic, Azure OpenAI, GitHub Models, Ollama) · RAG, fine-tuning, evaluation & guardrails |
| **Platform Engineering · IDPs · SRE** | Self-service Internal Developer Platforms · Vendor-agnostic multi-cloud (AWS & Azure, Terraform, Ansible) · SRE practices — SLOs, error budgets, golden paths                                                                 |
| **Enterprise DevSecOps**              | Supply-chain security (SBOM, SAST/DAST, SCA) · CSPM, CWPP, CNAPP, ZTNA · OWASP LLM Top 10 · Zero-trust IAM                                                                                                                    |
| **Engineering Leadership & OSS**      | Technical authority across global business units · Mentoring senior engineers · OSS tooling that removes dev↔ops friction                                                                                                     |

### Selected Impact

* **DevSecOps at scale** — Automated supply-chain security (SBOM, SAST/DAST) across enterprise SDLCs, delivering **60%+ reduction in MTTR**
* **Platform engineering** — Built and scaled Internal Developer Platforms enabling self-service infrastructure, **reducing deployment time \~70%**
* **Multi-cloud strategy** — Defined vendor-agnostic AWS + Azure architectures preventing lock-in while optimising large-scale cloud spend
* **GenAI productionisation** — Translated AI research into governed enterprise systems with LLMOps, observability, and model risk management
* **OSS authorship** — Published production-grade tooling in Python, Go, and Rust spanning AI agents, compliance, and security testing

### Selected Open Source

| Project                                                                   | Description                                                                                                              | Stack                  |
| ------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------ | ---------------------- |
| [aiops-orchestrator](https://github.com/Htunn/aiops-orchestrator)         | Production-ready multi-channel AI agent — AIOps, Kubernetes management, security scanning, human-in-the-loop remediation | Python · FastAPI · MCP |
| [hybrid-inference-app](https://github.com/Htunn/hybrid-inference-app)     | Mobile-first PWA for hybrid LLM inference — local Ollama + Google Gemini 2.5 Flash via secure FastAPI proxy              | Python · FastAPI       |
| [simple-cicd-translator](https://github.com/Htunn/simple-cicd-translator) | Agentic service translating CI/CD pipelines between Jenkins, GitLab CI, and GitHub Actions via pluggable LLMs            | Python · Multi-LLM     |
| [simple-exploit](https://github.com/Htunn/simple-exploit)                 | AI-powered pentest framework with dual AI engines — GitHub Models (GPT-4o/5) and Ollama (Gemma/Llama 3)                  | Go · Ollama            |
| [simple-waf-scanner](https://github.com/Htunn/simple-waf-scanner)         | High-performance WAF detection and authorised security testing toolkit                                                   | Rust                   |
| [simple-service-bus](https://github.com/Htunn/simple-service-bus)         | Decentralised ESB with multi-protocol support (HTTP, gRPC, AMQP, MQTT, Kafka, WebSocket, GraphQL)                        | Go · TypeScript        |
| [ansible-inspec](https://github.com/Htunn/ansible-inspec)                 | Combines Ansible automation with Chef InSpec for continuous compliance and security validation                           | Python                 |

This repository is my working knowledge base — not polished documentation, but honest write-ups from real projects: what worked, what failed, and the reasoning behind the decisions. Every article reflects a problem I actually had to solve.

### Let's Connect

* 🔗 [LinkedIn](https://www.linkedin.com/in/htunnthuthu/)
* 🐙 [GitHub](https://github.com/Htunn)
* ✍️ [Blog — Tech With Htunn](https://blog.htunnthuthu.com/)
* 🐳 [Docker Hub](https://hub.docker.com/u/htunnthuthu)
* 🏅 [Credly Certifications](https://www.credly.com/users/htunn)

> *Translating frontier AI into governed, enterprise-grade platforms — secure, observable, and built to scale.*
