Part 12: GitOps at Scale — ArgoCD Orchestrating the Full Platform
The GitOps Repo Structure
go-reliable-gitops/
├── argocd/
│ ├── apps/ # Individual ArgoCD Application manifests
│ │ ├── infrastructure/
│ │ │ ├── cert-manager.yaml
│ │ │ ├── external-secrets.yaml
│ │ │ ├── ingress-nginx.yaml
│ │ │ ├── kube-prometheus-stack.yaml
│ │ │ ├── loki.yaml
│ │ │ ├── tempo.yaml
│ │ │ └── otel-collector.yaml
│ │ ├── kubeflow/
│ │ │ ├── cert-manager.yaml
│ │ │ ├── istio.yaml
│ │ │ ├── kubeflow-pipelines.yaml
│ │ │ ├── katib.yaml
│ │ │ └── kserve.yaml
│ │ └── mlops/
│ │ ├── mlflow.yaml
│ │ └── prometheus-pushgateway.yaml
│ ├── appsets/ # ApplicationSets (parameterized multi-app)
│ │ ├── microservices.yaml # Covers all 4 services × 2 envs
│ │ ├── kubeflow.yaml # KubeFlow wave-ordered deployment
│ │ └── governance-jobs.yaml # Drift detector, model evaluator CronJobs
│ └── root-app.yaml # The app-of-apps
├── environments/
│ ├── staging/
│ │ ├── api-gateway/values.yaml
│ │ ├── order-service/values.yaml
│ │ ├── notification-worker/values.yaml
│ │ └── ml-inference-gateway/values.yaml
│ └── production/
│ ├── api-gateway/values.yaml
│ ├── order-service/values.yaml
│ ├── notification-worker/values.yaml
│ └── ml-inference-gateway/values.yaml
├── infrastructure/
│ ├── cert-manager/values.yaml
│ ├── ingress-nginx/values.yaml
│ ├── kube-prometheus-stack/
│ │ ├── values.yaml
│ │ └── dashboards/ # Grafana dashboard JSON files as ConfigMaps
│ ├── loki/values.yaml
│ ├── tempo/values.yaml
│ ├── otel-collector/
│ │ ├── values.yaml
│ │ └── config.yaml
│ ├── mlflow/values.yaml
│ ├── vllm/
│ │ ├── deployment.yaml
│ │ ├── rollout.yaml
│ │ └── service.yaml
│ ├── kserve/
│ │ └── recommendation-model.yaml
│ └── drift-detector/
│ └── cronjob.yaml
├── runbooks/ # Incident runbooks (referenced by alerts)
│ ├── high-error-rate.md
│ ├── order-service-down.md
│ ├── notification-backlog.md
│ └── model-drift.md
└── external-secrets/ # ExternalSecret CRDs (not values themselves)
├── production/
│ ├── app-secrets.yaml
│ └── mlflow-credentials.yaml
└── staging/
└── app-secrets.yamlThe Root App-of-Apps
Multi-Environment ApplicationSet
Governance Jobs ApplicationSet
Argo Rollouts Across All Services
ArgoCD Project-Level Access Control
The One Discipline That Made GitOps Work
PreviousPart 11: ModelOps — Governance, Drift Detection, and Production LifecycleNextPart 13: Reliability at Every Layer — The Complete Platform Reference
Last updated