Name	Name	Last commit message	Last commit date
parent directory ..
bin	bin
confighub/base	confighub/base
k8s	k8s
test	test
.gitignore	.gitignore
Dockerfile	Dockerfile
QUICKSTART.md	QUICKSTART.md
README.md	README.md
WHY-CONFIGHUB.md	WHY-CONFIGHUB.md
WORKFLOW.md	WORKFLOW.md
dashboard.html	dashboard.html
demo-confighhub-corrections.sh	demo-confighhub-corrections.sh
demo.go	demo.go
go.mod	go.mod
go.sum	go.sum
integration_test.go	integration_test.go
k8s_test.go	k8s_test.go
main.go	main.go
main_test.go	main_test.go
run.sh	run.sh

Drift Detector

Event-driven Kubernetes configuration drift detection that leverages ConfigHub's unique capabilities across multi-environment deployments.

📚 Documentation

QUICKSTART.md - Step-by-step setup guide (start here!)
WORKFLOW.md - Understanding the ConfigHub → Kubernetes workflow
README.md - Full architecture and features (this document)

Architecture

The drift detector uses the following components:

Persistent drift records stored as ConfigHub units
Multi-environment tracking across dev → staging → prod
Atomic corrections with ConfigHub rollback capability
Audit trail for all changes
Event-driven detection using Kubernetes informers
ConfigHub Sets for grouping related drift items
Bulk correction across environments using filters
Automatic correction with ConfigHub as configuration source

Scenario

The Drift Detector continuously monitors your Kubernetes clusters, comparing live state against ConfigHub units (your source of truth). When drift is detected, it can auto-correct, alert, or track for later remediation - all while maintaining a complete audit trail.

ConfigHub Layout

The detector uses ConfigHub's space hierarchy to track drift across environments:

graph TD
base[drift-detector-base] --> dev[drift-detector-dev]
dev --> staging[drift-detector-staging]
staging --> prod[drift-detector-prod]

base -.->|desired state| units[ConfigHub Units]
dev -.->|actual state| k8s-dev[K8s Dev Cluster]
staging -.->|actual state| k8s-staging[K8s Staging]
prod -.->|actual state| k8s-prod[K8s Production]

units -->|compares| detector[Drift Detector]
k8s-dev -->|monitors| detector
k8s-staging -->|monitors| detector
k8s-prod -->|monitors| detector

detector -->|stores| drifts[Drift Records]
detector -->|groups| sets[ConfigHub Sets]
detector -->|corrects| k8s-dev
detector -->|corrects| k8s-staging
detector -->|corrects| k8s-prod

Unit Organization

{prefix}-drift-detector/
├── Units (Configurations)
│   ├── drift-detector-deployment    # App deployment config
│   ├── drift-detector-service       # Service endpoints
│   ├── drift-detector-rbac         # Permissions
│   └── namespace                   # Infrastructure setup
│
├── Sets (Grouped Drifts)
│   ├── critical-drifts             # Security/compliance issues
│   ├── resource-drifts             # CPU/memory changes
│   ├── replica-drifts              # Scaling deviations
│   └── corrected-drifts            # Successfully fixed items
│
└── Filters (Smart Queries)
    ├── security-drift              # RBAC and secret changes
    ├── production-drift            # Prod-only deviations
    ├── auto-correctable           # Safe to auto-fix
    └── manual-review              # Requires human approval

Setup

Configure ConfigHub Structure

First, set up the ConfigHub spaces and base units:

# Install base configurations in ConfigHub
bin/install-base

# Set up environment hierarchy (dev → staging → prod)
bin/install-envs

# View the created structure
cub unit tree --node=space --filter drift-detector --space '*'

Step 2: Deploy to Kubernetes via ConfigHub

# Create secrets first
kubectl create secret generic drift-detector-secrets \
  --from-literal=cub-token=$CUB_TOKEN \
  --from-literal=claude-api-key=$CLAUDE_API_KEY \
  -n devops-apps

# Apply all units to dev environment
bin/apply-all dev

# Or apply to specific environment
bin/apply-all staging
bin/apply-all prod

Step 3: Promote Through Environments

# After testing in dev, promote to staging
bin/promote dev staging

# After validation in staging, promote to prod
bin/promote staging prod

Running Locally (Development)

# Set environment variables
export KUBECONFIG=/path/to/kubeconfig
export CUB_TOKEN=your-confighub-token
export CLAUDE_API_KEY=your-claude-key  # Optional

# Build and run
go build
./drift-detector

ConfigHub Structure

The drift-detector follows the global-app pattern with this hierarchy:

drift-detector (main space)
├── drift-detector-filters (filters for targeting)
├── drift-detector-base (base configurations)
│   ├── namespace.yaml
│   ├── drift-detector-rbac.yaml
│   ├── drift-detector-deployment.yaml
│   └── drift-detector-service.yaml
├── drift-detector-dev (cloned from base)
├── drift-detector-staging (cloned from dev)
└── drift-detector-prod (cloned from staging)

Each environment inherits from its upstream:

base → dev → staging → prod
Changes flow through push-upgrade pattern
Each environment can have local customizations

Configuration

Environment Variable	Description	Default
`NAMESPACE`	Kubernetes namespace to monitor	`qa`
`CUB_SPACE`	ConfigHub space to use as desired state	`acorn-bear-qa`
`CUB_API_URL`	ConfigHub API endpoint	`https://hub.confighub.com/api/v1`
`CUB_TOKEN`	ConfigHub API token	Required
`CLAUDE_API_KEY`	Claude API key for AI analysis	Optional
`AUTO_FIX`	Create fixes automatically	`false`

Viewing Drift Detection

🔍 Monitoring Dashboard

The drift detector includes a real-time monitoring dashboard:

# Open the dashboard and ConfigHub
./bin/view-dashboard

# Dashboard will open at:
# file:///path/to/drift-detector/dashboard.html

The dashboard shows:

Live drift status with affected resources
Claude AI analysis and recommendations
Fix buttons to apply corrections
Real-time logs of all operations
Cost impact of drift ($240/month saved)
Metrics: Resources monitored, drift detected, auto-fixes applied

📊 ConfigHub CLI Commands

After running ./bin/install, check what was created in ConfigHub:

# List all drift detector spaces
cub space list | grep drift-detector

You should see:

NAME                                UNITS    LINKS    TAGS    CHANGESETS    FILTERS    VIEWS    INVOCATIONS    TRIGGERS    WORKERS    TARGETS
drift-detector-1758540677              1         0       0            0           0        0              0           0          0          0
drift-detector-1758540677-filters     0         0       0            0           2        0              0           0          0          0

# View the hierarchy of spaces
cub unit tree --node=space --filter drift-detector --space '*'

You should see:

NODE                                     UNIT        STATUS    UPGRADE-NEEDED    UNAPPLIED-CHANGES    APPLY-GATES
└── drift-detector-1758540677            k8s-target  NoLive                                           None

# List sets (groups of critical services)
cub set list --space drift-detector-1758540677

You should see:

NAME            SPACE                        DESCRIPTION
critical-set    drift-detector-1758540677    Critical services that must not drift

# View filters with WHERE clauses
cub filter list --space drift-detector-1758540677-filters

You should see:

NAME                 SPACE                                FROM    WHERE                                  WHERE-DATA    RESOURCE-TYPE
critical-services    drift-detector-1758540677-filters    Unit    Labels.tier = 'critical'
production-only      drift-detector-1758540677-filters    Unit    Labels.environment = 'production'

# List units in the space
cub unit list --space drift-detector-1758540677

You should see:

NAME          SPACE                        CHANGESET    TARGET    STATUS    LAST-ACTION    UPGRADE-NEEDED    UNAPPLIED-CHANGES    APPLY-GATES
k8s-target    drift-detector-1758540677                           NoLive    ?                                                     None

# Get detailed set information
cub set get critical-set --space drift-detector-1758540677 --json | jq '.Labels'

You should see:

{
  "tier": "critical",
  "monitor": "true",
  "auto-fix": "true"
}

# View unit data (the Kubernetes configuration)
cub unit get-data k8s-target --space drift-detector-1758540677

You should see:

apiVersion: v1
kind: Target
metadata:
  name: k8s-cluster
spec:
  type: kubernetes
  config:
    context: kind-devops-test
    namespace: drift-test

# After drift is detected and fixed, check for new units
cub unit list --space drift-detector-1758540677 --verbose

You might see additional units for fixed configurations:

NAME             SPACE                        STATUS    DESCRIPTION
k8s-target       drift-detector-1758540677    NoLive    Kubernetes cluster configuration
backend-api      drift-detector-1758540677    Applied   Fixed replica count from 5 to 3
frontend-web     drift-detector-1758540677    Applied   Fixed replica count from 1 to 2

🌐 ConfigHub Web UI

Access the web interface:

Navigate to: https://hub.confighub.com
Go to Spaces: Click "Spaces" in top menu
Find your space: Search for drift-detector- prefix
Explore:
- Units tab: See k8s-target and any fixed configurations
- Sets tab: View critical-services grouping
- Filters tab: See WHERE clauses for targeting
- Live State: Monitor deployment status

📝 Example Drift Detection Output

When drift is detected, you'll see:

# Console output when running drift-detector
[drift-detector] 2025/09/22 12:16:15 ⚠️ DRIFT DETECTED: backend-api has 5 replicas, expected 3
[drift-detector] 2025/09/22 12:16:15 ⚠️ DRIFT DETECTED: frontend-web has 1 replica, expected 2
[drift-detector] 2025/09/22 12:16:16 🤖 Claude analysis: Over-scaling detected, cost impact $180/month
[drift-detector] 2025/09/22 12:16:17 🔧 Applying fix: backend-api replicas 5 → 3
[drift-detector] 2025/09/22 12:16:18 ✅ Push-upgrade complete, changes propagated downstream

# Check Kubernetes to verify fixes
kubectl get deployments -n drift-test
NAME           READY   UP-TO-DATE   AVAILABLE
backend-api    3/3     3            3         # Fixed from 5 to 3
frontend-web   2/2     2            2         # Fixed from 1 to 2

🚨 Introducing Test Drift

To test drift detection:

# Deploy test workloads with intentional drift
./bin/deploy-test --with-drift

You should see:

🚀 Deploying test workloads
==========================
📦 Deploying workloads to drift-test namespace...
⏳ Waiting for deployments to be ready...

📊 Deployment Status:
NAME           READY   UP-TO-DATE   AVAILABLE
backend-api    3/3     3            3
frontend-web   2/2     2            2

✅ Test workloads deployed successfully!

🔄 Introducing drift for testing...
deployment.apps/backend-api scaled
  - Scaled backend-api to 5 replicas (expected: 3)
deployment.apps/frontend-web scaled
  - Scaled frontend-web to 1 replica (expected: 2)

⚠️  Drift introduced! Run drift-detector to detect and fix.

# Verify drift in Kubernetes
kubectl get deployments -n drift-test

You should see the drift:

NAME           READY   UP-TO-DATE   AVAILABLE   AGE
backend-api    5/5     5            5           10m   # ⚠️ Should be 3
frontend-web   1/1     1            1           10m   # ⚠️ Should be 2

# Run drift detector to detect and fix
./drift-detector

You should see detection and Claude analysis:

[drift-detector] 2025/09/22 12:16:15 Resource updated, triggering drift detection...
[drift-detector] 2025/09/22 12:16:15 ⚠️  DRIFT DETECTED: backend-api has 5 replicas, expected 3
[drift-detector] 2025/09/22 12:16:15 ⚠️  DRIFT DETECTED: frontend-web has 1 replica, expected 2
[drift-detector] 2025/09/22 12:16:16 🤖 Claude AI: Over-scaling detected, monthly cost impact: $180
[drift-detector] 2025/09/22 12:16:17 🔧 Applying fixes using push-upgrade pattern...
[drift-detector] 2025/09/22 12:16:18 ✅ Fixed: backend-api scaled to 3 replicas
[drift-detector] 2025/09/22 12:16:19 ✅ Fixed: frontend-web scaled to 2 replicas

🔍 Finding Fixed Drift in ConfigHub

Quick Answer: After fixes are applied, look for:

GUI: Spaces → your-space → Units tab → backend-api, frontend-web

CLI: cub unit list --space drift-detector-1758540677 --verbose

Downstream: Check Changesets tab or run cub unit tree --show-downstream

After drift is fixed and propagated downstream, here's where to find the updates:

ConfigHub Web UI (SaaS)

Navigate to: https://hub.confighub.com
Go to your space: Spaces → drift-detector-1758540677

In the Units Tab:

Look for updated units like backend-api and frontend-web with corrected configurations
Status should show Applied with green checkmark
Click on a unit to see:
- Data tab: The corrected configuration (e.g., replicas: 3)
- History tab: When the fix was applied
- Downstream tab: Which environments received the update

In the Changesets Tab:

Find recent changesets showing:
- "Fixed backend-api replicas: 5 → 3"
- "Push-upgrade to downstream environments"
Click on a changeset to see all affected units

In the Live State Tab:

Real-time status of deployments
Should show "In Sync" after fixes are applied
Green indicators for healthy resources

In the Downstream View:

Click the "Downstream" button on any fixed unit

You'll see a tree showing propagation:

drift-detector-1758540677 (origin)
├── qa-space (updated)
├── staging-space (updated)
└── prod-space (pending approval)

ConfigHub CLI Commands

# 1. View the updated units in your space
cub unit list --space drift-detector-1758540677 --verbose

You should see the existing units have been updated (not new units created):

NAME              SPACE                        STATUS    LAST-CHANGE
k8s-target        drift-detector-1758540677    NoLive    Initial configuration
backend-api       drift-detector-1758540677    Applied   Fixed replicas: 5 → 3
frontend-web      drift-detector-1758540677    Applied   Fixed replicas: 1 → 2

# 2. View the corrected configuration
cub unit get-data backend-api --space drift-detector-1758540677

You should see the corrected manifest:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: backend-api
spec:
  replicas: 3  # ← Fixed from 5
  selector:
    matchLabels:
      app: backend-api

# 3. Check downstream propagation status
cub unit get backend-api --space drift-detector-1758540677 --json | jq '.DownstreamUnits'

You should see downstream unit IDs:

[
  "uuid-qa-backend-api",
  "uuid-staging-backend-api",
  "uuid-prod-backend-api"
]

# 4. View the propagation tree across all spaces
cub unit tree --node=unit --filter backend-api --space '*' --show-downstream

You should see the full propagation tree:

NODE                          SPACE                            STATUS    DOWNSTREAM
└── backend-api               drift-detector-1758540677        Applied
    ├── backend-api           qa-space                         Applied   (inherited)
    ├── backend-api           staging-space                    Applied   (inherited)
    └── backend-api           prod-space                       Pending   (awaiting approval)

# 5. Check changeset history
cub changeset list --space drift-detector-1758540677 --limit 5

You should see recent changes:

CHANGESET-ID    DESCRIPTION                                   UNITS    TIMESTAMP
cs-001          Fixed backend-api replicas: 5 → 3            1        2025-09-22T12:16:18Z
cs-002          Fixed frontend-web replicas: 1 → 2           1        2025-09-22T12:16:19Z
cs-003          Push-upgrade to downstream environments      6        2025-09-22T12:16:20Z

# 6. Verify fixes were applied to Kubernetes
cub unit get-live-state backend-api --space drift-detector-1758540677

You should see the live state is now in sync:

RESOURCE              EXPECTED    ACTUAL    STATUS
Deployment/backend-api
  replicas            3          3         ✅ In Sync
  image               nginx      nginx     ✅ In Sync
  cpu requests        100m       100m      ✅ In Sync
  memory requests     64Mi       64Mi      ✅ In Sync

# 7. Check which downstream spaces need the update
cub space list --filter "UpstreamSpaceID = 'drift-detector-1758540677'"

You should see downstream spaces:

NAME           UPSTREAM                      UPGRADE-NEEDED    STATUS
qa-space       drift-detector-1758540677     No               Updated
staging-space  drift-detector-1758540677     No               Updated
prod-space     drift-detector-1758540677     Yes              Pending Approval

# 8. View the complete audit trail
cub audit list --space drift-detector-1758540677 --filter "Action = 'BulkPatchUnits'"

You should see the audit log:

TIMESTAMP              USER              ACTION           DETAILS
2025-09-22T12:16:17Z   drift-detector   BulkPatchUnits   Fixed 2 units with Upgrade=true
2025-09-22T12:16:20Z   drift-detector   PropagateDown    Pushed to 3 downstream spaces

What It Detects

Currently detects drift in:

Deployment replica counts
Container images
Resource requests/limits
Service ports
ConfigMap data

Expected Output Examples

Demo Mode Output

$ ./drift-detector demo

🚀 DevOps as Apps - Drift Detector Demo
=====================================

📋 Step 1: Initialize ConfigHub Resources
   ✅ Created space: drift-detector
   ✅ Created set: critical-services
   ✅ Created filter: Labels['tier'] = 'critical'

🔍 Step 2: Discover Critical Services Using Sets and Filters
   Found 3 critical units to monitor:
   - backend-api (critical)
   - frontend-web (critical)
   - database-postgres (critical)

⚠️  Step 3: Detect Configuration Drift
   Detected 2 drift items:
   - backend-api [Deployment/backend-api]: spec.replicas expected=3, actual=5
   - frontend-web [Deployment/frontend-web]: spec.replicas expected=2, actual=1

🤖 Step 4: Claude AI Analysis
   Summary: Critical services have replica count mismatches. Backend is over-scaled
            (5 vs 3 expected), frontend is under-scaled (1 vs 2 expected).
            This affects performance and cost efficiency.
   Proposed fixes: 2
   - backend-api: Scale down from 5 to 3 replicas to reduce cost ($180/month)
   - frontend-web: Scale up from 1 to 2 replicas to ensure high availability

🔧 Step 5: Apply Fixes Using Push-Upgrade Pattern
   📝 Patching backend-api: /spec/replicas = 3
   📝 Patching frontend-web: /spec/replicas = 2
   ✅ Applied bulk patch with Upgrade=true (push-upgrade)
   ✅ Changes propagated downstream to dependent environments

Real-Time Detection Output

$ ./drift-detector

[drift-detector] 2025/09/22 12:15:37 Initializing ConfigHub resources...
[drift-detector] 2025/09/22 12:15:38 Created space: drift-detector-1758540677
[drift-detector] 2025/09/22 12:15:39 Created set: critical-services
[drift-detector] 2025/09/22 12:15:40 Created filter: WHERE Labels.tier = 'critical'
[drift-detector] 2025/09/22 12:15:41 Starting informers for Deployments, StatefulSets, DaemonSets
[drift-detector] 2025/09/22 12:15:42 Informers started, watching for changes...

[drift-detector] 2025/09/22 12:16:15 Resource updated, triggering drift detection...
[drift-detector] 2025/09/22 12:16:15 ⚠️  DRIFT DETECTED: backend-api has 5 replicas, expected 3
[drift-detector] 2025/09/22 12:16:15 ⚠️  DRIFT DETECTED: frontend-web has 1 replica, expected 2
[drift-detector] 2025/09/22 12:16:16 🤖 Claude AI: Over-scaling detected, monthly cost impact: $180
[drift-detector] 2025/09/22 12:16:17 🔧 Applying fixes using push-upgrade pattern...
[drift-detector] 2025/09/22 12:16:18 ✅ Fixed: backend-api scaled to 3 replicas
[drift-detector] 2025/09/22 12:16:19 ✅ Fixed: frontend-web scaled to 2 replicas
[drift-detector] 2025/09/22 12:16:20 ✅ Push-upgrade complete, downstream environments updated

Key Features Implemented

✅ ConfigHub Integration

Creates Spaces for organization
Uses Sets to group critical services
Filters with WHERE clauses for targeting
BulkPatchUnits with Upgrade=true (push-upgrade pattern)
Live State monitoring

✅ Kubernetes Monitoring

Event-driven informers (not polling)
Monitors Deployments, StatefulSets, DaemonSets
Real-time drift detection on Add/Update/Delete events

✅ AI Analysis

Claude integration for intelligent drift analysis
Cost impact calculations
Availability risk assessment
Automated fix recommendations

✅ Monitoring & Dashboards

Real-time HTML dashboard
ConfigHub CLI commands for inspection
Web UI navigation support
Comprehensive logging

✅ Production Ready

Health checks on /health endpoint
Metrics on /metrics endpoint
Proper error handling and retries
Kubernetes RBAC and service accounts

Documentation Quality Standards

Documentation Code is Production Code:

All cub commands in this README and QUICKSTART.md must be validated before changes are committed:

# 1. Run Mini TCK (environment check)
curl -fsSL https://raw.githubusercontent.com/monadic/devops-sdk/main/test-confighub-k8s | bash

# 2. Validate all cub commands in documentation
curl -fsSL https://raw.githubusercontent.com/monadic/devops-sdk/main/cub-command-analyzer.sh | bash -s -- .

Users copy-paste commands from docs. Invalid examples waste hours of debugging time.

Implementation Characteristics

The drift detector is implemented as a persistent Kubernetes application with the following characteristics:

Runs continuously, not just when triggered
Uses Kubernetes informers for event-driven response
Integrates Claude AI for drift analysis
Provides built-in dashboard and ConfigHub UI integration
Follows standard Kubernetes deployment patterns
Full source control and testing support
Version control and rollback capability
Monitoring with standard Kubernetes tools
Horizontal scaling support
Zero-downtime updates
Custom logic extensibility

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Drift Detector

📚 Documentation

Architecture

Scenario

ConfigHub Layout

Unit Organization

Setup

Configure ConfigHub Structure

Step 2: Deploy to Kubernetes via ConfigHub

Step 3: Promote Through Environments

Running Locally (Development)

ConfigHub Structure

Configuration

Viewing Drift Detection

🔍 Monitoring Dashboard

📊 ConfigHub CLI Commands

🌐 ConfigHub Web UI

📝 Example Drift Detection Output

🚨 Introducing Test Drift

🔍 Finding Fixed Drift in ConfigHub

ConfigHub Web UI (SaaS)

ConfigHub CLI Commands

What It Detects

Expected Output Examples

Demo Mode Output

Real-Time Detection Output

Key Features Implemented

Documentation Quality Standards

Implementation Characteristics

FilesExpand file tree

drift-detector

Directory actions

More options

Directory actions

More options

Latest commit

History

drift-detector

Folders and files

parent directory

README.md

Drift Detector

📚 Documentation

Architecture

Scenario

ConfigHub Layout

Unit Organization

Setup

Configure ConfigHub Structure

Step 2: Deploy to Kubernetes via ConfigHub

Step 3: Promote Through Environments

Running Locally (Development)

ConfigHub Structure

Configuration

Viewing Drift Detection

🔍 Monitoring Dashboard

📊 ConfigHub CLI Commands

🌐 ConfigHub Web UI

📝 Example Drift Detection Output

🚨 Introducing Test Drift

🔍 Finding Fixed Drift in ConfigHub

ConfigHub Web UI (SaaS)

ConfigHub CLI Commands

What It Detects

Expected Output Examples

Demo Mode Output

Real-Time Detection Output

Key Features Implemented

Documentation Quality Standards

Implementation Characteristics