Now

What I'm up to

Last updated: 8 March 2026

🔧 Working on

Hardening a Kubernetes-based data platform on AWS EKS — improving observability with OpenTelemetry and Grafana stack
Migrating batch pipelines to Apache Kafka streaming architecture
Evaluating Apache Iceberg as a table format migration path from legacy Hive-based storage
Building this site — documenting the journey publicly for the first time

📚 Learning

Kubeflow — getting hands-on with ML pipeline orchestration on Kubernetes
MLflow — experiment tracking and model registry patterns in production
Apache Flink — stateful stream processing beyond Kafka consumer groups
Terraform modules — making infrastructure reusable across environments

✍️ Writing about

The real operational cost of running a Cloudera CDP cluster vs cloud-native alternatives
Dremio as a query engine for a data lakehouse — when it works, when it doesn't
A practical guide to Apache Iceberg table maintenance at scale

💭 Thinking about

The gap between "ML platform" and "platform that can run ML workloads" — they are not the same thing
How much platform complexity is justified before the platform becomes the product
Writing more publicly — the friction of perfectionism vs the value of shipping