Hi, I'm Debasish Mohanty

Kubernetes Platform Engineer — Autoscaling & Distributed Systems

I build control-plane systems that make decisions under real production constraints — autoscaling, scheduling behavior, and system reliability.

Contributor to Kubernetes Autoscaler · Falco · Helm

Kubernetes Control Plane · Autoscaling Systems · Distributed Systems · System Reliability · Observability

View Systems I Built View Resume

About Me

I build control-plane systems for cloud-native infrastructure — where decisions are made under real production constraints.

My focus is system correctness: autoscaling behavior, scheduling interactions, and reliability — not just deploying infrastructure.

I’ve contributed to Kubernetes Autoscaler and Falco, fixing production-impacting bugs in scaling behavior and runtime system correctness, now merged into upstream release branches.

I build systems that operate under real-world conditions:

Autoscaling decisions under noisy and delayed metrics
Failure handling: retries, idempotency, and backpressure
Control-plane logic using CRDs, controllers, and reconciliation loops

I build systems that behave correctly under load — not just systems that “work”.

Control Plane & Distributed Systems

Production-grade systems focused on autoscaling, control-plane behavior, and distributed system reliability

SmartScaler — Kubernetes Autoscaling Operator

Kubernetes-native autoscaling operator implementing a deterministic multi-signal control loop across CPU utilization, node pressure, and cost for stable scaling decisions.

Deterministic autoscaling · Metric smoothing & staleness filtering · Conflict-safe scaling with retries

GoKubernetesCRDsControllersPrometheusDocker

Code

KubeJobs — Distributed Job Processing System

Fault-tolerant distributed job system built on Redis Streams with reliable execution, retry semantics, and worker recovery.

Reliable execution under failure · Backpressure control · DLQ + retry handling

GoRedis StreamsKubernetesPrometheusDocker

Code

Kubernetes Autoscaler — Control Plane Contribution

Fixed production-impacting scaling bug in Kubernetes Autoscaler where node groups ignored configured minSize, causing silent under-provisioning.

Merged upstream · Backported to release branches · Restored correct scaling behavior

GoKubernetesAutoscalingControl Plane

Code

Open Source Impact

Contributions to Kubernetes control-plane systems, autoscaling behavior, and distributed system reliability

Kubernetes Autoscaler (kubernetes/autoscaler)

Fixed control-plane bug where node groups ignored configured minSize, causing silent under-provisioning in production clusters.

Restored correct scaling behavior · Merged upstream · Backported to release branches (1.32, 1.33)

View Contribution

Kubernetes Autoscaler (VPA Helm)

Restored missing */scale RBAC in VPA Helm chart, enabling admission controller to correctly resolve CRD selectors.

Unblocked autoscaling behavior in Helm-based deployments · Fixed broken control-plane interaction

View Contribution

Falco (falcosecurity/libs)

Fixed integer overflow in memory telemetry (VMSIZE/VMRSS) affecting processes >4GB.

Restored correctness of runtime security metrics · Eliminated overflow edge cases · Merged into production releases

View Contribution

Falco (falcosecurity/libs)

Resolved file descriptor leak in libscap by enforcing error-path cleanup in syscall-level thread inspection.

Prevented resource leaks in long-running agents · Improved runtime stability

View Contribution

Helm (helm/helm)

Improved OCI registry client correctness by aligning documentation with actual authentication and request behavior.

Reduced ambiguity in registry interactions · Improved developer reliability

View Contribution

Professional Experience

Building control-plane systems, autoscaling logic, and distributed infrastructure for real-world production environments

Open Source Engineer — Kubernetes Control Plane & Systems

Cloud Native Computing Foundation (CNCF)

Jan 2026 – Present

Remote

•Contributed to Kubernetes Autoscaler, Falco, and Helm, focusing on control-plane correctness and system reliability
•Fixed Kubernetes Autoscaler bug where node groups ignored configured minSize, causing silent under-provisioning in production clusters
•Restored missing */scale RBAC in VPA Helm chart, enabling admission controller to correctly resolve CRD selectors
•Eliminated integer overflow in Falco memory telemetry (VMSIZE/VMRSS) using 64-bit arithmetic, restoring correctness for >4GB processes
•Resolved file descriptor leak in Falco libscap by enforcing error-path cleanup, preventing resource exhaustion in long-running agents
•Improved Helm OCI registry client correctness by aligning documentation with actual authentication and request behavior
•All contributions merged upstream and included in production release branches after maintainer review and CI validation

Cloud Infrastructure Engineer

TechEazy Consulting

Jun 2025 – July 2025

Remote

•Provisioned AWS infrastructure using Terraform (VPC, EC2, IAM, S3) with environment isolation
•Designed CI/CD pipelines enabling automated dev → production deployment promotion
•Implemented deployment validation using health checks and retry mechanisms
•Optimized cloud costs via automated EC2 lifecycle management and scheduled shutdown systems
•Managed environment-specific configurations using Terraform workspaces

Cloud Infrastructure & Platform Engineer

Elevate Labs

May 2025 – Jun 2025

Remote

•Implemented Istio-based canary deployments with traffic splitting and progressive rollout strategies
•Built GitOps workflows using ArgoCD for automated synchronization and deployment consistency
•Designed full observability stack using Prometheus, Grafana, Loki, and Jaeger
•Implemented alert-driven self-healing systems using Prometheus alerts and automation scripts
•Worked across containerized environments using Kubernetes (K3s/Minikube) and Docker

Skills & Expertise

Control-plane systems, autoscaling behavior, and distributed system reliability in cloud-native environments

Kubernetes Control Plane

Designing and debugging control-plane components including autoscaling, controllers, and reconciliation loops

KubernetesCRDs & ControllersAutoscalingReconciliation LoopsRBAC & Admission Control

Distributed Systems

Designing systems with failure handling, retries, and consistency guarantees under real-world conditions

Distributed SystemsRetry & BackoffIdempotencyBackpressure HandlingFault Tolerance

Systems Debugging & Reliability

Debugging production-scale systems and resolving correctness issues in control-plane and runtime behavior

Systems DebuggingPerformance AnalysisMemory & Resource ManagementFailure AnalysisLinux Internals

Observability

Monitoring and analyzing system behavior using metrics, logs, and tracing

PrometheusGrafanaJaegerOpenTelemetryMetrics & Logging

Programming & Systems Engineering

Building backend systems and infrastructure components for cloud-native and distributed environments

GoPythonCREST APIsSystem Design

Autoscaling Systems

Designing and implementing autoscaling strategies based on multi-signal metrics and control loops

Horizontal ScalingMetric SmoothingScaling PoliciesCooldown HandlingCost-Aware Scaling

Resume

Explore my work in Kubernetes control-plane systems, autoscaling behavior, and distributed system reliability

Kubernetes Control Plane Engineer — Autoscaling & Distributed Systems

Kubernetes Autoscaling · Control Plane Systems · Distributed Systems

Key Highlights

• Fixed Kubernetes Autoscaler bug causing silent under-provisioning due to ignored minSize (merged & backported)
• Restored missing RBAC in VPA Helm chart, enabling correct admission controller behavior
• Fixed integer overflow in Falco memory telemetry (>4GB processes), restoring correctness of runtime metrics
• Built SmartScaler — deterministic Kubernetes autoscaling operator using multi-signal control loops

Core Focus

• Kubernetes Control Plane (CRDs, controllers, autoscaling)
• Distributed Systems (retries, backoff, idempotency, failure handling)
• Systems Debugging & Reliability
• Observability (metrics, logs, tracing)

Download Resume

Get in Touch

Let’s discuss control-plane systems, autoscaling strategies, or distributed system challenges

Email

debasishm8765@gmail.com

Phone

+91 6372692682

Location

Bhubaneswar, Odisha (India)