Senior DevOps & Platform Infrastructure Engineer

I build the infrastructure
cities run on.

And I build it sovereign — infrastructure you own, open source end to end, with no hyperscaler dependency and no proprietary lock-in.

Owned, bare-metal infrastructure · Linux + Kubernetes + GitOps · 100,000+ IoT sensors in production · zero vendor lock-in

I'm Belhadj Kessas. I design and operate digital infrastructure that organizations truly own — Linux, Kubernetes and GitOps on bare metal they control, open source from the kernel up. The flagship proof: one of France's largest municipal Smart City deployments, at Montpellier Méditerranée Métropole, serving 500,000+ citizens — 100,000+ IoT sensors in production across air quality, waste management, water and energy metering, and mobility. Not a pilot, not a demo. A live city, running on infrastructure it owns.

Email me GitHub LinkedIn

Montpellier, France · CKA — Certified Kubernetes Administrator, in progress (CNCF, 2026)

100,000+ IoT sensors in production

500,000+ citizens served by the platform

100% open source, owned end to end

0 hyperscaler dependencies

4 Kubernetes clusters, GitOps-managed

1 min from silent gateway to alert fired

Owned, not rented

Migrated off the cloud, onto infrastructure we own

I moved this platform off managed cloud — from Docker containers on rented VMs to a fully on-premise, GitOps-driven multi-cluster architecture the organization owns outright. A deliberate sovereignty decision: the data lives on infrastructure we control, every change is auditable in Git, and the entire stack is open source, from the Linux kernel up.

Today it's four bare-metal Kubernetes clusters with every layer declared in Git. A git push triggers a self-hosted runner that calls the hypervisor API and stands up a complete cluster — RKE2, Cilium, MetalLB, ArgoCD — with zero manual steps. Destroy it, push again, get an identical one back.

No hyperscaler dependency. No proprietary lock-in. Infrastructure you own, can audit, and can operate without me.

commit → cluster

$ git push origin main
→ self-hosted runner picks up the pipeline
→ hypervisor API provisions bare-metal VMs
→ RKE2 bootstraps the cluster
→ Cilium · MetalLB · ArgoCD roll out
✓ complete cluster — 0 manual steps

engineer

git push

GitLab CI

OpenTofu + Ansible

hypervisor API

RKE2 cluster

ArgoCD sync

running apps

Edge to cloud

One platform, sensor to dashboard

sense 100k+ LoRaWAN sensors air quality · street lighting · water & energy · mobility

backhaul City-wide gateways RF planning at city scale · VLAN-segmented transport

ingest LoRaWAN network core multi-tenant network server on K8s · automated device provisioning · OTA at fleet scale

run 4× bare-metal Kubernetes RKE2 · Cilium · MetalLB · ArgoCD · Rook-Ceph

serve Data & dashboards MQTT → queues → time-series · Grafana · partner self-service via Git

production RKE2 on VMware vSphere the live city workload — 100k+ sensors, every citizen-facing service

pre-production RKE2 on Proxmox identical promotion target — changes prove themselves here first

monitoring Dedicated 3-node RKE2 the observability substrate, on its own failure domain — detailed below

gpu edge lab Jetson Orin cluster YOLO computer-vision inference experiments, GPU-aware scheduling — building toward MLOps on sovereign hardware

Shipped 2026

Observability with its own cluster

In 2026 I shipped a centralized, multi-tenant observability platform on a dedicated three-node cluster: Mimir for long-term metrics and Loki for logs, both backed by Rook-Ceph S3 object storage. Grafana Alloy collectors on production and pre-production remote-write into it, separated into three isolated tenants — production, pre-production, and the monitoring cluster watching itself. One Grafana federates every data source.

Provisioned the same way as everything else — OpenTofu → Ansible → GitLab CI → RKE2 → ArgoCD — and fed by real production traffic from 100,000+ sensors.

The architectural point is separation of concerns: the observability substrate lives on its own cluster, so losing a monitored cluster never means losing the ability to see it.

Grafanaone pane, every data source

Mimirlong-term metrics

Lokilogs

Rook-Ceph S3object storage

↑ remote-write

Alloy@ production

Alloy@ pre-production

3 isolated tenants: prod · pre-prod · monitoring itself

What I master

Depth where it counts

Kubernetes & platform engineering

RKE2/Rancher multi-cluster on bare metal — production on VMware vSphere, pre-production on Proxmox. No managed control plane, no cloud dependency. MetalLB, Rook-Ceph, Cilium (eBPF), Envoy Gateway, Helm.

CKA in progress — CNCF, expected 2026

GitOps & infrastructure as code

ArgoCD, GitLab CI/CD, OpenTofu, Terraform, Ansible. Full lifecycle as code: cluster provisioning, app rollout, drift detection, environment promotion. Built a self-service platform where external partners deploy via Git without ever touching cluster internals.

Owned, auditable, hyperscaler-free

Large-scale IoT & networking

End-to-end LoRaWAN at city scale: RF planning, gateway deployment, VLAN segmentation, a multi-tenant LoRaWAN network server on Kubernetes. Automated device provisioning and OTA updates for a 100k+ fleet. Edge-to-cloud pipelines that survive partial outages.

Observability & SRE

Designed a dual-layer telemetry stack from zero — Zabbix outside the clusters; a centralized multi-tenant Mimir + Loki platform with Grafana Alloy collectors inside, on its own dedicated cluster. Proactive degradation thresholds, not post-failure alarms.

Detection in minutes, not days

Programming & emerging

Python (Django), Rust, Bash. Computer vision on the GPU edge lab: YOLO inference experiments on a Jetson Orin cluster, GPU-aware Kubernetes scheduling. Prototyping on-prem Kubeflow for AI-assisted log analysis.

Proof, not promises

Independently verifiable

Cerema — French state institution, Ministry of Ecological Transition Featured in the national Smart City Data Governance case study smart-city.cerema.fr ↗ LoRa Alliance — the global LoRaWAN standards body Listed as Technical Contact for Montpellier Méditerranée Métropole lora-alliance.org ↗ Open source Production-ready Helm charts for running a LoRaWAN network stack on Kubernetes — published so other municipalities can reproduce the architecture github.com/beladjioo ↗

One minute, not days

T+0:00 A LoRa gateway goes silent — physical power failure.
T+1 min Monitoring fires. Not when sensor data goes missing — when the gateway stops responding.
Same hour A technician is on site and finds the fault — resolved before the data gap mattered.

“The real measure of an observability platform isn't how many incidents it helps resolve — it's how many it prevents.”

Beyond the day job

Radio is the passion

openhertz.org — personal project · 100% free & open source OpenHertz — learn radio by doing it, with a real SDR Radio is my passion, and OpenHertz is what I built with it: a gamified platform where you plug in a $30 RTL-SDR and decode live FM and ADS-B aircraft entirely in the browser — WebUSB, a Rust and TypeScript DSP engine, a bilingual learning library, and amateur-radio exam prep. Designed, built, and operated end to end: the curriculum, the app, and the infrastructure it runs on. openhertz.org ↗

Contact

Let's talk.

If you're building infrastructure people depend on — or you just want to talk radio and Kubernetes — I'd like to hear from you.

contact@belhadj.dev

linkedin.com/in/belhadj-k github.com/beladjioo

I build the infrastructurecities run on.