Matt – AWS, Kubernetes, Terraform, experts in Lemon.io

Matt

From United Kingdom (UTC+1)

Platform Engineer|Strong senior

DevOps|Strong senior

Site Reliability Engineer|Strong senior

Tech lead

Skills and seniority verified on Jul 1, 2026

Matt – AWS, Kubernetes, Terraform

Matt is a Staff-level Platform and SRE Engineer with extensive experience designing cloud-native platforms on Kubernetes, AWS, and GCP using Terraform and Go. He has led large-scale infrastructure migrations, developed Kubernetes operators, and built internal developer platforms that improve scalability and engineering productivity. Matt combines hands-on technical expertise with strong leadership, mentoring engineers and collaborating effectively with senior stakeholders across engineering and product. He excels at modernizing legacy infrastructure, driving reliability initiatives, and solving complex distributed systems challenges.

16 years of commercial experience in

AI

Cloud computing

Data analytics

Edtech

Healthtech

Management

Networking

Product management

Collaboration tools

Dev tools

Platforms

SaaS

Video streaming software

Software development

Main technologies

AWS

10 years

Kubernetes

7 years

Terraform

8 years

GCP

7 years

Python

6.5 years

Ansible

6.5 years

Linux

14 years

Additional skills

Grafana

CI/CD

Chef

Docker

Ceph

Direct hire

Possible

Ready to get matched with vetted developers fast?

Let’s get started today!

Experience Highlights

Staff Site Reliability Engineer

Apr 2026 - Jul 20263 months

Project Overview

Worked on internal tooling, management, and reliability of multi-petabyte ClickHouse clusters, including migrating Kafka-fed ingestion clusters processing 100,000s of events per second from EC2 to Kubernetes.

Responsibilities:

Led the migration of EKS-based deployments to EC2;
Served as the primary developer of an internal ClickHouse query proxy and routing layer;
Participated in the on-call rotation, supporting all ClickHouse clusters and ingestion pipelines.

Project Tech stack:

ClickHouse

AWS

Kubernetes

Ansible

Python

Golang

Amazon EC2

Amazon S3

Docker

Terraform

Terragrunt

GitHub

GitHub Actions

Prometheus

Grafana

PostgreSQL

Apache Kafka

Dagster

Amazon RDS

Staff Platform Engineer

Oct 2025 - May 20266 months

Project Overview

Served as European Tech Lead for the Platform Infrastructure team, leading the development of a Go-based Kubernetes operator for automated management of Neo4j clusters, databases, users, and infrastructure capacity.

Responsibilities:

Served as the European Tech Lead for the Platform Infrastructure team;
Designed and led the development of a production-grade Kubernetes operator for managing Neo4j clusters and databases;
Led the re-architecture of Kubernetes cluster deployments and management using Argo CD;
Defined platform engineering strategy in collaboration with other tech leads, product management, and senior stakeholders;
Partnered with product management, senior stakeholders, and engineering leadership to strengthen team trust, psychological safety, and cross-functional collaboration;
Mentored senior engineers.

Project Tech stack:

Neo4j

Kubernetes

Golang

AWS

GCP

Terraform

Terragrunt

Amazon EC2

GitHub

GitHub Actions

Jira

Prometheus

Grafana

PostgreSQL

Apache Kafka

gRPC

Lead Platform Engineer

Oct 2024 - Oct 20251 year

Project Overview

Led the development of a Kubernetes-native internal developer platform supporting on-premises and cloud deployments, including secrets management, IAM, observability, developer tooling, GitHub-based authentication, and application containerization.

Responsibilities:

Spearheaded the introduction of Platform Engineering as the founding engineer and lead of the Platform team;
Owned the team roadmap and identified key pain points and areas for Platform support;
Established mTLS-encrypted network meshing between cloud-based Kubernetes clusters and on-prem/colocated bare metal hardware;
Led the implementation and delivery of new platform solutions as the project owner;
Managed and mentored engineers on the Platform team;
Deployed secrets management solutions via OpenBao, including supporting tooling and integrations;
Centralized identity management via Keycloak as a primary auth solution for teams and services;
Delivered Kubernetes deployments as-a-service;
Created supporting tooling projects primarily in Go, including a templating engine for configuration files and secrets setups.

Project Tech stack:

Kubernetes

Golang

Terraform

Grafana

Vault

AWS

Microservices

Prometheus

Ansible

.NET

C#

Python

GitHub

GitHub Actions

DigitalOcean

ClickHouse

MariaDB

CDN

InfluxDB

Nginx

Platform Tech Lead

May 2023 - Oct 20241 year 5 months

Project Overview

Architected and delivered a customer-facing control plane for dedicated PostgreSQL deployments while leading the migration of microservices from AWS ECS to Kubernetes as part of a cell-based platform rearchitecture.

Responsibilities:

Owned the roadmap for the Platform team;
Liaised with other teams to promote best practices and solve workflow pain points;
Spearheaded a year-long project developing a control plane for dedicated database hardware on Kubernetes;
Created a custom Kubernetes operator handling custom event emission and routing;
Established dynamic network and routing flows with Istio and supporting tooling;
Supported the migration of existing services from ECS to Kubernetes;
Contributed to mentoring, cost optimization, SLI/SLOs, alerting, on-call, CI/CD assessment and expansion, and hiring roadmaps;
Delivered infrastructure as code with Pulumi and Terraform.

Project Tech stack:

Kubernetes

Terraform

Pulumi

Golang

CI

CD

AWS

Amazon EC2

Amazon RDS

Amazon S3

Amazon SQS

Microservices

API Gateway

Typescript

GitHub

GitHub Actions

PostgreSQL

Apache Kafka

ElasticSearch

Cloud Architecture

DevOps

Nginx

gRPC

Staff Platform Engineer

Jan 2023 - Apr 20232 months

Project Overview

Led the migration of Ruby-based clinical services from AWS ECS to EKS and implemented Argo CD-based continuous delivery, environment promotion, and developer tooling for secrets management.

Responsibilities:

Served as the technical lead for the Platform Engineering team;
Helped define and establish the Platform Engineering technical roadmap;
Improved key pain points in the SDLC, infrastructure management, and reliability architectural review;
Led secrets management improvements;
Set the technical direction and excellence for the team;
Created a proposal for migration to Kubernetes;
Executed and led the Kubernetes migration as the primary SME;
Improved CI/CD strategy and execution;
Rearchitected secrets management;
Moved the engineering organization towards a GitOps model of operation.

Project Tech stack:

Kubernetes

Terraform

CI

CD

Grafana

Amazon ECS

Amazon EC2

Amazon S3

Golang

Ruby

Ruby on Rails

Jira

Confluence

Nginx

Senior SRE, Delivery:Orchestration

May 2022 - Nov 20225 months

Project Overview

Enhanced and optimized delivery pipelines for self-managed and SaaS platform offerings, improving the safety, efficiency, and scalability of feature delivery.

Responsibilities:

Created workflows, frameworks, architecture, and automation for Engineering teams to reach production effectively and efficiently;
Enabled developer groups to adopt self-serve deployments and releases through tooling creation and product contribution;
Collaborated extensively with multiple internal teams in asynchronous and synchronous forms;
Used Kubernetes, Chef, Terraform, Prometheus, Thanos, Grafana, and GitLab CI/CD capabilities in depth.

Project Tech stack:

Kubernetes

Chef

Terraform

Prometheus

Grafana

CI

CD

Ruby

GitLab

GitLab CI

CD

Amazon EC2

GCP

GCP Compute Engine

Director of Core Engineering

May 2021 - Apr 202210 months

Project Overview

Served as Founding SRE and later Director at an early-stage startup, building the observability platform, establishing SLIs/SLOs for critical user journeys, and leading engineering organization growth and team transformation.

Responsibilities:

Joined as the founding member of a new SRE team responsible for reliability, operational excellence, and production readiness of platform services and infrastructure;
Expanded the team to include principal engineers across stack disciplines;
Introduced Honeycomb as the primary observability platform;
Led workshops on Honeycomb and worked with cross-functional teams to design dashboards and representative SLOs;
Created a proof-of-concept Cortex-backed Prometheus and Grafana deployment on Kubernetes;
Oversaw the management and administration of Kubernetes clusters running in GKE;
Introduced Jsonnet and Tanka for Kubernetes resource declaration and management;
Brought GCP configuration and resources under Terraform management from scratch;
Wrote and introduced a production readiness review stage for new services and larger feature sets;
Oversaw the Core Engineering team across SRE, development, and security after promotion.

Project Tech stack:

Kubernetes

GCP

Terraform

Grafana

Prometheus

Typescript

MongoDB

PostgreSQL

JSON

Nginx

Principal SRE

Apr 2017 - May 20214 years 1 month

Project Overview

Led platform engineering initiatives across observability, security, deployments, performance, tooling, and infrastructure during SaaS expansion from 7 to over 50 cloud regions across four cloud providers, establishing the foundation of the observability team and leading an internal guild of over 30 engineers driving reliability improvements.

Responsibilities:

Worked on performance testing, CDN migrations, and extensive Terraform work;
Served as a founding member of the Cloud Observability Team;
Took responsibility for internal logging and metrics ingest pipelines, APM integrations, and dashboard and alerting creation and curation;
Led the ZooKeeper Guild, a cross-functional group of over 30 engineers;
Created a metrics, monitoring, and alerting strategy that became the foundation of the Cloud Observability team;
Became a subject-matter expert on ZooKeeper and drove platform stability improvements;
Helped move from a single centralised logging and metrics ingest solution to over 30 Elasticsearch deployments across multiple regions using Beats and Logstash;
Planned and executed a migration from Akamai to Google Cloud CDN;
Co-developed a dedicated standalone testing environment for a primary datastore using Terraform, Java, and Clojure;
Mentored less senior engineers and collaborated closely on roadmap planning and cross-team efforts.

Project Tech stack:

Terraform

AWS

GCP

Microsoft Azure

Grafana

Docker

Python

Scala

Java

Ansible

Vault

ElasticSearch

Logstash

Kibana

Clojure

Golang

Nginx

Systems Engineering Lead

Dec 2014 - Apr 20172 years 3 months

Project Overview

Served as Lead Engineer, managing a team of 6+ Systems Engineers developing reusable Terraform and Puppet modules in collaboration with Architecture for a heavily regulated healthtech environment.

Responsibilities:

Managed and mentored team members;
Provided technical leadership for a team of 5+ systems engineers and supported hiring and team growth;
Created and expanded team standards for software quality, security, and shared responsibility;
Introduced infrastructure as code and orchestration using Terraform for AWS infrastructure and account configuration;
Completed extensive security reviews of infrastructure, systems, and software;
Architected and executed large migration projects, including the introduction of microservices architectures;
Introduced CI through Jenkins and static analysis with CheckMarx;
Re-architected existing software and systems for improved reliability and stability;
Migrated a large development team to Vagrant-managed VMs on personal laptops;
Created standardized log aggregation and monitoring with a central Elasticsearch deployment running on Docker;
Improved Solr architecture by moving to a ZooKeeper-managed multi-node deployment;
Optimized AWS account performance and cost;
Performed advanced system administration across Linux and Windows hosts;
Automated infrastructure lifecycles, deployments, support tasks, and security operations across AWS accounts and products.

Project Tech stack:

AWS

Terraform

Jenkins

Docker

ElasticSearch

Ansible

Puppet

Apache Solr

MySQL

PostgreSQL

RabbitMQ

Redis

Python

Bash

Vagrant

Vault

Nginx

Keep in mind, the experience summary might exclude non-relevant projects

Education

2010

Theology

BA (Hons)

Languages

English

Advanced

Hire Matt or someone with similar qualifications in days

All developers are ready for interview and are are just waiting for your request

Copyright © 2026 lemon.io. All rights reserved.

Terms of use Privacy policy