Logo
Mateus – Kubernetes, Terraform, Ansible, experts in Lemon.io

Mateus

From Brazil (UTC-3)flag

Site Reliability Engineer|Middle
DevOps|Middle-to-senior
Platform Engineer|Middle-to-senior

Mateus – Kubernetes, Terraform, Ansible

Mateus is a strong DevOps/Platform Engineer with solid hands-on experience across the full infrastructure automation stack — Terraform, Kubernetes, AWS, and Linux. He's taken real ownership in startup environments, leading re-architecture efforts and building GitOps workflows from scratch. Self-sufficient and process-minded, he doesn't just identify gaps — he closes them.

10 years of commercial experience in
AI
Architecture
Banking
Beauty
Construction
Credit and lending
E-learning
Govtech
Mental healthcare
Sports
Telecommunications
Open source
Enterprise software
Software development
AI platform
Agentic automation
Main technologies
Kubernetes
6 years
Terraform
6 years
Ansible
1 year
CI/CD
6 years
AWS
5 years
GCP
3.5 years
Python
6.5 years
Microsoft Azure
3 years
Linux
8 years
GPU
1.5 years
Additional skills
Prometheus
Grafana
Helm
Azure DevOps
SonarQube
Jenkins
ElasticSearch
Splunk
Golang
Bare metal provisioning
Nvidia GPU
SQL
Direct hire
Possible
Ready to get matched with vetted developers fast?
Let’s get started today!

Experience Highlights

DevOps Engineer
Jul 2025 - Apr 20268 months
Project Overview

An AI cluster orchestration platform for managing multiple Slurm and Kubernetes environments across several cloud providers. It provides fleet health monitoring and an alerting strategy designed to help maintain SLA targets.

Responsibilities:
  • Created and managed CI/CD pipelines in GitLab CI;
  • Used Terraform to deploy Kubernetes and Slurm clusters with hundreds of GB of NVIDIA GPUs (H100, H200, GB200, GB300);
  • Deployed Kubernetes and Slurm clusters in AWS, GCP, Azure, Nebius, and Oracle;
  • Troubleshot Linux environments;
  • Managed Slurm job operations;
  • Implemented TailScale, enabling users to access the Slurm Login Node securely;
  • Implemented ArgoCD using Kustomize for bootstrap;
  • Used Pulumi (with Python) to deploy AWS infrastructure for internal tools;
  • Integrated Grafana with Terraform to improve observability by creating dashboards for system metrics and Slurm fleet health;
  • Used Alloy to manage data pipelines from Kubernetes to Grafana Cloud;
  • Created a maintenance strategy to reduce false positive alerts through automation;
  • Implemented a Slurm exporter written in Golang to enrich observability across multiple cloud providers.
Project Tech stack:
AI
GCP
AWS
Microsoft Azure
Oracle
Firebase DB and Storage
Kubernetes
Terraform
Terragrunt
DevOps Engineer
Nov 2024 - Jul 20258 months
Project Overview

A fitness app platform focused on global delivery and cost-optimized infrastructure across Azure services. It relies on a CDN strategy and cloud architecture improvements to support user experience at scale while keeping infrastructure costs under control.

Responsibilities:
  • Managed multiple subscriptions with Terraform;
  • Modularized Terraform code for deployment across multiple environments;
  • Implemented a CI/CD pipeline in Azure DevOps to deploy Terraform and application code;
  • Improved Datadog monitoring capabilities through code (IaC);
  • Reduced tenant costs by 25% by optimizing AKS and Azure Functions;
  • Reduced database costs by 15% for Azure Cosmos DB by optimizing RU (Request Units);
  • Improved user experience by creating a CDN cache layer in Azure Front Door;
  • Used APIM to version developer APIs, integrating them with the Azure Front Door solution.
Project Tech stack:
Azure DevOps
Azure Functions
Microsoft Azure
Terraform
Kubernetes
DevOps Engineer
Feb 2022 - Jul 20231 year 5 months
Project Overview

An infrastructure and delivery platform for an international banking environment, focused on Terraform- and Bicep-based provisioning with Azure DevOps as the CI/CD tool. It supported secure, reliable, and governed infrastructure delivery across Azure and GCP.

Responsibilities:
  • Architected, implemented, and documented a disaster recovery strategy to ensure data security and reliability;
  • Produced post-mortem documentation to improve operational processes and systems;
  • Applied policies with infrastructure as code and CI/CD to streamline governance and automate compliance;
  • Applied security practices with SonarQube, Fortify, infrastructure as code, and CI/CD;
  • Facilitated agile, scalable, and efficient infrastructure management in Azure and GCP using Terraform and Bicep;
  • Implemented GitOps using ArgoCD on GKE;
  • Enhanced visibility and control over software deployments using CI/CD techniques and observability;
  • Enhanced security with automation and CI/CD techniques using Python scripts;
  • Used Azure DevOps to boost team productivity and support high-quality software releases.
Project Tech stack:
Terraform
GCP
Microsoft Azure
Kubernetes
CI
CD
Python
Azure DevOps
SonarQube

Education

2019
Computer Engineering
Bachelor's degree

Languages

English
Advanced

Hire Mateus or someone with similar qualifications in days
All developers are ready for interview and are are just waiting for your requestdream dev illustration
Copyright © 2026 lemon.io. All rights reserved.