Logo
Diego – AWS, GCP, Kubernetes, experts in Lemon.io

Diego

From United States (UTC-4)flag

DevOps|Senior
Site Reliability Engineer
Lemon.io stats
1
offers now 🔥

Diego – AWS, GCP, Kubernetes

With 8+ years in the DevOps field, Diego brings a wealth of experience in cloud infrastructure, automation, and CI/CD pipelines. He has worked with AWS, Kubernetes, and Terraform to design and manage high-performance systems while applying GitOps principles. Diego’s experience spans both enterprise and startup environments, including several AI-focused ventures. As a Team Lead, he has mentored Sysadmins and DevOps enginners and is passionate about knowledge-sharing and fostering team growth.

20 years of commercial experience in
AI
Cloud computing
Healthcare
Healthtech
Main technologies
AWS
8 years
GCP
8 years
Kubernetes
8 years
Terraform
8 years
Microsoft Azure
4 years
Additional skills
CI/CD
GitLab CI/CD
Bitbucket Pipelines
Docker
Ansible
Apache Kafka
LLM
Amazon CloudFront
RabbitMQ
Grafana
Prometheus
AWS Lambda
Java
ElasticSearch
Kafka
Amazon RDS
Terragrunt
PyTorch
Python
AI
Bash
Datadog
MLOps
AWS CloudFormation
Linux
Azure DevOps
Direct hire
Possible
Ready to get matched with vetted developers fast?
Let’s get started today!

Experience Highlights

Senior DevOps Engineer
Sep 2024 - Jan 20261 year 4 months
Project Overview

It's a startup focused on providing 100+ lab tests and expert medical insights into areas such as heart health, hormones, cancer, and more.

Responsibilities:
  • Designed and provisioned GCP infrastructure (Cloud Run, Cloud Functions, VPCs, subnets, databases) from scratch using Terraform, accelerating the startup's initial product launch.
  • Led architecture and implementation of GKE Clusters for the company's next-generation platform, migrating microservices from Cloud Run to Kubernetes.
  • Built multi-environment CI/CD pipelines (Cloud Build + GitHub Actions) reducing deployment time and improving security posture across all environments.
  • Initiated a multi-region viability study to support the company's global expansion roadmap.
Project Tech stack:
GCP
Terraform
RabbitMQ
Python
PyTorch
Senior DevOps Engineer (part-time)
May 2024 - Jan 20258 months
Project Overview

Migration from On-Primeses to Azure.

Responsibilities:
  • Designed and delivered a production-grade Kubernetes platform on top of the existing VMware environment, replacing ad-hoc VM management with a repeatable, automated provisioning pipeline.
  • Authored Terraform configurations to provision and lifecycle-manage VMware VMs (compute, memory, disk, networking) used as Kubernetes control-plane and worker nodes — enabling consistent, version-controlled infrastructure from day one.
  • Developed and maintained Ansible roles to fully automate the installation and configuration of RKE2 (Rancher Kubernetes Engine 2) clusters, covering node bootstrapping, CNI setup, TLS configuration, and post-install hardening.
  • Structured Ansible inventory and role hierarchy to support multi-environment deployments (dev, staging, production) from a single, parameterised codebase — minimising configuration drift across clusters.
  • Integrated RKE2 clusters with Keycloak as the OIDC identity provider, enabling centralised SSO-based authentication; configured API server OIDC flags and designed Keycloak realms, clients, and group-to-RBAC-role mappings fully automated via Ansible.
Project Tech stack:
Azure DevOps
Microsoft Azure
Kubernetes
CI
CD
Senior DevOps Engineer
Sep 2023 - Aug 202411 months
Project Overview

It's an AI-powered enterprise analytics platform that enables seamless automation and unification of workflows.

Responsibilities:
  • Architected and delivered end-to-end deployment of Large Language Models on AWS EKS, covering model serving, containerization, Helm chart authoring, and production rollout across multiple environments.
  • Configured GPU-enabled node groups and instance type selection on EKS to match model inference requirements, balancing throughput and latency SLAs.
  • Established model serving infrastructure using containerized inference endpoints, integrating with Redbird's AI agent and analytics pipeline.
  • Implemented KEDA (Kubernetes Event-Driven Autoscaling) to scale LLM inference pods based on real-time queue depth and request throughput metrics, eliminating idle compute and reducing pod-level costs.
  • Deployed Karpenter for intelligent, just-in-time EC2 node provisioning — replacing the static managed node groups and enabling right-sized instance selection (including Spot and Graviton) per workload profile.
  • Combined KEDA + Karpenter to achieve scale-to-zero during off-peak windows, delivering a meaningful reduction in monthly AWS compute spend while maintaining P99 latency targets during peak inference load.
  • Managed all cloud infrastructure via Terraform (IaC), ensuring consistent, auditable, and repeatable deployments across dev, staging, and production environments on AWS.
  • Built and maintained CI/CD pipelines (GitHub Actions + ArgoCD) for automated model and application updates, with progressive delivery and rollback capabilities.
  • Instrumented the platform with Prometheus and Grafana dashboards covering inference latency, GPU utilization, pod autoscaling events, and node provisioning time.
  • The same infrastructure created in AWS was created in Azure, deployed the Application in the AKS, for support of a client who requests to create a VPC peering, to connect his internal applications with the Redbird solution. Terraform was used to deploy all the infrastructure, like VPC, Subnets, and AKS.
Project Tech stack:
Kubernetes
AWS
Amazon CloudFront
LLM
Terraform
RabbitMQ
Prometheus
Grafana
Microsoft Azure
Senior DevOps Engineer
Jun 2022 - Dec 20231 year 6 months
Project Overview

Worked internally in the Client Recruitics, helping to manage the AWS with 24 Accounts and K8s Clusters.

Responsibilities:
  • Owned AWS infrastructure operations for Recruitics: monitoring, capacity planning, performance tuning, and incident response.
  • Migrated legacy application workloads to Kubernetes (EKS), improving scaling agility and resource efficiency.
  • Modernized CI/CD by migrating all pipeline operations from Jenkins to Bitbucket Pipelines, reducing build times and maintenance overhead.
  • Facilitated migration of legacy AWS accounts to purpose-built, governance-aligned accounts with Kubernetes and modern tooling.
Project Tech stack:
Kubernetes
AWS
AI
Terraform
Terragrunt
Senior DevOps Engineer
Feb 2021 - May 20221 year 3 months
Project Overview

It's a project designed to support clients in managing daily collaboration with the development team and internal projects.

Responsibilities:
  • Participated in the migration of the ElasticSearch environment to Kubernetes;
  • Contributed to deploying Karpenter across more than 20 EKS clusters.
Project Tech stack:
AWS
Kubernetes
Terraform
Terragrunt
Helm
Kafka
ElasticSearch
Amazon RDS
Senior DevOps Engineer
Dec 2018 - Jul 20212 years 6 months
Project Overview

It's an AI-powered platform that streamlines clinical documentation and enhances clinician efficiency through EHR integration.

Responsibilities:
  • Migrated the application from EC2 to Kubernetes, deploying an EKS cluster with all necessary management tools;
  • Developed Terraform code to provision AWS accounts following best practices;
  • Utilized Helm templates for multi-environment application deployments;
  • Optimized scalability and reliability, significantly reducing timeouts.
Project Tech stack:
AWS
Kubernetes
Terraform
Amazon CloudFront
CDN
Senior DevOps Engineer
Dec 2018 - Nov 201911 months
Project Overview

It's a project to migrate the Java application from the EC2 servers to Kubernetes Cluster.

Responsibilities:
  • Reduced the time of the scalability from 15 minutes to 35 seconds.
  • Lowered the cost by scaling from 450 EC2 instances to 150 EKS nodes.
  • Enhanced the performance based on the autoscalling.
Project Tech stack:
AWS
Kubernetes
Apache Kafka
Terraform
Java
CDN
AWS Lambda
DevOps System Engineer
Jul 2014 - Dec 20184 years 5 months
Project Overview

It's a sustainable brand in the retail of coatings.

Responsibilities:
  • Analyzed technologies used and developed processes to improve and expand existing workflows.
  • Established milestones for department contributions and developed processes to facilitate collaboration.
  • Provided detailed specifications for proposed solutions, including materials, manpower, and time required.
  • Collaborated with Developers on building applications and managing DevOps infrastructure, successfully monitoring progress and implementation.
  • Migrated the data center from on-premises to AWS.
  • Migrated applications to Kubernetes.
Project Tech stack:
AWS
Kubernetes
Oracle
Terraform
Helm

Education

2011
System Information
Bachelor

Languages

Portuguese
Advanced
English
Advanced

Hire Diego or someone with similar qualifications in days
All developers are ready for interview and are are just waiting for your requestdream dev illustration
Copyright © 2026 lemon.io. All rights reserved.