Alex

From Brazil (UTC-3)

AI Engineer|Senior

Lemon.io stats

1

projects done

66

hours worked

2

offers now 🔥

Skills and seniority verified on Sep 22, 2025

Alex – Python, LLM, AI

Alex is a senior AI Engineer with strong expertise in NLP, LLMs, and data science applications across industries. His hands-on experience in building RAG projects and enterprise knowledge solutions highlights his technical proficiency. His vast AI portfolio includes a multi-agent system for causal measurement across advertising platforms, AI chatbots, a semantic search system, and recommendation engines.

12 years of commercial experience in

Advertising

Analytics

Fintech

Healthcare

Chatbots

Enterprise software

Main technologies

Python

12 years

LLM

4 years

AWS

5 years

NLP

4 years

Additional skills

GCP

AWS Lambda

Node.js

RAG

LangChain

OpenAI API

Pinecone

Amazon ECS

PostgreSQL

Amazon S3

Amazon RDS

FastAPI

AWS SageMaker

NumPy

PyTorch

Redis

Snowflake

Pydantic

Kafka

Tensorflow

Vertex AI

ElasticSearch

Airflow

LangGraph

LLaMA

Direct hire

Possible

Ready to get matched with vetted developers fast?

Let’s get started today!

Experience Highlights

Machine Learning Engineer / Project Lead

Dec 2024 - Aug 20257 months

Project Overview

At this company, marketing teams lacked the ability to measure the true causal impact of digital advertising campaigns across channels like Meta, Google Ads, and Amazon. Existing tools provided fragmented insights and often misattributed success.

Alex built a multi-agent causal measurement platform to unify cross-platform data, apply causal inference techniques, and provide actionable insights on incremental ROI. Specialized agents collected and normalized data, ran causal experiments (geo-lift, conversion lift), and analyzed treatment effects. Context-aware sentiment agents assessed ad placement safety and emotional alignment. An orchestration layer coordinated all agents, while validation loops ensured accuracy and robustness.

The system translated technical causal outputs into executive-friendly insights via natural language generation and dashboards, driving millions in efficiency gains. Results included +45% measurement accuracy over last-click attribution, $3.2M in waste reduction, and a 28% ROI lift. The system became the foundation for an annual $15M efficiency gain.

Responsibilities:

Designed and implemented a multi-agent system for causal measurement across advertising platforms.
Built specialized data collection agents for Meta, Google Ads, Amazon, and analytics platforms.
Developed normalization pipeline for cross-platform metrics (CPM, CPC, ROAS).
Implemented causal inference methods (Bayesian time-series, causal forests, DAGs, geo-lift tests).
Built transformer-based agents for context & sentiment analysis of ad placements.
Designed orchestration and decision engine with Apache Airflow and optimization models.
Deployed system on AWS ECS with Kafka event streaming, Redis caching, and Snowflake data storage.
Set up monitoring with Datadog dashboards and anomaly detection in causal estimates.
Created natural language summaries and dashboards for executives with Plotly Dash.
Achieved 92% accuracy in incremental conversion prediction and reduced decision latency from 2 weeks to 2 hours.
Scaled the system to process 50M+ events daily, including TV attribution and retail media.

Project Tech stack:

Apache Airflow

Redis

Kafka

Amazon ECS

Snowflake

OpenAI API

NumPy

Python

Pydantic

Plotly

PyTorch

SQL

PostgreSQL

AI Engineer / Project Lead

Jul 2024 - Jul 20251 year

Project Overview

An internal AI chatbot for a large enterprise client. The chatbot needed to provide accurate answers about internal documentation and policies while responding in the distinct tone of company executives (e.g., CTO, Design Director) to build trust and engagement.

The solution combined Retrieval-Augmented Generation (RAG) with fine-tuned persona adapters. Alex designed a retrieval layer using semantic embeddings and Pinecone for real-time answers (<500ms latency), and fine-tuned LoRA adapters per persona to replicate communication styles. An evaluation framework ensured accuracy and tone alignment, while feedback loops allowed continuous improvement.

The chatbot significantly reduced HR and IT support tickets, increased internal NPS, and gained executive sponsorship for expansion into new teams and personas

Responsibilities:

Designed and implemented the RAG pipeline with Pinecone and OpenAI embeddings.
Built fine-tuned LoRA persona adapters using synthetic + real data.
Integrated persona routing into LangChain’s generation module.
Implemented evaluation pipeline (LLM-as-a-judge, accuracy benchmarks, latency tracking).
Designed an active learning loop with user feedback and retraining.
Ensured production safety with content moderation and prompt filters.
Achieved 85%+ persona tone accuracy (vs. <60% baseline).
Reduced support ticket volume by 40% and improved NPS by +25 points.

Project Tech stack:

Python

LangChain

Pinecone

OpenAI API

RAG

AWS

AWS Lambda

Typescript

Node.js

Machine Learning Engineer / Project Lead

Nov 2024 - Jun 20256 months

Project Overview

At this company, support teams struggled to navigate a large internal knowledge base due to keyword-based search limitations. Alex built a semantic search system powered by RAG and LLMs to provide context-aware answers using natural language queries.

The pipeline combined embeddings and Pinecone for retrieval, dynamic prompting via LangChain, and a reranker for higher precision. Guardrails detected hallucinations and unsafe outputs, with fallback to links or human escalation. The system was deployed on AWS (SageMaker + Lambda), integrated into the support agents’ UI, and included monitoring with human-in-the-loop review for continuous improvement.

The project improved retrieval precision by ~35%, reduced resolution times by 25%, and increased agent satisfaction by 40%. The same architecture was later scaled to HR and finance teams

Responsibilities:

Collected and processed ~100K internal support articles, FAQs, and chat logs.
Designed semantic chunking with recursive text splitting.
Tuned embeddings on in-domain data using PEFT.
Implemented RAG pipeline with Pinecone + LangChain.
Added a reranker transformer for precision improvement.
Built guardrails for hallucination and unsafe output detection.
Deployed production system on AWS (SageMaker, Lambda, Step Functions).
Designed monitoring with human-in-the-loop feedback and a retraining loop.
Achieved +35% retrieval precision and -25% resolution time.
Scaled architecture to multiple departments (HR, Finance).

Project Tech stack:

OpenAI API

Pinecone

LangChain

AWS Lambda

AWS SageMaker

Amazon ECS

Python

Generative AI Engineer

Jul 2021 - Jul 20243 years

Project Overview

At Brazil’s largest beauty retailer, Alex's work centered on designing and implementing high-performance Retrieval-Augmented Generation (RAG) systems that seamlessly integrated generative AI models, driving insights and enhancing data accessibility across the organization. Focused on maximizing value, Alex delivered robust data pipelines that ensured reliable, efficient, and impactful data-driven solutions for various business needs.

Responsibilities:

Developed and optimized generative AI solutions, including intelligent chatbots and recommendation engines, leveraging LLMs and prompt engineering to improve response relevance and user engagement.
Integrated vector databases (e.g., Pinecone, ElasticSearch) to enhance embedding storage and retrieval, driving increased accuracy and scalability in search and recommendation functionalities.
Built and maintained robust data pipelines to support high-volume data retrieval across internal and external systems, ensuring seamless and low-latency information flow.
Employed Docker and Kubernetes for containerized deployment of back-end services, establishing stable and resilient infrastructure across distributed environments.
Collaborated with data science and engineering teams to create and deploy reusable libraries and frameworks, streamlining generative AI model integration with the back-end. Achievements:
Successfully delivered 10+ high-impact generative AI projects, resulting in a 20% improvement in operational efficiency and a measurable increase in profitability.
Optimized data pipeline architecture, reducing processing times by 25%, which enabled faster, more efficient real-time data access.

Project Tech stack:

Python

Node.js

Typescript

LangChain

Pinecone

Supabase

ElasticSearch

Hugging Face

Tensorflow

PyTorch

GCP

Vertex AI

Microsoft Azure

AWS

Databricks

AI Engineer – Lead

Dec 2022 - Nov 202310 months

Project Overview

At Brazil’s largest beauty retailer, knowledge management was a major bottleneck across 4,000+ franchise stores and corporate teams. Employees wasted time searching for product information, policies, and procedures across fragmented systems, creating inefficiencies, duplicate tickets, and inconsistent customer service.

The solution was an Enterprise Knowledge Assistant integrated directly into Slack, the company’s main communication platform. This assistant retrieved accurate information from 50,000+ internal documents and handled 10,000+ daily queries in Portuguese with <2s response times.

Key features included:

Multi-agent architecture with semantic retrieval, contextual enrichment, and continuous learning.
Native Slack bot with slash commands, interactive responses, multilingual support, and proactive notifications.
Scalable AWS deployment with ECS, Lambda, RDS, and Pinecone vector DB.
Robust monitoring, analytics dashboard, and user feedback loops.

The project reduced response times from 45 minutes to under 2 seconds, cut duplicate tickets by 60%, improved sales conversion, and became the AI template for other business units.

Responsibilities:

Designed and implemented a multi-agent architecture using LangGraph.
Built a scalable knowledge ingestion pipeline for 50,000+ documents (PDFs, images, structured data).
Integrated Slack bot with advanced features: commands, contextual responses, and multilingual switching.
Deployed auto-scaling AWS infrastructure (ECS Fargate, Lambda, RDS, S3, CloudFront).
Ensured full LGPD compliance and enterprise security standards.
Created real-time analytics dashboards for usage, response quality, and business impact.
Reduced response time from 45 min to <1.8s; achieved 92% accuracy in Portuguese queries.
Drove adoption by 85% of employees within 3 months.

Project Tech stack:

Python

FastAPI

LangChain

Typescript

AWS

Pinecone

PostgreSQL

CloudWatch

Amazon CloudFront

AWS Lambda

Amazon RDS

Amazon S3

Amazon ECS

Senior Data Scientist

Jan 2020 - Jul 20211 year 5 months

Project Overview

Alex worked as a Data Scientist specializing in demand forecasting and inventory management. His role involved analyzing data to uncover patterns and trends affecting product demand, collaborating with cross-functional teams to develop data-driven solutions, and overseeing the entire lifecycle of forecasting models. Alex provided actionable insights to optimize inventory management, reduce stockouts, and enhance operational efficiency.

Responsibilities:

Conduct exploratory data analysis to identify patterns, trends, and relevant insights.
Managed the complete lifecycle of forecasting models, from data collection and preparation to evaluation and deployment in production.
Provide actionable insights and data-driven recommendations to optimize inventory management and reduce stockouts in stores.

Project Tech stack:

Python

PySpark

SQL

Airflow

Databricks

Bitbucket

Keep in mind, the experience summary might exclude non-relevant projects

Education

2023

Artificial Intelligence

Postgraduate Degree

2021

Data Sciences

Postgraduate Degree

2020

Computer Systems Technology

Degree

Languages

English

Advanced

Hire Alex or someone with similar qualifications in days

All developers are ready for interview and are are just waiting for your request