Daniel
From Portugal (UTC+1)
Lemon.io stats
1
offers now 🔥Daniel – Python, AWS, LLM
Daniel is a Senior Data Scientist and AI/ML Engineer with 8 years of Python experience specializing in NLP, LLMs, and RAG systems. He builds and deploys production ML solutions end-to-end, with hands-on experience across AWS and GCP. His work focuses on email security, text processing, and scalable ML pipelines handling real-world data and high-volume inference. Daniel is comfortable translating business goals into technical systems and collaborating directly with stakeholders. He also mentors junior engineers and works effectively in startup and client-facing environments.
8 years of commercial experience in
Main technologies
Additional skills
Direct hire
PossibleReady to get matched with vetted developers fast?
Let’s get started today!Experience Highlights
Senior Data Scientist
Deployed an LLM-based phishing detection ranking system using AWS Bedrock, enabling Sales Engineers to identify high-value threat cases; processes hundreds of detections daily with structured, reliable LLM outputs
- Deployed an LLM service through AWS Bedrock;
- Written an API as a wrapper to that LLM so it can be queried/answered;
- Implemented a RAG system that queries a client-side API vector db in Elastic Search to find the necessary record;
- Wrote another client-side API that maintains the vector db - deleted, inserts, updates records - as well as answers to query requests.
Senior Data Scientist
Designed and deployed NLP systems for email security, including a few-shot topic classifier (SetFit + contrastive learning) detecting phishing/BEC without traditional indicators and an anomaly detection pipeline for sensitive data leaks and policy violations, serving millions of inferences daily via AWS SageMaker.
- Translated business requirements into project and technological solutions;
- Successfully conducted a thorough investigation on the dataset and what solution is for our use case;
- Model research and validation;
- Model deployment and service (through an API);
- Model monitoring and alerting.
Senior Data Scientist
Built an impersonation detection system for email security that prevented any lookalike attacks for VIP people and known contacts.
- Built and maintained Big Data pipelines for data collection and transformation;
- Developed and engineered features for machine learning models;
- Conducted model research, experimentation, and training;
- Designed and implemented APIs for model serving;
- Deployed, monitored, and maintained ML models in production.
Data Scientist II
Built a semantic editor recommendation system that matched 100+ daily paper submissions with 15,000+ editors using dense embeddings and vector similarity search (early RAG-style retrieval), combining semantic similarity with editor availability and responsiveness for sub-second ranking.
- Built an embeddings-based retrieval system;
- Developed a behavioral model to support the first system;
- Matched 100+ daily paper submissions to a pool of 15,000+ editors;
- Designed and implemented APIs for query processing and indexing;
- Deployed, monitored, and maintained the system in production.