Daniel – AWS, Python, LLM
Daniel is a Senior Data Scientist and AI/ML Engineer with 8 years of Python experience specializing in NLP, LLMs, and RAG systems. He builds and deploys production ML solutions end-to-end, with hands-on experience across AWS and GCP. His work focuses on email security, text processing, and scalable ML pipelines handling real-world data and high-volume inference. Daniel is comfortable translating business goals into technical systems and collaborating directly with stakeholders. He also mentors junior engineers and works effectively in startup and client-facing environments.
8 years of commercial experience in
Main technologies
Additional skills
Direct hire
PossibleReady to get matched with vetted developers fast?
Let’s get started today!Experience Highlights
Senior Data Scientist
Deployed an LLM-based phishing detection ranking system using AWS Bedrock, enabling Sales Engineers to identify high-value threat cases; processes hundreds of detections daily with structured, reliable LLM outputs
- Deployed an LLM service through AWS Bedrock;
- Written an API as a wrapper to that LLM so it can be queried/answered;
- Implemented a RAG system that queries a client-side API vector db in Elastic Search to find the necessary record;
- Wrote another client-side API that maintains the vector db - deleted, inserts, updates records - as well as answers to query requests.
Senior Data Scientist
Designed and deployed NLP systems for email security, including a few-shot topic classifier (SetFit + contrastive learning) detecting phishing/BEC without traditional indicators and an anomaly detection pipeline for sensitive data leaks and policy violations, serving millions of inferences daily via AWS SageMaker.
- Translated business requirements into project and technological solutions;
- Successfully conducted a thorough investigation on the dataset and what solution is for our use case;
- Was responsible for model research and validation;
- Deployed the model and set up monitoring and alerting.
Senior Data Scientist
Built an impersonation detection system for email security that prevented any lookalike attacks for VIP people and known contacts.
- Built and maintained Big Data pipelines for data collection and transformation;
- Developed and engineered features for machine learning models;
- Conducted model research, experimentation, and training;
- Designed and implemented APIs for model serving;
- Deployed, monitored, and maintained ML models in production.
Data Scientist II
Built a semantic editor recommendation system that matched 100+ daily paper submissions with 15,000+ editors using dense embeddings and vector similarity search (early RAG-style retrieval), combining semantic similarity with editor availability and responsiveness for sub-second ranking.
- Built an embeddings-based retrieval system;
- Developed a behavioral model to support the first system;
- Matched 100+ daily paper submissions to a pool of 15,000+ editors;
- Designed and implemented APIs for query processing and indexing;
- Deployed, monitored, and maintained the system in production.