Logo
Oluwatosin – PyTorch, AWS, Python, experts in Lemon.io

Oluwatosin

From United Kingdom (UTC+1)flag

AI Engineer|Senior
Machine Learning Engineer|Senior
Back-end Web Developer|Senior

Oluwatosin – PyTorch, AWS, Python

Tosin is a highly experienced and versatile Senior AI/ML Engineer with a proven track record in scaling AI and machine learning products from research prototypes to production-ready solutions. Specializes in bridging the gap between interdisciplinary research and commercial applications, particularly in multimedia, health, and accounting domains. Has successfully delivered multiple NLP, computer vision, and multimodal AI solutions, including LLM- and RAG-driven products for document, speech, and data-heavy workloads. Comfortable leading projects, aligning product and engineering goals, making him an ideal candidate to drive both technical innovation and team growth. Highly adaptable, can rapidly integrate AI solutions into evolving product landscapes, ensuring practical, scalable, and impactful results.

7 years of commercial experience in
Accounting
AI
Edtech
Healthcare
Healthtech
Machine learning
Media
B2B
AI software
Chatbots
Customer support software
NLP software
Main technologies
PyTorch
6.5 years
AWS
4.5 years
Python
8 years
LLM
4 years
MLOps
6 years
Additional skills
GPT
Tensorflow
Deep Learning
FastAPI
Chroma
React
PostgreSQL
Kubernetes
Airflow
RAG
GCP
Terraform
Amazon S3
Computer Vision
Hugging Face
OpenCV
NLP
LangChain
Jira
Git
ElasticSearch
Pandas
Docker
Flask
Scikit-learn
Direct hire
Possible
Ready to get matched with vetted developers fast?
Let’s get started today!

Experience Highlights

CTO
Oct 2022 - Jul 20252 years 8 months
Project Overview

Developed a Generative AI–powered bookkeeping agent that automates the extraction of accounting entries, assigns GL and tax codes, and performs bank reconciliation. The system processes around 10 invoices every 3 minutes and supports multiple communication channels, including WhatsApp, email, and Slack. Designed for accounting firms, SMEs, and startups, the solution delivers high efficiency—up to 10× faster and 4× more cost-effective than traditional bookkeeping workflows.

Responsibilities:
  • Led the development of an AI-powered accounting agentic system and designed end-to-end AI/ML architecture;
  • Built scalable document-processing infrastructure handling tens of thousands of invoices monthly, including a custom VLM that outperformed traditional OCR;
  • Developed a multi-agent system for accounting tasks using open-source foundation models and implemented RAG/GraphRAG pipelines for accurate extraction and reasoning;
  • Optimized and scaled LLM/VLM inference using Kubernetes, GPU multi-LoRA adapters, quantization, flash attention, and CUDA-optimized engines (vLLM, LMDeploy);
  • Designed ETL and automation workflows with Airflow for continual fine-tuning and model improvement;
  • Created a custom orchestration layer (replacing LangChain and others) and built an in-house vector database for semantic search;
  • Delivered a production transactional accounting agent using React, Python, FastAPI, and Slack SDK, deployed via AWS ECS, Docker, and Terraform;
  • Established MLOps practices, CI/CD pipelines, observability (Grafana, Elasticsearch), and grew the engineering and AI teams from the ground up.
Project Tech stack:
Python
PyTorch
Hugging Face
FastAPI
GPT
LangChain
LangGraph
Terraform
AWS
Kubernetes
Amazon S3
Chroma
ClickHouse
PostgreSQL
Vector Databases
RAG
Terragrunt
MLflow
Grafana
CloudWatch
Amazon ECS
Deep Learning
NLP
OpenCV
Computer Vision
GCP
Cohere
Airflow
React
AWS SageMaker
Jira
Git
GitHub Actions
GitHub
Docker
Senior Data Scientist
Nov 2021 - Oct 202211 months
Project Overview

AI-powered contact center platform that analyzes conversations across digital and voice channels, delivers real-time coaching to advisors, and provides leaders with actionable insights into 100% of customer demand and performance metrics.

Responsibilities:
  • Built and deployed transcription (Speech2Text wav2vec-large-xlsr model) technology for optimizing call center productivity for client queries;
  • Used SQL best practices to design PostgreSQL back-end databases for insights;
  • Streamlined ETL processes, reducing errors due to manual data entry;
  • Built a multi-tenancy speech text classifier model using the Transformer DistilRoBerta model for client call center data, which were poorly mixed up during database setup for multiple customers' environments in Client Server;
  • Built and deployed a continuous customer prioritization algorithm for managing customer churn rates through continuous training and inference using Ludwig and Ray clusters for a state-of-the-art MLOps Pipeline;
  • Performed QA analysis of data-centric frontend product features, such as metrics, filters, and weights of call center agent performance with customers by querying MongoDB and transformation using pandas and numpy.
Project Tech stack:
Python
PyTorch
Hugging Face
NLP
Ray
FastAPI
AWS
AWS SageMaker
Jira
Git
GitHub
MLflow
Weights & Biases
GitHub Actions
Docker
NLP Engineer
Apr 2020 - Oct 20211 year 6 months
Project Overview

AI-driven platform for assessing news article credibility and fact-checking. The system uses transformer models to detect article stance and credibility, incorporates an evidence retrieval pipeline for verification, and includes automated CI/CD workflows for continuous model updates and deployment.

Responsibilities:
  • Developed and deployed a news article credibility algorithm using DistilRoBERTa, applied commercially for fact-checking major events such as the 2020 US presidential debate and monitoring misinformation during Euro 2020;
  • Built a stance detection model to identify agreement and disagreement, mitigating coordinated attacks and manipulative threats;
  • Created an evidence retrieval system combining BM25 ranking and semantic search with embeddings, powering the fact-checking pipeline by sourcing claims from media and social platforms;
  • Implemented CI/CD pipelines for continuous machine learning deployment on GCP;
  • Developed an aspect-based topic modeling system to extract and generate topic terms from social media content, enhancing data analysis and monitoring.
Project Tech stack:
Python
Hugging Face
PyTorch
Twitter API
FastAPI
AWS
Cloud Computing
Weights & Biases
Tensorflow
GitHub
Docker
AI
GPT
Pandas
ElasticSearch
Data Science Lead
May 2018 - Mar 20201 year 10 months
Project Overview

AI-powered assistive technology solutions designed to support inclusive education and enhance independence for individuals with visual impairments or reading disabilities, such as dyslexia and ADHD. The core product includes a mobile app and smart reading glasses to facilitate accessible reading and learning experiences.

Responsibilities:
  • Built a rule-based Clinical Decision Support system to detect early signs of blindness;
  • Developed multisensory deep learning algorithms for embedded smart reading glasses;
  • Implemented Optical Character Recognition (OCR) for real-time text capture;
  • Created Text-to-Speech models using RNNs to provide audio output for visually impaired users;
  • Integrated hardware and software components to optimize the performance of smart reading devices;
  • Conducted testing and fine-tuning to ensure accessibility, accuracy, and responsiveness of the system.
Project Tech stack:
Python
Pandas
PyTorch
Scikit-learn
Azure Functions
Flask
GitHub

Education

2019
Data Science
MSc

Languages

English
Advanced

Hire Oluwatosin or someone with similar qualifications in days
All developers are ready for interview and are are just waiting for your requestdream dev illustration
Copyright © 2025 lemon.io. All rights reserved.