Oluwatosin – PyTorch, AWS, Python, experts in Lemon.io

Oluwatosin

From United Kingdom (UTC+1)

AI Engineer|Senior

Machine Learning Engineer|Senior

Back-end Web Developer|Senior

Lemon.io stats

2

projects done

796

hours worked

1

offers now 🔥

Skills and seniority verified on Nov 18, 2025

Oluwatosin – PyTorch, AWS, Python

Tosin is a highly experienced and versatile Senior AI/ML Engineer with a proven track record in scaling AI and machine learning products from research prototypes to production-ready solutions. Specializes in bridging the gap between interdisciplinary research and commercial applications, particularly in multimedia, health, and accounting domains. Has successfully delivered multiple NLP, computer vision, and multimodal AI solutions, including LLM- and RAG-driven products for document, speech, and data-heavy workloads. Comfortable leading projects, aligning product and engineering goals, making him an ideal candidate to drive both technical innovation and team growth. Highly adaptable, can rapidly integrate AI solutions into evolving product landscapes, ensuring practical, scalable, and impactful results.

8 years of commercial experience in

Accounting

AI

Business intelligence

Edtech

Healthcare

Healthtech

Machine learning

Media

B2B

AI software

Chatbots

Customer support software

NLP software

SaaS

Main technologies

PyTorch

6.5 years

AWS

4.5 years

Python

8 years

LLM

4 years

MLOps

6 years

Additional skills

GPT

Tensorflow

Deep Learning

FastAPI

Chroma.js

React

PostgreSQL

Kubernetes

RAG

GCP

Terraform

Amazon S3

Vector Databases

NLP

LangGraph

LangChain

Docker

SQL

Typescript

ETL

Django

Chat bots

Data analysis

Next.js

Redis

Docker Compose

Claude Code

Cursor

Direct hire

Possible

Ready to get matched with vetted developers fast?

Let’s get started today!

Experience Highlights

Senior AI Engineer

Oct 2025 - Ongoing7 months

Project Overview

A founder‑first business clarity platform that turns scattered metrics into simple, actionable guidance for first‑time founders. It emphasizes guided setup, plain language, and visual insights over complex dashboards, with multi‑tenant architecture, outcomes/goals tracking, scorecards, and an AI assistant (“The Operator”) for metric questions.

Responsibilities:

Built a tool‑based chat pipeline that registers Operator tools, streams responses, tracks tool calls/results, and enforces required tool usage for data‑backed answers;
Added system‑prompt loading from DB with a resilient fallback, plus automatic knowledge‑base context injection for concept questions;
Centralized tool registration and instantiation for account‑scoped tool execution, supporting both RubyLLM native tools and legacy OpenAI function format;
Implemented a “get_metrics_summary” tool that performs metric search, fetches recent time‑series summaries, computes trends, detects bottlenecks, and auto‑pulls knowledge‑base guidance for recommendations;
Built a daily update generator that gathers revenue/vitals/goal status aligned with dashboard logic, assembles structured context, and produces a formatted update via LLM prompts;
Added tool‑call monitoring, reasoning step tracking, UI broadcasts, and a “direct response” bypass for daily updates to guarantee formatting and avoid double responses.

Project Tech stack:

Ruby on Rails

React

LLM

PostgreSQL

Vector Databases

GCP

Apache Airflow

Senior Full-stack AI Engineer

Nov 2025 - Dec 20251 month

Project Overview

Production “medical AI infrastructure” stack: a GPU-accelerated, evidence-grounded medical intelligence backend + web app that serves a medical agent. It runs as a FastAPI API with a React/Vite frontend and externalizes state to Redis + Postgres, connects to vLLM for LLM inference, and uses retrieval over FAISS plus Qdrant.

Responsibilities:

Built a hybrid RAG pipeline over FAISS (PubMed abstracts) + Qdrant (medical codes) with parallel retrieval + prompt assembly, feeding vLLM for grounded generation;
Implemented CIVERA repository-first guardrails that classify evidence completeness and dynamically constrain LLM outputs to retrieved sources (strict mode to avoid unconstrained generation);
Workspaces monorepo migration (refactor/monorepo). Reorganized into src/api (FastAPI), src/web (Vite/React), infrastructure/docker, scripts/, tests/, and organized Docs/;
DevX and ops hardening via unified Docker Compose workflow + health checks; standard dev scripts; structured test layout (unit/integration/e2e markers); clearer “how to run” docs and architecture handoff.

Project Tech stack:

Python

Vector Databases

React

Qdrant

Docker

Docker Compose

LLM

FastAPI

FastMCP

RAG

Vite

Redis

CTO

Sep 2022 - Jun 20252 years 9 months

Project Overview

Developed a Generative AI–powered bookkeeping agent that automates the extraction of accounting entries, assigns GL and tax codes, and performs bank reconciliation. The system processes around 10 invoices every 3 minutes and supports multiple communication channels, including WhatsApp, email, and Slack. Designed for accounting firms, SMEs, and startups, the solution delivers high efficiency—up to 10× faster and 4× more cost-effective than traditional bookkeeping workflows.

Responsibilities:

Led the development of an AI-powered accounting agentic system and designed end-to-end AI/ML architecture;
Built scalable document-processing infrastructure handling tens of thousands of invoices monthly, including a custom VLM that outperformed traditional OCR;
Developed a multi-agent system for accounting tasks using open-source foundation models and implemented RAG/GraphRAG pipelines for accurate extraction and reasoning;
Optimized and scaled LLM/VLM inference using Kubernetes, GPU multi-LoRA adapters, quantization, flash attention, and CUDA-optimized engines (vLLM, LMDeploy);
Designed ETL and automation workflows with Airflow for continual fine-tuning and model improvement;
Created a custom orchestration layer (replacing LangChain and others) and built an in-house vector database for semantic search;
Delivered a production transactional accounting agent using React, Python, FastAPI, and Slack SDK, deployed via AWS ECS, Docker, and Terraform;
Established MLOps practices, CI/CD pipelines, observability (Grafana, Elasticsearch), and grew the engineering and AI teams from the ground up;
Architected and deployed Label Studio and Argilla for structured annotation workflows automation across the AI team;
Integrated Labelbox for enterprise-grade labeling at scale;
Enabled a human-in-the-loop feedback loop that fed directly into Airflow-orchestrated fine-tuning pipelines;
Managed data annotator sourcing and management;
Reduced dependency on external annotation vendors by bringing labeling fully in-house over time.
Managed SoC2, ISO compliance via Drata. Achieved a successful audit and license within a year.

Project Tech stack:

Python

PyTorch

Hugging Face

NLP

LLM

Computer Vision

AWS

GCP

Vector Databases

PostgreSQL

React

Typescript

FastAPI

MCP Server

AI Trainer

Aug 2024 - Feb 20256 months

Project Overview

RLHF preference pair annotation for code and mathematics domains, contributing to foundation model alignment and fine-tuning pipelines for one of the world's leading AI data companies.

Responsibilities:

Evaluated and ranked model-generated code and math outputs to produce high-quality preference pairs for LLM fine-tuning;
Contributed to alignment pipelines used in training state-of-the-art foundation models;
Developed domain judgment for code correctness and mathematical reasoning at scale.

Project Tech stack:

Core Data

Python

Data Science

Senior Data Scientist

Nov 2021 - Oct 202211 months

Project Overview

AI-powered contact center platform that analyzes conversations across digital and voice channels, delivers real-time coaching to advisors, and provides leaders with actionable insights into 100% of customer demand and performance metrics.

Responsibilities:

Built and deployed transcription (Speech2Text wav2vec-large-xlsr model) technology for optimizing call center productivity for client queries;
Used SQL best practices to design PostgreSQL back-end databases for insights;
Streamlined ETL processes, reducing errors due to manual data entry;
Built a multi-tenancy speech text classifier model using the Transformer DistilRoBerta model for client call center data, which were poorly mixed up during database setup for multiple customers' environments in Client Server;
Built and deployed a continuous customer prioritization algorithm for managing customer churn rates through continuous training and inference using Ludwig and Ray clusters for a state-of-the-art MLOps Pipeline;
Performed QA analysis of data-centric frontend product features, such as metrics, filters, and weights of call center agent performance with customers by querying MongoDB and transformation using pandas and numpy.

Project Tech stack:

Python

PyTorch

Hugging Face

NLP

Ray

FastAPI

AWS

AWS SageMaker

Jira

Git

GitHub

MLflow

Weights & Biases

GitHub Actions

Docker

NoSQL

ETL

Data analysis

NLP Engineer

Apr 2020 - Oct 20211 year 6 months

Project Overview

AI-driven platform for assessing news article credibility and fact-checking. The system uses transformer models to detect article stance and credibility, incorporates an evidence retrieval pipeline for verification, and includes automated CI/CD workflows for continuous model updates and deployment.

Responsibilities:

Developed and deployed a news article credibility algorithm using DistilRoBERTa, applied commercially for fact-checking major events such as the 2020 US presidential debate and monitoring misinformation during Euro 2020;
Built a stance detection model to identify agreement and disagreement, mitigating coordinated attacks and manipulative threats;
Created an evidence retrieval system combining BM25 ranking and semantic search with embeddings, powering the fact-checking pipeline by sourcing claims from media and social platforms;
Implemented CI/CD pipelines for continuous machine learning deployment on GCP;
Developed an aspect-based topic modeling system to extract and generate topic terms from social media content, enhancing data analysis and monitoring.

Project Tech stack:

Python

Hugging Face

PyTorch

Twitter API

FastAPI

AWS

Cloud Computing

Weights & Biases

Tensorflow

GitHub

Docker

AI

GPT

Pandas

ElasticSearch

ETL

REST API

Data Science Lead

May 2018 - Mar 20201 year 10 months

Project Overview

AI-powered assistive technology solutions designed to support inclusive education and enhance independence for individuals with visual impairments or reading disabilities, such as dyslexia and ADHD. The core product includes a mobile app and smart reading glasses to facilitate accessible reading and learning experiences.

Responsibilities:

Built a rule-based Clinical Decision Support system to detect early signs of blindness;
Developed multisensory deep learning algorithms for embedded smart reading glasses;
Implemented Optical Character Recognition (OCR) for real-time text capture;
Created Text-to-Speech models using RNNs to provide audio output for visually impaired users;
Integrated hardware and software components to optimize the performance of smart reading devices;
Conducted testing and fine-tuning to ensure accessibility, accuracy, and responsiveness of the system.

Project Tech stack:

Python

Pandas

PyTorch

Scikit-learn

Azure Functions

Flask

GitHub

Django

Keep in mind, the experience summary might exclude non-relevant projects

Education

2019

Data Science

MSc

Languages

English

Advanced

Hire Oluwatosin or someone with similar qualifications in days

All developers are ready for interview and are are just waiting for your request

Copyright © 2026 lemon.io. All rights reserved.

Terms of use Privacy policy