Varun

From United States (UTC-5)

AI Engineer|Strong senior

Machine Learning Engineer|Strong senior

Skills and seniority verified on Aug 22, 2025

Varun – AI, LLM, AWS

Varun is a versatile Senior AI/ML and Backend Engineer with extensive experience delivering production-grade AI systems at scale. He brings strong expertise in LLMs, transformers, and retrieval-augmented generation, alongside a solid foundation in classical ML and backend engineering with Python. With hands-on experience in designing scalable architectures, fine-tuning models, and deploying enterprise-grade AI solutions for high-volume use cases, Varun excels at bridging research concepts with practical business applications. He is assessed as a candidate well-suited for founding engineer roles or senior AI/ML positions in startups and scale-ups.

10 years of commercial experience in

Analytics

Banking

Cloud computing

Data analytics

Healthcare

Machine learning

Retail

Data monetization

Chatbots

NLP software

Main technologies

4 years

LLM

3 years

AWS

8 years

Python

10 years

4 years

Additional skills

Pinecone

MLflow

Microsoft Azure

LangChain

MLOps

RAG

Hugging Face

LLaMA

Machine learning

Pandas

Scikit-learn

SQL

NumPy

PostgreSQL

OpenAI API

Prometheus

Grafana

AWS SageMaker

PyTorch

Kubernetes

Databricks

Docker

Tensorflow

PySpark

Airflow

Azure DevOps Server

GitHub Actions

Direct hire

Possible

Ready to get matched with vetted developers fast?

Let’s get started today!

Experience Highlights

Sr. Data Scientist. AI/ML Engineer

Mar 2023 - Jul 20252 years 4 months

Project Overview

One of the world’s largest financial services companies, serving millions of customers across payments, lending, and banking products. The project was an Agentic AI platform designed to modernize dispute resolution by applying large language models and retrieval-augmented generation for financial document analysis and decision support. The product enabled a compliant and more accurate handling of disputes by combining GPT-4 with enterprise retrieval systems, ensuring low-latency decisioning across millions of financial transactions.

Responsibilities:

Led an Agentic AI initiative with Azure OpenAI GPT-4, LangChain, and RAG to modernize dispute resolution and improve customer experience.
Designed and deployed FastAPI microservices on AKS with Cognitive Search and PGVector, enabling low-latency, compliant financial decisioning.
Fine-tuned GPT-4 with LoRA on Azure ML using financial corpora, strengthening classification accuracy and reducing escalations.
Built preprocessing pipelines with Dask on Databricks to enrich embeddings, improve recall, and reduce token usage in RAG workflows.
Managed end-to-end MLOps pipelines with Azure ML and DevOps, integrating blue-green rollouts, rollback gates, and SOC2-aligned governance in GitHub.
Secured production deployments with Azure Key Vault and Managed Identity, ensuring compliant secrets management and zero audit exceptions.
Applied LangSmith, OpenAI Evals, and Application Insights to monitor LangChain pipelines, enforce Responsible AI, and catch regressions early.
Built real-time streaming ingestion with Event Hubs and Synapse Spark, powering fraud alerts and improving grounding quality for RAG systems.
Documented prompt-engineering playbooks in Confluence to optimize templates and reduce token costs while safeguarding answer quality.
Delivered executive Power BI dashboards linked with LangChain traces, giving leadership visibility into KPIs, costs, and model performance.
Led Agile delivery in Jira across cross-functional teams, delivering production increments consistently with strong acceptance rates.

Project Tech stack:

Python

FastAPI

Dask

Azure Functions

Azure DevOps

OpenAI

LangChain

OpenAI API

Vector Databases

Databricks

Kubernetes

Docker

PowerBI

GitHub

Jira

Confluence

Data Scientist. AI/ML Engineer

Dec 2019 - Feb 20233 years 2 months

Project Overview

A leading U.S. health insurance and healthcare services provider. The project was a claims automation and healthcare AI platform designed to help insurers and clinicians streamline claims processing, detect fraud, and improve decision support. The product enabled end-to-end digitization and analysis of claims data, integrating OCR, NLP, and ML pipelines to convert handwritten and structured claims into actionable insights. It supported fast reimbursement cycles, clinical summarization, and compliance-ready data pipelines for sensitive health records.

Responsibilities:

Built and fine-tuned BERT, BioBERT, and ClinicalBERT models on AWS SageMaker to extract ICD-10 codes and triage healthcare claims.
Integrated OpenAI GPT-3 APIs and early RAG pipelines with Amazon Kendra and FAISS to power clinical summarization and policy search assistants.
Designed OCR and PHI redaction workflows using AWS Textract, Python, and Comprehend Medical to digitize handwritten claims and ensure secure data handling.
Developed data preprocessing pipelines in Pandas, NumPy, and AWS Glue to cleanse messy claims and improve downstream feature quality.
Built asynchronous batch processing pipelines with Python workers and AWS SQS to handle high-volume OCR claims efficiently.
Deployed low-latency adjudication services via Flask, AWS API Gateway, and SQS to streamline enterprise-scale claim processing.
Delivered Tableau dashboards powered by Amazon Redshift, providing leadership visibility into SLA compliance, fraud alerts, and triage KPIs.
Managed MLOps lifecycle with SageMaker Pipelines, MLflow, and CodePipeline, ensuring reproducible, compliant, and production-ready AI workflows.
Implemented observability and explainability with SHAP, CloudWatch, Prometheus, and Grafana to maintain trust, transparency, and resilience in AI-driven decisions.
Documented audit artifacts in GitHub and facilitated staged rollouts with rollback safeguards, supporting compliance and operational reliability.

Project Tech stack:

Python

PyTorch

Flask

pytest

BERT

OpenAI API

Vector Databases

RAG

AWS SageMaker

Amazon SQS

AWS Lambda

Redshift

CloudWatch

AWS CodeBuild

MLflow

Prometheus

Grafana

GitHub

Pandas

NumPy

Tableau

Data Scientist

Sep 2017 - Oct 20192 years 1 month

Project Overview

An extensive private hospital network serving millions of patients across specialties. The project was a healthcare data science and analytics platform designed to help doctors, insurers, and administrators predict patient risks, optimize resources, and detect fraud in claims. The product enabled data-driven healthcare decisions by integrating EHR records, billing data, and lab results into unified pipelines and applying predictive modeling, forecasting, and fraud detection.

Responsibilities:

Developed predictive models in Python, R, and Scikit-learn for readmission risk, ICU demand forecasting, and chronic disease modeling.
Designed fraud detection pipelines using XGBoost, PostgreSQL, and AWS Athena to identify anomalies in claims data.
Automated EHR data preprocessing with FHIR/HL7 standards, Pandas, and AWS Glue to streamline feature engineering and modeling workflows.
Applied unsupervised learning techniques such as K-Means and PCA for patient segmentation based on comorbidity and medication adherence patterns.
Built interactive Power BI and Tableau dashboards for hospital administrators and insurers to track utilization, costs, and fraud alerts.
Used SHAP and statistical validation methods to improve model explainability and ensure confidence in decision-making.
Implemented batch scoring scripts on AWS EC2 with Cron and Glue to operationalize regular model inference on healthcare data.
Maintained reproducible workflows using Git, Jupyter, and Markdown while collaborating with teams through Jira, Slack, and Confluence.

Project Tech stack:

Python

SQL

RegExp

Cron

Scikit-learn

XGBoost

Pandas

NumPy

Matplotlib

PostgreSQL

Amazon RDS

Hive

Amazon S3

Amazon EC2

PowerBI

Tableau

GitHub

Git

Jira

Confluence

Data Scientist

Jun 2015 - Aug 20172 years 2 months

Project Overview

One of India’s largest supermarket and hypermarket chains serving millions of daily shoppers. The project focused on developing early-stage data science and analytics solutions to optimize customer segmentation, inventory forecasting, and marketing effectiveness. The product was a set of analytics and predictive modeling solutions built on retail transaction, POS, and inventory data.

Responsibilities:

Built data cleaning and transformation pipelines with Python, R, SQL, and Excel, standardizing POS and inventory data for reliable downstream analytics.
Developed early regression, classification, and ARIMA forecasting models in scikit-learn and R, improving demand forecasting and customer segmentation accuracy.
Performed feature extraction and correlation analysis using Pandas, Excel, and RStudio, identifying key drivers for marketing and inventory decisions.
Leveraged Amazon S3 and EC2 with Hive queries for batch training and reporting, supporting large-scale data storage and compute needs.
Designed dashboards and reports in Tableau, Excel, Matplotlib, and Seaborn, enabling stakeholders to track promotions, stock turnover, and customer behavior.
Conducted EDA, statistical tests, and dataset validation with Pandas, R, and SQL, ensuring quality inputs and trustworthy results.
Maintained code reproducibility and documentation using Jupyter, RMarkdown, Word, and PowerPoint, supporting collaboration and audit readiness.
Supported dashboard enhancements and deployment reviews, contributing to production scoring tools and peer-reviewed Python workflows.

Project Tech stack:

Python

SQL

Apache Hadoop

Scikit-learn

Machine learning

SciPy

Pandas

Tableau

Matplotlib

Hive

Amazon S3

Amazon EC2

Keep in mind, the experience summary might exclude non-relevant projects

Education

2015

Computer Science

Bachelor's

Languages

English

Advanced

Hire Varun or someone with similar qualifications in days

All developers are ready for interview and are are just waiting for your request