Hemed

From Turkey (UTC+3)

Data ScientistSenior

Machine Learning EngineerSenior

AI Engineer

11 years of commercial experience

Administration

Architecture

Biotech

Consumer services

Customer support

Healthcare

Information services

Insurance

Legal tech

Machine learning

Media

Real estate

Travel

AI software

NLP software

Platforms

Lemon.io stats

3

projects done

800

hours worked

2

offers now 🔥

Skills and seniority verified on Aug 8, 2022

Hemed – Python, Flask, Machine learning

Senior Data Scientist with solid theoretical foundations (Ph.D.), hands-on technical experience, and proven people skills. Hemed has achieved remarkable results working on a diverse range of machine learning models, including non-linear optimization, time series forecasting, recommendation and ranking systems, text-to-speech conversion, and machine translation. Undoubtedly, this knowledgeable engineer with a bright personality will be a brilliant addition to any team.

Main technologies

Python

6 years

Flask

6 years

Machine learning

6 years

Additional skills

AWS

ETL

NumPy

Pandas

Snowflake

PostgreSQL

Tensorflow

PyTorch

GCP

Microsoft Azure

Deep Learning

Chroma

Transformer models

DevOps

Docker

Kubernetes

LangChain

LLM

Django

OpenAI API

Vertex AI

AWS CloudFormation

Hugging Face

MLOps

Direct hire

Possible

Ready to get matched with vetted developers fast?

Let’s get started today!

Experience Highlights

Chief AI Scientist

Jul 2024 - Mar 20258 months

Project Overview

An AI Workforce for Medical Practices aiming to increase practice revenue and profits by reducing claim submission overhead and filling gaps in the workforce in minutes, not months.

Responsibilities:

Created the ML solutions architecture;
Led the AI development team;
Owned the MLOPs pipeline from data acquisition to model deployment and monitoring.

Project Tech stack:

Vertex AI

GPT-4

Python

GCP

MLOps Engineer

Jan 2024 - Jun 20244 months

Project Overview

A real estate risk management firm that uses AI to measure, mitigate, and profit from insurable risk.

Responsibilities:

Hemed designed, developed, and maintained the complete life cycle of a model that predicts risk scores from customer data for precise risk assessment. The system facilitated continuous model improvement based on new data.

Project Tech stack:

AWS SageMaker

AWS CloudFormation

AI scientist

Aug 2023 - Mar 20246 months

Project Overview

A burgeoning platform specializing in text-to-speech conversion. In this endeavor, the team focused on enhancing the core product of the company: a sophisticated text-to-speech engine.

Responsibilities:

Hemed's work was centered on the analysis of various elements within a given movie or play script in PDF format. Particularly, he worked on:

Extraction of Text: He employed OCR technologies to seamlessly extract textual content from the script PDF, ensuring accuracy and fidelity.
Parsing and Segmentation: Leveraging cutting-edge parsing algorithms, he delineated distinct sections within the extracted text, including scenes, dialogues, and speaker attributions.
Voice Allocation: Employing state-of-the-art techniques, his work discerned the unique voices of each character involved in the conversation and allocated distinct vocal characteristics accordingly, enhancing the immersive experience for the audience.
Soundscape Integration: Recognizing the importance of ambiance and setting, Hemed curated background soundscapes and location effects tailored to each scene, seamlessly embedding them into the conversation to enrich the auditory experience.
Release: Finally, he presented a fully realized spoken rendition of the original written play, adapted to preserve its essence while elevating it to an immersive audio format.

Project Tech stack:

Django

LLM

OpenAI API

Head of AI

Jan 2023 - May 20233 months

Project Overview

An up-and-coming healthcare startup that launched a new product that makes the life of medical coders easy by automatically assigning relevant medical codes to handwritten discharge summaries.

Responsibilities:

Studied design, requirement elicitation, and data collection;
Trained four code-prediction models (Information retrieval based, dictionary-based, sequence-to-sequence, and multilabel classification models);
Evaluated the code prediction models on several datasets;
Created the back-end code prediction API leveraging the best-performing model.

Project Tech stack:

PyTorch

Tensorflow

PostgreSQL

GCP

Microsoft Azure

Tech lead

Nov 2022 - Jan 20232 months

Project Overview

A lawsuit is a long (5-100 pages) document detailing a legal case. The goal of this project is to train a summarization model and expose an API endpoint in such a way that, when given a lawsuit, the document returns its short (less than 500 words) summary. The summary should contain all important case details and be written in such a language that a person not trained in the legal domain can understand.

Responsibilities:

Created front-end and back-end architecture;
Conducted data studies to find the best datasets for training and evaluating the summarization task;
Evaluated different pre-trained summarization models to see which one best fits our needs;
Debugged and resolved the application issues.

Project Tech stack:

API

Chroma

Deep Learning

FastAPI

Selenium

Data Scientist / Full-stack Developer

Dec 2021 - Jan 20221 month

Project Overview

A tool for indexing Vimeo videos. The tool makes it possible to search Vimeo videos not only by the apparent text content (i.e., video title, tags, description, and create date) but also by the spoken words and phrases in the video itself. It was originally written for Viz media (the American manga publisher).

Responsibilities:

Implemented the API integration for fetching and updating video metadata to and from Vimeo;
Developed a text-to-speech model to generate subtitles for videos that did not contain subtitles automatically;
Implemented the whoosh index to index over 5000 videos that were uploaded anywhere from 2012-2022. The videos were of 3 minutes to 1hr in length;
Created a front-end web interface to interact with the indexer. The indexer allowed for automatic updating of the index with newly uploaded videos and searching for videos containing desired search phrases;
Hosted the index on the AWS-EBS instance.

Project Tech stack:

Flask

Python

HTML

CSS

REST API

Vimeo API

Mozilla deepscpeech

Whoosh index

Tech Lead

Aug 2021 - Nov 20212 months

Project Overview

A title block is an information box usually found in the bottom right-hand corner of an architectural drawing. The block indicates drawing details such as the title, author name, scale, version, and date of the drawing. In this project, Hemed developed a desktop application for automating the extraction of the title block information on PDFs/images of the drawings and automatically populating the fields in the cloud-based database. The main challenge was that different Architects or Engineering firms have different orientations or placements of the title block on the drawing sheets. The end result needed to be pretty smart to identify the location of the title block, its orientation, and the right attributes in it. The project was for a client from the Architectural Engineering and Construction (AEC) industry.

Responsibilities:

Trained a YOLO model to identify the location of the title block in any given architectural drawing;
Implemented a tesseract backend to re-orient the PDF and extract title block text rightly;
Implemented the TKinterbased desktop application as an interface for the process.

Project Tech stack:

SQL

Python

Git

pytesseract

Optical character recognition (OCR)

YOLO

TKinter

Amazon RDS

Senior Data Scientist

Apr 2021 - Jun 20212 months

Project Overview

An Australia-based travelling agency needed a way to scrape the web for all wonderful destinations. The goal was to build a mobile application that, when a user searches for places to visit in a particular location, they are presented with an exhaustive list of such destinations. The list would include the location name, address, images of the place, heading, and description.

Responsibilities:

Developed a dynamic web scraper that fits all potential web pages as a flask API endpoint hosted as an AWS Lambda instance;
Ran an Entity recognition model to identify location names in the scrapped text;
Implemented the Flutter-based mobile application to interact with the API.

Project Tech stack:

Beautiful soap

Selenium

Flask

Flutter

Lambda

Google places API

Spacy

Lead Machine Learning Engineer

Jan 2021 - Feb 20211 month

Project Overview

The tool extracts Frequently Asked Questions (FAQs) from a given email service. A client company receives hundreds of support emails every week. Each email thread contains back-and-forth conversations between the developers' team, the customer support team, and the customer. The tool swift through millions of email threads to identify the commonly asked questions. The questions are then well-contracted and presented on the FAQ page of the organization's website.

Responsibilities:

Managed the API connections between Outlook and the local development environment;
Trained a question identification/classification model by using transfer learning;
Ran the clustering algorithm based on a sentence transformer to identify similar questions;
Designed and implemented an ETL system to continuously fetch new emails and identify whether and where they fit in the FAQ database.

Project Tech stack:

BERT

Transformer models

Microsoft outook API

RAKE

Text clustering algorithms

Keywords extraction models

NoSQL

ETL

Keep in mind, the experience summary might exclude non-relevant projects

Education

2018

Computer Science

Master's

Languages

English

Advanced

Hire Hemed or someone with similar qualifications in days

All developers are ready for interview and are are just waiting for your request