Burcin Sarac, Developer in Istanbul, Turkey
Burcin is available for hire
Hire Burcin

Burcin Sarac

Verified Expert  in Engineering

Data Scientist and Software Developer

Location
Istanbul, Turkey
Toptal Member Since
August 13, 2021

Burcin是一位经验丰富的数据科学家和人工智能开发人员,拥有该领域的硕士学位,并获得了机器学习和人工智能的认证. With a strong command of Python and its ecosystem, he has extensive hands-on experience across various AI and ML technologies. Burcin's current focus lies in the advancements of large language models (LLMs), 专注于任务自动化以及在云环境中开发和部署人工智能产品, particularly on Google Cloud Platform (GCP).

Portfolio

n11.com
Google Cloud ML, Google Cloud Platform (GCP), Natural Language Processing (NLP)...
Onyx Relations Corp
Artificial Intelligence (AI), GPT, Twitter API, Reddit API...
Sole Entrepreneurship in US
Trading, Artificial Intelligence (AI), Data Science, Data Analysis...

Experience

Availability

Part-time

Preferred Environment

Ubuntu 20.04, Python 3, Jupyter Notebook, PyCharm, Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), Google Cloud Platform (GCP), Amazon Web Services (AWS)

The most amazing...

...我建立的是一个倾向模型,可以用更低的成本来改善营销运作. It improved customer return rates by about 20%.

Work Experience

Senior Data Scientist

2022 - 2024
n11.com
  • Constructed customer data pipelines for daily, weekly, and monthly generated features based on customer transactions. Scheduled jobs to generate tables in BigQuery using Python.
  • 重新设计并改进了流失模型,以检测流失并使用客户交易作为原始数据计算客户生命周期价值.
  • 根据使用平台活动日志和事务的客户行为对客户进行细分.
  • Developed and deployed a custom chatbot using customer interaction data. 在GCP中创建了自定义模型端点,在Cloud Run中创建了API,并为计划的模型再训练操作设计了Kubeflow管道.
  • 设计了一个HTML页面,使用办公室屏幕跟踪实时订单数量,并使用HTML动画来庆祝目标命中, CSS, and JavaScript together with FastAPI in the back end.
  • 作为团队的一员,在一个定制的内部推荐系统开发项目中工作,并为整个项目生命周期的设计做出贡献, including the API design.
  • Designed and developed fraud and counterfeit product detection approaches, including image recognition, TF-IDF, lemmatization stemming and text embedding generation, and keyword extraction.
Technologies: Google Cloud ML, Google Cloud Platform (GCP), GPT, Natural Language Processing (NLP), Python 3, Python, Google BigQuery, BigQuery, Apache Airflow, Cron, Cloud Dataflow, Machine Learning, Deep Learning, Unsupervised Learning, Customer Segmentation, Classification, Data Analysis, Data Science, Data Engineering, Data Pipelines, Natural Language Toolkit (NLTK), SpaCy, Artificial Intelligence (AI), Google Cloud Functions, Google Cloud, Kubeflow, APIs, Flask, Chatbots, HTML, CSS, JavaScript, OpenAI, ChatGPT, TensorFlow, Time Series, Beautiful Soup, Clustering, Supervised Machine Learning, Scikit-learn, Apache Beam, Large Language Models (LLMs)

AI Developer

2023 - 2023
Onyx Relations Corp
  • Developed a bot capable of posting about specific topics, press releases, and engaging with users on social media platforms.
  • 集成和利用LLM/GPT技术,实现对用户交互的有机和上下文相关的响应.
  • Implemented functionalities to detect and respond to relevant threads, discussions, and trends across Twitter and Reddit.
  • Deployed all the processes to Google Cloud Platform using various technologies, such as Cloud Run, Cloud Functions, BigQuery, and Cloud Scheduler, among others.
Technologies: Artificial Intelligence (AI), GPT, Twitter API, Reddit API, Generative Pre-trained Transformers (GPT), OpenAI, OpenAI GPT-4 API, Web Scraping, Natural Language Processing (NLP), Automation, Google Cloud Platform (GCP), Google Cloud Functions, Google Cloud, Docker, BigQuery, Machine Learning Operations (MLOps), ChatGPT, TensorFlow, Beautiful Soup, Scikit-learn, Large Language Models (LLMs)

Data Scientist | AI Developer

2023 - 2023
Sole Entrepreneurship in US
  • 利用价格相关数据跟踪美国股市趋势策略,开发并进行回测.
  • 通过连接股票市场api,使用Python根据回测结果自动执行成功交易策略.
  • Deployed all fully automated trading bots on the cloud, allowing the user to change parameters and start/stop them through a clean front screen.
  • 创建单独的BigQuery表来记录每个交易机器人的关闭交易,并通过过滤选项将交易结果可视化,让用户使用Looker Studio分析机器人的性能.
Technologies: Trading, Artificial Intelligence (AI), Data Science, Data Analysis, Algorithmic Trading, Trend Analysis, Google Cloud, Google Cloud Platform (GCP), Google BigQuery, Looker, API Integration, Finance APIs, Finance, Time Series, Scikit-learn

Senior Applied Scientist

2022 - 2022
Magnify
  • 在一个售后自动化和编排平台开发项目中担任ML模型开发人员. Segmented customers based on Salesforce platform usage attributes.
  • Gathered, transformed, 并总结特征,定义了一种基于规则的客户流失算法,以检测客户之间可能的流失.
  • Connected to AWS VM Instance using SSH from the local machine, set up ML Flow experiment tracking records in an S3 bucket in AWS, and generated experiment track reports using Prefect.
Technologies: Python 3, Machine Learning Operations (MLOps), Clustering, Unsupervised Learning, Amazon SageMaker, Amazon Web Services (AWS), Artificial Intelligence (AI), Data Engineering, Python, Statistics, Data Science, Scikit-learn, Docker, Time Series

Senior Data Scientist

2021 - 2022
Intertech (Emirates NBD Bank)
  • 开发NLP模型,使用索赔文档总结文本,对客户请求进行分类,并将其转发给相关部门.
  • Summarized effort logs of employees were collected as time series data, and then future efforts were estimated for planning future employee capacity requirements.
  • 建立异常检测模型,检测发票支付中的异常情况,并实施电子邮件警报系统,以便相关团队及时干预.
  • Constructed pipelines for gathering data from various sources such as relational databases and HTML or Excel files to generate reports; these were published via Power BI.
Technologies: Python 3, Microsoft SQL Server, Microsoft Power BI, Financial Modeling, Trend Forecasting, Natural Language Processing (NLP), GPT, Natural Language Understanding (NLU), Data Analysis, Microsoft Azure, Data Visualization, Artificial Intelligence (AI), Data Engineering, Python, ETL, SQL, Data Pipelines, Data Analytics, Data Science, Statistics, Natural Language Toolkit (NLTK), SpaCy, Time Series, Clustering, Supervised Machine Learning, Scikit-learn

Senior Data Scientist

2020 - 2021
Sekerbank (Samruk — Kazyna Invest LLP)
  • 建立并提出零售贷款产品和贷款账户的倾向模型,以确定客户购买这些产品的倾向.
  • 开发并实现了基于资产对零售客户进行细分的聚类算法, liabilities, and product ownership.
  • 整理和分类客户对产品和服务的投诉文本,生成每周报告.
  • 开发基于客户产品所有权的购物篮分析项目,以改进营销活动.
  • Constructed pipelines for the parsing and analysis of customer data for daily, weekly, and monthly executive reports to automatize report preparation.
Technologies: Python 3, Oracle SQL, Predictive Modeling, Classification, Trend Forecasting, Machine Learning, Supervised Machine Learning, Machine Learning Operations (MLOps), Data Engineering, SQL, Python, Data Science, Data Analysis, Data Analytics, Data Pipelines, ETL, Scikit-learn, Pandas, Forecasting, Natural Language Toolkit (NLTK), Artificial Intelligence (AI), Time Series, Clustering

Data Scientist

2019 - 2020
Vakifbank
  • 为零售和中小企业客户开发和部署产品倾向模型,以检测客户是否可能购买,并改进营销计划的客户定位.
  • Constructed a customer segmentation model based on the customer's balance account, transactions, credit cards, and loan usage behaviors.
  • 调查和更新当前使用的预测模型,以提高预测性能并简化结果.
  • 改进的报告生成管道,使基于客户数据的准备过程自动化.
Technologies: Python 3, Oracle SQL, Classification, Machine Learning Operations (MLOps), Clustering, Unsupervised Learning, Supervised Learning, Python, Statistics, Data Pipelines, Data Science, Data Analysis, Data Analytics, SQL, ETL, Machine Learning, Financial Modeling, Trading, Algorithmic Trading, Artificial Intelligence (AI), Finance, Finance APIs, Time Series, Supervised Machine Learning, Scikit-learn

Lyrics Generator | A Web Scrapping and Lyric Generation Project

http://github.com/burcins/LyricsGenerator
In this self-developed project, I aimed to generate lyrics by using lyrics of the entire discography of a given performer. I developed my model using Bob Dylan lyrics, but it is open for new trials.

In the first step, 我通过Beautiful Soup软件包解析了网页上的歌词,然后进行了清理,并为模型开发做好了准备. After that, 我创建了一个有几个层的双向LSTM模型,然后用一百次迭代来训练它. Eventually, I provided the initial words for the trained model and it predicted an additional 100 words.

Twitter Sentiment Analysis

http://github.com/burcins/Twitter-Sentiment-Analysis
在这个项目中,我的目标是获得最新的Twitter tweet和干净的字符串. Afterward, 我会对每条推文逐一进行情绪分析,并为它们分配分数,以确定推文的积极或消极.

ATM Cash Demand Forecasting

http://github.com/burcins/Time-Series-Forecasting
该项目的主要目的是通过使用一年的每日存款和取款日志来预测下个月atm的每日现金需求.

The dataset included three features: Cash In, Cash Out, and Date. It also contains 1,186 observations in total which correspond to 1,186 days starting from 01/01/2016 to 03/31/2019. Eventually, 预计将分别预测2019年4月1日至2019年4月30日之间的现金流入和现金流出价值.

Term Deposit Propensity Prediction

http://github.com/burcins/Term-Deposit-Propensity-Prediction
项目的主要目标是建立一个端到端的机器学习项目,利用呼叫中心的数据来预测客户的定期存款购买倾向. In other words, we tried to predict the probability of customers purchasing a term deposit. In addition, 最后一部分用于客户聚类,以识别更有可能购买投资产品的客户.

该数据包含40,000条客户数据,具有14个特征,包括定期存款所有权.

Text Summarizer

http://huggingface.co/spaces/Burcin/ExtractiveSummarizer
For this project, my primary aim was to summarize texts based on their content. I developed a model and deployed it to Hugging Face with an interface. This interface allows users to summarize Wikipedia content. 唯一的要求是从维基百科中获取主题及其收集的内容. For summarization, this model uses two different extractive summarization methods. The number of sentences in the output depends on the length of the original text.

Languages

Python 3, Python, SQL, SAS, R, HTML, CSS, JavaScript

Libraries/APIs

Pandas, Scikit-learn, Twitter API, TensorFlow, Beautiful Soup, Natural Language Toolkit (NLTK), SpaCy, Reddit API

Tools

BigQuery, PyCharm, Microsoft Power BI, Amazon SageMaker, Apache Airflow, Cron, Cloud Dataflow, Grafana, Looker, Apache Beam

Paradigms

Data Science, ETL, Automation

Platforms

Jupyter Notebook, Google Cloud Platform (GCP), Docker, Kubeflow, Amazon Web Services (AWS)

Storage

Google Cloud, Microsoft SQL Server, Oracle SQL, MySQL, Data Pipelines, MongoDB, Cassandra, Redis, NoSQL

Other

Machine Learning, Time Series, Classification, Clustering, Unsupervised Learning, Supervised Machine Learning, Data Analysis, Supervised Learning, Artificial Intelligence (AI), Data Analytics, Regression, Google BigQuery, Data Processing Automation, Google Cloud Functions, Finance, Ubuntu 20.04, Deep Learning, Statistics, Natural Language Processing (NLP), Text Classification, Web Scraping, Machine Learning Operations (MLOps), Time Series Analysis, Financial Modeling, Trend Forecasting, Microsoft Azure, Data Visualization, Data Engineering, Trading, Algorithmic Trading, Financial Markets, Capital Markets, Stock Market, Stock Trading, Stock Exchange, Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNN), Long Short-term Memory (LSTM), Text Categorization, GPT, OpenAI, OpenAI GPT-4 API, ChatGPT, Finance APIs, APIs, Chatbots, Large Language Models (LLMs), Predictive Modeling, Natural Language Understanding (NLU), Forecasting, Stock Price Analysis, Stock Market Techinical Analysis, Financial Marketing, Big Data, Social Media Analytics, Sequence Models, Data Cleaning, Google Cloud ML, Customer Segmentation, MLflow, Prefect, Trend Analysis, API Integration, Generative Pre-trained Transformers (GPT)

Frameworks

Flask

2018 - 2020

Master's Degree in Business Analytics

Athens University of Economics and Business - Athens, Greece

2011 - 2013

Master's Degree in Capital Markets

Marmara University - Istanbul, TURKEY

SEPTEMBER 2022 - PRESENT

MLOps Zoomcamp

DataTalks.Club

NOVEMBER 2020 - PRESENT

Natural Language Processing Specialization

Coursera

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring