Henrik Sergoyan,德国巴伐利亚州慕尼黑的开发人员
Henrik is available for hire
Hire Henrik

Henrik Sergoyan

Verified Expert  in Engineering

数据科学家和机器学习开发人员

Location
慕尼黑,巴伐利亚,德国
Toptal Member Since
November 5, 2021

Henrik是一位拥有超过六年专业经验的数据科学家. His primary expertise includes and is not limited to natural language processing, forecasting algorithms, 表格数据的梯度增强算法, data scraping, 和机器学习操作(MLOps). 作为一名资深数据科学家, Henrik使用SQL和NoSQL数据库, including MongoDB, 并为他的工作带来了强大的项目管理技能和卓越.

Portfolio

Toptal Client
Python,数据科学,MongoDB, Jupyter Notebook, ETL,数据可视化...
车站赌场有限责任公司-主要
机器学习,SQL, Python, Linux, RapidMiner, Windows...
Fozzy Group
Python, PyCharm, MySQL,时间序列分析...

Experience

Availability

Part-time

Preferred Environment

Windows, MacOS, Slack, PyCharm, Jupyter Notebook, Visual Studio Code (VS Code)

The most amazing...

...thing I've developed is an end-to-end data science pipeline for two different platforms used by the Armenian government.

Work Experience

高级数据科学顾问

2022 - 2022
Toptal Client
  • Developed compound Aggregation pipelines in MongoDB to process a large number of nested documents in a given collection.
  • Created a system that identifies bugs in the data processing stage where structured information was derived from PDF reportings of charity organizations. With the help of my system, we could detect and fix all inconsistencies in the database.
  • Created a user-friendly Streamlit dashboard (MVP) that serves as a user's charity navigator. I've developed interactive visualization (Sankey diagram) for each charity that shows the flow of money (from revenue to expenses) across the year.
Technologies: Python,数据科学,MongoDB, Jupyter Notebook, ETL,数据可视化, Streamlit, Data, Data Analysis, API Integration, Analytics, Database Management, 统计编程, Statistical Modeling, Sentiment Analysis

机器学习专家

2021 - 2022
车站赌场有限责任公司-主要
  • Developed a system that identifies customers who are going to leave the building (in the 15-minute interval), taking into account 42 variables that describe the past and current behavior of the client.
  • 开发复杂的SQL查询,从SQL数据库中提取实时数据.
  • 使用RapidMiner部署将模型留在生产环境中的机会.
Technologies: 机器学习,SQL, Python, Linux, RapidMiner, Windows, 渐变增强树, Deep Learning, Deep Neural Networks, Predictive Learning, Data, Data Analysis, API Integration, Data Science, Analytics, NoSQL, Database Management, 统计编程, Statistical Modeling, Sentiment Analysis

Senior Data Scientist

2021 - 2021
Fozzy Group
  • 创建并实施促销产品的销售预测模型.
  • Deployed a promotional forecasting model and implemented a monitoring system for that model.
  • Assisted in improving the recommender system of the Ukrainian biggest grocery stores, 包括特征工程和建模.
  • 为销售预测模型创建了Power BI仪表板,以分析错误.
  • Led the model deployment by communicating with relevant stakeholders to identify business needs, 创建系统架构, 并协助后端团队以最优化的方式部署我们的模型.
Technologies: Python, PyCharm, MySQL,时间序列分析, 机器学习操作(MLOps), 推荐系统, Microsoft Power BI, LightGBM, CatBoost, XGBoost, Graylog, RabbitMQ, Flask, REST, Windows, Slack, Jupyter Notebook, Data Mining, Data Engineering, SQL, ETL, Machine Learning, 人工智能(AI), 数据科学产品经理, Azure SQL, Ensemble Methods, BERT, TensorFlow, Data Science, Deep Learning, Keras, Statistics, PySpark, 亚马逊网络服务(AWS), Dashboards, RStudio Shiny, Tableau, Predictive Learning, 渐变增强树, Reporting, Data Analytics, Data Analysis, Data Reporting, Web Scraping, Time Series, BigQuery, Statistical Analysis, Model Development, Pandas, PyTorch, Software Engineering, Mathematics, Data Visualization, Source Code Review, Task Analysis, Interviewing, Data, API Integration, Predictive Analytics, Analytics, NoSQL, Database Management, 统计编程, Statistical Modeling, Sentiment Analysis, NumPy

高级数据科学顾问

2019 - 2021
亚美尼亚国家可持续发展目标创新实验室|联合国开发计划署办事处
  • 开发了有史以来第一个人工智能驱动的实时工具travelinsights.数据分析,使用人工智能来收集数据, analyze, 并将Tripadvisor上关于亚美尼亚的旅游评论可视化, Facebook, and Booking.com.
  • 创建了实时平台Edu2Work,刮了60多个,000个在线欧博体育app下载, extract and standardize relevant information from the unstructured job descriptions, 并在仪表板上显示分析结果.
  • 开发监控平台sdglab的数据科学部分.am/ zh /亚美尼亚可持续发展目标监测项目. This is a user-friendly, AI-powered, open-access interactive online tool for data analytics.
  • Built a citizen request classification model to increase the Armenian government's operational efficiency, assigning requests made by Armenian citizens to the corresponding ministries and departments.
  • 管理一个数据科学团队. 从项目初期开始参与项目策划, 为每个任务制定了工作分解结构(WBS), and managed communication between the data science team and the lab executives.
Technologies: Python, GPT, 生成预训练变压器(GPT), 自然语言处理(NLP), TensorFlow, Google Cloud, BERT, Transformers, 零射击学习(ZSL), Few-shot Learning, Word2Vec, Clustering, GRAPH, FbProphet, CATS Forecasting, Ensemble Methods, Data Scraping, ETL, MongoDB, Selenium, Social Media APIs, Project Design, Design Thinking, 敏捷项目管理, Windows, MacOS, Slack, PyCharm, Jupyter Notebook, Data Mining, Unsupervised Learning, Data Engineering, 机器学习操作(MLOps), Machine Learning, 人工智能(AI), 数据科学产品经理, Data Science, 命名实体识别(NER), Deep Learning, Keras, Scikit-learn, Dashboards, RStudio Shiny, Linux, Predictive Learning, 渐变增强树, Deep Neural Networks, Reporting, Data Analytics, 谷歌云平台(GCP), Data Analysis, Data Reporting, Web Scraping, Time Series, Statistical Analysis, Model Development, Pandas, PyTorch, Software Engineering, Mathematics, Data Visualization, Technical Hiring, Code Review, Source Code Review, Task Analysis, Interviewing, Team Management, Data, API Integration, Predictive Analytics, Office 365, Analytics, NoSQL, Database Management, 统计编程, Statistical Modeling, Sentiment Analysis, NumPy

Teaching Associate

2019 - 2020
亚美尼亚美国大学
  • Supervised a team of senior students for their Capstone project focusing on real estate market analytics in Armenia. 开发数据提取模型, 室内设计分类, distance calculation, 以及最优的价格估计.
  • Conducted weekly problem-solving sessions with 20 BSc and MSc students for the Statistics course. 根据所讨论的主题,解释了一组独特问题的解决方案.
  • Assisted in creating the syllabus and agenda for the Natural Language Processing and Statistics courses.
  • 指导学生完成顶点项目, 一些与房地产市场相关的新闻分析.
技术:统计数据, Bayesian Statistics, 自然语言处理(NLP), GPT, 生成预训练变压器(GPT), University Teaching, Supervisor, Real Estate, Web Scraping, Data Collection, BigQuery, Statistical Analysis, PyTorch, Mathematics, Technical Hiring, Code Review, Task Analysis, Interviewing, Data, GIS, RStudio, Predictive Analytics, Office 365, Sports, Data Science, Analytics, NoSQL, Database Management, 统计编程, Sentiment Analysis, NumPy

Data Scientist

2018 - 2019
Ameriabank
  • 为银行员工创建并部署了一个基于人工智能的虚拟助手. Reduced the operational efficiency of the bank's internal communications by 120%.
  • Developed forecasting algorithms for financial market indicators, commodities, prices, and sales.
  • Performed customer segmentation analysis based on their transactions and activity.
Technologies: Python, SQL, 自然语言处理(NLP), 生成预训练变压器(GPT), GPT, Windows, Slack, PyCharm, Jupyter Notebook, Data Mining, Data Scraping, Unsupervised Learning, Data Engineering, ETL, Machine Learning, 人工智能(AI), Ensemble Methods, 零射击学习(ZSL), BERT, TensorFlow, Google Cloud ML, Data Science, 命名实体识别(NER), Statistics, Bayesian Statistics, Scikit-learn, Dashboards, RStudio Shiny, Linux, Predictive Learning, 渐变增强树, Reporting, Data Analytics, 谷歌云平台(GCP), Sports, Data Analysis, Data Reporting, Web Scraping, Data Collection, Time Series, Statistical Analysis, Model Development, Pandas, Mathematics, Data Visualization, Code Review, Source Code Review, Task Analysis, Team Management, RStudio, Predictive Analytics, Office 365, Analytics, NoSQL, Database Management, 统计编程, Statistical Modeling, Sentiment Analysis, NumPy

数据科学家|统计学家

2017 - 2018
ClinChoice
  • Recognized inconsistencies in datasets while preparing SAS programs before a database lock.
  • 开发SAS程序生成表格, listings, and graphs according to the specifications indicated in the statistical analysis plan (SAP).
  • Created, validated, and documented the SAS programs by good clinical programming practices and according to applicable guidelines and the client's standard operating procedures.
Technologies: SAS, SAS SQL, Windows, Slack, Data Mining, ETL, Ensemble Methods, BERT, Bayesian Statistics, R, Predictive Learning, Reporting, Data Analytics, Data Analysis, Data Reporting, Web Scraping, Data Collection, Statistical Analysis, Pandas, RStudio, Predictive Analytics, Office 365, NoSQL, Database Management, 统计编程, Statistical Modeling, Sentiment Analysis, NumPy

劳动力市场信息平台| Edu2Work

http://edu2work.am/
The Edu2Work platform was developed in response to the dynamic nature of the labor market and the ongoing mismatch between the demand and supply of talent in Armenia. The platform employs cutting-edge natural language processing (NLP) models to gather and analyze thousands of online job postings from a range of commercial websites. By doing so, 它提供全面的, 亚美尼亚劳动力市场的最新数据, 使个人能够做出明智的职业决定.
The development of Edu2Work involved the design and implementation of an end-to-end data science pipeline, 包含高效和灵活的数据摄取, 信息提取与标准化, 数据可视化. Core NLP tasks performed during the project included job title standardization according to European standards, 行业分类, 技能提取和分类(软/硬), 和学位提取(BSc), MSc, PhD, None). These tasks were instrumental in enabling the platform to provide high-quality labor market data in a user-friendly and accessible format.

促销预测

In this project, I've developed an end-to-end pipeline for forecasting the sales model of promotional products in the largest retail stores in Ukraine. The model considers over 30 features to accurately predict the sales of products planned to be in a promotion. 在内部部署之后, the model has increased the operational efficiency of the commerce team deciding on the type and amount of promotion, 后勤团队, 在每个分支机构分配足够的资源.

旅游分析平台

http://www.travelinsights.ai/
I developed an AI-powered real-time data analytics tool for the tourism sector in Armenia. The online tool uses travel storytelling and artificial intelligence to collect, analyze, 并将Tripadvisor上关于亚美尼亚的旅游评论可视化, Facebook, and Booking.com. 通过实时分析和可视化的游客评论, the tool reveals actual travel preferences and on-the-ground issues in Armenia. With one scroll, policymakers, businesses, or tourists can explore insights from all over the world about different regions and locations of Armenia.
2022 - 2022

Ph.D. 数据科学学位

亚美尼亚欧洲大学-埃里温

2020 - 2022

数据科学数学硕士学位

慕尼黑工业大学-慕尼黑,德国

2019 - 2021

统计学硕士学位

埃里温州立大学-埃里温

2015 - 2019

计算机科学学士学位

亚美尼亚美国大学-埃里温,亚美尼亚

Libraries/APIs

CatBoost, XGBoost, Pandas, NumPy, TensorFlow, Keras, Scikit-learn, PyTorch, Social Media APIs, PySpark

Tools

Slack, PyCharm, 命名实体识别(NER), Visual Studio, Tableau, BigQuery, GIS, Microsoft Power BI, Graylog, RabbitMQ, Supervisor, AutoML

Frameworks

LightGBM, Selenium, RStudio Shiny, Flask, Streamlit

Languages

Python, R, SQL, SAS

Paradigms

ETL, Data Science, Design Thinking, 敏捷项目管理, REST, Automation

Platforms

MacOS, Jupyter Notebook, RStudio, Windows, Linux, Azure, 亚马逊网络服务(AWS), 谷歌云平台(GCP), Visual Studio Code (VS Code), RapidMiner

Storage

数据库管理,MongoDB, MySQL, Google Cloud, SAS SQL, NoSQL, Azure SQL

Other

Data Mining, Data Scraping, 自然语言处理(NLP), Word2Vec, FbProphet, Ensemble Methods, Machine Learning, 人工智能(AI), Deep Learning, Statistics, Dashboards, 渐变增强树, Reporting, Data Analytics, Fantasy Sports, Data Analysis, Data Reporting, Web Scraping, Data Collection, Time Series, Statistical Analysis, Model Development, Mathematics, Data Visualization, Task Analysis, Interviewing, Data, Predictive Analytics, Sports, Football, Analytics, 统计编程, Statistical Modeling, Sentiment Analysis, GPT, 生成预训练变压器(GPT), Unsupervised Learning, Data Engineering, 计算统计数据, 机器学习操作(MLOps), Dash, Time Series Analysis, BERT, Transformers, 零射击学习(ZSL), Few-shot Learning, Project Design, 数据科学产品经理, Bayesian Statistics, Predictive Learning, Deep Neural Networks, University Teaching, Real Estate, Technical Hiring, Code Review, Source Code Review, Team Management, API Integration, Office 365, Google Cloud ML, 推荐系统, CATS Forecasting, Software Engineering, Agile Data Science, Graphs, Clustering, GRAPH, AppFolio

有效的合作

如何使用Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

开始你的无风险人才试验

与你选择的人才一起工作,试用最多两周. 只有当你决定雇佣他们时才付钱.

对顶尖人才的需求很大.

Start hiring