Sebastián Castaño,德国柏林的开发人员
Sebastián is available for hire
Hire Sebastián

Sebastián Castaño

Verified Expert  in Engineering

数据科学家和机器学习开发人员

Location
Berlin, Germany
Toptal Member Since
September 13, 2021

Sebastián has a PhD in machine learning and data science and a decade of experience in interdisciplinary projects in medicine, banking, marketing, and consumer products, among others. 他的专长包括设计数据收集系统, 分析和建模复杂的数据, 开发和部署机器学习管道. 作为一个经验丰富的研究者和教育家, Sebastián constantly delivers compelling data-driven insights and intuitive tools for technical and non-technical colleagues.

Portfolio

Stealth Startup
生成预训练变压器(GPT)...
全球食品和饮料公司
Python, Machine Learning, SQL, TensorFlow, Docker, 亚马逊网络服务(AWS)...
D-fine
Machine Learning, SQL, Python, R, Java, Neural Networks, Statistical Modeling...

Experience

Availability

Part-time

Preferred Environment

Windows, Linux, Spyder, PyCharm, Jupyter Notebook, Scikit-learn, Visual Studio Code (VS Code), Git, Docker

The most amazing...

...我开发的项目是一个基于ml的, closed-loop system for optimizing brain stimulation therapy in Parkinson's disease and essential tremor patients.

Work Experience

Co-founder

2022 - PRESENT
Stealth Startup
  • Co-founded a company that develops a knowledge management framework for IT teams.
  • Developed and implemented a system for the analysis of multi-domain corpora using transformer-based NLP models.
  • Designed and deployed AWS infrastructure for servicing a neural search engine.
技术:自然语言处理(NLP), 生成预训练变压器(GPT), 亚马逊网络服务(AWS), SQL, Docker, Generative Systems, OpenAI, Text Processing

机器学习工程师

2021 - 2022
全球食品和饮料公司
  • Set up and rolled out an existing media mix model for a new geographical market and product family as a member of an MLOps team.
  • Designed, implemented, and deployed a PoC for a media mix model based on Bayesian statistical modeling as a member of an ML R&D team.
  • 领导由五名机器学习工程师和数据科学家组成的团队. The team developed, benchmarked, productized, 并为营销团队部署了下一代媒体组合模式.
Technologies: Python, Machine Learning, SQL, TensorFlow, Docker, 亚马逊网络服务(AWS), Bayesian Statistics

Consultant

2021 - 2021
D-fine
  • 验证了一家大型德国银行的信用风险和会计模型.
  • Performed predictive and prescriptive statistical analysis of soccer players' data for injury prediction and talent development at a Bundesliga team.
  • 为一家大型欧洲银行部署数据管理系统.
  • 开发MLOps管道, 包括神经网络的架构优化, for an in-house project.
技术:机器学习, SQL, Python, R, Java, Neural Networks, Statistical Modeling, Predictive Analytics, Bayesian Statistics, Data Analysis, Data Analytics, Data Science, 机器学习操作(MLOps), Scikit-learn, Probability Theory, PyTorch, Statistics, Statistical Methods, Seaborn, Matplotlib, 数据库管理系统(DBMS), Data Warehousing, CI/CD Pipelines, Jupyter, Statistical Analysis, 人工智能(AI), NumPy, Consulting

博士研究助理

2014 - 2020
University of Freiburg
  • 开发了第一个基于机器学习的, closed-loop, 脑深部电刺激系统在自由活动患者中的应用. The projects related to this achievement were carried out in close collaboration with clinicians and industry partners.
  • Established data-driven adaptive deep brain stimulation as a novel research field in the university.
  • Published seven research articles in peer-reviewed scientific journals and 10+ contributions to scientific workshops and conferences in the fields of machine learning, data science, and neuroscience.
  • Supported the machine learning lecture of the Master in Computer Science program for five years with the conception of exercises and exams and tutoring. 该讲座的平均出席人数为每学期100人左右.
  • Supervised a team of 2-5 (paid) master's research assistants in their supporting tasks at the lab.
  • Supervised 15+ students in their master's and bachelor's theses in the research lab.
技术:机器学习, 数字信号处理, Data Science, Research, Neuroscience, Statistics, Data Analysis, Data Analytics, Writing & Editing, Matplotlib, Pandas, Scikit-learn, Time Series Analysis, Control Theory, Probability Theory, Linear Algebra, Deep Learning, Reinforcement Learning, Python, MATLAB, Technical Writing, PyTorch, Statistical Methods, Data Engineering, Seaborn, Neural Networks, Predictive Analytics, Statistical Modeling, Jupyter, AutoML, 人工智能(AI), NumPy, Consulting, 分类算法

Research Assistant

2012 - 2014
哥伦比亚国立大学
  • 作为唯一的讲师讲授模拟电子学课程. 除了准备讲座, exercises sheets, and exams, 我监督学生项目的执行.
  • Supervised one (paid) undergraduate teaching assistant for the lecture on analog electronics.
  • 开发了神经信号源定位的新方法, 结果在同行评议的期刊上发表了一篇科学论文.
技术:贝叶斯统计, Linear Algebra, Research, Neuroscience, MATLAB, Python, Probability Theory, Statistical Analysis, 人工智能(AI), NumPy

基于NLP方法的研究发现引擎

Conception and development of a framework for knowledge management and discovery in machine learning teams. 该产品被定义为机器学习研究的发现引擎, tackling the problem of information overflow like more than 100 machine learning papers uploaded daily to arXiv.

Key Activities
• Implemented a PoC system consisting of a discovery engine for machine learning research using large language models.
•设计和部署服务于发现引擎的云基础设施.
•创建商业模式和市场策略.
• Conducted user discovery and development interviews with more than 50 interviewees.

消费类产品的媒体组合模型

Development, implementation, and deployment of a marketing mix model based on Bayesian statistical modeling for the company's global operations.

Key Activities
•实施基于最新研究论文的学习模式.
•根据可用数据的特定属性定制模型.
•将模型部署到云上,供MLOps团队使用.
•利用来自业务部门的反馈反复改进模型.
• Presented results to several non-technical stakeholders in the business unit.

ML-based Adaptive Deep Brain Stimulation System for Essential Tremor Patients

http://www.frontiersin.org/articles/10.3389/fnhum.2020.541625/full
The first machine learning-based adaptive deep brain stimulation system implemented in freely moving essential tremor patients. This project was executed in cooperation with colleagues at the University of Washington.

Key Activities
• Established the cooperation between our research lab at the University of Freiburg (brain state decoding lab) and the University of Washington (biorobotics lab).
•构思并开发底层机器学习, control, 以及数字信号处理方法.
• Deployed the algorithms on a host PC and the embedded system of the patients' neurostimulators.
•执行数据收集实验.
•对收集的数据进行离线分析.
•撰写并编辑了同行评审出版物的最终手稿.

某足球队伤病数据分析

Descriptive and prescriptive analysis of injury data for a Bundesliga soccer team. 在我分析了数据并将结果呈现给所有利益相关者之后, 包括非技术人员, the project became part of the team's research department to investigate injury prevention and talent development.

Key Activities
•准备和清理来自多个数据库的数据.
•对数据进行描述性和规范性分析.
• Presented the results to all stakeholders, 包括非技术人员.

数据管理系统的部署

Handover of a data management system that a large European bank used for the warehousing and analysis of credit risk data.

Key Activities
•配置和部署UAT和生产环境.
•为后端和前端实现了CI/CD管道.

从大脑信号中解码帕金森病症状

http://www.sciencedirect.com/science/article/pii/S2213158220302138
A novel supervised ML approach for decoding the intensity of Parkinson's disease symptoms from brain signals. We collected data from seven patients undergoing deep brain stimulation therapy and showed that our novel ML approach improved the decoding performance of the symptoms.

Key Activities
•设计并执行数据收集实验.
•对数据进行预处理并进行探索性数据分析.
•构思并实现了新的ML方法.
• Validated the novel ML method with the collected data and a benchmark against state-of-the-art models, 包括深度卷积神经网络.
•应用AutoML对所有考虑的模型进行超参数优化.
• Wrote and edited the final manuscript published in a peer-reviewed journal.

神经科学中ML的数据增强框架

http://www.frontiersin.org/articles/10.3389/fninf.2019.00055/full
The novel framework developed in this project allows for an objective evaluation and benchmarking of novel ML algorithms to analyze neurological data.

我们应对了以下挑战:
• Scarcity of data available when using data-driven methods in the analysis of brain signals.
•可用标签不可靠
•原始信号中的高水平噪声.

Key Activities
• Conceived the idea.
•执行数据分析.
•撰写了发表在同行评议期刊上的科学手稿.

Languages

Python, SQL, R, Java, C#

Libraries/APIs

Scikit-learn, Matplotlib, Pandas, NumPy, PyTorch, React, TensorFlow

Tools

Spyder, MATLAB, Git, Seaborn, Jupyter, AutoML, PyCharm, Excel 2010

Paradigms

数据科学,用户验收测试(UAT)

Platforms

Windows, Linux, Jupyter Notebook, Docker, JBoss EAP, 亚马逊网络服务(AWS), Visual Studio Code (VS Code)

Other

数字信号处理, Programming, Time Series Analysis, Linear Algebra, Machine Learning, Neuroscience, Deep Learning, Research, Technical Writing, Statistics, Statistical Methods, Data Analytics, Data Analysis, Data Engineering, Neural Networks, Statistical Modeling, Predictive Analytics, Writing & Editing, Statistical Analysis, 人工智能(AI), Consulting, 分类算法, Probability Theory, Reinforcement Learning, Bayesian Statistics, Algorithms, Electronics, 自然语言处理(NLP), OpenAI, Text Processing, 生成预训练变压器(GPT), Circuit Design, Control Theory, Calculus, 机器学习操作(MLOps), Data Warehousing, CI/CD Pipelines, Generative Systems, 大型语言模型(llm), Business Planning, IT Project Management, 自然语言理解(NLU), Customer Research

Storage

数据库管理系统(DBMS)

2014 - 2020

PhD in Computer Science

弗莱堡大学-德国弗莱堡

2012 - 2014

工程硕士学位

哥伦比亚国立大学-马尼萨莱斯,哥伦比亚

2007 - 2012

电子工程工程师学位

哥伦比亚国立大学-马尼萨莱斯,哥伦比亚

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

开始你的无风险人才试验

与你选择的人才一起工作,试用最多两周. 只有当你决定雇佣他们时才付钱.

对顶尖人才的需求很大.

Start hiring