Sebastián Castaño
Verified Expert in Engineering
数据科学家和机器学习开发人员
Sebastián has a PhD in machine learning and data science and a decade of experience in interdisciplinary projects in medicine, banking, marketing, and consumer products, among others. 他的专长包括设计数据收集系统, 分析和建模复杂的数据, 开发和部署机器学习管道. 作为一个经验丰富的研究者和教育家, Sebastián constantly delivers compelling data-driven insights and intuitive tools for technical and non-technical colleagues.
Portfolio
Experience
Availability
Preferred Environment
Windows, Linux, Spyder, PyCharm, Jupyter Notebook, Scikit-learn, Visual Studio Code (VS Code), Git, Docker
The most amazing...
...我开发的项目是一个基于ml的, closed-loop system for optimizing brain stimulation therapy in Parkinson's disease and essential tremor patients.
Work Experience
Co-founder
Stealth Startup
- Co-founded a company that develops a knowledge management framework for IT teams.
- Developed and implemented a system for the analysis of multi-domain corpora using transformer-based NLP models.
- Designed and deployed AWS infrastructure for servicing a neural search engine.
机器学习工程师
全球食品和饮料公司
- Set up and rolled out an existing media mix model for a new geographical market and product family as a member of an MLOps team.
- Designed, implemented, and deployed a PoC for a media mix model based on Bayesian statistical modeling as a member of an ML R&D team.
- 领导由五名机器学习工程师和数据科学家组成的团队. The team developed, benchmarked, productized, 并为营销团队部署了下一代媒体组合模式.
Consultant
D-fine
- 验证了一家大型德国银行的信用风险和会计模型.
- Performed predictive and prescriptive statistical analysis of soccer players' data for injury prediction and talent development at a Bundesliga team.
- 为一家大型欧洲银行部署数据管理系统.
- 开发MLOps管道, 包括神经网络的架构优化, for an in-house project.
博士研究助理
University of Freiburg
- 开发了第一个基于机器学习的, closed-loop, 脑深部电刺激系统在自由活动患者中的应用. The projects related to this achievement were carried out in close collaboration with clinicians and industry partners.
- Established data-driven adaptive deep brain stimulation as a novel research field in the university.
- Published seven research articles in peer-reviewed scientific journals and 10+ contributions to scientific workshops and conferences in the fields of machine learning, data science, and neuroscience.
- Supported the machine learning lecture of the Master in Computer Science program for five years with the conception of exercises and exams and tutoring. 该讲座的平均出席人数为每学期100人左右.
- Supervised a team of 2-5 (paid) master's research assistants in their supporting tasks at the lab.
- Supervised 15+ students in their master's and bachelor's theses in the research lab.
Research Assistant
哥伦比亚国立大学
- 作为唯一的讲师讲授模拟电子学课程. 除了准备讲座, exercises sheets, and exams, 我监督学生项目的执行.
- Supervised one (paid) undergraduate teaching assistant for the lecture on analog electronics.
- 开发了神经信号源定位的新方法, 结果在同行评议的期刊上发表了一篇科学论文.
Experience
基于NLP方法的研究发现引擎
Key Activities
• Implemented a PoC system consisting of a discovery engine for machine learning research using large language models.
•设计和部署服务于发现引擎的云基础设施.
•创建商业模式和市场策略.
• Conducted user discovery and development interviews with more than 50 interviewees.
消费类产品的媒体组合模型
Key Activities
•实施基于最新研究论文的学习模式.
•根据可用数据的特定属性定制模型.
•将模型部署到云上,供MLOps团队使用.
•利用来自业务部门的反馈反复改进模型.
• Presented results to several non-technical stakeholders in the business unit.
ML-based Adaptive Deep Brain Stimulation System for Essential Tremor Patients
http://www.frontiersin.org/articles/10.3389/fnhum.2020.541625/fullKey Activities
• Established the cooperation between our research lab at the University of Freiburg (brain state decoding lab) and the University of Washington (biorobotics lab).
•构思并开发底层机器学习, control, 以及数字信号处理方法.
• Deployed the algorithms on a host PC and the embedded system of the patients' neurostimulators.
•执行数据收集实验.
•对收集的数据进行离线分析.
•撰写并编辑了同行评审出版物的最终手稿.
某足球队伤病数据分析
Key Activities
•准备和清理来自多个数据库的数据.
•对数据进行描述性和规范性分析.
• Presented the results to all stakeholders, 包括非技术人员.
数据管理系统的部署
Key Activities
•配置和部署UAT和生产环境.
•为后端和前端实现了CI/CD管道.
从大脑信号中解码帕金森病症状
http://www.sciencedirect.com/science/article/pii/S2213158220302138Key Activities
•设计并执行数据收集实验.
•对数据进行预处理并进行探索性数据分析.
•构思并实现了新的ML方法.
• Validated the novel ML method with the collected data and a benchmark against state-of-the-art models, 包括深度卷积神经网络.
•应用AutoML对所有考虑的模型进行超参数优化.
• Wrote and edited the final manuscript published in a peer-reviewed journal.
神经科学中ML的数据增强框架
http://www.frontiersin.org/articles/10.3389/fninf.2019.00055/full我们应对了以下挑战:
• Scarcity of data available when using data-driven methods in the analysis of brain signals.
•可用标签不可靠
•原始信号中的高水平噪声.
Key Activities
• Conceived the idea.
•执行数据分析.
•撰写了发表在同行评议期刊上的科学手稿.
Skills
Languages
Python, SQL, R, Java, C#
Libraries/APIs
Scikit-learn, Matplotlib, Pandas, NumPy, PyTorch, React, TensorFlow
Tools
Spyder, MATLAB, Git, Seaborn, Jupyter, AutoML, PyCharm, Excel 2010
Paradigms
数据科学,用户验收测试(UAT)
Platforms
Windows, Linux, Jupyter Notebook, Docker, JBoss EAP, 亚马逊网络服务(AWS), Visual Studio Code (VS Code)
Other
数字信号处理, Programming, Time Series Analysis, Linear Algebra, Machine Learning, Neuroscience, Deep Learning, Research, Technical Writing, Statistics, Statistical Methods, Data Analytics, Data Analysis, Data Engineering, Neural Networks, Statistical Modeling, Predictive Analytics, Writing & Editing, Statistical Analysis, 人工智能(AI), Consulting, 分类算法, Probability Theory, Reinforcement Learning, Bayesian Statistics, Algorithms, Electronics, 自然语言处理(NLP), OpenAI, Text Processing, 生成预训练变压器(GPT), Circuit Design, Control Theory, Calculus, 机器学习操作(MLOps), Data Warehousing, CI/CD Pipelines, Generative Systems, 大型语言模型(llm), Business Planning, IT Project Management, 自然语言理解(NLU), Customer Research
Storage
数据库管理系统(DBMS)
Education
PhD in Computer Science
弗莱堡大学-德国弗莱堡
工程硕士学位
哥伦比亚国立大学-马尼萨莱斯,哥伦比亚
电子工程工程师学位
哥伦比亚国立大学-马尼萨莱斯,哥伦比亚
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
开始你的无风险人才试验
对顶尖人才的需求很大.
Start hiring